ppo-LunarLander-v2 / results.json
harpomaxx's picture
first model using just 100 episodes
cfdabee
raw
history blame
159 Bytes
{"mean_reward": -266.3231953, "std_reward": 117.05282561986462, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-06-26T22:40:09.115820"}