Analysis of 24710562_24710205_24710493_24710549_24737276_24772634_25167875_25045009_25045205_25045219 xm/24710562_24710205_24710493_24710549_24737276_24772634_25167875_25045009_25045205_25045219

LAST SCORE



Door expert Door human Hammer expert Hammer human Pen expert Ant HalfCheetah Hopper Humanoid Walker2d
90% 0.721794 0.250011 1.08318 0.455461 0.74138 0.897576 1.0688 1.18446 0.505675 0.987299
95% 0.912446 0.826693 1.25923 1.14831 0.889886 0.98613 1.10639 1.19625 0.874158 1.00824
99% 1.04172 2.28742 1.37171 3.04371 1.11007 1.06566 1.17321 1.22692 1.01226 1.04206
Max 1.16335 3.72805 1.45151 5.5451 1.43742 1.17977 1.36555 1.33538 1.05911 1.21292
best wu 4359/19479/13699/6549/3499 24710/13220/13860/2460/10790 1457/15867/14827/6697/21337 9398/9278/2218/3588/8948 8996/6236/1126/18486/8236 24614/6234/19784/18914/5594 1101/1681/7851/10871/6011 3582/1482/15462/14522/1642 5375/13745/4195/4555/3565 6503/10783/1423/413/20103

MEAN SCORE



Door expert Door human Hammer expert Hammer human Pen expert Ant HalfCheetah Hopper Humanoid Walker2d
90% 0.419106 0.297493 0.590241 0.41657 0.555365 0.611435 0.81995 0.926778 0.291467 0.697951
95% 0.566474 0.556538 0.765931 0.701092 0.662994 0.722274 0.872992 0.984262 0.534006 0.758368
99% 0.73897 1.04071 0.959907 1.22675 0.838638 0.850376 0.935044 1.05537 0.788307 0.83516
Max 0.923225 2.0774 1.18123 3.41974 1.09319 0.962736 1.04752 1.10264 0.922889 0.921003
best wu 7699/22339/21919/11699/20349 15960/9410/1780/14540/23350 7087/9237/2497/9297/5537 1128/3498/9398/918/23348 1126/6696/686/18486/2716 7254/2724/1454/2274/5084 2361/3981/13371/3801/2061 6122/21272/9202/7322/3972 5085/5595/5355/4385/2845 14203/6343/9873/1353/1713

Analysis of direct_rl_algorithm



Analysis of sac_learning_rate



Analysis of sac_target_entropy_per_dimension



Analysis of tau



Analysis of max_replay_size



Analysis of num_policy_layers



Analysis of policy_layer_size



Analysis of num_critic_layers



Analysis of critic_layer_size



Analysis of activation



Analysis of discount



Analysis of pretrain_with_bc



Analysis of explicit_absorbing_state



Analysis of _gin.GAILBuilder.max_replay_size



Analysis of _gin.make_discriminator.discriminator_module



Analysis of _gin.discriminator_input__macro.value



Analysis of _gin.discriminator__MLP.num_layers



Analysis of _gin.discriminator__MLP.num_units



Analysis of _gin.discriminator__MLP.activation



Analysis of _gin.discriminator__MLP.last_layer_kernel_init_scale



Analysis of _gin.regularizer__macro.value



Analysis of _gin.discriminator__optax.adamw.weight_decay



Analysis of obs_normalization



Analysis of eval_behavior_policy_type



Analysis of _gin.discriminator__optax.adamw.learning_rate



Analysis of _gin.reward_function__macro.value



Analysis of td3_policy_learning_rate



Analysis of td3_critic_learning_rate



Analysis of td3_gradient_clipping



Analysis of sigma



Analysis of _gin.add_gradient_penalty.gradient_penalty_coefficient



Analysis of _gin.add_gradient_penalty.gradient_penalty_target



Analysis of _gin.gail_loss.mixup_alpha



Analysis of _gin.discriminator__MLP.input_dropout_rate



Analysis of _gin.discriminator__MLP.hidden_dropout_rate



Analysis of vmax



Analysis of d4pg_learning_rate



Analysis of n_step



Analysis of num_atoms



Analysis of _gin.gail_loss.entropy_coefficient



Analysis of _gin.pugail_loss.positive_class_prior



Analysis of _gin.pugail_loss.pugail_beta



Analysis of new_regularizer