We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent d1ab3df commit 5faf27dCopy full SHA for 5faf27d
…_trainer_debug_w_rollout_stepwise_152.sh …_trainer_debug_w_rollout_stepwise_152.shexamples/osworld/async/run_trainer_debug_w_rollout_stepwise_152.sh renamed to examples/osworld/async/backup/run_trainer_debug_w_rollout_stepwise_152.sh
…ebug_w_rollout_stepwise_152_rolloutn4.sh …ebug_w_rollout_stepwise_152_rolloutn4.shexamples/osworld/async/run_trainer_debug_w_rollout_stepwise_152_rolloutn4.sh renamed to examples/osworld/async/backup/run_trainer_debug_w_rollout_stepwise_152_rolloutn4.sh
…ner_debug_w_rollout_stepwise_ablation.sh …ner_debug_w_rollout_stepwise_ablation.shexamples/osworld/async/run_trainer_debug_w_rollout_stepwise_ablation.sh renamed to examples/osworld/async/backup/run_trainer_debug_w_rollout_stepwise_ablation.sh
…g_w_rollout_stepwise_ablation_no_vllm.sh …g_w_rollout_stepwise_ablation_no_vllm.shexamples/osworld/async/run_trainer_debug_w_rollout_stepwise_ablation_no_vllm.sh renamed to examples/osworld/async/backup/run_trainer_debug_w_rollout_stepwise_ablation_no_vllm.sh
…n_trainer_debug_w_rollout_stepwise_kl.sh …n_trainer_debug_w_rollout_stepwise_kl.shexamples/osworld/async/run_trainer_debug_w_rollout_stepwise_kl.sh renamed to examples/osworld/async/backup/run_trainer_debug_w_rollout_stepwise_kl.sh
…rainer_debug_w_rollout_stepwise_kl_pm.sh …rainer_debug_w_rollout_stepwise_kl_pm.shexamples/osworld/async/run_trainer_debug_w_rollout_stepwise_kl_pm.sh renamed to examples/osworld/async/backup/run_trainer_debug_w_rollout_stepwise_kl_pm.sh
…wise_maxstep30_trainset152_2node_0913.sh …wise_maxstep30_trainset152_2node_0913.shexamples/osworld/async/run_trainer_debug_w_rollout_stepwise_maxstep30_trainset152_2node_0913.sh renamed to examples/osworld/async/backup/run_trainer_debug_w_rollout_stepwise_maxstep30_trainset152_2node_0913.sh
…rollout_stepwise_maxstep30_trainset90.sh …rollout_stepwise_maxstep30_trainset90.shexamples/osworld/async/run_trainer_debug_w_rollout_stepwise_maxstep30_trainset90.sh renamed to examples/osworld/async/backup/run_trainer_debug_w_rollout_stepwise_maxstep30_trainset90.sh
…wise_maxstep30_trainset90_TRAJ_FILTER.sh …wise_maxstep30_trainset90_TRAJ_FILTER.shexamples/osworld/async/run_trainer_debug_w_rollout_stepwise_maxstep30_trainset90_TRAJ_FILTER.sh renamed to examples/osworld/async/backup/run_trainer_debug_w_rollout_stepwise_maxstep30_trainset90_TRAJ_FILTER.sh
…er_debug_w_rollout_stepwise_multinode.sh …er_debug_w_rollout_stepwise_multinode.shexamples/osworld/async/run_trainer_debug_w_rollout_stepwise_multinode.sh renamed to examples/osworld/async/backup/run_trainer_debug_w_rollout_stepwise_multinode.sh
0 commit comments