You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
2 issues with the current way post_rollout is set up:
it does not have a try..except so can crash a run if not handled very carefully by the env designer. should handle this more holistically probably
File "/home/ubuntu/research-environments/environments/code_env/code_env/code_env.py", line 229, in post_rollout
completion = state["trajectory"][-1]["completion"]
~~~~~~~~~~~~~~~~~~~^^^^
IndexError: list index out of range
it does not have the completion field set in state yet because it gets called before this gets rendered -- this used to be different and is a bit unexpected given that e.g. rubric can always access the completion field