Batch: don't just set 0 when elements have None entries #1088

MischaPanch · 2024-04-03T11:13:20Z

This is essentially a bug due to the current implementation of __setitem__:

Setting 0 when something is off is wrong. This can manifest itself in things like info of reset, where certain values might be unknown (like actions or env_num). Then None is erroneously replaced by 0. There are multiple tests covering this erroneous behavior, e.g.,
line 112 in

One should instead at least turn this into NaN entries

The text was updated successfully, but these errors were encountered:

maxhuettenrauch · 2024-04-10T09:54:29Z

As far as I can tell, we reach this point in two cases:

Adding an item to the buffer where the new item has an inconsistent structure compared to the already existing structure and
updating an existing item with an item that has an inconsistent structure compared to the already existing structure.

I think the main question here is if these operations should be allowed in the first place. In the RL context this probably only happens in case the info dict changes its structure between steps (as happens in MoveToRightEnv in the tests). One could argue, that the env is ill-defined in this case and instead of setting arbitrary default values, the env should be fixed?

maxhuettenrauch · 2024-04-10T13:56:13Z

Regarding your suggestion to set values to NaN instead, assigning np.nan will fail on arrays whose dtype is not float (as is the case in the above mentioned test).

MischaPanch · 2024-04-12T10:18:58Z

@maxhuettenrauch unfortunately, in the RL context this is bound to happen because some things might not be known at reset that are known at step, and thus will contain None entries. The prime example is the action, which is not available at reset, but some entrances from the info might also be missing

MischaPanch · 2024-04-12T10:19:22Z

This issue is strongly related to #1087

maxhuettenrauch · 2024-04-12T10:28:55Z

But when would you add obs directly after reset to the buffer (where a Batch object is created) and only after that retrieve an action, call step, and append the rest to this entry?

MischaPanch · 2024-04-12T11:19:42Z

I thought it might happen in the collectors, but maybe I'm mistaken.

For sure I've seen this happen with info objects somewhere in collector tests - though there it is a bad implementation of the env.

Unfortunately Gymnasium doesn't force any interface on the info dicts which are the main drivers of this problem.We can stop supporting such cases, or ask the use to specify what should happen then explicitly.

I am all for restricting the number of supported operations to decrease complexity and probability of errors :)

maxhuettenrauch · 2024-04-12T12:33:38Z

Yes, definitely happening in the collector tests, due to said bad env design. I'm gonna check some examples with standard mujoco envs.

maxhuettenrauch · 2024-04-15T09:50:53Z

Related: Farama-Foundation/Gymnasium#540

MischaPanch added bug Something isn't working Batch and Buffer Improvements in internal data structures, temporary label labels Apr 3, 2024

MischaPanch added this to To do in Overall Tianshou Status via automation Apr 3, 2024

maxhuettenrauch mentioned this issue Apr 11, 2024

Warn on batch.add when missing keys #1106

Closed

8 tasks

MischaPanch assigned maxhuettenrauch Apr 16, 2024

MischaPanch mentioned this issue Apr 16, 2024

Batch: don't just strip off empty entries when creating batches #1089

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch: don't just set 0 when elements have None entries #1088

Batch: don't just set 0 when elements have None entries #1088

MischaPanch commented Apr 3, 2024

maxhuettenrauch commented Apr 10, 2024

maxhuettenrauch commented Apr 10, 2024

MischaPanch commented Apr 12, 2024

MischaPanch commented Apr 12, 2024

maxhuettenrauch commented Apr 12, 2024

MischaPanch commented Apr 12, 2024

maxhuettenrauch commented Apr 12, 2024

maxhuettenrauch commented Apr 15, 2024

Batch: don't just set 0 when elements have None entries #1088

Batch: don't just set 0 when elements have None entries #1088

Comments

MischaPanch commented Apr 3, 2024

maxhuettenrauch commented Apr 10, 2024

maxhuettenrauch commented Apr 10, 2024

MischaPanch commented Apr 12, 2024

MischaPanch commented Apr 12, 2024

maxhuettenrauch commented Apr 12, 2024

MischaPanch commented Apr 12, 2024

maxhuettenrauch commented Apr 12, 2024

maxhuettenrauch commented Apr 15, 2024