WIP HITL multiprocessing speedup #1653

eundersander · 2023-10-25T17:20:28Z

Motivation and Context

MultiprocessDriverWrapper is a wrapper intended to be used with SandboxDriver (or any GuiAppDriver). It creates a separate "sim process" to run the driver, with the goal to overlap (1) sim-updating and (2) rendering in ReplayGuiAppRenderer (used by GuiApplication). The ultimate goal is runtime speedup.

Using the Oct 18 snapshot, in scene 2, on 2021 Macbook, I see the minimum FPS improve from 26 to 31. Note the minimum FPS is usually when Spot is busy, particularly when it's backing up.

This is a work in progress. Various todos are listed in the code. I'll try to summarize here:

Text-drawing and Line-rendering are unsolved. In the old single-threaded code, we directly share these objects between the driver and GuiApplication. In this multiprocessing paradigm, we need to utilize a MP data structure like Queue to transmit the requests for text-drawing and line-rendering from the sim process to the main process.
GuiInput is partially broken, in particular, zooming with mouse-scroll. I suspect I'm calling GuiInput.on_frame_end at the wrong time on the main process, causing some input events to be lost instead of sent to the sim process.
Better cleanup for the sim process on shutdown.
Decouple MultiprocessDriverWrapper and SandboxDriver: there's a comment in code about this.
More robust testing. We should make sure we haven't overlooked a way for sim_update to get called without a GuiInput having been provided via the queue; this would cause the app to hang, as the main process would be waiting for a post_sim_update_dict and the sim process would be waiting for a GuiInput.
Clean up Magnum pickling. See this Slack thread.

How Has This Been Tested

Local testing on 2021 Macbook. Note I haven't tested with --remote-gui-mode but I don't anticipate problems.

Types of changes

PR into a non-main branch.

Checklist

My code follows the code style of this project.
I have updated the documentation if required.
I have read the CONTRIBUTING document.
I have completed my CLA (see CONTRIBUTING)
I have added tests to cover my changes if required.

…endering, for speedup

eundersander

Some comments for reviewers

eundersander · 2023-10-26T15:18:23Z

examples/siro_sandbox/serialize_utils.py

 # fix for unpickable type; run once at startup
 copyreg.pickle(mn.Vector3, pickle_vector3)
+copyreg.pickle(mn.Vector2i, pickle_vector2i)


Note this approach produces a very bloated serialization (every serialized Vector2i includes the unpickle_vector2i string among other junk!). @mosra is working on proper pickle support for Magnum types.

Not that the serialization introduced in mosra/magnum-bindings@f561337 would produce anything significantly better -- it still has to encode the actual type as a string for each such instance. But as long as the serialized data is an arbitrary object tree / a JSON-like structure instead of a small set of typed (multi-dimensional) arrays with many elements, there's no way around that.

eundersander · 2023-10-26T15:20:38Z

examples/siro_sandbox/sandbox_app.py

+        # using OpenGL on 2+ threads in the same process. We probably just can't
+        # support using MultiprocessDriverWrapper with dummy multiprocessing (which
+        # is fine; dummy multiprocessing is more for debugging).
+        assert not multiprocessing_config.use_dummy


I guess what I'm saying is that this code should use multiprocessing.Process directly instead of multiprocessing_config. For context, I originally created multiprocessing_config to make it easy for our multithreaded networking/server code to switch between real and dummy multiprocessing.

eundersander · 2023-10-26T15:22:26Z

examples/siro_sandbox/sandbox_app.py

+                block=True
+            )
+
+            sim_gui_input.shallow_copy_from(latest_gui_input)


This is a goofy paradigm for "syncing" an object across the process boundary (basically, sending the entire object across, then doing this shallow copy). Maybe we do something similar for TextDrawer and DebugLineRenderer. Also maybe there's a more pythonic way to do this.

WIP: run SandboxDriver on a separate process, concurrently with GUI r…

6518ede

…endering, for speedup

eundersander added the do not merge Not ready to merge. This label should block merging. label Oct 25, 2023

facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Oct 25, 2023

eundersander commented Oct 26, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP HITL multiprocessing speedup #1653

WIP HITL multiprocessing speedup #1653

eundersander commented Oct 25, 2023 •

edited

eundersander left a comment

eundersander Oct 26, 2023

mosra Oct 27, 2023

eundersander Oct 26, 2023

eundersander Oct 26, 2023

WIP HITL multiprocessing speedup #1653

Are you sure you want to change the base?

WIP HITL multiprocessing speedup #1653

Conversation

eundersander commented Oct 25, 2023 • edited

Motivation and Context

How Has This Been Tested

Types of changes

Checklist

eundersander left a comment

Choose a reason for hiding this comment

eundersander Oct 26, 2023

Choose a reason for hiding this comment

mosra Oct 27, 2023

Choose a reason for hiding this comment

eundersander Oct 26, 2023

Choose a reason for hiding this comment

eundersander Oct 26, 2023

Choose a reason for hiding this comment

eundersander commented Oct 25, 2023 •

edited