Parallel (sort of) connected component labeling #547

ckhroulev · 2024-05-02T23:11:45Z

Labels connected components in parallel, computing the input "mask" on the fly.

The goal is to make labeling connected components less of a bottleneck in PICO and elsewhere in PISM. The old implementation is purely serial and so does not scale. This version uses the same serial algorithm on each sub-domain and then combine results to get labels for the whole grid.

Here's the idea:

Identify connected components in each sub-domain, putting intermediate results in output.
"Update ghosts" of output, then iterate over sub-domain edges to identify connections between patches in sub-domains that make up connected components spanning multiple sub-domains. This defines a graph: each patch on a sub-domain is a node, two nodes are connected by an edge if and only if they "touch".
Gather graph description on all sub-domains that have at least one patch.
Use breadth-first search to traverse this graph and compute final labels.
Apply final labels.

This method communicates ghosts (once), number of graph edges per sub-domain (once) and then all graph edges (once, only to sub-domains that have at least one patch).

Graph traversal is done "redundantly", i.e. each participating sub-domain¹ traverses the whole graph even if it contains one isolated node. This is needed to ensure that resulting labels use consecutive numbers. (Consecutive labels are useful for indexing elsewhere in the code).

We could gather graph information on one MPI processor, traverse the graph to compute final labels, then scatter final labels. It is not clear if this would be better, though.

The current implementation uses

MPI_Allgather() to gather number of graph edges per subdomain (send 4 bytes, receive 4 bytes per subdomain).
MPI_Allgatherv() to gather edges to all participating subdomains (send 8 bytes per local edge, receive 8-16 bytes per edge).

An alternative implementation could use

MPI_Gather() to gather number of graph edges per sub-domain to one of sub-domains. (Each sub-domain sends 4 bytes, one sub-domain receives 4 bytes per sub-domain).
MPI_Gatherv() to gather edges from all participating sub-domains. (All sub-domains send 8 bytes per local edge, one sub-domain receives 8 bytes per edge in the whole graph.)
MPI_Bcast() to scatter the mapping from old labels to new labels (8 bytes per local sub-domain).

It is not clear which way is better. We need to run benchmarks!

This code works as it should, but

we don't know if it is any faster than the old serial code
we don't know how its performance scales as the number of MPI processes goes up.

We need to run a few simulations to compare this to the old implementation.

Checklist

update documentation
update CHANGES.rst
add regression tests (old regression tests are okay, no need to add more to test this)

This implementation uses MPI_Comm_split() to create a sub-communicator containing "participating sub-domains", i.e. sub-domains that contain at least one foreground pixel. It is not clear if this is a good idea. We could traverse the graph on all sub-domains (including empty ones) instead. This may or may not be a good idea depending on the cost of MPI_Comm_split() and MPI_Allgatherv() calls mentioned above. ↩

ckhroulev added 9 commits April 30, 2024 09:19

Get new connected component labeling code from an old branch

948f9a5

Remove old connected component labeling code

c97c931

Switch to using new, "parallel" connected component labeling code

6dce6ec

Clean up the connected component labeling code a little bit

a2eb685

Minor cleanup

df11911

Add a missing header

217aac5

Use full paths to headers

da22e0f

Fix some comments

fa9ce65

Remove extra semicolons that broke builds with GCC 9 and 10

653a3db

ckhroulev self-assigned this May 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel (sort of) connected component labeling #547

Parallel (sort of) connected component labeling #547

ckhroulev commented May 2, 2024 •

edited

Parallel (sort of) connected component labeling #547

Are you sure you want to change the base?

Parallel (sort of) connected component labeling #547

Conversation

ckhroulev commented May 2, 2024 • edited

Footnotes

ckhroulev commented May 2, 2024 •

edited