retain adjacency order in nx-loopback copy of networkx graph #7432

dschult · 2024-04-26T02:12:43Z

This rewrites the copy functionality within nx-loopback to ensure that the neighbor order is the same in the copy. Instead of copying edges, this copies adjacencies (careful to ensure opposite representations of an edge point to the same datadict).

The result is that the incoming PR for fixing colliders and v-structures passes its tests.
Also, dfs_labeled_edges test passes. In fact all the topological sort functions that were excluded pass the tests as well, though I'm not sure whether we should still exclude them from the nx-loopback tests because I don't know what the issue was regarding the function changing the original graph. E.g. maybe we shouldn't be copying the graph at all since it needs to be changed.

~~This also seems to fix the long time for the nx-loopback tests. It is still about 8 secs (~10-15%) longer to run, but not multiple minutes. Almost all of that is due to using G._adj directly instead of add_edges_from and G.edges.~~ [Edit: error on my part caused mistaken speed results. With the change it is the same speed as it was.]

@eriknw can you look at my comments near the top of the diff:

should any of these functions marked as being skipped be included? (~~I think the topo sort maybe~~) [A: Only the dfs_labeled_edges is fixed by the order of edges in loopback. The rest stay in place.]
the conversion of an AntiGraph to a normal graph class is just G = nx.complement(A). But I'm not sure how that helps the code near the comment about AntiGraph. [A: Removing comment about AntiGraph]

dschult · 2024-04-26T02:19:39Z

If this copy code works for nx-loopback -- I think we should update the copy methods in the base classes to ensure that the neighbor/adjacency order is maintained when users do G.copy(). It didn't matter when the order of the neighbors was arbitrary based on storage/reporting within a Python dict. But now that dicts have ordered keys, users might expect the order to be the same after a copy.

Would maintaining the adjacency order during copy be considered a breaking change? We didn't tell people the current reordering was a feature -- indeed, we said the order could be machine dependent. But it is possible that some people are counting on a specific order after a copy. Still, my inclination is that implementing copy as maintaining neighbor order is not breaking any assurance. Your output may change from what it was, but now it has a defined pattern. Anyway, I'm open to a deprecation on that (still to be written) PR if/when we get to it.

eriknw

Very nice!

Using a similar approach for G.copy to maintain order makes sense to me--just be sure to add a remark to the next release notes.

This also seems to fix the long time for the nx-loopback tests. It is still about 8 secs (~10-15%) longer to run, but not multiple minutes.

Where do you see this? "dispatch" CI still takes several minutes. I have not experimented with this locally yet.

networkx/classes/tests/dispatch_interface.py

rossbar

Here's a naive question: does the loopback interface need to explicitly copy at all? In principle, the dispatching is handled by looking up the __networkx_backend__ attr on the instance, so can't that just be added to the input graph instance without any copying? In other words, the convert_from_nx function might look something like:

def convert_from_nx(graph, *, ...):
    graph.__networkx_backend__ = "nx-loopback"
    return graph

I guess the main issue is that this wouldn't probe the kwarg values of convert_from_nx... is that right? Are those probed now (i.e. taken from the @_dispatch decorator values)?

On the copy question: +1 from me for refactoring copy to preserve ordering - that makes a lot of sense as of Python 3.7!

dschult · 2024-04-26T19:34:34Z

The timing results I was getting are apparently a "feature" of my local testing environment. I would like some help in getting my environment set up better.

Running NETWORKX_TEST_BACKEND=nx-loopback pytest --doctest-modules --durations=10 --pyargs networkx from the base dir of my repo runs all the tests just fine without the loopback!! I get the same results as running it without the environment variable set. I can add an assert False to a test file in the repo and that test fails. So I am testing the correct files, just not with the loopback backend. I have put assert False into the loopback.convert_from_nx function and it is never reached.

NETWORKX_TEST_BACKEND=nx-loopback pytest --doctest-modules --durations=10 --pyargs networkx/classes (or any other specific file or dir within the networkx repo) runs with the nx-loopback process. All the files and loopback functions are called as they should be.

I do not have networkx installed in this mamba environment. I have used pip install -e . to create the egg directory in the repo's home directory. Without that, the testing command leads to an error about nx-loopback not being a backend. And that error happens even when pointing to specific files/dirs to run the tests for.

So... How do i run pytest on the whole repo using nx-loopback as my backend?

networkx/classes/tests/dispatch_interface.py

dschult · 2024-05-02T19:37:32Z

I've now found that I can get nx-loopback testing feature working while testing the whole library so long as I don't try to use the --doctest-modules option. :) Both of these work:
NETWORKX_TEST_BACKEND=nx-loopback pytest --pyargs networkx and
NETWORKX_TEST_BACKEND=nx-loopback pytest networkx

while adding --doctest-modules does not convert the graph:
NETWORKX_TEST_BACKEND=nx-loopback pytest --doctest-modules --pyargs networkx

If I specify a file/directory within networkx it works with or without --doctest-modules. I am trying to mimic the environment created by the github "dispatch" workflow -- but I'm on mac...

[Found the trouble. I had an unstaged python file that set backend information in the repo's directory. When using --docutils-modules that file was executed (which I did not expect... I figured it would just be searched for doc_strings.). Removing that file (starting with a clean repo and no staged files) resolved the issue.]

…vert_and_call_for_test

dschult · 2024-05-07T20:34:21Z

This change to keep edge order consistent during nx-loopback conversion is ready for review.
It should work to make #7398 pass the dispatch tests -- but I haven't tested that yet.

[Edit: Yes -- this allows the nx-loopback tests in #7398 pass. :) ]

rossbar

LGTM, thanks @dschult !

rossbar · 2024-05-21T05:39:48Z

networkx/classes/digraph.py

+        for prop in ["edges", "out_edges", "degree", "out_degree", "in_degree"]:
+            if prop in od:
+                del od[prop]


It looks like "adj" and "succ" could just be added to this list to save a few lines, but there's no reason to do so other than style. Just thought I'd mention!

dschult · 2024-05-22T11:29:29Z

Thanks for those comments and suggestions @rossbar.
I have made the suggested changes. And @eriknw has reviewed the PR with comments/suggestions above.
So I think it is ready to merge -- but I'd feel more confident of that after Erik has a chance to look again (and github approve it?).

eriknw

LGTM, this is pretty slick! It took me a little while to convince myself that inner/outer data dicts were handled correctly for multigraphs, but it's straightforward enough. The comments were helpful regarding ensuring that certain dicts be the same.

…x#7432) * retain adj-order in nx-loopback copy of networkx graph * rewrite copy logic * fix buggy assignment to _adj, rewrite if-structure for nx-loopbak convert_and_call_for_test * check if CI uses nx-loopback by halting tests if the graph isn't copied. * enable nx-loopback again * gather and unify property resetting

dschult added the type: Maintenance label Apr 26, 2024

eriknw reviewed Apr 26, 2024

View reviewed changes

rossbar reviewed Apr 26, 2024

View reviewed changes

rlratzel reviewed Apr 29, 2024

View reviewed changes

networkx/classes/tests/dispatch_interface.py Outdated Show resolved Hide resolved

dschult marked this pull request as draft April 29, 2024 12:39

dschult mentioned this pull request May 6, 2024

Understanding nx-loopback graph conversion #7428

Closed

dschult added 5 commits May 7, 2024 15:57

retain adj-order in nx-loopback copy of networkx graph

bd2f71a

rewrite copy logic

28e6d76

fix buggy assignment to _adj, rewrite if-structure for nx-loopbak con…

58e9bda

…vert_and_call_for_test

check if CI uses nx-loopback by halting tests if the graph isn't copied.

0d2006a

enable nx-loopback again

f0b9bf8

dschult force-pushed the chg_loopback_copy branch from 21ca167 to f0b9bf8 Compare May 7, 2024 19:57

dschult marked this pull request as ready for review May 7, 2024 19:57

dschult linked an issue May 21, 2024 that may be closed by this pull request

Understanding nx-loopback graph conversion #7428

Closed

rossbar approved these changes May 21, 2024

View reviewed changes

gather and unify property resetting

5f586cc

eriknw approved these changes May 22, 2024

View reviewed changes

rossbar merged commit fd27fb0 into networkx:main May 22, 2024
42 checks passed

jarrodmillman added this to the 3.4 milestone May 22, 2024

dschult deleted the chg_loopback_copy branch May 23, 2024 20:17

dschult mentioned this pull request May 23, 2024

[ENH, BUG]: added compute_colliders and fixed compute_v_structures #7398

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

retain adjacency order in nx-loopback copy of networkx graph #7432

retain adjacency order in nx-loopback copy of networkx graph #7432

dschult commented Apr 26, 2024 •

edited

dschult commented Apr 26, 2024

eriknw left a comment

rossbar left a comment

dschult commented Apr 26, 2024

dschult commented May 2, 2024 •

edited

dschult commented May 7, 2024 •

edited

rossbar left a comment

rossbar May 21, 2024

dschult commented May 22, 2024

eriknw left a comment

retain adjacency order in nx-loopback copy of networkx graph #7432

retain adjacency order in nx-loopback copy of networkx graph #7432

Conversation

dschult commented Apr 26, 2024 • edited

dschult commented Apr 26, 2024

eriknw left a comment

Choose a reason for hiding this comment

rossbar left a comment

Choose a reason for hiding this comment

dschult commented Apr 26, 2024

dschult commented May 2, 2024 • edited

dschult commented May 7, 2024 • edited

rossbar left a comment

Choose a reason for hiding this comment

rossbar May 21, 2024

Choose a reason for hiding this comment

dschult commented May 22, 2024

eriknw left a comment

Choose a reason for hiding this comment

dschult commented Apr 26, 2024 •

edited

dschult commented May 2, 2024 •

edited

dschult commented May 7, 2024 •

edited