ENH: linalg: support array API for standard extension functions #19260

lucascolley · 2023-09-19T13:10:22Z

For context, this work was started as part of my internship at Quansight Labs, which ran until the end of September 2023.

Reference issue

Towards gh-19068 and gh-18867.
Please see gh-19068 for context.

What does this implement/fix?

Support is added for the functions in the array API standard linalg extension. This allows users to input arrays from any compatible array library.

Tests are modified to allow testing with numpy.array_api, cupy and torch. Some new tests are added for exceptions for unsupported parameters.

Additional information

I was not able to convert every relevant test since lots of NumPy-specifc things are used. We may want to find ways to convert some of these, or write new tests to serve the same purpose.

TestSVD is a little strange due to the way the lapack_driver parameter is tested. I have tried to apply a minimal refactor here, but a more substantial refactor may result in something more readable. It is a bit of a misnomer that all of our array API compatible tests are under TestSVD_GESDD, just because gesdd is the default value.

Lots of tests are failing for PyTorch CUDA, but hopefully we just need to wait for pytorch/pytorch#106773 to come through.

scipy/_lib/_array_api.py

scipy/linalg/tests/test_basic.py

scipy/linalg/tests/test_decomp.py

scipy/linalg/tests/test_decomp_cholesky.py

scipy/linalg/_basic.py

[skip cirrus] [skip circle]

ilayn

I think the regular function modifications are OK to me. But I'm not sure if we really need all the test changes. It doesn't reflect the goals of the tests at certain places and I'm not sure I understand why we test nonnumpy xp namespace in the tests. It should be pretty safe to test SciPy with default numpy arrays without any xp_assert_close's or modified tol parameters.

scipy/linalg/tests/test_basic.py

scipy/linalg/tests/test_decomp.py

… tols [skip cirrus] [skip circle]

[skip cirrus] [skip circle]

rgommers

This looks pretty good to me overall. I'd like to see the diff shrink as much as possible though, both in the implementations and tests. That's in general a sign that the code is in good shape, and it makes things easier to review and understand later on.

I think the regular function modifications are OK to me. But I'm not sure if we really need all the test changes. It doesn't reflect the goals of the tests at certain places and I'm not sure I understand why we test nonnumpy xp namespace in the tests. It should be pretty safe to test SciPy with default numpy arrays without any xp_assert_close's or modified tol parameters.

I think it is quite useful to test non-numpy arrays; without testing it's almost certainly going to be broken. I think of these as testing optional dependencies - just like we have tests with mpmath, scikit-umfpack and a whole bunch of other optional runtime dependencies.

Sometimes this requires extra code, however it also tends to uncover bugs and non-standard code constructs that (when refactored) improve the code itself. Two examples here:

The changes here from integer to floating point arrays for testing make sense. Functions like solve are inherently floating point-only. We also need to test that integers (and lists, and other array-like's) are still converted correctly and we don't break backwards compat. However, that can be a single small test. That many tests use integers is a matter of previous authors taking a shortcut because it didn't matter much, rather than all those tests using integers by design.
The stricter dtype checks led me to spot a bug quickly when Lucas asked me about result_type:

>>> import numpy as np
>>> a = np.array([[3, 2, 0], [1, -1, 0], [0, 5, 1]])
>>> b = np.array([2, 4, -1])
>>> x = linalg.solve(a, b)
>>> a.dtype, b.dtype
(dtype('int64'), dtype('int64'))
>>> x.dtype
dtype('float64')

>>> # so output dtype should be float64 for integer input, but:
>>> linalg.solve(a, np.empty((3,0), dtype=np.int64)).dtype
dtype('int64')

We've seen in many places in cluster and fft as well that our tests don't check for expected dtypes, and we often have inconsistent return dtypes as a result (arguably all bugs).

So there is extra value from testing with other libraries: finding bugs and improving test coverage.

scipy/linalg/tests/test_decomp_cholesky.py

scipy/linalg/_decomp.py

scipy/linalg/tests/test_basic.py

scipy/linalg/tests/test_decomp.py

ilayn · 2023-09-27T14:29:44Z

I think it is quite useful to test non-numpy arrays; without testing it's almost certainly going to be broken. I think of these as testing optional dependencies - just like we have tests with mpmath, scikit-umfpack and a whole bunch of other optional runtime dependencies.

Testing is always nice indeed but the question is what to do when it is broken. I think none of us want to go chasing around PyTorch or CuPy repos for fixing things that is not really meant for us to do just to get our tests out to the greenland.

rgommers · 2023-09-27T15:31:13Z

Testing is always nice indeed but the question is what to do when it is broken. I think none of us want to go chasing around PyTorch or CuPy repos for fixing things that is not really meant for us to do just to get our tests out to the greenland.

I think the same of something breaks in NumPy, Cython, pytest, Sphinx or wherever else: we file an issue and skip the test or put a temporary upper bound. We're using pretty core/standard functions here, so I am not too worried about seeing too many regressions once things work. That would be really surprising. And in terms of debugging or even contributing upstream, I'd much rather work with CuPy or PyTorch than with things like pytest/sphinx/mpmath.

Also, the CuPy and PyTorch teams (and Dask and JAX) have invested large amounts of effort in NumPy and SciPy compatibility, so I'm pretty sure they'd appreciate and are willing to address bug reports.

[skip cirrus] [skip circle]

[skip ci]

[skip cirrus] [skip circle]

scipy/linalg/_decomp.py

lucascolley · 2024-03-31T23:32:57Z

The tests still need a good bit of work. But dare I say this is getting pretty close.

[skip ci]

scipy/linalg/_decomp_cholesky.py

scipy/linalg/tests/test_decomp.py

lucascolley · 2024-04-01T16:25:00Z

scipy/linalg/tests/test_decomp.py

+            xp_assert_close(u.T @ u, xp.eye(3), atol=1e-6)
+            xp_assert_close(vh.T @ vh, xp.eye(3), atol=1e-6)
+            sigma = xp.zeros((u.shape[0], vh.shape[0]), dtype=s.dtype)
+            for i in range(s.shape[0]):
                sigma[i, i] = s[i]
-            assert_array_almost_equal(u @ sigma @ vh, a)
+            xp_assert_close(u @ sigma @ vh, a, rtol=1e-6)


the tolerances around here could do with a look, I'm not sure why I wrote a mixture of atol and rtol.

ah I remember, it's because the default for np.testing.assert_array_almost_equal is roughly equivalent to rtol=0, atol=1.5e-6.

scipy/linalg/tests/test_basic.py

[skip ci]

[skip cirrus]

lucascolley · 2024-04-01T23:08:36Z

CI should be green. I think this is almost ready. A few questions remain:

this PR involves a lot of general improvements to the tests (checking more dtypes, checking shapes, stricter tolerances), but clearly these improvements haven't been pushed to be optimal. I think doing so would be a huge effort given the size of this diff, but happy to work a little more if there are particular areas that could do with some TLC.
I don't know if we want somewhat of a policy about what to do with tolerances. I've basically just gone with the defaults of the assertions, but where identity matrices are used we get atol failures due to things being non-0, so I have introduced atol bumps where needed (the default atol is 0).
A lot of the tests which are currently skipped could be split up into parts which are compatible and parts which aren't (at least for this PR). I think it would be too much effort to split up all of them, but some are maybe worth it. I've marked a few with TODOs.

EDIT: spoke too soon on CI but looks like just a atol=0 thing so far

EDIT 2: finally green :)

[skip cirrus] [skip circle]

lucascolley commented Sep 19, 2023

View reviewed changes

lucascolley marked this pull request as ready for review September 19, 2023 13:17

lucascolley requested review from larsoner and ilayn as code owners September 19, 2023 13:17

lucascolley changed the title ~~WIP, ENH: linalg: support array API for standard extension functions~~ ENH: linalg: support array API for standard extension functions Sep 19, 2023

lucascolley force-pushed the linalg_array_api branch from 53641f7 to 0d9337e Compare September 19, 2023 13:40

j-bowhay reviewed Sep 19, 2023

View reviewed changes

scipy/linalg/_basic.py Outdated Show resolved Hide resolved

lucascolley mentioned this pull request Sep 19, 2023

MAINT: array API: rename arg_err_msg and move to _lib #19265

Merged

j-bowhay added enhancement A new feature or improvement scipy.linalg array types Items related to array API support and input array validation (see gh-18286) labels Sep 19, 2023

lucascolley added 5 commits September 21, 2023 12:24

ENH: linalg: use xp.linalg for array API standard functions

aa217e8

TST: linalg: modifications for array API standard functions

7e85473

TST: linalg: new tests for array API standard functions

debc41b

MAINT: linalg: update import for xp_unsupported_param_msg

7bdf1bc

TST: linalg: xfail test for string dtypes

25df2f8

[skip cirrus] [skip circle]

lucascolley force-pushed the linalg_array_api branch from 0d9337e to 25df2f8 Compare September 21, 2023 11:36

lucascolley added 3 commits September 21, 2023 13:13

MAINT: linalg: clean up check_finite parameters

aa0d71b

[skip cirrus] [skip circle]

BUG: linalg: revert removal of finite checks

c866515

[skip cirrus] [skip circle]

MAINT: linalg: add xp arg to finite checks to reduce overhead

ef9831d

[skip cirrus] [skip circle]

lucascolley force-pushed the linalg_array_api branch from 37239f2 to ef9831d Compare September 21, 2023 13:10

ilayn reviewed Sep 21, 2023

View reviewed changes

This was referenced Sep 21, 2023

MAINT: array types: make compliance_scipy more strict #19276

Closed

MAINT: fft: clean up test-skips #19262

Merged

lucascolley added 2 commits September 24, 2023 21:06

ENH/TST: linalg: add type promotion to floats for xp, tidy dtypes and…

d6e4709

… tols [skip cirrus] [skip circle]

MAINT: linalg: bump stringent test tols for CI

cb48f7c

[skip cirrus] [skip circle]

rgommers reviewed Sep 27, 2023

View reviewed changes

scipy/linalg/tests/test_decomp_cholesky.py Outdated Show resolved Hide resolved

scipy/linalg/_decomp.py Outdated Show resolved Hide resolved

scipy/linalg/tests/test_basic.py Outdated Show resolved Hide resolved

scipy/linalg/tests/test_decomp.py Outdated Show resolved Hide resolved

MAINT: linalg: refactor exceptions for xp unsupported args

5d4bb07

lucascolley marked this pull request as draft March 26, 2024 18:30

lucascolley added 10 commits March 31, 2024 22:17

Merge branch 'main' into linalg_array_api

e6d52ce

appease linter

54d2c2f

[skip cirrus] [skip circle]

changes to bring up to date

21a60a4

[skip cirrus] [skip circle]

update cholesky, solve tests

62fe1d9

[skip ci]

cholesky tests complete

6f5ac20

solve tests update

e34c195

inv tests update

69e1351

wip

e43be02

det tests update

891ff76

pinv tests update

416bb4a

[skip cirrus] [skip circle]

lucascolley commented Mar 31, 2024

View reviewed changes

scipy/linalg/_decomp.py Outdated Show resolved Hide resolved

lucascolley added 4 commits April 1, 2024 00:38

cholesky test tweak

880f6b4

[skip ci]

cholesky test tweak

d312b88

[skip ci]

cholesky tweak

eee0260

[skip ci]

Merge branch 'main' into linalg_array_api

7ad1050

[skip ci]

lucascolley commented Apr 1, 2024

View reviewed changes

lucascolley added 6 commits April 1, 2024 19:26

update QR tests and some others

87d441c

[skip ci]

update decomp tests

ac3b785

[skip ci]

update more tests

7be1755

[skip ci]

more test updates

5a3606b

[skip cirrus]

fix CI

0c3f095

[skip cirrus]

fix CI round 2

6169872

[skip cirrus]

lucascolley requested a review from ilayn April 1, 2024 23:09

lucascolley marked this pull request as ready for review April 1, 2024 23:11

lucascolley added 2 commits April 2, 2024 11:11

fix CI round 3

82ad122

[skip cirrus] [skip circle]

Merge branch 'main' into linalg_array_api

69053e1

[skip cirrus] [skip circle]

ilayn mentioned this pull request Apr 10, 2024

ENH: add one-hot special function or support broadcasting in signal.unit_impulse #20442

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: linalg: support array API for standard extension functions #19260

ENH: linalg: support array API for standard extension functions #19260

lucascolley commented Sep 19, 2023 •

edited

ilayn left a comment

rgommers left a comment

ilayn commented Sep 27, 2023

rgommers commented Sep 27, 2023

lucascolley commented Mar 31, 2024

lucascolley Apr 1, 2024

lucascolley Apr 1, 2024

lucascolley commented Apr 1, 2024 •

edited

ENH: linalg: support array API for standard extension functions #19260

Are you sure you want to change the base?

ENH: linalg: support array API for standard extension functions #19260

Conversation

lucascolley commented Sep 19, 2023 • edited

Reference issue

What does this implement/fix?

Additional information

ilayn left a comment

Choose a reason for hiding this comment

rgommers left a comment

Choose a reason for hiding this comment

ilayn commented Sep 27, 2023

rgommers commented Sep 27, 2023

lucascolley commented Mar 31, 2024

lucascolley Apr 1, 2024

Choose a reason for hiding this comment

lucascolley Apr 1, 2024

Choose a reason for hiding this comment

lucascolley commented Apr 1, 2024 • edited

lucascolley commented Sep 19, 2023 •

edited

lucascolley commented Apr 1, 2024 •

edited