[CPU] Bump `test_complex_2d` thresholds for LBFGS on `complex64` #126358

eqy · 2024-05-16T00:18:21Z

Is this supposed to be bitwise identical? Wasn't sure how to interpret the comment but it seems to be giving mismatches like:

Mismatched elements: 1 / 2 (50.0%)
Greatest absolute difference: 4.6372413635253906e-05 at index (1,) (up to 1e-05 allowed)
Greatest relative difference: 3.4600801882334054e-05 at index (1,) (up to 1.3e-06 allowed)

To execute this test, run the following from the base repo dir:
     python test/test_optim.py -k test_complex_2d_LBFGS_cpu_complex64

on Neoverse-N2 SBSA ARM CPUs.

cc @vincentqb @jbschlosser @albanD @janeyx99 @crcrpar @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @ezyang @anjali411 @dylanbespalko @mruberry @lezcano @nikitaved @amjames

pytorch-bot · 2024-05-16T00:18:24Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/126358

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 3900c22 with merge base 636e799 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / linux-focal-cuda12.1-py3.10-gcc9-sm86 / test (default, 2, 5, linux.g5.4xlarge.nvidia.gpu) (gh) (similar failure)
test_foreach.py::TestForeachCUDA::test_binary_op_list_error_cases__foreach_add_cuda_bfloat16

This comment was automatically generated by Dr. CI and updates every 15 minutes.

lezcano · 2024-05-16T09:16:07Z

I reckon that one vectorises and the other one doesn't, hence the error, but would be good to confirm it.

albanD · 2024-05-16T13:26:25Z

torch/testing/_internal/common_optimizers.py

+                            rtol=4.5e-5,
+                            atol=5e-5,


What is the default tolerance for complex64? Souds like it should be at that level already?

rtol of 1.3e-6 and atol of 1e-5

ref: https://pytorch.org/docs/stable/testing.html#torch.testing.assert_close

Ho right! Sorry I confused myself with halving the bitlength. This is expected to match fp32.
The general rule here is that for single op, we expect these to hold. If you test a bunch of ops chained one after the other, then we might have to increase the tolerance yes.

janeyx99 · 2024-05-16T14:19:42Z

https://github.com/pytorch/pytorch/blob/main/test/test_optim.py#L518-L527 for why it's not bitwise equal

eqy · 2024-05-22T03:58:45Z

@pytorchmergebot rebase

pytorchmergebot · 2024-05-22T04:00:16Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

pytorchmergebot · 2024-05-22T04:00:19Z

Successfully rebased eqy-patch-3 onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout eqy-patch-3 && git pull --rebase)

janeyx99 · 2024-05-22T14:34:00Z

@pytorchbot merge

pytorchmergebot · 2024-05-22T14:36:12Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-05-22T18:42:10Z

Merge failed

Reason: Not merging any PRs at the moment because there is a merge blocking https://github.com/pytorch/pytorch/labels/ci:%20sev issue open at:
#126896

Details for Dev Infra team

Raised by workflow job

eqy · 2024-05-23T00:13:36Z

@pytorchmergebot merge

pytorchmergebot · 2024-05-23T00:16:19Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

eqy requested a review from janeyx99 May 16, 2024 00:18

eqy added open source module: optimizer Related to torch.optim module: cpu CPU specific problem (e.g., perf, algorithm) module: complex Related to complex number support in PyTorch topic: not user facing topic category labels May 16, 2024

lezcano approved these changes May 16, 2024

View reviewed changes

albanD reviewed May 16, 2024

View reviewed changes

Update common_optimizers.py

3900c22

pytorchmergebot force-pushed the eqy-patch-3 branch from efdd003 to 3900c22 Compare May 22, 2024 04:00

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label May 22, 2024

janeyx99 approved these changes May 22, 2024

View reviewed changes

pytorchmergebot added the merging label May 22, 2024

pytorchmergebot removed the merging label May 22, 2024

pytorchmergebot added the merging label May 23, 2024

pytorchmergebot closed this in ebbd431 May 23, 2024

pytorchmergebot added Merged and removed merging labels May 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CPU] Bump `test_complex_2d` thresholds for LBFGS on `complex64` #126358

[CPU] Bump `test_complex_2d` thresholds for LBFGS on `complex64` #126358

eqy commented May 16, 2024 •

edited

pytorch-bot bot commented May 16, 2024 •

edited

lezcano commented May 16, 2024

albanD May 16, 2024

crcrpar May 16, 2024

albanD May 16, 2024

janeyx99 commented May 16, 2024

eqy commented May 22, 2024

pytorchmergebot commented May 22, 2024

pytorchmergebot commented May 22, 2024

janeyx99 commented May 22, 2024

pytorchmergebot commented May 22, 2024

pytorchmergebot commented May 22, 2024

eqy commented May 23, 2024

pytorchmergebot commented May 23, 2024

[CPU] Bump test_complex_2d thresholds for LBFGS on complex64 #126358

[CPU] Bump test_complex_2d thresholds for LBFGS on complex64 #126358

Conversation

eqy commented May 16, 2024 • edited

pytorch-bot bot commented May 16, 2024 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/126358

✅ You can merge normally! (1 Unrelated Failure)

lezcano commented May 16, 2024

albanD May 16, 2024

Choose a reason for hiding this comment

crcrpar May 16, 2024

Choose a reason for hiding this comment

albanD May 16, 2024

Choose a reason for hiding this comment

janeyx99 commented May 16, 2024

eqy commented May 22, 2024

pytorchmergebot commented May 22, 2024

pytorchmergebot commented May 22, 2024

janeyx99 commented May 22, 2024

pytorchmergebot commented May 22, 2024

Merge started

pytorchmergebot commented May 22, 2024

Merge failed

eqy commented May 23, 2024

pytorchmergebot commented May 23, 2024

Merge started

[CPU] Bump `test_complex_2d` thresholds for LBFGS on `complex64` #126358

[CPU] Bump `test_complex_2d` thresholds for LBFGS on `complex64` #126358

eqy commented May 16, 2024 •

edited

pytorch-bot bot commented May 16, 2024 •

edited