[XPU] call empty_cache for dynamo tests #126377

Stonepia · 2024-05-16T04:22:29Z

When running a batch of models, lacking empty_cache() would result in OOM for subsequent models.

This PR unifies the empty_cache call for both CUDA and XPU.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @chenyang78 @kadeng @chauhang

pytorch-bot · 2024-05-16T04:22:32Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/126377

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 56a7d2e with merge base d0dfcd2 ():

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Lint / lintrunner-noclang / linux-job (gh) (trunk failure)
>>> Lint for torch/onnx/_internal/onnx_proto_utils.py:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

benchmarks/dynamo/common.py

desertfire · 2024-05-16T22:47:45Z

benchmarks/dynamo/common.py

+    ), "The empty_gpu_cache needs to be called with a non empty device str"
+    if device == "cuda":
+        torch.cuda.empty_cache()
+    if device == "xpu":


Thanks for the review! Solved in 56a7d2e

Stonepia · 2024-05-17T06:03:32Z

@pytorchbot merge

pytorchmergebot · 2024-05-17T06:05:20Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Summary: When running a batch of models, lacking `empty_cache()` would result in OOM for subsequent models. This PR unifies the `empty_cache` call for both CUDA and XPU. X-link: pytorch/pytorch#126377 Approved by: https://github.com/EikanWang, https://github.com/guangyey, https://github.com/desertfire Reviewed By: huydhn Differential Revision: D57518757 fbshipit-source-id: a42ae31e7fb81bb05217fd672a3427bd68478a50

When running a batch of models, lacking `empty_cache()` would result in OOM for subsequent models. This PR unifies the `empty_cache` call for both CUDA and XPU. Pull Request resolved: pytorch#126377 Approved by: https://github.com/EikanWang, https://github.com/guangyey, https://github.com/desertfire

[XPU] call empty_cache call for gpu dynamo tests

a9401f7

pytorch-bot bot added the module: dynamo label May 16, 2024

Stonepia changed the title ~~[XPU] call empty_cache call for gpu dynamo tests~~ [XPU] call empty_cache for gpu dynamo tests May 16, 2024

Stonepia changed the title ~~[XPU] call empty_cache for gpu dynamo tests~~ [XPU] call empty_cache for dynamo tests May 16, 2024

pytorchbot added the open source label May 16, 2024

Move empty_gpu_cache outside

65bacb7

EikanWang reviewed May 16, 2024

View reviewed changes

benchmarks/dynamo/common.py Outdated Show resolved Hide resolved

Stonepia marked this pull request as ready for review May 16, 2024 05:12

EikanWang approved these changes May 16, 2024

View reviewed changes

EikanWang requested a review from desertfire May 16, 2024 05:14

only call empty_cache for corresponding device

e25b6b6

Stonepia requested a review from EikanWang May 16, 2024 05:26

guangyey approved these changes May 16, 2024

View reviewed changes

Stonepia added 3 commits May 16, 2024 05:44

Pass device to the empty_gpu_cache

2bee364

Add an assertion

ec09a15

format code

e6e36d9

EikanWang approved these changes May 16, 2024

View reviewed changes

desertfire approved these changes May 16, 2024

View reviewed changes

desertfire reviewed May 16, 2024

View reviewed changes

refine code

56a7d2e

guangyey added ciflow/trunk Trigger trunk jobs on your pull request release notes: dynamo release notes: benchmark release notes category and removed release notes: dynamo labels May 17, 2024

pytorchmergebot added the merging label May 17, 2024

pytorchmergebot closed this in 5756b53 May 17, 2024

pytorchmergebot added Merged and removed merging labels May 17, 2024

Stonepia deleted the xpu/empty_cache branch May 17, 2024 06:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[XPU] call empty_cache for dynamo tests #126377

[XPU] call empty_cache for dynamo tests #126377

Stonepia commented May 16, 2024 •

edited by pytorch-bot bot

pytorch-bot bot commented May 16, 2024 •

edited

desertfire May 16, 2024

Stonepia May 17, 2024

Stonepia commented May 17, 2024

pytorchmergebot commented May 17, 2024

[XPU] call empty_cache for dynamo tests #126377

[XPU] call empty_cache for dynamo tests #126377

Conversation

Stonepia commented May 16, 2024 • edited by pytorch-bot bot

pytorch-bot bot commented May 16, 2024 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/126377

✅ You can merge normally! (1 Unrelated Failure)

desertfire May 16, 2024

Choose a reason for hiding this comment

Stonepia May 17, 2024

Choose a reason for hiding this comment

Stonepia commented May 17, 2024

pytorchmergebot commented May 17, 2024

Merge started

Stonepia commented May 16, 2024 •

edited by pytorch-bot bot

pytorch-bot bot commented May 16, 2024 •

edited