Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Revives two previously broken MPASSI tests #2323

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

singhbalwinder
Copy link
Contributor

Revives two MPASSI tests that were previously broken on GPUs in hopes that more recent code changes might have fixed these. I have tested them on a CPU machine (Compy) to ensure that they work on a CPU machine. Due to the missing partition files for the default Compy configurations, I ran the following equivalent tests:

./cs.status.mpassi_tests3 -s
mpassi_tests3: 3 tests
  PASS ERP_Ln22_P64x1.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.compy_intel.atmlndactive-rtm_off
  PASS ERS_D_Ln22_P40x1.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.compy_intel.atmlndactive-rtm_off
  PASS PEM_Ln90_P128x1.ne30pg2_EC30to60E2r2.F2010-SCREAMv1-MPASSI.compy_intel

@singhbalwinder singhbalwinder added the AT: RETEST Force the autotester (AT) to retest the PR label May 9, 2023
@E3SM-Autotester
Copy link
Collaborator

Status Flag 'Pull Request AutoTester' - User Requested Retest - Label AT: RETEST will be reset after testing.

@E3SM-Autotester
Copy link
Collaborator

Status Flag 'Pre-Test Inspection' - Auto Inspected - Inspection Is Not Necessary for this Pull Request.

@E3SM-Autotester
Copy link
Collaborator

Status Flag 'Pull Request AutoTester' - Failure: Timed out waiting for job SCREAM_PullRequest_Autotester_Mappy to start: Total Wait = 1803

  • Other jobs have been previously started - We must stop them...

@E3SM-Autotester E3SM-Autotester removed the AT: RETEST Force the autotester (AT) to retest the PR label May 9, 2023
@singhbalwinder singhbalwinder added the AT: RETEST Force the autotester (AT) to retest the PR label May 9, 2023
@E3SM-Autotester
Copy link
Collaborator

Status Flag 'Pull Request AutoTester' - User Requested Retest - Label AT: RETEST will be reset after testing.

@E3SM-Autotester
Copy link
Collaborator

Status Flag 'Pull Request AutoTester' - Failure: Timed out waiting for job SCREAM_PullRequest_Autotester_Mappy to start: Total Wait = 1803

  • Other jobs have been previously started - We must stop them...

@E3SM-Autotester E3SM-Autotester removed the AT: RETEST Force the autotester (AT) to retest the PR label May 9, 2023
@E3SM-Autotester
Copy link
Collaborator

Status Flag 'Pull Request AutoTester' - Testing Jenkins Projects:

Pull Request Auto Testing STARTING (click to expand)

Build Information

Test Name: SCREAM_PullRequest_Autotester_Mappy

  • Build Num: 3870
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
PR_LABELS
PULLREQUESTNUM 2323
SCREAM_SOURCE_BRANCH singhbalwinder/mpassi-tests
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA 4940e3b
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 06a178c
TEST_REPO_ALIAS SCREAM

Build Information

Test Name: SCREAM_PullRequest_Autotester_Weaver

  • Build Num: 4444
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
PR_LABELS
PULLREQUESTNUM 2323
SCREAM_SOURCE_BRANCH singhbalwinder/mpassi-tests
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA 4940e3b
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 06a178c
TEST_REPO_ALIAS SCREAM

Build Information

Test Name: SCREAM_PullRequest_Autotester_Blake

  • Build Num: 4498
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
PR_LABELS
PULLREQUESTNUM 2323
SCREAM_SOURCE_BRANCH singhbalwinder/mpassi-tests
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA 4940e3b
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 06a178c
TEST_REPO_ALIAS SCREAM

Using Repos:

Repo: SCREAM (E3SM-Project/scream)
  • Branch: singhbalwinder/mpassi-tests
  • SHA: 4940e3b
  • Mode: TEST_REPO

Pull Request Author: singhbalwinder

@E3SM-Autotester
Copy link
Collaborator

Status Flag 'Pull Request AutoTester' - Jenkins Testing: all Jobs PASSED

Pull Request Auto Testing has PASSED (click to expand)

Build Information

Test Name: SCREAM_PullRequest_Autotester_Mappy

  • Build Num: 3870
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
PR_LABELS
PULLREQUESTNUM 2323
SCREAM_SOURCE_BRANCH singhbalwinder/mpassi-tests
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA 4940e3b
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 06a178c
TEST_REPO_ALIAS SCREAM

Build Information

Test Name: SCREAM_PullRequest_Autotester_Weaver

  • Build Num: 4444
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
PR_LABELS
PULLREQUESTNUM 2323
SCREAM_SOURCE_BRANCH singhbalwinder/mpassi-tests
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA 4940e3b
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 06a178c
TEST_REPO_ALIAS SCREAM

Build Information

Test Name: SCREAM_PullRequest_Autotester_Blake

  • Build Num: 4498
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
PR_LABELS
PULLREQUESTNUM 2323
SCREAM_SOURCE_BRANCH singhbalwinder/mpassi-tests
SCREAM_SOURCE_REPO https://github.com/E3SM-Project/scream
SCREAM_SOURCE_SHA 4940e3b
SCREAM_TARGET_BRANCH master
SCREAM_TARGET_REPO https://github.com/E3SM-Project/scream
SCREAM_TARGET_SHA 06a178c
TEST_REPO_ALIAS SCREAM

@E3SM-Autotester
Copy link
Collaborator

Status Flag 'Pre-Merge Inspection' - - This Pull Request Requires Inspection... The code must be inspected by a member of the Team before Testing/Merging
WARNING: NO REVIEWERS HAVE BEEN REQUESTED FOR THIS PULL REQUEST!

@E3SM-Autotester
Copy link
Collaborator

All Jobs Finished; status = PASSED, However Inspection must be performed before merge can occur...

1 similar comment
@E3SM-Autotester
Copy link
Collaborator

All Jobs Finished; status = PASSED, However Inspection must be performed before merge can occur...

# "ERS_Ln22.ne30pg2_EC30to60E2r2.F2010-SCREAMv1-MPASSI.atmlndactive-rtm_off",
# "ERP_D_Ln9.ne4_oQU240.F2010-SCREAMv1-MPASSI.atmlndactive-rtm_off",
# "SMS_D_Ln9.ne4_oQU240.F2010-SCREAMv1-MPASSI-noAero",
"ERP_Ln22.ne4pg2_oQU480.F2010-SCREAMv1-MPASSI.atmlndactive-rtm_off",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@singhbalwinder have you confirmed that these tests pass on pm-gpu now?

@AaronDonahue AaronDonahue added testing BFB Bit for bit AT: AUTOMERGE Inform the autotester (AT) that it can merge this PR if reviewers approved, and tests pass AT: Skip Stand-Alone Testing Do not run SCREAM internal unit tests for AT runs labels May 10, 2023
@E3SM-Autotester
Copy link
Collaborator

Status Flag 'Pre-Merge Inspection' - - This Pull Request Requires Inspection... The code must be inspected by a member of the Team before Testing/Merging
THE LAST COMMIT TO THIS PULL REQUEST HAS BEEN REVIEWED, BUT NOT ACCEPTED OR REQUIRES CHANGES

@E3SM-Autotester
Copy link
Collaborator

All Jobs Finished; status = PASSED, However Inspection must be performed before merge can occur...

1 similar comment
@E3SM-Autotester
Copy link
Collaborator

All Jobs Finished; status = PASSED, However Inspection must be performed before merge can occur...

@E3SM-Autotester E3SM-Autotester added the AT: STALE Label set by the autotester (AT). PR is stale, and should not be tested. Override with "AT: RETEST" label May 17, 2023
@E3SM-Autotester
Copy link
Collaborator

All Jobs Finished; status = PASSED, However PR is now STALE, and must be retested. Set the AT: RETEST Label to force retest....

1 similar comment
@E3SM-Autotester
Copy link
Collaborator

All Jobs Finished; status = PASSED, However PR is now STALE, and must be retested. Set the AT: RETEST Label to force retest....

@PeterCaldwell
Copy link
Contributor

@singhbalwinder and @AaronDonahue - my understanding was that Balwinder was going to just submit this PR to the AT and if it passed, we'd integrate it... we would have integrated it a long time ago except that it wasn't passing these tests :-). Though maybe I'm not understanding the situation correctly.

@AaronDonahue
Copy link
Contributor

@PeterCaldwell , I'm happy to merge it. I had asked @singhbalwinder to check that this was going to work on pm-gpu which I think was the last step. He was going to report back once he did the check.

@E3SM-Autotester
Copy link
Collaborator

All Jobs Finished; status = PASSED, However PR is now STALE, and must be retested. Set the AT: RETEST Label to force retest....

1 similar comment
@E3SM-Autotester
Copy link
Collaborator

All Jobs Finished; status = PASSED, However PR is now STALE, and must be retested. Set the AT: RETEST Label to force retest....

@E3SM-Autotester
Copy link
Collaborator

All Jobs Finished; status = PASSED, However PR is now STALE, and must be retested. Set the AT: RETEST Label to force retest....

4 similar comments
@E3SM-Autotester
Copy link
Collaborator

All Jobs Finished; status = PASSED, However PR is now STALE, and must be retested. Set the AT: RETEST Label to force retest....

@E3SM-Autotester
Copy link
Collaborator

All Jobs Finished; status = PASSED, However PR is now STALE, and must be retested. Set the AT: RETEST Label to force retest....

@E3SM-Autotester
Copy link
Collaborator

All Jobs Finished; status = PASSED, However PR is now STALE, and must be retested. Set the AT: RETEST Label to force retest....

@E3SM-Autotester
Copy link
Collaborator

All Jobs Finished; status = PASSED, However PR is now STALE, and must be retested. Set the AT: RETEST Label to force retest....

@PeterCaldwell PeterCaldwell added the AT: WIP Inform the autotester (AT) that the PR is a work in progress, and should not be tested label May 25, 2023
@PeterCaldwell
Copy link
Contributor

Followed Michael Jackson's advice and WIPped this PR since the AT keeps running but I know that Balwinder is tracking down errors on Perlmutter.

@AaronDonahue
Copy link
Contributor

@singhbalwinder , do you have a status update on this PR?

@singhbalwinder
Copy link
Contributor Author

I thought getting the fully coupled compset working is a more urgent task, so I spent time making it work. It works now! I will come to these tests as part of my next steps.

@PeterCaldwell
Copy link
Contributor

Great that the coupled compset works! I thought that getting these MPASSI tests working was needed to get the coupled compset working since the coupled compset needs to use MPASSI. But I guess so long as you're not running on the machine that's having the MPASSI fails it isn't...

@singhbalwinder
Copy link
Contributor Author

Yes, you are right. All tests were working on Compy, so I went ahead and started building the fully coupled compset on Compy. I am now looking into the setup to ensure that the simulation is set up correctly before moving to address the failing tests.

@E3SM-Autotester
Copy link
Collaborator

All Jobs Finished; status = PASSED, However PR is now STALE, and must be retested. Set the AT: RETEST Label to force retest....

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
AT: AUTOMERGE Inform the autotester (AT) that it can merge this PR if reviewers approved, and tests pass AT: Skip Stand-Alone Testing Do not run SCREAM internal unit tests for AT runs AT: STALE Label set by the autotester (AT). PR is stale, and should not be tested. Override with "AT: RETEST" AT: WIP Inform the autotester (AT) that the PR is a work in progress, and should not be tested BFB Bit for bit testing
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants