Reproducibility of primary alignments with bwa-mem2 #228

gh-jphan · 2023-04-13T13:48:47Z

I've encountered an issue with reproducibility of bwa-mem2 that is likely due to a multi-threading bug. This may be related to the following issue: #215.

Observations while documenting this issue:

When running bwa-mem2 multiple times using the same input, there is a small but random probability that the alignment output is different. E.g., out of 1000 runs with the same dataset, most outputs are identical, but a small single digit percentage of outputs are different.
These differences are always related to alignments that have multiple alternate alignment positions (i.e., alignments that produce XA tags). I understand that when there are multiple equally good alignment positions, bwa-mem2 will randomly choose a position. However, that "random" choice should be the same from run to run, given that the input data is identical (including read order).
A smaller -K parameter (for batch size) increases the probability that a different output occurs. A -K large enough that loads the entire dataset results in 0 different outputs (fixes the problem).
Disabling multi-threaded IO (-1 parameter) results in 0 different outputs (fixes the problem).
AVX512 optimized binary has a higher probability of different outputs compared to SSE, AVX, AVX2, etc. This probably affects reproducibility because of the speed/timing changes when executing certain code blocks or steps.
The original bwa mem does not have this issue, outputs are always identical.

I believe I have a simple fix, but wanted to document it as an issue. I will open a PR shortly.

Thanks!
-John

robertzeibich · 2023-05-28T10:20:48Z

Hi John, Do you think your finding could also fix my problem (#233)? Should I turn off multi-threading? I think my problem also aligns with what was posted here: #227. Any recommendation for me would be much appreciated.

gh-jphan · 2023-05-30T14:12:42Z

Hi Robert, I'm not sure if it's a related problem, but it could be. My comparisons were within runs of bwa-mem2, and I didn't compare output between bwa-mem2 and bwa. You mentioned that there is an input difference for the fasta files. Ideally they should be the same if you're expecting the same output SAM/BAM files (unless I'm mis-understanding). Also, it's not clear to me what are the differences between the outputs. Maybe if you could post some example SAM outputs showing the differences.

chappj1 · 2023-06-02T17:11:52Z

@gh-jphan Your point 2 above:

"These differences are always related to alignments that have multiple alternate alignment positions (i.e., alignments that produce XA tags). I understand that when there are multiple equally good alignment positions, bwa-mem2 will randomly choose a position. However, that "random" choice should be the same from run to run, given that the input data is identical (including read order)."

How do you know "bwa-mem2 will randomly choose a position"? I do not see that information in the documentation anywhere. Also, is there any way to set this option to something different, e.g. create an alignment record for all positions, rather than randomly selecting one?

Thanks,
James

gh-jphan · 2023-06-02T17:42:57Z

@chappj1, I think that question has been asked before and unfortunately, the docs don't clearly describe the behavior (at least in the docs I could find). It's supposed to mimic bwa behavior, so some of the older bwa docs describe behavior for samse, etc, but not for mem. For example: https://www.biostars.org/p/304614/. And in the reply to this post is a description of the exepected behavior that I've observed: https://davetang.org/muse/2011/10/11/bwa-and-multi-mapping-reads/.

There is an option "-a" to output all alignments, but there may be a very large number of alternative alignments for some reads.

k1sauce · 2023-12-06T19:50:57Z

@yuk12 Is there any plan for #229, also is bwa-mem2 still being maintained?

vasimuddin · 2023-12-07T04:38:39Z

@k1sauce yes, it is being maintained. We are in the middle of fixing the issue, and will do a release this month.

shanebrubaker · 2024-05-08T18:14:31Z

HI is there any update on this? We would really like to use this fix and have it merged in. Thanks!

gh-jphan · 2024-05-08T18:24:41Z

I second that, and just resolved a conflict since the PR has been open for a while.

gh-jphan · 2024-05-08T18:32:47Z

I second that, and just resolved a conflict since the PR has been open for a while.

Unfortunately I do not have write access, I think it is up to: @yuk12

yuk12 · 2024-05-08T18:50:20Z

merged. Sorry for the delay. Appreciate the fix.

yuk12 · 2024-05-08T18:50:51Z

Will make a release after a few tests in a day or two.

serge2016 · 2024-05-08T19:09:42Z

Thank you!!!

shanebrubaker · 2024-05-09T15:51:49Z

Thank you everyone!!!! From: Sergey Mitrofanov ***@***.***> Date: Wednesday, May 8, 2024 at 12:10 PM To: bwa-mem2/bwa-mem2 ***@***.***> Cc: Shane Brubaker ***@***.***>, Comment ***@***.***> Subject: Re: [bwa-mem2/bwa-mem2] Reproducibility of primary alignments with bwa-mem2 (Issue #228) Myriad Security Notice: This is an external email. Do not click links or open attachments unless you recognize the sender and know the content is safe. Thank you!!! — Reply to this email directly, view it on GitHub<https://protect.checkpoint.com/v2/___https:/github.com/bwa-mem2/bwa-mem2/issues/228%23issuecomment-2101251962___.YzJ1Om15cmlhZGdlbmV0aWNzOmM6bzo5ZDYxMWM5Mzk0NjYyODJhNmE0NzZmZjVhMmIwMzc0Yzo2OmE2OGE6NjYxYjE1MGJmMmEzMDdlYjY1YjA0OTY3NTIwYTQ4MGZiODA1ZDFmODRhOGNlY2QyNmViNDc3ODMzOTczYzIyOTpoOlQ>, or unsubscribe<https://protect.checkpoint.com/v2/___https:/github.com/notifications/unsubscribe-auth/ASEMQ275N5LVNHD2NFWP5EDZBJ2AZAVCNFSM6AAAAAAW5DULMSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCMBRGI2TCOJWGI___.YzJ1Om15cmlhZGdlbmV0aWNzOmM6bzo5ZDYxMWM5Mzk0NjYyODJhNmE0NzZmZjVhMmIwMzc0Yzo2OjViOGQ6MmZlM2YyMzExYjU2NTI0OWRjYWI3N2QwMDFkOTA2ZWE3ZjMxNmE3NGU5NWQ1ZmQ2YmIzNTNhODQ0NjQ0YTk0YjpoOlQ>. You are receiving this because you commented.Message ID: ***@***.***>

gh-jphan mentioned this issue Apr 13, 2023

Fix threading reproducibility issue by moving n_processed update line from step 2 to step 1 #229

Merged

quito418 mentioned this issue Jan 31, 2024

Why are bwa and BWA-MEME results inconsistent? kaist-ina/BWA-MEME#27

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproducibility of primary alignments with bwa-mem2 #228

Reproducibility of primary alignments with bwa-mem2 #228

gh-jphan commented Apr 13, 2023 •

edited

robertzeibich commented May 28, 2023

gh-jphan commented May 30, 2023

chappj1 commented Jun 2, 2023

gh-jphan commented Jun 2, 2023

k1sauce commented Dec 6, 2023

vasimuddin commented Dec 7, 2023

shanebrubaker commented May 8, 2024

gh-jphan commented May 8, 2024

gh-jphan commented May 8, 2024

yuk12 commented May 8, 2024

yuk12 commented May 8, 2024

serge2016 commented May 8, 2024

shanebrubaker commented May 9, 2024 via email

Reproducibility of primary alignments with bwa-mem2 #228

Reproducibility of primary alignments with bwa-mem2 #228

Comments

gh-jphan commented Apr 13, 2023 • edited

robertzeibich commented May 28, 2023

gh-jphan commented May 30, 2023

chappj1 commented Jun 2, 2023

gh-jphan commented Jun 2, 2023

k1sauce commented Dec 6, 2023

vasimuddin commented Dec 7, 2023

shanebrubaker commented May 8, 2024

gh-jphan commented May 8, 2024

gh-jphan commented May 8, 2024

yuk12 commented May 8, 2024

yuk12 commented May 8, 2024

serge2016 commented May 8, 2024

shanebrubaker commented May 9, 2024 via email

gh-jphan commented Apr 13, 2023 •

edited