Adds option to write sample names in variantCounts #8656

odcambc · 2024-01-15T22:08:04Z

In some cases it may be useful to know what reads are giving rise to which specific variants. I have run into several cases while debugging some strange results where this would be useful to know, and also there is a QC workflow we would like to implement where this would be essential information. This is unlikely to be generally useful, however.

This PR adds a flag, --write-qnames, which will, for each variant, write the list of qnames in the bam that give rise to that variant as a comma separated list in the final column.

This PR also makes synonymous variants (with no protein-level consequence) write an empty value rather than nothing, in order to keep column order.

This seems to work with SE reads, but hasn't been tested much with PE reads.

This should also probably not parse read names by default, but only if write-qnames is set.

odcambc added 6 commits October 20, 2023 16:33

Add option for keeping disjoint mates in ASM

065012a

Better name and fixing reports

a54d606

Finish fixing report

c4a9ed9

Fix report name

f60c975

Adds support for writing qnames

30f41a0

Merge branch 'broadinstitute:master' into name

6f10f00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds option to write sample names in variantCounts #8656

Adds option to write sample names in variantCounts #8656

odcambc commented Jan 15, 2024

Adds option to write sample names in variantCounts #8656

Are you sure you want to change the base?

Adds option to write sample names in variantCounts #8656

Conversation

odcambc commented Jan 15, 2024