Adds option to write sample names in variantCounts #8656
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
In some cases it may be useful to know what reads are giving rise to which specific variants. I have run into several cases while debugging some strange results where this would be useful to know, and also there is a QC workflow we would like to implement where this would be essential information. This is unlikely to be generally useful, however.
This PR adds a flag,
--write-qnames
, which will, for each variant, write the list of qnames in the bam that give rise to that variant as a comma separated list in the final column.This PR also makes synonymous variants (with no protein-level consequence) write an empty value rather than nothing, in order to keep column order.
This seems to work with SE reads, but hasn't been tested much with PE reads.
This should also probably not parse read names by default, but only if write-qnames is set.