Skip to content
This repository has been archived by the owner on Mar 10, 2023. It is now read-only.

Add option to make per-cell bam files #96

Open
olgabot opened this issue Aug 17, 2020 · 2 comments
Open

Add option to make per-cell bam files #96

olgabot opened this issue Aug 17, 2020 · 2 comments

Comments

@olgabot
Copy link
Collaborator

olgabot commented Aug 17, 2020

Currently, if one wants to count reads with differential hashes, in genes, one needs to grep/search the ENTIRE 22-gigabyte channel bam file for one single cell (out of ~700,000), which is extremely inefficient. So let's do this work up fron.
After filtering for the good barcodes, then add the option to create per-cell bam files which are useful for nf-predictorthologs.

script:
barcode_pattern = "CB:Z:${cell_barcode}-1|XC:Z:${cell_barcode}" 
"""
samtools view ${channel_bam} \\
  | rg --threads ${task.cpus}  '${barcode_pattern}' - \\
  | cat ${header_sam} - \\
  | samtools view -Sb > ${cell_barcode_bam}
"""

@lekhakaranam may be a good feature to add after the template merge (#93 )

@pranathivemuri
Copy link
Contributor

looks like this PR was opened but closed a while ago - #97

@olgabot
Copy link
Collaborator Author

olgabot commented Oct 21, 2020

Oh yeah I think there were some merge/rebase issues

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants