Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bamtools random reads are highly skewed towards small chromosomes #230

Open
benbfly opened this issue Nov 12, 2023 · 0 comments
Open

bamtools random reads are highly skewed towards small chromosomes #230

benbfly opened this issue Nov 12, 2023 · 0 comments

Comments

@benbfly
Copy link

benbfly commented Nov 12, 2023

I believe the way bamtools random works is to first randomly pick a reference, and then randomly pick a position within the reference.

For instance, in human GRCh38 , there are lots of tiny reference chromosomes, like chrUn_**** and chr1_****_random. Bamtools random will pick a large fraction of reads from these chromosomes, even if they are proportionally at tiny fraction of all reads. This is undesirable behavior, especially since these chromosomes are abnormal sequence which is mostly high copy and low complexity repeats.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant