Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Polish only INDELs or only SNPs, and polishing thresholds #95

Open
mmontonerin opened this issue Jun 13, 2022 · 1 comment
Open

Polish only INDELs or only SNPs, and polishing thresholds #95

mmontonerin opened this issue Jun 13, 2022 · 1 comment

Comments

@mmontonerin
Copy link

Hi, I have tried NextPolish, and oveall I am happy with it, but I miss a bit pore possibilities to select what to polish in order to trust what is doing to the de novo genome assemblies I am working with.

One functionality that I feel I miss in NextPolish is the possibility to fix either only INDELs or only SNPs, depending on the type of data that is being used. For example, I have a set of short reads that I would want to use to only correct INDELs, as many SNPs could be just normal heterozygous sites, in different proportions in different datasets.

I also miss the possibility to be a bit more conservative in polishing, and be able to select a certain depth or quality threshold for a position to be polished.

Do you plan to implement any of these functionalities in the future?

@moold
Copy link
Member

moold commented Jun 14, 2022

Hi, first, thank you for your good suggestions. However, SNP and INDEL are hard to distinguish for NextPolish, because NextPolish correct error-bases using kmers, so NextPolish does not distinguish between SNP and INDEL. For heterozygous kmer, NextPolish selects the kmer with the most counts as the corrected kmer.

BTW, I will consider your suggestion and maybe add some extra functions/parameters in the future.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants