Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

per base sequence quality fix: https://github.com/smithlabcode/falco/… #27

Conversation

Shelestova-Anastasia
Copy link

@Shelestova-Anastasia Shelestova-Anastasia commented Feb 25, 2022

#25

When we calculate per base sequence quality by group (for example 10-14) - we need to summarize percentiles for each base position in group and then divide sum by base positions number in group.

Now the results are almost as fastqc. The difference is only percentile_thresh calculation. Seems your thresholds are more accurate.

Fastqc calculates trash as long: long percentile_thresh = totalCounts * percentile / 100;
Falco calculates as double - for example: ldecile_thresh = 0.1 * bases_in_group;

@guilhermesena1 guilhermesena1 merged commit d9f18f3 into smithlabcode:master Mar 1, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants