You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Apologies if I've missed this in the documentation. I wanted to clarify how krakenuniq handles multiple databases when run as
krakenuniq --db HOST --db PROK --db EUK_DRAFT
Am I correct in assuming that only kmers that do not match the HOST DB will be subsequently searched in the PROK DB?
Would this generally be a more conservative way to remove host DNA than including the host genome in a single DB?
Given a single taxonomy, is it possible to have the same genome in multiple DB's or does this cause problems and is it important to ensure the DBs do not overlap?
The text was updated successfully, but these errors were encountered:
hmm, maybe some of the others will answer but all I can say is what I do - I never run krakenuniq with multiple DBs. Instead, I run it with one DB and then use krakentools to extract all the unmapped reads. I then take those reads and (if I have a 2nd DB) I run them against the 2nd DB.
And to remove host DNA, I usually run bowtie2 to align against human (that's the only host I've filtered out) and then take the unmapped reads from that, and run them through KrakenUniq.
It does not cause a problem to have the same genome in multiple DBs. However some kmers might get assigned to different taxonomic IDs if you do that.
Hi, thanks for creating such a useful tool!
Apologies if I've missed this in the documentation. I wanted to clarify how krakenuniq handles multiple databases when run as
The text was updated successfully, but these errors were encountered: