Is it possible to get ASVs with zero counts in all samples after DADA2 run? #1958

Anto007 · 2024-05-17T22:50:22Z

Hi

I'm a frequent user of your wonderful package. In the Novaseq 6000 dataset (COI marker gene) that I'm currently analyzing, I noticed an odd thing in the ASV count output table for the first time- a couple hundred ASVs have got zero read counts (out of a total of 59K plus ASVs) across all samples after the completion of the standard DADA2 workflow run. Is this possible? or is this an artifact of using NovaSeq data? Would it be ok to proceed by just removing these ASVs or should I consider enforcing monotonicity and rerun the pipeline and then hopefully such zero-count ASVs will not appear anymore?

I was initially hoping to just filter out the rare (and possibly erroneous) ASVs (if found in <2 sample libraries and if represented by <10 reads) as was done in some published papers with such NovaSeq data rather than spend a lot of time testing monotonicity enforcing parameters as this could take some time due to the size of my dataset. Rare ASVs aren't actually that important as far as my current project objectives go. However, I realize now that going down this rare ASV elimination route will result in the loss of >94% of the reads across all of my samples.

Thank you very much in advance for any valuable suggestions here.

benjjneb · 2024-05-18T20:55:03Z

a couple hundred ASVs have got zero read counts (out of a total of 59K plus ASVs) across all samples after the completion of the standard DADA2 workflow run.

I don't see how that would be possible if following the dada2 tutorial workflow. ASVs aren't added to the table unless they exist in >0 abundance, and steps like chimera removal remove the ASV column entirely, they don't just delete all the counts and leave a zero-study-wise-abundance ASV around.

Perhaps try checking for these zero-count ASVs at intermediate steps along the workflow, to try to isolate where they may be cropping up?

Anto007 · 2024-05-19T05:32:55Z

Thank you very much @benjjneb and sorry, yes, you're correct...Zero-count ASVs appear in the table only after I perform decontam and then finally remove the blank samples from the table. Another final related question is why would singletons appear in the counts table if in the standard workflow, singleton detection is set to FALSE by default? Thank you again.

benjjneb · 2024-05-20T20:53:00Z

Singletons can appear due to merging. That is, a unique pair of forward and reverse ASVs, both of which were not singletons themselves, can produce a singleton if only one paired-read had that combination.

Anto007 · 2024-05-21T09:49:07Z

Many thanks again for your kind clarification and your time @benjjneb

Anto007 closed this as completed May 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to get ASVs with zero counts in all samples after DADA2 run? #1958

Is it possible to get ASVs with zero counts in all samples after DADA2 run? #1958

Anto007 commented May 17, 2024 •

edited

benjjneb commented May 18, 2024

Anto007 commented May 19, 2024

benjjneb commented May 20, 2024

Anto007 commented May 21, 2024

Is it possible to get ASVs with zero counts in all samples after DADA2 run? #1958

Is it possible to get ASVs with zero counts in all samples after DADA2 run? #1958

Comments

Anto007 commented May 17, 2024 • edited

benjjneb commented May 18, 2024

Anto007 commented May 19, 2024

benjjneb commented May 20, 2024

Anto007 commented May 21, 2024

Anto007 commented May 17, 2024 •

edited