-
Notifications
You must be signed in to change notification settings - Fork 141
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Missing reads when transitioning from Dada2 to phyloseq #1933
Comments
I'm not sure what is defining the If you are trying to get the total abundances (numbers of reads) in each sample from a phyloseq object, the appropriate function to use is |
I tried sample_sums to check the reads, but it gave me the same result. Create phyloseq object
ArcB1 AuP1 AuB1 AuB2 AuB3 ArcB1 AuP1 AuB1 AuB2 AuB3 The problematic step is between the seqtab.nochim and the physeq object. I assigend my taxa with the tryRC=TRUE argument in the assignTaxonomy function. Do you think that this could have an affect on the number of reads? Here is the code i used to assign the taxonomy: Assign taxonomy
|
I would suggest trying to construct the phyloseq object directly from the original sequence table and tax table from DADA2 (i.e. that contain the full DNA sequences as IDs), and checking if that solves the change in read counts issue. It is simpler and less error prone to then switch to "ASV1" style IDs in the phyloseq object. Code for this is included in the DADA2 tutorial "Handoff to phyloseq" section:
|
Hello,
im encountering a similar issue to related to my sample reads. After completing the Dada2 pipeline according to the tutorial, i created a phyloseq object, but in doing so i loose reads in almost all my samples. Some of my samples loose over 50%.
Here is an excerpt from my track_reads file with added entry for when i create my phyloseq object:
sample | input | filtered | dadaF | dadaR | merged | nonchim | finalPercReadsKept % | reads physeq obj
ArcB1 | 82650 | 47038 | 45576 | 46265 | 43855 | 42770 | 51.7 | 42770
AuB1 | 90559 | 56066 | 54249 | 55233 | 52257 | 50278 | 55.5 | 32479
AuB2 | 100718 | 62615 | 59976 | 60933 | 57438 | 55009 | 54.6 | 53549
AuB3 | 137090 | 81540 | 80356 | 80983 | 79213 | 76150 | 55.5 | 27221
There is a large iscrepancy between the seqtab. nochim reads and the phyloseq object.
This is the code i use to create my seqtab.nochim file in the pipeline:
And here is the code i use to create my phyloseq object:
Any idea what could cause the reads to disappear? Or am i using the readcount function wrong? Thanks in advance!
The text was updated successfully, but these errors were encountered: