Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Taxonomic profiles #32

Open
cmorganl opened this issue Jul 18, 2019 · 1 comment
Open

Taxonomic profiles #32

cmorganl opened this issue Jul 18, 2019 · 1 comment
Assignees
Labels
feature request A request for a new feature unlike one that already exists
Milestone

Comments

@cmorganl
Copy link
Collaborator

It would be beneficial to output a table with the relative and absolute abundances of taxa based on individual reference packages. This should be in long-table (tidy) format for simple integration with the tidyverse and other new visualization tools.

An example table could be:

Taxon Lineage Rank RefPkg Count Relative_abundance
Euryarchaeota Root; Archaea; Euryarchaeota Phylum McrA 28 0.03
Archaea Root; Archaea Kingdom McrA 56 0.06

Eventually this multiple marker genes, such as universal single-copy markers, should be used together to produce better estimates of abundance.

@cmorganl cmorganl added the enhancement Highlight something that could be improved. Please be specific, TreeSAPP isn't perfect. label Jul 18, 2019
@cmorganl cmorganl self-assigned this Jul 18, 2019
@cmorganl cmorganl modified the milestones: 0.6.0, 1.0.0 Jul 18, 2019
@cmorganl cmorganl added feature request A request for a new feature unlike one that already exists and removed enhancement Highlight something that could be improved. Please be specific, TreeSAPP isn't perfect. labels Jan 15, 2021
@cmorganl
Copy link
Collaborator Author

This feature should be written to report summaries data at multiple scales - from SAGs and MAGs all the way to metagenomes.

When profiling SAGs and MAGs - this should essentially function with CheckM. The only thing standing in the way here is a critical number of reference packages: we have far too few reference packages for universal single copy marker genes.

Another requirement is defining the set of reference packages relevant to different clades.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request A request for a new feature unlike one that already exists
Projects
None yet
Development

No branches or pull requests

1 participant