Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add the taxonomic category "Kingdom" for eukaryotes #747

Open
glenjasper opened this issue Nov 8, 2023 · 4 comments
Open

Add the taxonomic category "Kingdom" for eukaryotes #747

glenjasper opened this issue Nov 8, 2023 · 4 comments
Labels

Comments

@glenjasper
Copy link

It would be fantastic if in a future update, they could add the taxonomic category "Kingdom", this category applies not only to eukaryotes (kingdom: Fungi, Animalia, Vividiplantae, Matazoa, etc.) but also to Viruses. Of course, this category doesn't apply to bacteria and archaea, which would have the value of NA. The Kingdom category is well defined in the NCBI Taxonomy database.

Best,
Glen,

@fpusan
Copy link
Collaborator

fpusan commented Nov 8, 2023

This is in principle possible.

By looking at the LCA_tax/parents.txt file in the SqueezeMeta database folder, kingdom is indeed a well defined field.

See eg Dangeardiella macrospora superkingdom:Eukaryota;clade:Opisthokonta;kingdom:Fungi;subkingdom:Dikarya;phylum:Ascomycota;clade:saccharomyceta;subphylum:Pezizomycotina;clade:leotiomyceta;clade:dothideomyceta; class:Dothideomycetes;no rank:Dothideomycetes incertae sedis;genus:Dangeardiella;species:Dangeardiella macrospora 100009

We could maybe make Bacteria and Archea have the same value as in the Superkingdom, instead of assigning NA. On one hand this is not technically correct, on the other hand it would make our life easier (eg doing plotTaxonomy at the kingdom level would break if we have NAs around).

This change would involve changes in the database creation step, several SqueezeMeta scripts, and SQMtools. I don't foresee the individual changes to be very big but added up this is a somewhat large undertaking. @jtamames what do you think?

@jtamames
Copy link
Owner

jtamames commented Nov 8, 2023

Hello
It would be technically possible, but it will break several things. We would have to redefine the accepted levels for taxonomy and tweak several scripts to accept the change.

I would keep Bacteria and Archaea as NA for that rank rather than creating a non-existent kingdom rank for them. Probably you are already dealing with NAs somehow, @fpusan, since some taxa don´t have some intermediate ranks (I am thinking in cyanos here).

I will think on this for further versions.

Best,
J

@fpusan
Copy link
Collaborator

fpusan commented Nov 8, 2023

Ah yes, I actually do. I add a no rank in NCBI tag to those cases

@glenjasper
Copy link
Author

Excellent, after further analysis, they might decide to implement it.

Best,
Glen,

@fpusan fpusan added enhancement New feature or request Taxonomic annotation labels Jan 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants