Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why my metaeuk search is very slow? #9

Open
liuxianghui opened this issue Jun 12, 2020 · 3 comments
Open

Why my metaeuk search is very slow? #9

liuxianghui opened this issue Jun 12, 2020 · 3 comments

Comments

@liuxianghui
Copy link

Dear Sir,
Thanks for developing the tool for prediction of euk proteins from contigs.
I downloaded the database for gut microbiome gene catalog amino acid sequences
fasta: humanGut_IGC.11M.pep.fa
from cngb database https://db.cngb.org/microbiome/genecatalog/genecatalog/?gene_name=Human%20Gut%20(11M))
and made it into a mmseq2 database twins_IGC.
11M seems big but shall be big but yours MMETSP_zenodo_3247846_uniclust90_2018_08_seed_valid_taxids seems to be much bigger with ~88 million proteins and I have no problem to use my contigs dataset allDBswabs to query for it (only take several days).

metaeuk easy-predict allDBswabs twins_IGC predsResultProteinsswabs_twins_IGC tmp
metaeuk taxtocontig allDBswabs predsResultProteinsswabs_twins_IGC.fas predsResultProteinsswabs_twins_IGC.fas.headersMap.tsv twins_IGC taxResultswabs_twins_IGC tmp --majority 0.5 --tax-lineage --lca-mode 2
However, the job has been running for weeks and I have no idea about it because there is no output.
Job id Name User Time Use S Queue


100780.pbs01 predictIGC xianghui 1888:55: R std
I hope you could kindly suggest,
My query allDBswabs contains 102934 sequences.
Does it help if i try to specify more memory for it or like mmseq2, try to run mpi process? How to run mpi for metaeuk.
Xianghui

Expected Behavior

Current Behavior

Steps to Reproduce (for bugs)

Please make sure to execute the reproduction steps with newly recreated and empty tmp folders.

MetaEuk Output (for bugs)

Please make sure to also post the complete output of MetaEuk. You can use gist.github.com for large output.

Context

Providing context helps us come up with a solution and improve our documentation for the future.

Your Environment

Include as many relevant details about the environment you experienced the bug in.

  • Git commit used (The string after "MetaEuk Version:" when you execute MetaEuk without any parameters):
  • Which MetaEuk version was used (Statically-compiled, self-compiled, Homebrew, etc.):
  • For self-compiled and Homebrew: Compiler and Cmake versions used and their invocation:
  • Server specifications (especially CPU support for AVX2/SSE and amount of system memory):
  • Operating system and version:
@milot-mirdita
Copy link
Member

What kind of machine are you running metaeuk on? There is no output at all?

@liuxianghui
Copy link
Author

I am running this on a linux cluster. My job was submitted with 64 cores. It finally came to an end. I am very surprised that it took a lot time. Could the problem caused by any indexing?

@elileka
Copy link
Member

elileka commented Jun 20, 2020

Could you please send us the log files produced during the run?
Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants