Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Translate gene #205

Open
wants to merge 17 commits into
base: dev
Choose a base branch
from
Open

Translate gene #205

wants to merge 17 commits into from

Conversation

jpjarnoux
Copy link
Member

@jpjarnoux jpjarnoux commented Mar 28, 2024

Two things have been done here:

  1. Added the ability to write the translated sequence of all genes using MMSeqs2 with the --proteins option, which works like the other options in the ppanggolin fasta command.
  2. In MSA, if the gene sequence length is not divisible by 3, PPanGGOLiN prints a warning message. To be more flexible, the last one or two nucleotides are deleted if the sequence size is not modulo 3. A debug message will issued the partial genes.

@axbazin axbazin self-requested a review April 9, 2024 07:16
@jpjarnoux
Copy link
Member Author

MMseqs2 create db must be forced in mode 1. So in translate_genes we must force the mode.

ppanggolin/formats/writeSequences.py Outdated Show resolved Hide resolved
ppanggolin/formats/writeSequences.py Outdated Show resolved Hide resolved
@JeanMainguy JeanMainguy self-requested a review June 5, 2024 14:23
docs/user/writeFasta.md Show resolved Hide resolved
docs/user/writeFasta.md Show resolved Hide resolved
ppanggolin/align/alignOnPang.py Show resolved Hide resolved
@@ -100,6 +100,7 @@ jobs:
ppanggolin fasta -p stepbystep/pangenome.h5 --output stepbystep -f --prot_families module_0
ppanggolin fasta -p stepbystep/pangenome.h5 --output stepbystep -f --prot_families core
ppanggolin fasta -p stepbystep/pangenome.h5 --output stepbystep -f --gene_families module_0 --genes module_0
ppanggolin fasta -p stepbystep/pangenome.h5 --output stepbystep -f --proteins cloud --threads 1 --keep_tmp
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You way want to use the NUM_CPU variable here

ppanggolin/align/alignOnPang.py Show resolved Hide resolved
ppanggolin/projection/projection.py Show resolved Hide resolved
ppanggolin/projection/projection.py Show resolved Hide resolved
ppanggolin/cluster/cluster.py Show resolved Hide resolved
ppanggolin/formats/writeSequences.py Show resolved Hide resolved
ppanggolin/formats/writeSequences.py Show resolved Hide resolved
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants