Skip to content

Understanding of --proteins flag #169

Answered by oschwengers
Rridley7 asked this question in Q&A
Discussion options

You must be logged in to vote

Hi and thanks for this question!
The --proteins option is not used to filter CDS but to improve their annotation. If users have a trusted set of proteins along with decent annotations that are not included in the standard database or for which a user has better (more descriptive, more specific) annotations, then the --proteins option offers a simple mechanism to feed these into the normal functional annotation workflow of Bakta.

So, it does not interfere in the gene prediction, regional annotation or filtering, but in the functional annotation of predicted protein-coding genes (CDS). Via the following two simple Fasta header schema, user can provide a lot of information.

Via the short sch…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@Rridley7
Comment options

Answer selected by oschwengers
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants