-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Invalid alleles and invalid genes #189
Comments
Hello @lskatz! Thank you for your interest in chewBBACA and for sharing the steps to adapt the schema. Kind regards, Rafael |
Then, are you saying that these two loci are never going to be called when using ChewBBACA even though they are in the curated scheme? How do I identify which two these are? And which alleles they are? |
Does this also mean that the chewbbaca.online wgMLST scheme excludes these two cgMLST loci? |
The loci seem to have alternative start sites. Is it possible to specify alternative start sites when running PrepExternalSchema? I translated them and then show the first allele for each below.
|
Yes, those two loci will not be called. They are excluded during schema adaptation, meaning that the adapted schema will not contain any FASTA files or information about those loci. You can define the genetic code/table used by Prodigal (chewie <3.3.0)/Pyrodigal (chewie >= 3.3.0) and for translation, but I doubt that will help with those two loci. The default is genetic code 11 (The Bacterial, Archaeal and Plant Plastid Code). I have just checked, and as expected, those two loci are not in the wgMLST schema available on Chewie-NS. The PrepExternalSchema creates three output files, which are the following:
These files are created in the parent directory of the output directory you pass to the Also, I noticed that you are using Chewie v3.1.0. The latest version, v3.3.1, includes new features and several fixes, which you might find important. You can check the list of changes in the Changelog. Rafael |
Thank you! These insights are helpful! One more question is, what if I want to force these loci into the scheme? Is there a method to bring in one locus at a time? |
Not easily Lee, we have no option to do that. But even if we had the problem then becomes how and if would prodigal (or pyrodigal in the latest chewBBACA versions) recognize the loci in the query genomes. If these are not valid ORFs for pyrodigal, they will not be passed onto chewBBACA and will not be called. We do not seem to have a straightforward and simple solution to forcefully include the loci that would guarantee they would be called even if included. |
Thank you for helping me understand it so well! I understand where you are coming from. |
Hi, I am using ChewBBACA v3 and imported the EnteroBase Salmonella cgMLST scheme and got invalid genes. These are the steps I took.
And then I prepped the scheme with
Why do I see invalid genes and alleles? How can I fix it? Or is there anything to fix?
The text was updated successfully, but these errors were encountered: