Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug Funcotator: wrong canonical #8714

Open
zhanyinx opened this issue Mar 5, 2024 · 1 comment
Open

Bug Funcotator: wrong canonical #8714

zhanyinx opened this issue Mar 5, 2024 · 1 comment

Comments

@zhanyinx
Copy link

zhanyinx commented Mar 5, 2024

Bug Report

Affected tool(s) or class(es)

Funcotator

Affected version(s)

4.5.0.0

Description

Funcotator does not select the correct transcript. It does not select the Canonical trascript

Steps to reproduce

Command line used:

gatk Funcotator -L null -R path2/hg19.fasta -V file.vcf.gz -O GERS014SDZ-1N.maf --annotation-default Matched_Norm_Sample_Barcode:GERS014SDZ-1N --remove-filtered-variants true --output-file-format MAF --data-sources-path path2/hg19/funcotator_dataSources.v1.8.2024g --ref-version hg19 --splice-site-window-size 5 --interval-padding 2

Expected behavior

It should annotate the variant using the canonical transcript

Actual behavior

It uses a transcript which is not the Ensembl_canonical (MANE_select)

More details

I am using the gencode v43 from the funcotator last datasource.
In the gencode.v43lift37.annotation.REORDERED.gtf, I see that the canonical transcript from the gene MUTYH is ENST00000456914.7_10. However, in the output maf file, I got ENST00000531105.5_4 transcript.

I attach the input vcf and the output maf file

file.vcf.gz
GERS014SDZ-1N.maf.zip

Thanks for your help
Best
Zhan

@gokalpcelik
Copy link
Contributor

Hi @zhanyinx
Funcotator's canonical transcript designation is different than what is presented by other tools therefore if you wish to force annotate a variant with a certain transcript you need to provide the names of those transcripts with the parameter below

--transcript-list <String>    File to use as a list of transcripts (one transcript ID per line, version numbers are
                              ignored) OR A set of transcript IDs to use for annotation to override selected transcript.
                              This argument may be specified 0 or more times. Default value: null. 

Funcotator's method for selecting transcripts is summarized below

BEST_EFFECT

Select a transcript to be reported with details with priority on effect according to the folowing list of selection criteria:

Choose the transcript that is on the custom list specified by the user. If no list was specified, treat as if no transcripts were on the list (tie).
In case of tie, choose the transcript that yields the variant classification highest on the variant classification rank list (see below).
If still a tie, choose the transcript with highest level of curation. Note that this means lower number is better for level (see below).
If still a tie, choose the transcript with the best appris annotation (see below).
If still a tie, choose the transcript with the longest transcript sequence length.
If still a tie, choose the first transcript, alphabetically.

CANONICAL

Select a transcript to be reported with details with priority on canonical order according to the folowing list of selection criteria:

Choose the transcript that is on the custom list specified by the user. If no list was specified, treat as if all transcripts were on the list (tie).
In case of tie, choose the transcript with highest level of curation. Note that this means lower number is better for level (see below).
If still a tie, choose the transcript that yields the variant classification highest on the variant classification rank list (see below).
If still a tie, choose the transcript with the best appris annotation (see below).
If still a tie, choose the transcript with the longest transcript sequence length.
If still a tie, choose the first transcript, alphabetically.

ALL

Same as CANONICAL, but indicates that no transcripts should be dropped. Render all overlapping transcripts.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants