-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Segmentation fault (core dumped) #45
Comments
Could you provide more details about your environment? |
Hi @Griffan, I'm also getting this error but in my case it's when I'm trying to create new resource files for vbid2. My commandline is
I get the following error:
|
Hi @yfarjoun , I drafted a PR to print out the crash site under branch "develop_branch_with_backward_cpp" and its PR is:#65 Would you mind to checkout this branch and post the backtrace info as a first step? Thanks for reporting this issue! |
here's the stacktrace:
|
Thanks, @yfarjoun! Could you also post a few lines of the Vcf File? it seems to be related to the GT or PL feilds. |
I'm working with the 1000genomes file as input, so there's very little "secret" data here.... I added some fprintf lines and I have the format field that seems to throw it off. I printed the position at each vcf iteration and the sample index (and ID) in the begining of the sample iteration. so I ran
I think that both GT and PL look fine.... not sure what seems to be the problem. |
here's the vcf up to the problematic line: |
This is the problematic sample:"./.:.:42,0:42:.:.:.:.". I will try to fix it in the Debugging mode PR. |
Having missing genotypes may result in inaccurate PCA estimates. I would advise remove variants with many missing genotypes, and fill-in the remaining missing genotypes with best-guess genotypes (or dosages) before calculating PCs, which should be the common practice. |
I have updated the PR to apply QC filters on each VCF record. #65 |
Thanks. when I looked for the errant sample using @hyunminkang thanks for the reminder. I'm looking for a way to drastically increase the sensitivity of vbid2 and for that I need to use specific SNPs. it's odd that the 1KG snps have many missing genotypes... if it's only a few samples that are problematic and have many missing genotypes, I'll filter out these samples. If not, In lieu of removing the sites, I could also impute the missing genotypes, does that make sense? |
not my issue to close, but my part in this issue is resolved. |
The original issue should be different from the "—RefVCF" one. But the procedure to locate and report the specific crash scene is the same. Should anyone in future also encounter this error, please refer to the "Debugging Mode" section on README. |
get results ,but return unsuccessful code.
LOG:
Estimation from OptimizeHeter:
Contaminating Sample PC1:-0.025539 PC2:-0.0565987
Intended Sample PC1:-0.0166733 PC2:-0.0297511
FREEMIX(Alpha):0.000110694
NOTICE - Success!
run.sh: line 7: 3722258 Segmentation fault (core dumped) VerifyBamID2 --Reference GRCh38_full_analysis_set_plus_decoy_hla.fa --BamFile test.bam --Output out_prefiex --NumThread 4 --SVDPrefix 1000g.phase3.100k.b38.vcf.gz.dat
MUGQICexitStatus:139
The text was updated successfully, but these errors were encountered: