Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Only one pair of homologous chromosomes were not phased #21

Open
apple-orange-banana opened this issue Aug 26, 2023 · 5 comments
Open

Comments

@apple-orange-banana
Copy link

Hi~
SubPhaser is a great piece of software, I have suffered some problems when I use this software to phase my diploid genome.

After the previous hic scaffolding, I got 22 superscaffolds, then I want to divide these scaffolds into 2 parts(2n=2x=22).
3c28a7d68e52191977cb542b72062b2
so I used this SubPhaser(-k 17 -q 50 -f 1.5), then 20 superscaffolds were phased and only 1 pair of homologous scaffolds(scaffold_9 and scaffold_10) were not phased.
0d8d5c8a695e265d13d28060e73c722
k17_q50_f1.5.kmer_freq.pdf
k17_q50_f1.5.kmer_pca.pdf
k17_q50_f1.5.ltr.insert.density.pdf

How can I solve this problem?
Looking forward to your reply!
Yang

@zhangrengang
Copy link
Owner

The evidence is quite weak and I do not think the results are reliable. The two parents may be too closely related to be well distinguished.

If you want to force the phasing, you may adjust the parameters by varying -k and -q. If it do not work well, you may adjust the phasing manually by editing the subgenome assignments (edit and rename the *chrom-subgenome.tsv file) and then feeding it to SubPhaser via the -sg_assigned option. This means to assign scaffold_9 and scaffold_10 by yourself.

@apple-orange-banana
Copy link
Author

Thank you for your reply!
Our genome with about 1.2% heterozygosity is sequenced by PacBio HiFi and Hi-C, I found there is less specific kmer from 2 subgenomes by using SubPhaser, Maybe 2 parents are too close, In this situation, how can I phase it into 2 parts? or maybe I should just assemble one genome?

@zhangrengang
Copy link
Owner

I prefer to not to phase it unless the data of two parents are available. But assembling two haplotypes is okay.

@apple-orange-banana
Copy link
Author

At first I don't know which chromosomes belong to one subgenome, so I use this SubPhaser. I will try to finish the switch error analysis to test my assambly. And proceed with downstream analysis and get back to you if there is further questions.

Thanks!
yang

@zhangrengang
Copy link
Owner

I prefer to not to phase it unless the data of two parents are available. But assembling two haplotypes is okay.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants