Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add ability to reconstruct genomes #2

Open
apriha opened this issue Nov 19, 2017 · 3 comments
Open

Add ability to reconstruct genomes #2

apriha opened this issue Nov 19, 2017 · 3 comments

Comments

@apriha
Copy link
Owner

apriha commented Nov 19, 2017

Combine techniques identified by Whit Athey in Phasing the Chromosomes of a Family Group When One Parent is Missing and the results of find_shared_dna to reconstruct genomes of maternal and/or paternal ancestors.

This can be approached as a constraint satisfaction problem. For example, the algorithm could be provided several individuals, with the maternal and/or paternal relationships also identified (e.g., siblings = [ind1, ind2]; mother = [ind3]; paternal_relation = [ind4]). Then, shared DNA could be discovered by find_shared_dna between all combinations of individuals. This information - whether the various combinations of individuals share one chromosome, both chromosomes, or no chromosomes for a given SNP position - would serve as the constraints for reconstructing the ancestral genomes.

As a simple example, say two siblings have genotypes of CA and AG at a given SNP. If one knew they shared one chromosome at that location, AN could be attributed to one parent, and CG to the other, where N would be any allele. Additional comparisons between other individuals could further narrow the solution space for the ancestral genomes.

@apriha
Copy link
Owner Author

apriha commented Mar 3, 2018

Consider integrating https://github.com/poruloh/Eagle

@ebacherdom
Copy link

Consider integrating https://github.com/poruloh/Eagle

This only seems useful if no familial DNA is available - IBD gives a much more conclusive result for phasing than statistical methods.

@apriha
Copy link
Owner Author

apriha commented Jun 15, 2019

@ebacherdom, I agree. Like discussed above, I think using the results of find_shared_dna would help with this, especially when more comparisons of individuals in a family group are available. Formally, I think this is a constraint satisfaction problem.

@apriha apriha changed the title Add phasing capability Add ability to reconstruct genomes Feb 21, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants