Skip to content
/ magenpy Public

Modeling and Analysis of (Statistical) Genetics data in python

License

Notifications You must be signed in to change notification settings

shz9/magenpy

Repository files navigation

magenpy: Modeling and Analysis of (Statistical) Genetics data in python

PyPI pyversions PyPI version fury.io License: MIT

Linux CI MacOS CI Windows CI Docs Build Binary wheels

Downloads Downloads

magenpy is a Python package for modeling and analyzing statistical genetics data. The package provides tools for:

  • Reading and processing genotype data in plink BED format.
  • Efficient LD matrix construction and storage in Zarr array format.
  • Data structures for harmonizing various GWAS data sources.
    • Includes parsers for commonly used GWAS summary statistics formats.
  • Simulating polygenic traits (continuous and binary) using complex genetic architectures.
    • Multi-cohort simulation scenarios (beta)
    • Simulations incorporating functional annotations in the genetic architecture (beta)
  • Interfaces for performing association testing on simulated and real phenotypes.
  • Preliminary support for processing and integrating genomic annotations with other data sources.

Helpful links