Skip to content

Aavache/awesome-genetic-ai-papers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 

Repository files navigation

Awesome Genomic AI Papers 🦠 + 🤖 + 📄

A curated list of popular scientific papers in relation to omics. Some of topics could be genome, spatial transcriptomics and more, where the main approaches are based on machine learning (AI) techniques.

Papers

2024

  • Innovative super-resolution in spatial transcriptomics: a transformer model exploiting histology images and spatial gene expression. Chongyue Zhao, Zhongli Xu, Xinjun Wang, Shiyue Tao, William A MacDonald, Kun He, Amanda C Poholek, Kong Chen, Heng Huang, Wei Chen. Briefing in Bioinformatics. [Link].

  • CellPLM: Pre-training of Cell Language Model Beyond Single Cells. Hongzhi Wen, Wenzhuo Tang, Xinnan Dai, Jiayuan Ding, Wei Jin, Yuying Xie, Jiliang Tang. ICLR. [Link]

  • DNABERT-2: Efficient Foundation Model and Benchmark For Multi-Species Genome. Brendan F. Miller, Feiyang Huang, Lyla Atta, Arpan Sahoo & Jean Fan. ICLR. [Link]

  • Reference-free cell type deconvo- lution of multi-cellular pixel-resolution spatially resolved tran- scriptomics data Zhihan Zhou, Yanrong Ji, Weijian Li, Pratik Dutta, Ramana Davuluri, Han Liu. Nature communications. [Link]

  • Genegpt: Augmenting large language models with domain tools for improved access to biomedical information Q Jin, Y Yang, Q Chen, Z Lu . Bioinformatics. [Link]

  • THItoGene: a deep learning method for predicting spatial transcriptomics from histological images Yuran Jia, Junliang Liu, Li Chen, Tianyi Zhao, Yadong Wang. Briefing in Bioinformatics. [Link]

  • Spatially resolved gene expression prediction from h&e histology images via bi-modal contrastive learning. Ronald Xie, Kuan Pang, Sai W. Chung, Catia T. Perciani, Sonya A. MacParland, Bo Wang, Gary D. Bader. Advances in Neural Information Processing Systems. [Link]

  • Cell clustering for spatial transcriptomics data with graph neural network Jiachen Li, Siheng Chen, Xiaoyong Pan, Ye Yuan & Hong-Bin Shen . Nature communications. [Link]

  • Inferring super-resolution tissue architecture by integrating spatial transcriptomics with histology Daiwei Zhang, Amelia Schroeder, Hanying Yan, Haochen Yang, Jian Hu, Michelle Y. Y. Lee, Kyung S. Cho, Katalin Susztak, George X. Xu, Michael D. Feldman, Edward B. Lee, Emma E. Furth, Linghua Wang & Mingyao Li . Nature Biotechnology. [Link]

  • Transformer with convolution and graph-node co-embedding: an accurate and interpretable vision backbone for predicting gene expressions from local histopatho- logical image Xiao Xiao, Yan Kong, Ronghan Li, Zuoheng Wang, Hui Lu. Medical Image Analysis. [Link]

2023

  • HyenaDNA: Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution. Eric Nguyen, Michael Poli, Marjan Faizi, Armin Thomas, Callum Birch-Sykes, Michael Wornow, Aman Patel, Clayton Rabideau, Stefano Massaroli, Yoshua Bengio, Stefano Ermon, Stephen A. Baccus, Chris Ré. NeurIPS (Spotlight). [Link]

  • Deeptrasynergy: drug com- binations using multimodal deep learning with transformers. Fatemeh Rafiei, Hojjat Zeraati, Karim Abbasi, Jahan B Ghasemi, Mahboubeh Parsaeian, Ali Masoudi-Nejad. Bioinformatics. [Link]

  • xTrimoABFold: Improving Antibody Structure Prediction without Multiple Sequence Alignments Yining Wang, Xumeng Gong, Shaochuan Li, Bing Yang, YiWu Sun, Chuan Shi, Hui Li, Yangang Wang, Cheng Yang, Le Song. ICLR. [Link]

  • Robust mapping of spatiotemporal trajectories and cell–cell interactions in healthy and diseased tissues Duy Pham, Xiao Tan, Brad Balderson, Jun Xu, Laura F. Grice, Sohye Yoon, Emily F. Willis, Minh Tran, Pui Yeng Lam, Arti Raghubar, Priyakshi Kalita-de Croft, Sunil Lakhani, Jana Vukovic, Marc J. Ruitenberg & Quan H. Nguyen . Nature communications. [Link]

  • Transfer learning enables predictions in network biology Christina V. Theodoris, Ling Xiao, Anant Chopra, Mark D. Chaffin, Zeina R. Al Sayed, Matthew C. Hill, Helene Mantineo, Elizabeth M. Brydon, Zexian Zeng, X. Shirley Liu & Patrick T. Ellinor . Nature. [Link]

  • Generation of synthetic whole-slide image tiles of tumours from RNA-sequencing data via cascaded diffusion models Francisco Carrillo-Perez, Marija Pizurica, Yuanning Zheng, Tarak Nath Nandi, Ravi Madduri, Jeanne Shen & Olivier Gevaert Nature Biomedical Engineering. [Link]

  • The Nucleotide Transformer: Building and evaluating robust foundation models for human genomics. Hugo Dalla-Torre, Liam Gonzalez, Javier Mendoza Revilla, Nicolas Lopez Carranza, Adam Henryk Grzywaczewski, Francesco Oteri, Christian Dallago, Evan Trop, Hassan Sirelkhatim, Guillaume Richard, Marcin Skwark, Karim Beguir, Marie Lopez, Thomas Pierrot. BioRxiv. [Link]

2022

  • Spatially aware dimension reduction for spatial transcriptomics. Chang Xu, Xiyun Jin, Songren Wei, Pingping Wang, Meng Luo, Zhaochun Xu, Wenyi Yang, Yideng Cai, Lixing Xiao, Xiaoyu Lin, Hongxin Liu, Rui Cheng, Fenglan Pang, Rui Chen, Xi Su, Ying Hu, Guohua Wang, Qinghua Jiang. Nucleic Acids Res. [Link]

  • Deepst: identifying spatial domains in spatial transcriptomics by deep learning Shang L, Zhou X. Nature Communications. [Link]

  • Genomics enters the deep learning era Etienne Routhier1 and Julien Mozziconacci. PeerJ. [Link]

  • Deciphering spatial domains from spatially resolved transcriptomics with an adaptive graph attention auto- encoder Dong K, Zhang S. Nature Communications. [Link]

  • Interpretable RNA Foundation Model from Unannotated Data for Highly Accurate RNA Structure and Function Predictions Jiayang Chen, Zhihang Hu, Siqi Sun, Qingxiong Tan, Yixuan Wang, Qinze Yu, Licheng Zong, Liang Hong, Jin Xiao, Tao Shen, Irwin King, Yu Li. BioRxiv. [Link]

  • Exemplar Guided Deep Neural Network for Spatial Transcriptomics Analysis of Gene Expression Prediction Yan Yang, Md Zakir Hossain, Eric A Stone, Shafin Rahman. WACV. [Link]

2021

  • DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome. Yanrong Ji, Zhihan Zhou, Han Liu, Ramana V Davuluri. Bioinformatics. [Link]

  • SPOTlight: seeded NMF regression to deconvolute spatial transcriptomics spots with single-cell transcriptomes. Marc Elosua-Bayes, Paula Nieto, Elisabetta Mereu, Ivo Gut, Holger Heyn. Nuclei Acid Res. [Link]

  • Spatial deconvolution of HER2-positive breast cancer delineates tumor-associated cell type interactions Alma Andersson, Ludvig Larsson, Linnea Stenbeck, Fredrik Salmén, Anna Ehinger, Sunny Z Wu, Ghamdan Al-Eryani, Daniel Roden, Alex Swarbrick, Åke Borg, Jonas Frisén, Camilla Engblom, Joakim Lundeberg . Nature communications. [Link]

  • Spagcn: integrating gene expression, spatial location and histology to identify spatial domains and spatially variable genes by graph convolutional network. Jian Hu, Xiangjie Li, Kyle Coleman, Amelia Schroeder, Nan Ma, David J Irwin, Edward B Lee, Russell T Shinohara, Mingyao Li. Nature. [Link]

  • Spatial transcriptomics at subspot resolution with bayesspace. Edward Zhao, Matthew R. Stone, Xing Ren, Jamie Guenthoer, Kimberly S. Smythe, Thomas Pulliam, Stephen R. Williams, Cedric R. Uytingco, Sarah E. B. Taylor, Paul Nghiem, Jason H. Bielas & Raphael Gottardo. Nature Biotechnology. [Link]

  • Leveraging information in spatial transcriptomics to predict super-resolution gene expression from histology images in tumors Minxing Pang, Kenong Su, Mingyao Li. bioRxiv. [Link]

  • Highly accurate protein structure prediction with AlphaFold John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Kathryn Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, Alex Bridgland, Clemens Meyer, Simon A. A. Kohl, Andrew J. Ballard, Andrew Cowie, Bernardino Romera-Paredes, Stanislav Nikolov, Rishub Jain, Jonas Adler, Trevor Back, Stig Petersen, David Reiman, Ellen Clancy, Michal Zielinski, Martin Steinegger, Michalina Pacholska, Tamas Berghammer, Sebastian Bodenstein, David et al. Nature. [Link]

2020 or older

  • Integrating spatial gene expression and breast tumour morphology via deep learning.. Bryan He, Ludvig Bergenstråhle, Linnea Stenbeck, Abubakar Abid, Alma Andersson, Åke Borg, Jonas Maaskola, Joakim Lundeberg, James Zou. Nature Biomedical Engineering. [Link]

  • Deep learning: new computational modelling techniques for genomics Gökcen Eraslan, Žiga Avsec, Julien Gagneur & Fabian J. Theis . Naure. [Link]

  • A primer on deep learning in genomics James Zou, Mikael Huss, Abubakar Abid, Pejman Mohammadi, Ali Torkamani & Amalio Telenti. Nature genetics. [Link]

  • Approximate Bayesian computation with deep learning supports a third archaic introgression in Asia and Oceania Mayukh Mondal, Jaume Bertranpetit & Oscar Lao. Nature communications. [Link]

  • Machine learning applications in genetics and genomics Maxwell W. Libbrecht & William Stafford Noble . Nature genetics. [Link]

Contributing

Please feel free to pull requests to add papers with reasonable impact in the domain.

License

CC0

To the extent possible under law, Aavache has waived all copyright and related or neighboring rights to this work.