Publications
Science (New York, N.Y.)May 2025 |
eads7373
DOI:
10.1126/science.ads7373

Predicting expression-altering promoter mutations with deep learning

Jaganathan, Kishore; Ersaro, Nicole; Novakovsky, Gherman; Wang, Yuchuan; James, Terena; Schwartzentruber, Jeremy; Fiziev, Petko; Kassam, Irfahan; Cao, Fan; Hawe, Johann; Cavanagh, Henry; Lim, Ashley; Png, Grace; McRae, Jeremy; Banerjee, Abhimanyu; Kumar, Arvind; Ulirsch, Jacob; Zhang, Yan; Aguet, Francois; Wainschtein, Pierrick; Sundaram, Laksshman; Salcedo, Adriana; Kyriazopoulou Panagiotopoulou, Sofia; Aghamirzaie, Delasa; Padhi, Evin; Weng, Ziming; Dong, Shan; Smedley, Damian; Caulfield, Mark; O'Donnell-Luria, Anne; Rehm, Heidi L; Sanders, Stephan J; Kundaje, Anshul; Montgomery, Stephen B; Ross, Mark T; Farh, Kyle Kai-How
Product Used
Variant Libraries
Abstract
Only a minority of patients with rare genetic diseases are currently diagnosed by exome sequencing, suggesting that additional unrecognized pathogenic variants may reside in non-coding sequence. Here, we describe PromoterAI, a deep neural network that accurately identifies non-coding promoter variants which dysregulate gene expression. We show that promoter variants with predicted expression-altering consequences produce outlier expression at both RNA and protein levels in thousands of individuals, and that these variants experience strong negative selection in human populations. We observe that clinically relevant genes in rare disease patients are enriched for such variants and validate their functional impact through reporter assays. Our estimates suggest that promoter variation accounts for 6% of the genetic burden associated with rare diseases.
Product Used
Variant Libraries

Related Publications