Mount Sinai researchers use AI to identify 17 key genes in heart disease

Using an advanced artificial intelligence tool, researchers at the Icahn School of Medicine at Mount Sinai have identified rare coding variants in 17 genes that shed light on the molecular basis of coronary artery disease (CAD), the leading cause of morbidity and mortality worldwide.

The discoveries, detailed in the June 11 online issue of Nature Genetics [DOI: 10.1038/s41588-024-01791-x], reveal genetic factors impacting heart disease that open new avenues for targeted treatments and personalized approaches to cardiovascular care.

The investigators used an in silico, or computer-derived, score for coronary artery disease (ISCAD) that holistically represents CAD, as described in a previous paper by the team in The Lancet. The ISCAD score incorporates hundreds of different clinical features from the electronic health record, including vital signs, laboratory test results, medications, symptoms, and diagnoses. To build the score, they trained machine learning models on the electronic health records of 604,914 individuals across the UK Biobank, All of Us Research Program, and BioMe Biobank in this comprehensive meta-analysis.

The score was then tested for association with rare and ultra-rare coding variants found in the exome sequences of these individuals. In addition, the research team conducted further investigation into the discovered genes to study their roles in causal CAD risk factors, clinical manifestations of CAD, and their connections with CAD status in traditional large-scale genome-wide association studies, among other factors.

Our findings help us understand how these 17 genes are involved in coronary artery disease. Some of these genes are already known to influence heart disease development, while others have never been linked to it before. Our study shows how machine learning tools can uncover genetic insights that traditional methods might miss when comparing cases and controls. This could lead to new ways to identify biological mechanisms of heart disease or gene targets for treatment."

Ron Do, PhD, senior study author and the Charles Bronfman Professor in Personalized Medicine at Icahn Mount Sinai

Because they occur in only a small percentage of individuals, rare coding variants may have a significant impact on disease risk or susceptibility when present. Therefore, studying these variants is essential to understanding the genetic basis of diseases and can inform therapeutic targets.

The study was driven by the challenges faced, over the last decade, in identifying rare coding variants associated with CAD using traditional methods relying on diagnosed cases and controls. Diagnostic codes' limitations in capturing the complexity of CAD prompted the researchers to explore new avenues of investigation.

"Our previous Lancet paper showed that a machine learning model trained with electronic health records can generate an in silico score for coronary artery disease, capturing disease across its spectrum," says lead author Ben Omega Petrazzini, BS, Associate Bioinformatician in Dr. Do's lab at Icahn Mount Sinai. "Based on these findings, we hypothesized that the in-silico score for CAD could reveal novel rare coding variants related to CAD by offering a more holistic view of the disease."

Next, the investigators plan to further investigate the role of the identified genes in CAD biology and explore potential applications of machine learning in the genetic study of other complex diseases, as part of their ongoing efforts to advance understanding of disease mechanisms, discover new treatments, and improve patient outcomes.

The paper is titled "Exome sequence analysis identifies rare coding variants associated with a machine learning-based marker for coronary artery disease."

The remaining authors of the paper, all with Icahn Mount Sinai except where indicated, are:

Iain S. Forrest, PhD (MD candidate); Ghislain Rocheleau, PhD; Ha My T. Vy, PhD; Carla Márquez-Luna, PhD; Áine Duffy, MS (PhD candidate); Robert Chen, MS (MD/PhD candidate); Joshua K. Park, BS (MD/PhD candidate); Kyle Gibson, BA (MD candidate); Sascha N. Goonewardena, MD (University of Michigan); Waqas A. Malick, MD; Robert S. Rosenson, MD; and Daniel M. Jordan, PhD.

The study was made possible by funding from the National Institutes of Health: National Institute of General Medical Sciences (R35-GM124836); National Institute of Aging (R01 AG061186-0); the National Heart, Lung, and Blood Institute (R01HL157439, R01-HL139865, R01-HL155915); and by VA MERIT grant 1I01CX002560.

Source:
Journal reference:

Petrazzini, B. O., et al. (2024). Exome sequence analysis identifies rare coding variants associated with a machine learning-based marker for coronary artery disease. Nature Genetics. doi.org/10.1038/s41588-024-01791-x.

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News Medical.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
Excessive sedentary time linked to higher heart disease risk