Cornell scientists develop new computational method to identify functional human genome

Striving to unravel and comprehend DNA's biological significance, Cornell University scientists have created a new computational method that can identify positions in the human genome that play a role in the proper functioning of cells, according to a report published Jan. 19 in the journal Nature Genetics.

The human genome is vast, totaling some three billion base pairs of nucleotides, the subunits of DNA. But only about 1.25 percent of those billions of base pairs account for genes that encode all the proteins we use. A fraction of the rest of that genetic material regulates genes and turns them on and off, but these have yet to be fully identified.

"This paper tackles the deep question of how to identify functional non-coding human genomic material controlling human traits and disease," said Brad Gulko, the paper's first author and a graduate student in the field of computer science. Gulko's adviser, Adam Siepel, Cornell associate professor of biological statistics and computational biology and professor of computer science at Cold Spring Harbor Laboratory, is a co-author.

"What makes our approach unique is the straightforward combination of DNA biochemistry with recent evolutionary pressures," said Gulko. "Our method allows other scientists not only to use the results, but to readily understand them."

Insight into the human genome gained from this new computation method could be applied to personalized medicine and it may be a big step toward developing treatments for diseases like AIDS, malaria, muscular sclerosis, ALS and Alzheimer's.

Geneticists identify biologically significant DNA by looking for signals of selective pressure in DNA, genes and genetic material that give individuals in a population advantages and greater "fitness," or reproductive success.

The new method combines two previously used techniques to identify selective pressure. One technique looks for divergence, or differences between humans and chimpanzee genomes accumulated over millions of years; a less commonly used method looks for mutations in DNA (polymorphisms) between individual humans.

The new computational method clusters functionally similar markers in the genome into groups, then estimates a probability of whether a group is contributing to the fitness of the species based on associated patterns of divergence and genomic polymorphisms.

In this way, the researchers receive a "fitness consequence" (fitCons) score that predicts which genetic material might be under selective pressure and therefore biologically significant.

Compared to conventional techniques, fitCons scores demonstrate a much greater power to predict which genetic material regulates the expression of genes.

In addition, fitCons scores indicate that 4.2 to 7.5 (but probably closer to 5) percent of nucleotides in the human genome have influenced fitness since humans diverged from chimpanzees.

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News Medical.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
Breakthrough research reveals how to target malignant DNA in aggressive cancers