Cornell scientists develop new computational method to identify functional human genome

Download PDF Copy

Add News Medical on Googleas a preferred source

Cornell UniversityJan 22 2015Reviewed

Striving to unravel and comprehend DNA's biological significance, Cornell University scientists have created a new computational method that can identify positions in the human genome that play a role in the proper functioning of cells, according to a report published Jan. 19 in the journal Nature Genetics.

The human genome is vast, totaling some three billion base pairs of nucleotides, the subunits of DNA. But only about 1.25 percent of those billions of base pairs account for genes that encode all the proteins we use. A fraction of the rest of that genetic material regulates genes and turns them on and off, but these have yet to be fully identified.

"This paper tackles the deep question of how to identify functional non-coding human genomic material controlling human traits and disease," said Brad Gulko, the paper's first author and a graduate student in the field of computer science. Gulko's adviser, Adam Siepel, Cornell associate professor of biological statistics and computational biology and professor of computer science at Cold Spring Harbor Laboratory, is a co-author.

"What makes our approach unique is the straightforward combination of DNA biochemistry with recent evolutionary pressures," said Gulko. "Our method allows other scientists not only to use the results, but to readily understand them."

Insight into the human genome gained from this new computation method could be applied to personalized medicine and it may be a big step toward developing treatments for diseases like AIDS, malaria, muscular sclerosis, ALS and Alzheimer's.

Geneticists identify biologically significant DNA by looking for signals of selective pressure in DNA, genes and genetic material that give individuals in a population advantages and greater "fitness," or reproductive success.

The new method combines two previously used techniques to identify selective pressure. One technique looks for divergence, or differences between humans and chimpanzee genomes accumulated over millions of years; a less commonly used method looks for mutations in DNA (polymorphisms) between individual humans.

The new computational method clusters functionally similar markers in the genome into groups, then estimates a probability of whether a group is contributing to the fitness of the species based on associated patterns of divergence and genomic polymorphisms.

In this way, the researchers receive a "fitness consequence" (fitCons) score that predicts which genetic material might be under selective pressure and therefore biologically significant.

Compared to conventional techniques, fitCons scores demonstrate a much greater power to predict which genetic material regulates the expression of genes.

In addition, fitCons scores indicate that 4.2 to 7.5 (but probably closer to 5) percent of nucleotides in the human genome have influenced fitness since humans diverged from chimpanzees.

Source:

Cornell University

Posted in: Device / Technology News | Medical Science News | Medical Research News

Comments (0)

Download PDF Copy

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News Medical.

Post a new comment

(Logout)

Post

Sign in to keep reading

We're committed to providing free access to quality science. By registering and providing insight into your preferences you're joining a community of over 1m science interested individuals and help us to provide you with insightful content whilst keeping our service free.