Scientists develop high-throughput strategy to build de novo genomes

Scientists from the Icahn School of Medicine at Mount Sinai have developed a new approach to build nearly complete genomes by combining high-throughput DNA sequencing with genome mapping. The methodology enabled researchers to detect complex forms of genomic variation, critically important for their association with human disease, but previously difficult to detect. The study was published today in Nature Methods, and is a collaboration with scientists at European Molecular Biology Lab, Weill Cornell Medical College, Cold Spring Harbor Laboratory, Rockefeller University, University of California, San Francisco, Pacific Biosciences, and BioNano Genomics.

Conventional next-generation sequencing (NGS) techniques are able to accurately detect certain types of variation, such as single nucleotide variants and small insertions or deletions, but miss many large or complex forms of genomic variation that are associated with human disease. Further, these previous approaches are poorly suited for completely de novo analysis of genomes and for phasing the maternal and paternal haplotypes of an individual.

"We created a high-throughput strategy that builds highly contiguous de novo genomes without the need for complex jumping libraries or targeted approaches. This strategy, in some cases, automatically resolved complete arms of chromosomes," said Ali Bashir, PhD, Assistant Professor of Genetics and Genomics at the Icahn School of Medicine and senior author of the study. "While we focused this study on a human genome, the method can be applied to any new genome, including those with high genomic complexity, such as plants, that have been extremely challenging to study."

To overcome limitations with existing NGS methods, the study authors combined two single molecule approaches: long read sequencing from Pacific Biosciences and Nanochannel Array technology from BioNano Genomics. Pacific Biosciences sequencing enables reads exceeding 10kb in length, which can directly resolve and phase complex forms of variation. The NanoChannel Array from BioNano confines and linearizes DNA molecules up to megabases in length to provide high-resolution sequence motif physical maps, termed 'genome maps'.

The researchers studied the NA12878 diploid genome, a well-sequenced sample that is part of the 1000 Genomes project and often used for benchmarking new techniques. The study authors mapped variation and built assemblies with both technologies, then combined the two to create a "hybrid" assembly that dramatically improved the contiguity of each. The resulting hybrid assembly N50s, the length such that 50% of all base pairs are contained in scaffolds of the given length or longer, approach 30Mb - on par with the best assemblies to date at a fraction of the cost and labor.

"The study revealed an unprecedented view of genomic complexity, in many cases identifying regions overlooked by conventional sequencing or further refining previously known genetic variant classes," said study co-author Jan Korbel, PhD, Group Leader at the European Molecular Biology Laboratory. "We had notable success in challenging regions such as inversions and tandem repeats," added co-author Robert Sebra, PhD, Assistant Professor of Genetics and Genomic Sciences at the Icahn School of Medicine. "For example, a systematic underrepresentation of tandem repeat sizes was observed in the human reference genomes. Such expansions, as we observed within the LPA gene which has been associated with plasmid lipid levels, are increasingly being identified as important markers for disease."

"By using a powerful combination of new technologies, we can finally begin to circumvent biases induced by overreliance on a single reference genome" said co-author Eric Schadt, PhD, Founding Director of the Icahn Institute, and Professor of Genomics at the Icahn School of Medicine. "Fully de novo approaches will increasingly become standard practice to enable direct and comprehensive characterization of genome variation. This will accelerate our understanding of the links to human diseases that such variations induce."

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News Medical.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
NCCN leaders collaborate to enhance cancer care in Vietnam