Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Accurate detection of complex structural variations using single-molecule sequencing

Accurate detection of complex structural variations using single-molecule sequencing Structural variations are the greatest source of genetic variation, but they remain poorly understood because of technological limitations. Single-molecule long-read sequencing has the potential to dramatically advance the field, although high error rates are a challenge with existing methods. Addressing this need, we introduce open-source methods for long-read alignment (NGMLR; https://github.com/philres/ngmlr ) and structural variant identification (Sniffles; https://github.com/fritzsedlazeck/Sniffles ) that provide unprecedented sensitivity and precision for variant detection, even in repeat-rich regions and for complex nested events that can have substantial effects on human health. In several long-read datasets, including healthy and cancerous human genomes, we discovered thousands of novel variants and categorized systematic errors in short-read approaches. NGMLR and Sniffles can automatically filter false events and operate on low-coverage data, thereby reducing the high costs that have hindered the application of long reads in clinical and research settings. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Nature Methods Springer Journals

Accurate detection of complex structural variations using single-molecule sequencing

Loading next page...
 
/lp/springer_journal/accurate-detection-of-complex-structural-variations-using-single-4V8SSC632U

References (54)

Publisher
Springer Journals
Copyright
Copyright © 2018 by The Author(s)
Subject
Life Sciences; Life Sciences, general; Biological Techniques; Biological Microscopy; Biomedical Engineering/Biotechnology; Bioinformatics; Proteomics
ISSN
1548-7091
eISSN
1548-7105
DOI
10.1038/s41592-018-0001-7
Publisher site
See Article on Publisher Site

Abstract

Structural variations are the greatest source of genetic variation, but they remain poorly understood because of technological limitations. Single-molecule long-read sequencing has the potential to dramatically advance the field, although high error rates are a challenge with existing methods. Addressing this need, we introduce open-source methods for long-read alignment (NGMLR; https://github.com/philres/ngmlr ) and structural variant identification (Sniffles; https://github.com/fritzsedlazeck/Sniffles ) that provide unprecedented sensitivity and precision for variant detection, even in repeat-rich regions and for complex nested events that can have substantial effects on human health. In several long-read datasets, including healthy and cancerous human genomes, we discovered thousands of novel variants and categorized systematic errors in short-read approaches. NGMLR and Sniffles can automatically filter false events and operate on low-coverage data, thereby reducing the high costs that have hindered the application of long reads in clinical and research settings.

Journal

Nature MethodsSpringer Journals

Published: Apr 30, 2018

There are no references for this article.