Meta-IDBA: a de Novo assembler for metagenomic data

Yu Peng; Henry C. M. Leung; S. M. Yiu; Francis Y. L. Chin

doi:10.1093/bioinformatics/btr216

Loading next page...

References (24)

L. Krause, Naryttza Diaz, A. Goesmann, S. Kelley, T. Nattkemper, F. Rohwer, R. Edwards, J. Stoye (2008)
Phylogenetic classification of short environmental DNA fragments
Nucleic Acids Research, 36
Ruiqiang Li, Hong-mei Zhu, Jue Ruan, W. Qian, X. Fang, Z. Shi, Yingrui Li, Shengting Li, Gao Shan, K. Kristiansen, Songgang Li, Huanming Yang, Jian Wang, Jun Wang (2010)
De novo assembly of human genomes with massively parallel short read sequencing.
Genome research, 20 2
P. Pevzner, Haixu Tang, Michael Waterman (2001)
An Eulerian path approach to DNA fragment assembly
Proceedings of the National Academy of Sciences of the United States of America, 98
Mark Chaisson, D. Brinza, P. Pevzner (2009)
De novo fragment assembly with short mate-paired reads: Does the read length matter?
Genome research, 19 2
Bin Yang, Yu Peng, Henry Leung, S. Yiu, J. Qin, Ruiqiang Li, Francis Chin (2010)
MetaCluster: unsupervised binning of environmental genomic fragments and taxonomic annotation
K. Pruitt, T. Tatusova, W. Klimke, D. Maglott (2008)
NCBI Reference Sequences: current status, policy and new initiatives
Nucleic Acids Research, 37
D. Zerbino, E. Birney (2008)
Velvet: algorithms for de novo short read assembly using de Bruijn graphs.
Genome research, 18 5
Mihai Pop (2009)
Genome assembly reborn: recent computational challenges
Briefings in bioinformatics, 10 4
K. Mavromatis, N. Ivanova, K. Barry, H. Shapiro, E. Goltsman, A. Mchardy, I. Rigoutsos, A. Salamov, Frank Korzeniewski, M. Land, A. Lapidus, I. Grigoriev, P. Richardson, P. Hugenholtz, N. Kyrpides (2007)
Use of simulated data sets to evaluate the fidelity of metagenomic processing methods
Nature Methods, 4
Sunhee Hong, J. Bunge, Sunok Jeon, S. Epstein (2006)
Predicting microbial species richness.
Proceedings of the National Academy of Sciences of the United States of America, 103 1
Yu-Wei Wu, Yuzhen Ye (2010)
A Novel Abundance-Based Algorithm for Binning Metagenomic Sequences Using l-Tuples
Journal of computational biology : a journal of computational molecular cell biology, 18 3
Kent (2002)
656
Genome Res., 12
D. Huson, Alexander Auch, J. Qi, S. Schuster (2007)
MEGAN analysis of metagenomic data.
Genome research, 17 3
Yu Peng, Henry Leung, S. Yiu, Francis Chin (2010)
IDBA - A Practical Iterative de Bruijn Graph De Novo Assembler
J. Wooley, A. Godzik, I. Friedberg (2010)
A Primer on Metagenomics
PLoS Computational Biology, 6
J. Qin, Ruiqiang Li, J. Raes, M. Arumugam, K. Burgdorf, C. Manichanh, T. Nielsen, N. Pons, F. Levenez, Takuji Yamada, D. Mende, Junhua Li, Junming Xu, Shaochuan Li, Dongfang Li, Jianjun Cao, Bo Wang, Huiqing Liang, Huisong Zheng, Yinlong Xie, J. Tap, P. Lepage, Marcelo Bertalan, Jean-Michel Batto, T. Hansen, D. Paslier, A. Linneberg, H. Nielsen, É. Pelletier, P. Renault, Thomas Sicheritz-Pontén, Keith Turner, Hong-mei Zhu, Chang Yu, Shengting Li, Min Jian, Yan Zhou, Yingrui Li, Xiuqing Zhang, Songgang Li, Nan Qin, Huanming Yang, Jian Wang, S. Brunak, J. Doré, F. Guarner, K. Kristiansen, O. Pedersen, J. Parkhill, J. Weissenbach, P. Bork, S. Ehrlich, Jun Wang (2010)
A human gut microbial gene catalogue established by metagenomic sequencing
Nature, 464
Y. Fofanov, Yi Luo, Charles Katili, Jim Wang, Y. Belosludtsev, T. Powdrill, C. Belapurkar, V. Fofanov, Tong-Bin Li, Sergey Chumakov, Sergey Chumakov, B. Pettitt (2004)
How independent are the appearances of n-mers in different genomes?
Bioinformatics, 20 15
A. Zhang, M. Borodovsky, G. Ozsoyoglu, Armin Mikler (2010)
Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology, BCB 2010, Niagara Falls, NY, USA, August 2-4, 2010
S. Gnerre, E. Lander, K. Lindblad-Toh, D. Jaffe (2009)
Assisted assembly: how to improve a de novo genome assembly by using related species
Genome Biology, 10
Jared Simpson, Kim Wong, S. Jackman, J. Schein, Steven Jones, I. Birol (2009)
ABySS: a parallel assembler for short read sequence data.
Genome research, 19 6
Mark Chaisson, P. Pevzner (2008)
Short read fragment assembly of bacterial genomes.
Genome research, 18 2
N. Youssef, M. Elshahed (2008)
Species richness in soil bacterial communities: a proposed approach to overcome sample size bias.
Journal of microbiological methods, 75 1
D. Zerbino, Gayle McEwen, E. Margulies, E. Birney (2009)
Pebble and Rock Band: Heuristic Resolution of Repeats and Scaffolding in the Velvet Short-Read de Novo Assembler
PLoS ONE, 4
W. Kent (2002)
BLAT--the BLAST-like alignment tool.
Genome research, 12 4

Publisher: Oxford University Press
Copyright: © The Author(s) 2011. Published by Oxford University Press.
ISSN: 1367-4803
eISSN: 1460-2059
DOI: 10.1093/bioinformatics/btr216
pmid: 21685107
Publisher site: See Article on Publisher Site

Abstract

Motivation: Next-generation sequencing techniques allow us to generate reads from a microbial environment in order to analyze the microbial community. However, assembling of a set of mixed reads from different species to form contigs is a bottleneck of metagenomic research. Although there are many assemblers for assembling reads from a single genome, there are no assemblers for assembling reads in metagenomic data without reference genome sequences. Moreover, the performances of these assemblers on metagenomic data are far from satisfactory, because of the existence of common regions in the genomes of subspecies and species, which make the assembly problem much more complicated.Results: We introduce the Meta-IDBA algorithm for assembling reads in metagenomic data, which contain multiple genomes from different species. There are two core steps in Meta-IDBA. It first tries to partition the de Bruijn graph into isolated components of different species based on an important observation. Then, for each component, it captures the slight variants of the genomes of subspecies from the same species by multiple alignments and represents the genome of one species, using a consensus sequence. Comparison of the performances of Meta-IDBA and existing assemblers, such as Velvet and Abyss for different metagenomic datasets shows that Meta-IDBA can reconstruct longer contigs with similar accuracy.Availability: Meta-IDBA toolkit is available at our website http://www.cs.hku.hk/~alse/metaidba.Contact: chin@cs.hku.hk

Journal

Bioinformatics – Oxford University Press

Published: Jun 14, 2011

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Meta-IDBA: a de Novo assembler for metagenomic data

Meta-IDBA: a de Novo assembler for metagenomic data

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Meta-IDBA: a de Novo assembler for metagenomic data

Meta-IDBA: a de Novo assembler for metagenomic data

References (24)

Abstract

Journal

Recommended Articles

There are no references for this article.

Our policy towards the use of cookies