Bookmark

Distribution of short paired duplications in mammalian genomes

Preview Only

Distribution of short paired duplications in mammalian genomes

Abstract

Mammalian genomes are densely populated with long duplicated sequences. In this paper, we demonstrate the existence of doublets, short duplications between 25 and 100 bp, distinct from previously described repeats. Each doublet is a pair of exact matches, separated by some distance. The distribution of these intermatch distances is strikingly nonrandom. An unexpectedly high number of doublets have matches either within 100 bp (adjacent) or at distances tightly concentrated ≈1,000 bp apart (nearby). We focus our study on these proximate doublets. First, they tend to have both matches on the same strand. By comparing nearby doublets shared in human and chimpanzee, we can also see that these doublets seem to arise by an insertion event that produces a copy without markedly affecting the surrounding sequence. Most doublets in humans are shared with chimpanzee, but many new pairs arose after the divergence of the species. Doublets found in human but not chimpanzee are most often composed of almost tandem matches, whereas older doublets (found in both species) are more likely to have matches spaced by ≈1 kb, indicating that the nearly tandem doublets may be more dynamic. The spacing of doublets is highly conserved. So far, we have found clearly recognizable doublets in the following genomes: Homo sapiens, Mus musculus, Arabidopsis thaliana, and Caenorhabditis elegans, indicating that the mechanism generating these doublets is widespread. A mechanism that generates short local duplications while conserving polarity could have a profound impact on the evolution of regulatory and proteincoding sequences.
Loading next page...
1 Page

Preview Only. This article cannot be rented because we do not currently have permission from the publisher.

 
/lp/pnas/distribution-of-short-paired-duplications-in-mammalian-genomes-zuaYGNb610
Title
Distribution of short paired duplications in mammalian genomes
Author(s)
Thomas, Elizabeth E.; Srebro, Nathan; Sebat, Jonathan; Navin, Nicholas; Healy, John; Mishra, Bud; Wigler, Michael
Journal
Proceedings of the National Academy of Sciences , Volume 101 (28): 10349 PNAS – Jul 13, 2004
Publisher
National Acad Sciences
Copyright
Copyright ©2009 by the National Academy of Sciences
ISSN
0027-8424
eISSN
1091-6490
Publisher site
Get PDF