Lyons, Eric; Pedersen, Brent; Kane, Josh; Freeling, Michael
doi: 10.1007/s12042-008-9017-y
We find great value in the genomes from the nonmodel organisms papaya and grape. These genomes help us understand the chromosomal history of the super-order rosids. Essential to this process are new, online genomics tools that allow researchers to easily perform their own experiments, such as identifying and evaluating syntenic regions and estimating the degree of post-tetraploidy gene fractionation (diploidization); this process is exemplified here using the online comparative genomics toolset CoGe. Using case studies, we show that two of the three genomes within the rosid paleohexaploid are more fractionated with respect to one another than to a third genome. This indicates a shared history derived from a [tetraploid]-then-[wide cross to generate a triploid]-then-[whole genome duplication to generate the hexaploid,] or similar scenario involving unreduced gametes. Two alternative hypotheses are presented that differ in terms of the mechanism and timing of fractionation.
Nagarajan, Niranjan; Navajas-Pérez, Rafael; Pop, Mihai; Alam, Maqsudul; Ming, Ray; Paterson, Andrew; Salzberg, Steven
doi: 10.1007/s12042-008-9015-0
Papaya (Carica papaya L.) is an important fruit crop cultivated in tropical and subtropical regions worldwide. A first draft of its genome sequence has been recently released. Together with Arabidopsis, rice, poplar, grapevine and other genomes in the pipeline, it represents a good opportunity to gain insight into the organization of plant genomes. Here we report a detailed analysis of repetitive elements in the papaya genome, including transposable elements (TEs), tandemly-arrayed sequences, and high copy number genes. These repetitive sequences account for ∼56% of the papaya genome with TEs being the most abundant at 52%, tandem repeats at 1.3% and high copy number genes at 3%. Most common types of TEs are represented in the papaya genome with retrotransposons being the dominant class, accounting for 40% of the genome. The most prevalent retrotransposons are Ty3-gypsy (27.8%) and Ty1-copia (5.5%). Among the tandem repeats, microsatellites are the most abundant in number, but represent only 0.19% of the genome. Minisatellites and satellites are less abundant, but represent 0.68% and 0.43% of the genome, respectively, due to greater repeat length. Despite an overall smaller gene repertoire in papaya than many other angiosperms, a significant fraction of genes (>2%) are present in large gene families with copy number greater than 20. This repeat database clarified a major part of the papaya genome organization and partly explained the lower gene repertoire in papaya than in Arabidopsis.
Shakirov, E.; Salzberg, S.; Alam, M.; Shippen, D.
doi: 10.1007/s12042-008-9018-xpmid: 20664721
Telomeres are terminal regions of linear eukaryotic chromosomes that are critical for genome stability and continued cell proliferation. The draft assembly of the papaya genome provides an opportunity to analyze and compare the evolution of telomeric DNA sequence composition and telomere maintenance machinery in this and other organisms of the Brassicales Order, which includes Arabidopsis. Here we investigate telomere size and sequence variation at papaya chromosome ends. As with most other plant species, papaya telomeres consist of TTTAGGG repeats. However, in contrast to members of the closely related Brassicaceae family, telomeres in papaya are ~10-fold longer. Sequence analysis reveals that many centromere-proximal telomere repeats in papaya harbor nucleotide substitutions and insertions of Gs and Ts. In contrast, we found very few N-to-C substitutions, and even fewer instances of nucleotide deletion, suggesting that a six-nucleotide telomere repeat is not well tolerated. The papaya genome encodes single-copy sequence homologues of several genes involved in telomere maintenance and chromosome end protection, including the Telomerase Reverse Transcriptase (TERT) and Protection Of Telomeres (POT1). Notably, unlike Arabidopsis, which encodes six Telomere Repeat binding Factor-like (TRFL) proteins that bind double-stranded telomere DNA, papaya appears to encode only two such proteins. Thus, the more streamlined genome of papaya will provide an excellent resource for comparative and functional analysis of telomeres in plants.
Nelson, David; Ming, Ray; Alam, Maqsudul; Schuler, Mary
doi: 10.1007/s12042-008-9022-1
Plants depend on cytochrome P450 (CYP) enzymes for nearly every aspect of their biology. In several sequenced angiosperms, CYP genes constitute up to 1% of the protein coding genes. The angiosperm sequence diversity is encapsulated by 59 CYP families, of which 52 families form a widely distributed core set. In the 20 years since the first plant P450 was sequenced, 3,387 P450 sequences have been identified and annotated in plant databases. As no new angiosperm CYP families have been discovered since 2004, it is now apparent that the sampling of CYP diversity is beginning to plateau. This review presents a comparison of 1,415 cytochrome P450 sequences from the six sequenced genomes of Vitis vinifera (grape), Carica papaya (papaya), Populus trichocarpa (poplar), Oryza sativa (rice), Arabidopsis thaliana (Arabidopsis or mouse ear’s cress) and Physcomitrella patens (moss). An evolutionary analysis is presented that tracks land plant P450 innovation over time from the most ancient and conserved sequences to the newest dicot-specific families. The earliest or oldest P450 families are devoted to the essential biochemistries of sterol and carotenoid synthesis. The next evolutionary radiation of P450 families appears to mediate crucial adaptations to a land environment. And, the newest CYP families appear to have driven the diversity of angiosperms in mediating the synthesis of pigments, odorants, flavors and order-/genus-specific secondary metabolites. Family-by-family comparisons allow the visualization of plant genome plasticity by whole genome duplications and massive gene family expansions via tandem duplications. Molecular evidence of human domestication is quite apparent in the repeated P450 gene duplications occurring in the grape genome.
Zdepski, Anna; Wang, Wenqin; Priest, Henry; Ali, Faraz; Alam, Maqsudul; Mockler, Todd; Michael, Todd
doi: 10.1007/s12042-008-9020-3pmid: 20671772
Most organisms have internal circadian clocks that mediate responses to daily environmental changes in order to synchronize biological functions to the correct times of the day. Previous studies have focused on plants found in temperate and sub-tropical climates, and little is known about the circadian transcriptional networks of plants that typically grow under conditions with relatively constant day lengths and temperatures over the year. In this study we conducted a genomic and computational analysis of the circadian biology of Carica papaya, a tropical tree. We found that predicted papaya circadian clock genes cycle with the same phase as Arabidopsis genes. The patterns of time-of-day overrepresentation of circadian-associated promoter elements were nearly identical across papaya, Arabidopsis, rice, and poplar. Evolution of promoter structure predicts the observed morning- and evening-specific expression profiles of the papaya PRR5 paralogs. The strong conservation of previously identified circadian transcriptional networks in papaya, despite its tropical habitat and distinct life-style, suggest that circadian timing has played a major role in the evolution of plant genomes, consistent with the selective pressure of anticipating daily environmental changes. Further studies could exploit this conservation to elucidate general design principles that will facilitate engineering plant growth pathways for specific environments.
Paull, Robert; Irikura, Beth; Wu, Pingfang; Turano, Helen; Chen, Nancy; Blas, Andrea; Fellman, John; Gschwend, Andrea; Wai, Ching; Yu, Qingyi; Presting, Gernot; Alam, Maqsudul; Ming, Ray
doi: 10.1007/s12042-008-9021-2
Papaya (Carica papaya L.) is the first fleshy fruit with a climacteric ripening pattern to be sequenced. As a member of the Rosids superorder in the order Brassicales, papaya apparently lacks the genome duplication that occurred twice in Arabidopsis. The predicted papaya genes that are homologous to those potentially involved in fruit growth, development, and ripening were investigated. Genes homologous to those involved in tomato fruit size and shape were found. Fewer predicted papaya expansin genes were found and no Expansin Like-B genes were predicted. Compared to Arabidopsis and tomato, fewer genes that may impact sugar accumulation in papaya, ethylene synthesis and response, respiration, chlorophyll degradation and carotenoid synthesis were predicted. Similar or fewer genes were found in papaya for the enzymes leading to volatile production than so far determined for tomato. The presence of fewer papaya genes in most fruit development and ripening categories suggests less subfunctionalization of gene action. The lack of whole genome duplication and reductions in most gene families and biosynthetic pathways make papaya a valuable and unique tool to study the evolution of fruit ripening and the complex regulatory networks active in fruit ripening.
Wang, Jianping; Chen, Cuixia; Na, Jong-Kuk; Yu, Qingyi; Hou, Shaobin; Paull, Robert; Moore, Paul; Alam, Maqsudul; Ming, Ray
doi: 10.1007/s12042-008-9024-z
Microsatellites, or simple sequence repeats (SSRs), are highly polymorphic and universally distributed in eukaryotes. SSRs have been used extensively as sequence tagged markers in genetic studies. Recently, the functional and evolutionary importance of SSRs has received considerable attention. Here we report the mining and characterization of the SSRs in papaya genome. We analyzed SSRs from 277.4 Mb of whole genome shotgun (WGS) sequences, 51.2 Mb bacterial artificial chromosome (BAC) end sequences (BES), and 13.4 Mb expressed sequence tag (EST) sequences. The papaya SSR density was one SSR per 0.7 kb of DNA sequence in the WGS, which was higher than that in BES and EST sequences. SSR abundance was dramatically reduced as the repeat length increased. According to SSR motif length, dinucleotide repeats were the most common motif in class I, whereas hexanucleotides were the most copious in class II SSRs. The tri- and hexanucleotide repeats of both classes were greater in EST sequences compared to genomic sequences. In class I SSR, AT and AAT were the most frequent motifs in BES and WGS sequences. By contrast, AG and AAG were the most abundant in EST sequences. For SSR marker development, 9,860 primer pairs were surveyed for amplification and polymorphism. Successful amplification and polymorphic rates were 66.6% and 17.6%, respectively. The highest polymorphic rates were achieved by AT, AG, and ATG motifs. The genome wide analysis of microsatellites revealed their frequency and distribution in papaya genome, which varies among plant genomes. This complete set of SSRs markers throughout the genome will assist diverse genetic studies in papaya and related species.
Showing 1 to 8 of 8 Articles