Multiplex sequencing of paired-end ditags (MS-PET): a strategy for the ultra-high-throughput analysis of transcriptomes and genomesNg, Patrick; Tan, Jack J.S.; Ooi, Hong Sain; Lee, Yen Ling; Chiu, Kuo Ping; Fullwood, Melissa J.; Srinivasan, Kandhadayar G.; Perbost, Clotilde; Du, Lei; Sung, Wing-Kin; Wei, Chia-Lin; Ruan, Yijun
doi: 10.1093/nar/gkl444pmid: 16840528
The paired-end ditagging (PET) technique has been shown to be efficient and accurate for large-scale transcriptome and genome analysis. However, as with other DNA tag-based sequencing strategies, it is constrained by the current efficiency of Sanger technology. A recently developed multiplex sequencing method (454-sequencing™) using picolitre-scale reactions has achieved a remarkable advance in efficiency, but suffers from short-read lengths, and a lack of paired-end information. To further enhance the efficiency of PET analysis and at the same time overcome the drawbacks of the new sequencing method, we coupled multiplex sequencing with paired-end ditagging (MS-PET) using modified PET procedures to simultaneously sequence 200 000 to 300 000 dimerized PET (diPET) templates, with an output of nearly half-a-million PET sequences in a single 4 h machine run. We demonstrate the utility and robustness of MS-PET by analyzing the transcriptome of human breast carcinoma cells, and by mapping p53 binding sites in the genome of human colorectal carcinoma cells. This combined sequencing strategy achieved an approximate 100-fold efficiency increase over the current standard for PET analysis, and furthermore enables the short-read-length multiplex sequencing procedure to acquire paired-end information from large DNA fragments.
A novel and simple method for construction of recombinant adenovirusesTan, Rong; Li, Chunhua; Jiang, Sijing; Ma, Lixin
doi: 10.1093/nar/gkl449pmid: 16855284
Recombinant adenoviruses have been widely used for various applications, including protein expression and gene therapy. We herein report a new and simple cloning approach to an efficient and robust construction of recombinant adenoviral genomes based on the mating-assisted genetically integrated cloning (MAGIC) strategy. The production of recombinant adenovirus serotype 5-based vectors was greatly facilitated by the use of the MAGIC procedure and the development of the Adeasy™ adenoviral vector system. The recombinant adenoviral plasmid can be generated by a direct and seamless substitution, which replaces the stuff fragment in a full-length adenoviral genome with the gene of interest in a small plasmid in Escherichia coli. Recombinant adenoviral plasmids can be rapidly constructed in vivo by using the new method, without manipulations of the large adenoviral genome. In contrast to other traditional systems, it reduces the need for multiple in vitro manipulations, such as endonuclease cleavage, ligation and transformation, thus achieving a higher efficiency with negligible background. This strategy has been proven to be suitable for constructing an adenoviral cDNA expression library. In summary, the new method is highly efficient, technically less demanding and less labor-intensive for constructing recombinant adenoviruses, which will be beneficial for functional genomic and proteomic researches in mammalian cells.
Genomic DNA functions as a universal external standard in quantitative real-time PCRYun, James J.; Heisler, Lawrence E.; Hwang, Irene I. L.; Wilkins, Olivia; Lau, Suzanne K.; Hyrcza, Martin; Jayabalasingham, Bamini; Jin, Jing; McLaurin, JoAnne; Tsao, Ming-Sound; Der, Sandy D.
doi: 10.1093/nar/gkl400pmid: 16840529
Real-time quantitative PCR (qPCR) is a powerful tool for quantifying specific DNA target sequences. Although determination of relative quantity is widely accepted as a reliable means of measuring differences between samples, there are advantages to being able to determine the absolute copy numbers of a given target. One approach to absolute quantification relies on construction of an accurate standard curve using appropriate external standards of known concentration. We have validated the use of tissue genomic DNA as a universal external standard to facilitate quantification of any target sequence contained in the genome of a given species, addressing several key technical issues regarding its use. This approach was applied to validate mRNA expression of gene candidates identified from microarray data and to determine gene copies in transgenic mice. A simple method that can assist achieving absolute quantification of gene expression would broadly enhance the uses of real-time qPCR and in particular, augment the evaluation of global gene expression studies.
In vitro selection using a dual RNA library that allows primerless selectionJarosch, Florian; Buchner, Klaus; Klussmann, Sven
doi: 10.1093/nar/gkl463pmid: 16855281
High affinity target-binding aptamers are identified from random oligonucleotide libraries by an in vitro selection process called Systematic Evolution of Ligands by EXponential enrichment (SELEX). Since the SELEX process includes a PCR amplification step the randomized region of the oligonucleotide libraries need to be flanked by two fixed primer binding sequences. These primer binding sites are often difficult to truncate because they may be necessary to maintain the structure of the aptamer or may even be part of the target binding motif. We designed a novel type of RNA library that carries fixed sequences which constrain the oligonucleotides into a partly double-stranded structure, thereby minimizing the risk that the primer binding sequences become part of the target-binding motif. Moreover, the specific design of the library including the use of tandem RNA Polymerase promoters allows the selection of oligonucleotides without any primer binding sequences. The library was used to select aptamers to the mirror-image peptide of ghrelin. Ghrelin is a potent stimulator of growth-hormone release and food intake. After selection, the identified aptamer sequences were directly synthesized in their mirror-image configuration. The final 44 nt-Spiegelmer, named NOX-B11-3, blocks ghrelin action in a cell culture assay displaying an IC50 of 4.5 nM at 37°C.
Chromosome-wide identification of novel imprinted genes using microarrays and uniparental disomiesSchulz, Reiner; Menheniott, Trevelyan R.; Woodfine, Kathryn; Wood, Andrew J.; Choi, Jonathan D.; Oakey, Rebecca J.
doi: 10.1093/nar/gkl461pmid: 16855283
Genomic imprinting refers to a specialized form of epigenetic gene regulation whereby the expression of a given allele is dictated by parental origin. Defining the extent and distribution of imprinting across genomes will be crucial for understanding the roles played by imprinting in normal mammalian growth and development. Using mice carrying uniparental disomies or duplications, microarray screening and stringent bioinformatics, we have developed the first large-scale tissue-specific screen for imprinted gene detection. We quantify the stringency of our methodology and relate it to previous non-tissue-specific large-scale studies. We report the identification in mouse of four brain-specific novel paternally expressed transcripts and an additional three genes that show maternal expression in the placenta. The regions of conserved linkage in the human genome are associated with the Prader–Willi Syndrome (PWS) and Beckwith–Wiedemann Syndrome (BWS) where imprinting is known to be a contributing factor. We conclude that large-scale systematic analyses of this genre are necessary for the full impact of genomic imprinting on mammalian gene expression and phenotype to be elucidated.
Sequence biases in large scale gene expression profiling dataSiddiqui, Asim S.; Delaney, Allen D.; Schnerch, Angelique; Griffith, Obi L.; Jones, Steven J. M.; Marra, Marco A.
doi: 10.1093/nar/gkl404pmid: 16840527
We present the results of a simple, statistical assay that measures the G+C content sensitivity bias of gene expression experiments without the requirement of a duplicate experiment. We analyse five gene expression profiling methods: Affymetrix GeneChip, Long Serial Analysis of Gene Expression (LongSAGE), LongSAGELite, ‘Classic’ Massively Parallel Signature Sequencing (MPSS) and ‘Signature’ MPSS. We demonstrate the methods have systematic and random errors leading to a different G+C content sensitivity. The relationship between this experimental error and the G+C content of the probe set or tag that identifies each gene influences whether the gene is detected and, if detected, the level of gene expression measured. LongSAGE has the least bias, while Signature MPSS shows a strong bias to G+C rich tags and Affymetrix data show different bias depending on the data processing method (MAS 5.0, RMA or GC-RMA). The bias in the Affymetrix data primarily impacts genes expressed at lower levels. Despite the larger sampling of the MPSS library, SAGE identifies significantly more genes (60% more RefSeq genes in a single comparison).
Lac repressor hinge flexibility and DNA looping: single molecule kinetics by tethered particle motionVanzi, Francesco; Broggio, Chiara; Sacconi, Leonardo; Pavone, Francesco Saverio
doi: 10.1093/nar/gkl393pmid: 16835309
The tethered particle motion (TPM) allows the direct detection of activity of a variety of biomolecules at the single molecule level. First pioneered for RNA polymerase, it has recently been applied also to other enzymes. In this work we employ TPM for a systematic investigation of the kinetics of DNA looping by wild-type Lac repressor (wt-LacI) and by hinge mutants Q60G and Q60 + 1. We implement a novel method for TPM data analysis to reliably measure the kinetics of loop formation and disruption and to quantify the effects of the protein hinge flexibility and of DNA loop strain on such kinetics. We demonstrate that the flexibility of the protein hinge has a profound effect on the lifetime of the looped state. Our measurements also show that the DNA bending energy plays a minor role on loop disruption kinetics, while a strong effect is seen on the kinetics of loop formation. These observations substantiate the growing number of theoretical studies aimed at characterizing the effects of DNA flexibility, tension and torsion on the kinetics of protein binding and dissociation, strengthening the idea that these mechanical factors in vivo may play an important role in the modulation of gene expression regulation.
AUG sequences are required to sustain nonsense-codon-mediated suppression of splicingKamhi, Eyal; Yahalom, Galit; Kass, Gideon; Hacham, Yael; Sperling, Ruth; Sperling, Joseph
doi: 10.1093/nar/gkl390pmid: 16855285
More than 90% of human genes are rich in intronic latent 5′ splice sites whose utilization in pre-mRNA splicing would introduce in-frame stop codons into the resultant mRNAs. We have therefore hypothesized that suppression of splicing (SOS) at latent 5′ splice sites regulates alternative 5′ splice site selection in a way that prevents the production of toxic nonsense mRNAs and verified this idea by showing that the removal of such in-frame stop codons is sufficient to activate latent splicing. Splicing control by SOS requires recognition of the mRNA reading frame, presumably recognizing the start codon sequence. Here we show that AUG sequences are indeed essential for SOS. Although protein translation does not seem to be required for SOS, the first AUG is shown here to be necessary but not sufficient. We further show that latent splicing can be elicited upon treatment with pactamycin—a drug known to block translation by its ability to recognize an RNA fold—but not by treatment with other drugs that inhibit translation through other mechanisms. The effect of pactamycin on SOS is dependent neither on steady-state translation nor on the pioneer round of translation. This effect is found for both transfected and endogenous genes, indicating that SOS is a natural mechanism.
Bacterial regulatory networks are extremely flexible in evolutionLozada-Chávez, Irma; Janga, Sarath Chandra; Collado-Vides, Julio
doi: 10.1093/nar/gkl423pmid: 16840530
Over millions of years the structure and complexity of the transcriptional regulatory network (TRN) in bacteria has changed, reorganized and enabled them to adapt to almost every environmental niche on earth. In order to understand the plasticity of TRNs in bacteria, we studied the conservation of currently known TRNs of the two model organisms Escherichia coli K12 and Bacillus subtilis across complete genomes including Bacteria, Archaea and Eukarya at three different levels: individual components of the TRN, pairs of interactions and regulons. We found that transcription factors (TFs) evolve much faster than the target genes (TGs) across phyla. We show that global regulators are poorly conserved across the phylogenetic spectrum and hence TFs could be the major players responsible for the plasticity and evolvability of the TRNs. We also found that there is only a small fraction of significantly conserved transcriptional regulatory interactions among different phyla of bacteria and that there is no constraint on the elements of the interaction to co-evolve. Finally our results suggest that majority of the regulons in bacteria are rapidly lost implying a high-order flexibility in the TRNs. We hypothesize that during the divergence of bacteria certain essential cellular processes like the synthesis of arginine, biotine and ribose, transport of amino acids and iron, availability of phosphate, replication process and the SOS response are well conserved in evolution. From our comparative analysis, it is possible to infer that transcriptional regulation is more flexible than the genetic component of the organisms and its complexity and structure plays an important role in the phenotypic adaptation.