Access the full text.
Sign up today, get DeepDyve free for 14 days.
Yoseph Barash, G. Elidan, N. Friedman, Tommy Kaplan (2003)
Modeling dependencies in protein-DNA binding sites
J. Hughes, P. Estep, Saeed Tavazoie, G. Church (2000)
Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae.Journal of molecular biology, 296 5
M. Burset, I. Seledtsov, V. Solovyev (2001)
SpliceDB: database of canonical and non-canonical mammalian splice sitesNucleic acids research, 29 1
(1998)
Mechanical devices of the spliceosome: motors, clocks
S. Kirkpatrick, C. Gelatt, Mario Vecchi (1983)
Optimization by Simulated AnnealingScience, 220
S. Salzberg, A. Delcher, S. Kasif, O. White (1998)
Microbial gene identification using interpolated Markov models.Nucleic acids research, 26 2
G. Potamianos, F. Jelinek (1998)
A study of n-gram and decision tree letter language modeling methodsSpeech Commun., 24
L. Pachter, M. Alexandersson, S. Cawley (2001)
Applications of generalized pair hidden Markov models to alignment and gene finding problemsJournal of computational biology : a journal of computational molecular cell biology, 9 2
G. Schwarz (1978)
Estimating the Dimension of a ModelAnnals of Statistics, 6
M. Alexandersson, S. Cawley, L. Pachter (2003)
SLAM: cross-species gene finding and alignment with a generalized pair hidden Markov model.Genome research, 13 3
P. Bühlmann (2000)
Model Selection for Variable Length Markov Chains and Tuning the Context AlgorithmAnnals of the Institute of Statistical Mathematics, 52
(2000)
Statistical models for DNA sequencing and analysis
Deyou Cai, A. Delcher, B. Kao, S. Kasif (2000)
Modeling splice sites with Bayes networksBioinformatics, 16 2
T. Schneider, R. Stephens (1990)
Sequence logos: a new way to display consensus sequences.Nucleic acids research, 18 20
G. Stuart, K. Moffett, Steve Baker (2002)
Integrated gene and species phylogenies from unaligned whole genome protein sequencesBioinformatics, 18 1
K. Ellrott, Chuhu Yang, F. Sladek, Tao Jiang (2002)
Identifying transcription factor binding sites through Markov chain optimizationBioinformatics, 18 Suppl 2
C. Lawrence, S. Altschul, M. Boguski, Jun Liu, A. Neuwald, J. Wootton (1993)
Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment.Science, 262 5131
J. Rissanen (1986)
Complexity of strings in the class of Markov sourcesIEEE Trans. Inf. Theory, 32
A. Barron, J. Rissanen, Bin Yu (1998)
The Minimum Description Length Principle in Coding and ModelingIEEE Trans. Inf. Theory, 44
H. Akaike (1974)
A new look at the statistical model identificationIEEE Transactions on Automatic Control, 19
C. Burge, S. Karlin (1997)
Prediction of complete gene structures in human genomic DNA.Journal of molecular biology, 268 1
M. Reese, D. Kulp, H. Tammana, D. Haussler (2000)
Genie--gene finding in Drosophila melanogaster.Genome research, 10 4
Gene Yeo, C. Burge (2003)
Maximum entropy modeling of short sequence motifs with applications to RNA splicing signalsJournal of computational biology : a journal of computational molecular cell biology, 11 2-3
R. Staden (1984)
Computer methods to locate signals in nucleic acid sequencesNucleic acids research, 12 1 Pt 2
S. Batzoglou, L. Pachter, J. Mesirov, B. Berger, E. Lander (2000)
Human and mouse gene structure: comparative analysis and application to exon predictionGenome research, 10 7
P. Green (1995)
Reversible jump Markov chain Monte Carlo computation and Bayesian model determinationBiometrika, 82
Michael Zhang, T. Marr (1993)
A weight array method for splicing signal analysisComputer applications in the biosciences : CABIOS, 9 5
H. Chipman, E. George, R. McCulloch (1998)
Bayesian CART Model SearchJournal of the American Statistical Association, 93
W. Fairbrother, L. Chasin (2000)
Human Genomic Sequences That Inhibit SplicingMolecular and Cellular Biology, 20
J. Staley, C. Guthrie (1998)
Mechanical Devices of the Spliceosome: Motors, Clocks, Springs, and ThingsCell, 92
Richard Seifert, K. Nauhaus, T. Treude, Volker Thiel, M. Blumenberg, K. Knittel, A. Gieseke, K. Peterknecht, Thomas Pape, A. Boetius, Rudolf Amann, Bo Jørgensen, Friedrich Widdel, Jörn Peckmann, Nikolai Pimenov
Predictive Identification of Exonic Splicing Enhancers in Human Genes
E. Wingender, Xin Chen, R. Hehl, H. Karas, I. Liebich, V. Matys, T. Meinhardt, M. Prüß, I. Reuter, F. Schacherer (2000)
TRANSFAC: an integrated system for gene expression regulationNucleic acids research, 28 1
Haiyan Huang, M. Kao, X. Zhou, Jun Liu, W. Wong (2004)
Determination of Local Statistical Significance of Patterns in Markov Sequences with Application to Promoter Element IdentificationJournal of computational biology : a journal of computational molecular cell biology, 11 1
Finding Short DNA Motifs Using Permuted Markov Models Xiaoyue Zhao Haiyan Huang , Terence P. Speed ¡ , {xiaoyue, hhuang, terry}@stat.berkeley.edu ABSTRACT Many short DNA motifs such as transcription factor binding sites (TFBS) and splice sites exhibit strong local as well as non-local dependence. We introduce permuted variable length Markov models (PVLMM) which could capture the potentially important dependencies among positions, and apply them to the problem of detecting splice and TFB sites. They have been satisfactory from the viewpoint of prediction performance, and also give ready biological interpretations of the sequence dependence observed. The issue of model selection is also studied. 1. INTRODUCTION It is an important and also challenging task to identify short biological motifs such as transcription factor binding sites (TFBS) and splice sites, where the gene expression machinery interacts with nucleic acids. The binding of transcription factors to speci c DNA sequences near the transcription start site, is one of the rst steps determining whether a gene is turned on or o . Splice sites are the boundaries between exons and introns, consisting of 5 donor sites at the exon/intron junctions, and 3 acceptor sites at the intron/exon junctions. The variability
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.