TY - JOUR AU - Schneider, Thomas D. AB - Abstract In vitro experiments that characterize DNA-protein interactions by artificial selection, such as SELEX, are often performed with the assumption that the experimental conditions are equivalent to natural ones. To test whether SELEX gives natural results, we compared sequence logos composed from naturally occurring leucine-responsive regulatory protein (Lrp) binding sites with those composed from SELEX-generated binding sites. The sequence logos were significantly different, indicating that the binding conditions are disparate. A likely explanation is that the SELEX experiment selected for a dimeric or trimeric Lrp complex bound to DNA. In contrast, natural sites appear to be bound by a monomer. This discrepancy suggests that in vitro selections do not necessarily give binding site sets comparable with the natural binding sites. Introduction Genetic control is exerted when proteins bind to specific nucleic acid sequences. Traditionally, these sequences have been collected from the naturally evolved sites. More recently, protein-binding motifs have been characterized by using in vitro selection procedures. It is often assumed that in vitro results accurately reflect natural binding sites, but a quantitative comparison of the two approaches has usually been lacking. In this paper we make this comparison for the leucine-responsive regulatory protein (Lrp). Lrp is a pleiotropic DNA-binding protein in Escherichia coli and Salmonella typhimurium that consists of two 18.8 kDa subunits (1), and that forms a homodimer in solution (2). Lrp binds to multiple sites in a number of operons, including dad, fanABC, papBA and ilvIH (3–5). Leucine can invoke either positive or negative transcriptional control by Lrp (1,6). Cui et al. investigated Lrp by using the SELEX (systematic evolution of ligands by exponential enrichment) procedure (7), an in vitro method that is used to identify binding motifs. In the SELEX procedure, a specific protein is used to select binding sequences from random synthetic sequences (8). Since its introduction, the SELEX technique has been used to study a variety of systems (9,10). Since Lrp has many natural binding sites, a reasonably accurate model for in vivo binding sequences can be created and compared with sites produced by SELEX. Based on Claude Shannon's information theory (11,12), molecular information theory (13,14) is a mathematical approach to explaining molecular interactions. Using information theory, we constructed two separate models of Lrp binding sequences for comparison. These quantitative models, called sequence logos (15), graphically represent Lrp binding in both the natural and synthetic environments. Comparison of the models allowed us to test whether the sites selected in vitro had evolved to simulate natural binding sites. Materials and Methods Twenty-seven Lrp binding sites were aligned for analysis of Lrp binding patterns (Fig. 1). Only sites with supporting experimental data (footprint, mutational analysis, deletions) were used. Seventeen of the 27 sites are on the E.coli chromosome and the remaining 10 are found on four different plasmids. The lrp, gltBDF, leuABCD, oppA, pnt, sdaA, glyA, livJ, glnALG, fimB, fanABC, serA and ompF sites, discussed in Fraenkel et al. (16), were not used because no experimental data supported binding there. The delila, alist, encode, rseq, dalvec and makelogo programs were used as described previously to create both natural (Fig. 2) and SELEX (Fig. 5) sequence logos (15,17). The malign program was used to adjust and check the alignment of the Lrp sites, and to maximize the information content (18). The rsim program was used to determine the standard deviation of Rsequence (13,19). The information content of individual genetic sequences (Ri) can be determined to identify potential binding sites (20–25). The programs ri, scan, search, live and lister were used to identify and map the Lrp sites relative to the footprint data, and sites were displayed by the sequence walker method (20,21). First, we used ri to create an Riw (b, l) weight matrix from the aligned set of sites. Then we scanned each sequence with the natural Lrp weight matrix using scan. Next, we used the search program to identify and mark the footprinted regions on the map. The live program was used to create a spectrum color strip to indicate protein binding site orientation on B-form DNA (Figs 3 and 4). We then used the lister program to place the walkers and other features on the DNA map. Figure 1 View largeDownload slide Aligned listing of 27 E.coli Lrp binding sites. Columns from left to right indicate gene or operon name; Lrp activation (A), repression (R) or whether its effect is unknown (?); GenBank accession number; zero coordinate in GenBank entry; the orientation of the sequence relative to the GenBank entry; sequence number; the binding sequence and Ri of each site in bits for the range −1 to +12. Twenty-five footprinted sites [ilvIH (5,27), trxB (36), micF and ompC (37), gcv (38), gltBDF (33), lysU (39), papBA (4,40,41), faeA (42), daaAB and sfaBA (43)], and two mutated sites that affected binding [dad (3) and tdh (44)] are shown. The alignment is derived from Fraenkel (16). These sites are summarized as sequence logos in Figure 2. Figure 1 View largeDownload slide Aligned listing of 27 E.coli Lrp binding sites. Columns from left to right indicate gene or operon name; Lrp activation (A), repression (R) or whether its effect is unknown (?); GenBank accession number; zero coordinate in GenBank entry; the orientation of the sequence relative to the GenBank entry; sequence number; the binding sequence and Ri of each site in bits for the range −1 to +12. Twenty-five footprinted sites [ilvIH (5,27), trxB (36), micF and ompC (37), gcv (38), gltBDF (33), lysU (39), papBA (4,40,41), faeA (42), daaAB and sfaBA (43)], and two mutated sites that affected binding [dad (3) and tdh (44)] are shown. The alignment is derived from Fraenkel (16). These sites are summarized as sequence logos in Figure 2. To compare binding energy with individual information (Fig. 6), we plotted the relative binding strength from Cui et al. (7) against the strongest Ri found by scanning the SELEX sites with a natural weight matrix. The reported binding site strength was also compared with the Ri predicted from the SELEX sequences themselves. The xyplo program was used to generate the graph. Further information on programs is available at http://www-lecb.ncifcrf.gov/~toms/ Results Natural Lrp sites The Lrp sequence logo (Fig. 2A) shows well conserved bases at positions −1 to +3 and low conservation up to position +12. Positions −1 to +3 have heights >1 bit, suggesting major groove binding in that region or minor groove binding with distortion of the helix (26). The cosine wave represents B-form DNA. The sequence logo follows the wave fairly well, a common attribute of many logos in which the peaks correspond to major groove binding (17). However, methylation protection is observed outside of positions −1 to +3, which is inconsistent with this suggested major groove binding. DMS modifies the N7 position of guanine, but these are rare in the region −1 to +3 and none were in the experimental DNA (27), so the DMS protection experiments did not address the question of whether contacts are made in this region. It is possible that Lrp binds to the major groove in the protected regions −5, −4 and +4 to +7 and that the sequence conservation in −1 to +3 is through the minor groove. To account for the sequence conservation exceeding 1 bit in the minor groove, the DNA would have to be heavily distorted, as with TATA binding sites (28–30). The sites contained a total information content of Rsequence = 10.8 ± 0.9 bits, for the range −1 to +12. Lrp is known to both activate and repress transcription (6), so sequence logos for both Lrp activation and repression sites were made (Fig. 2B and C). There are no major differences observed between the sequence characteristics of activation and repression sites, except for more strongly conserved bases at the −10, −9, −2, −1 and +6 positions in the activation logo and a more strongly conserved A at the +2 position and T at the −4 and +12 positions in the repression logo. Activation sites occur about twice as frequently as repression sites. To see if activation sites can be distinguished from repression sites, activation and repression Riw (b, l) weight matrices were assembled for individual informational analysis of all footprinted Lrp sites. Activation sites were given higher Ri evaluation by the activation model and the repression sites were favored by the repression model. To test the predictive capability of these matrices, we repeated this analysis but excluded each site from its own matrix. We found that we could not predict repression versus activation. The failure of this bootstrap test, for all sites, suggests that either the activation and repression sites are essentially identical or that more examples are needed to distinguish between them. Figure 2 View largeDownload slide Sequence logos of natural Lrp sites. (A) Sequence logo of the 27 wild-type Lrp binding sites shown in Figure 1. Sequence conservation, measured in bits of information, is depicted by the height of a stack of letters for each position in the binding sites. The relative heights of the letters within a stack are proportional to their frequencies. Circles were placed below guanines protected from DMS attacks by Lrp (27). Open circles (○) are guanines protected on the top strand, and filled circles (•) are guanines protected on the bottom strand. Triangles (⊲) denote DNase I hypersensitive sites (43). (B) Sequence logo of 17 Lrp-activation binding sites (A in Fig. 1). (C) sequence logo of nine Lrp-repression binding sites (R in Fig. 1). The cosine wave represents the 10.6 base twist of B-form DNA (17,26). Figure 2 View largeDownload slide Sequence logos of natural Lrp sites. (A) Sequence logo of the 27 wild-type Lrp binding sites shown in Figure 1. Sequence conservation, measured in bits of information, is depicted by the height of a stack of letters for each position in the binding sites. The relative heights of the letters within a stack are proportional to their frequencies. Circles were placed below guanines protected from DMS attacks by Lrp (27). Open circles (○) are guanines protected on the top strand, and filled circles (•) are guanines protected on the bottom strand. Triangles (⊲) denote DNase I hypersensitive sites (43). (B) Sequence logo of 17 Lrp-activation binding sites (A in Fig. 1). (C) sequence logo of nine Lrp-repression binding sites (R in Fig. 1). The cosine wave represents the 10.6 base twist of B-form DNA (17,26). Evidence of model accuracy To test our model's accuracy, we scanned the complete 27 site individual information weight matrix across the six in vivo footprinted ilvIH sites (5) and displayed the results using sequence walkers (Fig. 3). The walkers coincided with the six protected areas, confirming the model. The Lrp Riw (b, l) weight matrix was scanned across the other sites of Figure 1 and similar results were seen (data not shown). Interestingly, the sites all have the same asymmetric orientation and fall on two opposite faces of the DNA. Informational predictions of possible sites To test our Lrp binding site model, we excluded the fimA regulatory region from our data set (Fig. 1). (OP)2Cu2+ footprinting data along with two-stage methidiumpropyl-EDTA footprinting analysis indicates that Lrp binds to over 60 bp in fimA (31,32), and based on a pre-existing consensus sequence, two Lrp binding sites had been predicted at the ends of the protected region (31). As shown by the map of Figure 4, both sites appear to be correctly placed since there is a 10.1 bit walker beneath the first and a 3.3 bit walker partially beneath the second. In addition, the map suggests multiple Lrp binding sites (22) in the fimA regulatory region, all in the same asymmetric orientation and on approximately two opposite faces of the DNA. Figure 3 View largeDownload slide Lrp sequence walkers compared to in vivo DMS footprinting data. Six in vivo DMS footprinted ilvIH sites are marked by dashed lines (5). Beneath these sites are sequence walkers along with the Ri of each site given in bits. The height of each letter in a walker is the sequence conservation that that base contributes to the average sequence conservation shown in the sequence logo (Fig. 2A). The green rectangles mark the zero coordinate of each walker and provide a scale from −3 to +2 bits. All letters of the walkers are rotated 90° counter-clockwise, indicating that all Lrp ilvIH sites have the same orientation. Walker location was determined by the scan program for Ri > 4 bits, which includes all known natural sites in Figure 1. The asterisks and numbers above the sequence indicate the position on the E.coli genome, GenBank entry U00096 (45). The color strip above the sequence has a 10.6 base cycle, representing the helical structure of B-form DNA. Sites 1, 2 and 4 are on the same face because their zero coordinates fall under the same color. Sites 3, 5 and 6 are on the opposite face. Figure 3 View largeDownload slide Lrp sequence walkers compared to in vivo DMS footprinting data. Six in vivo DMS footprinted ilvIH sites are marked by dashed lines (5). Beneath these sites are sequence walkers along with the Ri of each site given in bits. The height of each letter in a walker is the sequence conservation that that base contributes to the average sequence conservation shown in the sequence logo (Fig. 2A). The green rectangles mark the zero coordinate of each walker and provide a scale from −3 to +2 bits. All letters of the walkers are rotated 90° counter-clockwise, indicating that all Lrp ilvIH sites have the same orientation. Walker location was determined by the scan program for Ri > 4 bits, which includes all known natural sites in Figure 1. The asterisks and numbers above the sequence indicate the position on the E.coli genome, GenBank entry U00096 (45). The color strip above the sequence has a 10.6 base cycle, representing the helical structure of B-form DNA. Sites 1, 2 and 4 are on the same face because their zero coordinates fall under the same color. Sites 3, 5 and 6 are on the opposite face. SELEX-generated sites Cui et al. used the SELEX procedure (8) to obtain sequences that bind Lrp (7). Two SELEX experiments that had been performed in the absence of leucine are represented by sequence logos in Figure 5A and B and combined in Figure 5C. A third experiment that had been performed in the presence of leucine has a similar logo (Fig. 5D). Cui et al. noticed that the sequences from all three experiments were similar, so they combined all of the sites to generate their consensus sequence. We therefore also made a sequence logo that combined all of the SELEX sequences (Fig. 5E). These sites contained an Rsequence of 13.9 ± 0.6 bits, for the range −7 to +8. Various sized symmetric and asymmetric SELEX models were used to predict binding in the natural sequences (Fig. 1) and, unlike the predictions by the natural model, the sites were not consistently identified (data not shown). Finally, we plotted each reported relative binding strength against the corresponding natural Ri values (Fig. 6A), and we plotted the binding strength against Ri values from a weight matrix created from all 67 SELEX sequences (Fig. 6B). Figure 4 View largeDownload slide Predicted Lrp sites in the fimA regulatory region. Two Lrp binding sites previously predicted (31) are marked with dashes, while (OP)2Cu2+ and two-stage methidiumpropyl-EDTA footprinting data shows extended protection (31,32), that corresponds to a series of Lrp walkers. To include the 3.3 bit site predicted by Gally et al. (31), all sequence walkers with Ri > 3 bits are shown. The black rectangle indicates a base not observed in the original data set (Fig. 1). If this site were included in the model, it would become 7.2 bits (20). All predicted Lrp sites have the same orientation and the four sites having 10.1, 5.1, 4.8 and 5.5 bits would be on the same face of the DNA since their zero positions are at nearly the same color on the spectrum, while the 4.0, 4.6 and 7.5 bit sites are approximately on the opposite face. The sequence is from GenBank accession U00096 (45). Figure 4 View largeDownload slide Predicted Lrp sites in the fimA regulatory region. Two Lrp binding sites previously predicted (31) are marked with dashes, while (OP)2Cu2+ and two-stage methidiumpropyl-EDTA footprinting data shows extended protection (31,32), that corresponds to a series of Lrp walkers. To include the 3.3 bit site predicted by Gally et al. (31), all sequence walkers with Ri > 3 bits are shown. The black rectangle indicates a base not observed in the original data set (Fig. 1). If this site were included in the model, it would become 7.2 bits (20). All predicted Lrp sites have the same orientation and the four sites having 10.1, 5.1, 4.8 and 5.5 bits would be on the same face of the DNA since their zero positions are at nearly the same color on the spectrum, while the 4.0, 4.6 and 7.5 bit sites are approximately on the opposite face. The sequence is from GenBank accession U00096 (45). Discussion Experimentally characterized natural Lrp binding sites (Fig. 1) have typical sequence logos that show moderate variation between activation and repression sites (Fig. 2). However, we were unable to use separated activation and repression models to predict activation and repression. Because they are asymmetric, all natural sequence logos are consistent with a monomer of Lrp binding the DNA, even though experimental evidence suggests homodimer formation in solution (2). While the sinusoidal shape of the logo and sequence conservation in excess of 1 bit suggest major groove binding (17), protection of major groove N7 moieties is outside the region of conservation, suggesting the alternative possibility of minor groove binding in conjunction with large distortions of the DNA helix (26,28,29). In any case, individual information analysis shows that Lrp sites are easily identified in footprinted regions (Figs 3 and 4). As with Fis binding sites (22), predicted Lrp binding sites appear to have specific patterns of binding to opposite faces of the DNA helix. In addition, the sites are all oriented in the same direction. The physiological implications of these structures are unknown. Surprisingly, the sequence logos for natural Lrp binding sites determined by footprints or mutations (Fig. 2A) do not closely resemble the sequence logos obtained by SELEX (Fig. 5E). The SELEX information content is 13.9 ± 0.6 bits whereas the natural is 10.8 ± 0.9 bits, and these differ significantly (P < 0.005 by Student's t-test for 92 degrees of freedom). Although the central regions from −1 to +3 resemble each other, the SELEX logo has two additional regions, −7 to −5 and +5 to +7, that are complementary to each other (7) and which account for 54% of the SELEX information. A residue of this is visible in the natural logo but only comprises 13% of the natural information. Finally, a lack of correlation between the measured relative binding strength and the Ri found using the natural Riw (b, l) weight matrix (Fig. 6A), demonstrates the incongruity of the two models. Unexpectedly, there was also a poor correlation between the binding strength and an Riw (b, l) weight matrix created from the SELEX sequences themselves (Fig. 6B). To explain these major discrepancies between the natural and the SELEX sites, we suggest that three proteins are binding in the SELEX experiment, since there are three bulges of sequence conservation that rise above the 1 bit mark in the SELEX logo (Fig. 5E). Two of these are the complementary regions separated by 10 bp at −7 to −5 and +5 to +7, which Cui et al. suggested could be bound by a homodimer (7). The correlation between these conserved regions and the 10.6 base cosine wave (Fig. 5E) (17,26), suggests that the homodimer binds in two major grooves on one DNA face. The sequence in between these two conserved regions is asymmetric and vaguely resembles the natural logo (Fig. 2) because of the prominent Ts, suggesting a monomer binding in the major groove of the opposite DNA face. A possible explanation for differences between the in vivo and in vitro results is that Lrp naturally forms a trimer with only the central molecule specifically binding to the DNA. A homodimer flanking a central monomer would not only be consistent with sequence conservation in the center of both natural and SELEX logos (−1 to +3 in Figs 2A and 5E) but also explains the methylation protection observed outside the center region in the natural sites and the outer conservation observed in the SELEX sites. Others have suggested that Lrp forms a dimer but that only one of the monomers binds to the DNA (33). Figure 5 View largeDownload slide Sequence logos of SELEX-generated Lrp binding sites. (A) Sequence logo of 30 sites that made up SELEX experiment 1 of Cui et al. (7), in which there was no leucine added. (B) Sequence logo of 25 sites that made up experiment 2, without leucine. (C) Sequence logo for experiments 1 and 2 combined, without leucine. (D) Sequence logo of 12 sites that made up experiment 3, with leucine. (E) Sequence logo for experiments 1, 2 and 3 combined. (E) has two cosine waves to show possible major groove binding on two different faces. The SELEX and natural coordinate systems were chosen to facilitate comparison with each other. Figure 5 View largeDownload slide Sequence logos of SELEX-generated Lrp binding sites. (A) Sequence logo of 30 sites that made up SELEX experiment 1 of Cui et al. (7), in which there was no leucine added. (B) Sequence logo of 25 sites that made up experiment 2, without leucine. (C) Sequence logo for experiments 1 and 2 combined, without leucine. (D) Sequence logo of 12 sites that made up experiment 3, with leucine. (E) Sequence logo for experiments 1, 2 and 3 combined. (E) has two cosine waves to show possible major groove binding on two different faces. The SELEX and natural coordinate systems were chosen to facilitate comparison with each other. When in vitro selections for OxyR and TrpR were analyzed by information theory, they were also found to give results that differ from those obtained from naturally selected sites (17,34). The differences between the in vivo and in vitro Lrp logos might be attributed to unnatural experimental conditions. Inappropriate salt levels or temperatures, the absence of spermidine (35), or selection of band-shifted DNA with the highest molecular weight (i.e., a triplet complex), among many other possibilities, might be reasons for these results. Sequence logos could be used to quantitatively investigate the effects of such varying conditions (17). In vitro selection procedures do not always mimic natural evolution (10). The strongest sites, such as those found by SELEX, are not ‘optimal’ when viewed on an information theory scale (20). Instead, natural sites are observed to have a Gaussian distribution that peters out at the high end. From this viewpoint, the strongest possible sites are seen as abnormal. When SELEX is pushed to obtain the strongest possible binding sites, the resulting sequence logo should show more sequence conservation than the natural sites, and as shown in this paper may be radically different from the natural logo. When the in vitro selections are more mild, the logos may resemble each other if the conditions are comparable. If one's goal is to obtain the strongest binder, as has been the emphasis for most of the work with SELEX (10), then strong selection is appropriate and sequence logos can be used to characterize the strong sites. If, instead, the goal is to learn more about the binding pattern of natural ligands, then weaker selection under various conditions could be guided by using sequence logos. Figure 6 View largeDownload slide Comparing the relative binding strength of Cui et al. with Ri. (A) Using the Riw (b, l) weight matrix from the natural sites (Fig. 1), we scanned the SELEX-generated sequences in Cui et al. (7). The highest Ri for each SELEX-generated site was chosen for the 62 sites reported; no correlation was observed between binding strength and Ri (r = 0.15). Also, the sum of all positive Ri values in each sequence was compared with the reported binding energy, but no correlation was found by this second approach (r = 0.06). (B) An Riw (b, l) matrix was made from the SELEX sequences and used to evaluate the same sequences. The single outlier, referred to as 7 in Figure 3 of Cui et al. (7), contains a T at +2 that is not observed in any other SELEX sequence and is therefore rated with a low value (20) (r = 0.43 without the outlier). Figure 6 View largeDownload slide Comparing the relative binding strength of Cui et al. with Ri. (A) Using the Riw (b, l) weight matrix from the natural sites (Fig. 1), we scanned the SELEX-generated sequences in Cui et al. (7). The highest Ri for each SELEX-generated site was chosen for the 62 sites reported; no correlation was observed between binding strength and Ri (r = 0.15). Also, the sum of all positive Ri values in each sequence was compared with the reported binding energy, but no correlation was found by this second approach (r = 0.06). (B) An Riw (b, l) matrix was made from the SELEX sequences and used to evaluate the same sequences. The single outlier, referred to as 7 in Figure 3 of Cui et al. (7), contains a T at +2 that is not observed in any other SELEX sequence and is therefore rated with a low value (20) (r = 0.43 without the outlier). Acknowledgements We thank Paul Shultzaberger for providing R.K.S. with a car, Debby Shultzaberger for re-typing sequences for comparison, Kay Kennedy and Emily Moler for running the Werner H. Kirsten Student Intern Program at NCI, Brenda Deener for scientific guidance and instruction, Mike Miller for bringing attention to grammatical errors, Elaine Bucheimer, Karen Lewis, Paul N. Hengen and Peter K. Rogan for their critique. We thank an anonymous referee for suggesting Figure 6B. R.K.S. would also like to thank the National Institutes of Health for funding his research. References 1 Newman E.B., Lin R.T., D'ari R.. Neidhardt F.C.. , Escherichia coli and Salmonella: Cellular and Molecular Biology , 1996 Washington, DC ASM Press(pg. 1513- 1525) 2 Willins D.A., Ryan C.W., Plakto J.V., Calvo J.M.. , J. Biol. Chem. , 1991, vol. 266 (pg. 10768- 10774) PubMed 3 Mathew E., Zhi J., Freundlich M.. , J. Bacteriol. , 1996, vol. 178 (pg. 7234- 7240) CrossRef Search ADS PubMed 4 Braaten B.A., Plakto J.V., van der Woude M.W., Simons B.H., de Graff F.K., Calvo J.M., Low D.A.. , Proc. Natl Acad. Sci. USA , 1992, vol. 89 (pg. 4250- 4254) CrossRef Search ADS 5 Wang Q., Calvo J.M.. , J. Mol. Biol. , 1993, vol. 229 (pg. 306- 318) CrossRef Search ADS PubMed 6 Calvo J.M., Matthews R.G.. , Microbiol. Rev. , 1994, vol. 58 (pg. 446- 490) 7 Cui Y., Wang Q., Stormo G.D., Calvo J.M.. , J. Bacteriol. , 1995, vol. 177 (pg. 4872- 4880) CrossRef Search ADS PubMed 8 Tuerk C., Gold L.. , Science , 1990, vol. 249 (pg. 505- 510) CrossRef Search ADS PubMed 9 Klug S.J., Flamulok M.. , Mol. Biol. Rep. , 1994, vol. 20 (pg. 97- 107) CrossRef Search ADS PubMed 10 Gold L., Polisky B., Uhlenbeck O., Yarus M.. , Annu. Rev. Biochem. , 1995, vol. 64 (pg. 763- 797) CrossRef Search ADS PubMed 11 Shannon C.E.. , Bell System Tech. J. , 1948, vol. 27 (pg. 379- 423) CrossRef Search ADS 12 Shannon C.E.. , Bell System Tech. J. , 1948, vol. 27 (pg. 623- 656) CrossRef Search ADS 13 Schneider T.D., Stormo G.D., Gold L., Ehrenfeucht A.. , J. Mol. Biol. , 1986, vol. 188 (pg. 415- 431) CrossRef Search ADS PubMed 14 Schneider T.D.. , Nanotechnology , 1994, vol. 5 (pg. 1- 18) http://www-lecb.ncifcrf.gov/~toms/paper/nano2/.Schneider CrossRef Search ADS 15 Schneider T.D., Stephens R.M.. , Nucleic Acids Res. , 1990, vol. 18 (pg. 6097- 6100) CrossRef Search ADS PubMed 16 Fraenkel Y.M., Mandel Y., Friedberg D., Margalit H.. , CABIOS , 1995, vol. 11 (pg. 379- 387) PubMed 17 Schneider T.D.. , Methods Enzymol. , 1996, vol. 274 (pg. 445- 455) http://www-lecb.ncifcrf.gov/~toms/paper/oxyr PubMed 18 Schneider T.D., Mastronarde D.. , Discrete Applied Mathematics , 1996, vol. 71 (pg. 259- 268) http://www-lecb.ncifcrf.gov/~toms/paper/malign CrossRef Search ADS PubMed 19 Stephens R.M., Schneider T.D.. , J. Mol. Biol. , 1992, vol. 228 (pg. 1124- 1136) CrossRef Search ADS PubMed 20 Schneider T.D.. , J. Theor. Biol. , 1997, vol. 189 (pg. 427- 441) http://www-lecb.ncifcrf.gov/~toms/paper/ri/ CrossRef Search ADS PubMed 21 Schneider T.D.. , Nucleic Acids Res. , 1997, vol. 25 (pg. 4408- 4415) Erratum (1998) Nucleic Acids Res., 26, 1135. http://www-lecb.ncifcrf.gov/~toms/paper/walker/ CrossRef Search ADS PubMed 22 Hengen P.N., Bartram S.L., Stewart L.E., Schneider T.D.. , Nucleic Acids Res. , 1997, vol. 25 (pg. 4994- 5002) http://www-lecb.ncifcrf.gov/~toms/paper/fisinfo/ CrossRef Search ADS PubMed 23 Rogan P.K., Faux B.M., Schneider T.D.. , Human Mutat. , 1998, vol. 12 (pg. 153- 171) CrossRef Search ADS 24 Allikmets R., Wasserman W.W., Hutchinson A., Smallwood P., Nathans J., Rogan P.K., Schneider T.D., Dean M.. , Gene , 1998, vol. 215 (pg. 111- 122) CrossRef Search ADS PubMed 25 Kahn S.G., Levy H.L., Legerski R., Quackenbush E., Reardon J.T., Emmert S., Sancar A., Li L., Schneider T.D., Cleaver J.E., Kraemer K.H.. , J. Invest. Dermatol. , 1998, vol. 111 (pg. 791- 796) CrossRef Search ADS PubMed 26 Papp P.P., Chattoraj D.K., Schneider T.D.. , J. Mol. Biol. , 1993, vol. 233 (pg. 219- 230) CrossRef Search ADS PubMed 27 Marasco R., Varcamonti M., Cara F.L., Ricca E., Felice M.D., Sacco M.. , J. Bacteriol. , 1994, vol. 176 (pg. 5197- 5201) CrossRef Search ADS PubMed 28 Kim Y., Geiger J.H., Hahn S., Sigler P.B.. , Nature , 1993, vol. 365 (pg. 512- 520) CrossRef Search ADS PubMed 29 Kim J.L., Nikolov D.B., Burley S.K.. , Nature , 1993, vol. 365 (pg. 520- 527) CrossRef Search ADS PubMed 30 Penotti F.E.. , J. Mol. Biol. , 1990, vol. 213 (pg. 37- 52) CrossRef Search ADS PubMed 31 Gally D.L., Rucker T.J., Blomfield I.C.. , J. Bacteriol. , 1994, vol. 176 (pg. 5665- 5672) CrossRef Search ADS PubMed 32 Roesch P.L., Blomfield I.C.. , Mol. Microbiol. , 1998, vol. 27 (pg. 751- 761) CrossRef Search ADS PubMed 33 Wiese D.E., Ernsting B.R., Blumenthal R.M., Mathews R.G.. , J. Mol. Biol. , 1997, vol. 270 (pg. 152- 168) CrossRef Search ADS PubMed 34 Haran T.E.. , J. Biomol. Struct. Dyn. , 1998, vol. 15 (pg. 689- 701) CrossRef Search ADS PubMed 35 Pingoud A.. , Eur. J. Biochem. , 1985, vol. 147 (pg. 105- 109) CrossRef Search ADS PubMed 36 Wang Q., Wu J., Friedberg D., Plakto J., Calvo J.M.. , J. Bacteriol. , 1994, vol. 176 (pg. 1831- 1839) CrossRef Search ADS PubMed 37 Ferrario M., Ernsting B.R., Borst D.W., Wiese D.E., Blumenthal R.M., Matthews R.G.. , J. Bacteriol. , 1995, vol. 177 (pg. 103- 113) CrossRef Search ADS PubMed 38 Stauffer L.T., Stauffer G.V.. , J. Bacteriol. , 1994, vol. 176 (pg. 6159- 6164) CrossRef Search ADS PubMed 39 Lin R., Ernsting B., Hirshfield I.N., Mathews R.G., Neidhardt F.C., Clark R.L., Newman E.B.. , J. Bacteriol. , 1992, vol. 174 (pg. 2779- 2784) CrossRef Search ADS PubMed 40 Nou X., Braaten B., Kaltenbach L., Low D.A.. , EMBO J. , 1995, vol. 14 (pg. 5785- 5797) PubMed 41 van der Woude M.W., Braaten B.A., Low D.A.. , Mol. Microbiol. , 1992, vol. 6 (pg. 2429- 2435) CrossRef Search ADS PubMed 42 Huisman T.T., de Graff F.K.. , Mol. Microbiol. , 1995, vol. 16 (pg. 943- 953) CrossRef Search ADS PubMed 43 van der Woude M.W., Low D.A.. , Mol. Microbiol. , 1994, vol. 11 (pg. 605- 618) CrossRef Search ADS PubMed 44 Rex J.H., Aronson B.D., Somerville R.L.. , J. Bacteriol. , 1991, vol. 173 (pg. 5944- 5953) CrossRef Search ADS PubMed 45 Blattner F.R., Plunkett G.III, Bloch C.A., Perna N.T., Burland V., Riley M., Collado-Vides J., Glasner J.D., Rode C.K., Mayhew G.F., Gregor J., Davis N.W., Kirkpatrick H.A., Goeden M.A., Rose D.J., Mau B., Shao Y.. , Science , 1997, vol. 277 (pg. 1453- 1474) CrossRef Search ADS PubMed © 1999 Oxford University Press TI - Using sequence logos and information analysis of Lrp DNA binding sites to investigate discrepancies between natural selection and SELEX JF - Nucleic Acids Research DO - 10.1093/nar/27.3.882 DA - 1999-02-01 UR - https://www.deepdyve.com/lp/oxford-university-press/using-sequence-logos-and-information-analysis-of-lrp-dna-binding-sites-SP80tlKyP9 SP - 882 EP - 887 VL - 27 IS - 3 DP - DeepDyve ER -