TY - JOUR AU - Leroy, Jean-Louis AB - ABSTRACT The self-associative properties of cytidine-rich oligonucleotides into symmetrical i-motif tetramers give to these oligonucleotides the capacity of forming supramolecular structures (sms) that have potential applications in the nanotechnology domain. In order to facilitate sms formation, oligonucleotides containing two cytidine stretches of unequal length (CnXCm) separated by a non-cytidine spacer were synthesized. They were designed to associate into a tetramer including an i-motif core built by intercalation of the C·C+ pairs of the longer C stretch with the two dangling non-intercalated strands of the shorter C stretch at each end. Gel filtration chromatography shows that the non-intercalated C-rich ends give to this structure the capacity of forming extremely stable sms. Using C7GC4 as a model, we find that the sms formation rate varies as the oligonucleotide concentration and increases at high temperature. Competitively with the tetramer involved in sms elongation, CnXCm oligonucleotides form i-motif dimers that compete with sms elongation. The dimer stability is strongly reduced when the pH is moved away from the cytidine pK. This results in an equilibrium shift towards the tetramer and in the acceleration of the sms formation rate. The chromatograms of the sms formed by C7GC4 indicate a broad distribution. In a 1.5 mM solution incubated at 37°C, the equilibrium distribution is centered on a molecular weight corresponding to the assembly of nine tetramers and the upper limit corresponds to 80 tetramers. The lifetime of this structure is about 4 days at 40°C, pH 4.6. INTRODUCTION At slightly acid pH, the hemiprotonated C·C+ pairs formed by neutral and protonated cytidines induce the association of cytidine-rich oligonucleotides into parallel duplexes. These duplexes are short-lived elusive species, but the tetramers built by head to tail intercalation of two duplexes are extremely stable. This structural arrangement, called i-motif is exceptional in that it involves systematic base-pair intercalation (1). The i-motif structure may include four identical C-rich strands, two hairpins each caring two cytidine stretches or a folded strand caring four cytidine stretches (2,3). The remarkable symmetry of i-motif tetramers is a consequence of the equivalence of the four strands associated in the structure. Face to face intercalation of thymidine in i-motif structures is generally unfavorable (4,5) and purine intercalation seems to be sterically hindered (6,7). A recent kinetics investigation shows that i-motif tetramers with partial or full intercalation topology are formed at comparable rates and that the predominance at equilibrium of the fully intercalated species is due to its longer lifetime (8). The self-associative properties of C-rich stretches give to i-motif tetramers the potentiality of forming supramolecular structures (sms) by mutual intercalation of the non-paired cytidines of tetramers built by partially intercalated duplexes. This property has been reported in a recent study revealing the association of C7 oligonucleotides into linear repetitive structures that could have biotechnological applications (9). In order to understand and control the formation process of i-motif sms, we examine in the present paper how temperature, strand concentration, ionic strength and pH influence the sms formation and dissociation rates. Oligonucleotides of the CnXCm series were designed to improve the assembly of i-motif tetramers into sms. They are composed of two cytidine stretches of unequal length separated by a non-cytidine residue i.e. by A, T, G or a diol-spacer. They were synthesized with the longer C stretch either at the 5′- or the 3′-end. The non-cytidine spacer that cannot intercalate into the i-motif was intended to prevent intercalation of both cytidine stretches into a single i-motif core. It was expected that the spacer would enforce the formation of a tetramer (Te in Scheme 1) including the i-motif core built by the C·C+ pairs of the longer cytidine stretch and at each end the non-intercalated cytidines of the shorter stretch. Scheme 1. View largeDownload slide Postulated association pathway of C7GC4 into i-motif sms. The monomer is in equilibrium with one or several hairpin dimer(s) (cf. Scheme 2) and Te, the tetramer formed by full intercalation of the C7 stretches. Association and mutual intercalation of the C4 stretches of two Te building blocks result in the formation of the Te2 species. The i-motif symmetry gives to the assembly of several building blocks the same elongation capacity than that of the building blocks themselves and allows the association of preformed Ten and Tem species into structures including (n + m) building blocks. Scheme 1. View largeDownload slide Postulated association pathway of C7GC4 into i-motif sms. The monomer is in equilibrium with one or several hairpin dimer(s) (cf. Scheme 2) and Te, the tetramer formed by full intercalation of the C7 stretches. Association and mutual intercalation of the C4 stretches of two Te building blocks result in the formation of the Te2 species. The i-motif symmetry gives to the assembly of several building blocks the same elongation capacity than that of the building blocks themselves and allows the association of preformed Ten and Tem species into structures including (n + m) building blocks. Gel filtration chromatography shows that these oligonucleotides associate into sms. The stability, the formation and dissociation rates of the sms formed by oligonucleotides differing by the C stretch length and the nature of the spacer were qualitatively compared. The relatively fast formation rate and the good stability of the sms formed by C7GC4 prompted us to select this sequence as a benchmark model to investigate the effect of pH, strand concentration, temperature and ionic strength on the sms formation and dissociation kinetics. The influence of the C stretch length and of the nature of non-cytidine spacer on the sms stability, formation rate and structure will be examined in separate publications. MATERIALS AND METHODS Oligonucleotide synthesis and sample preparation The oligonucleotides were synthesized on a 2 or 10 μM scale, purified by chromatography on an anionic DEAE column according to procedures already described and extensively dialyzed (5). After dialysis, the oligonucleotide solutions were lyophilized and dissolved in water to make 0.5–1 mM stock solutions. The C7GC4 concentration was determined using the A260 values, 89600 M–1 cm–1, computed according to a nearest neighbor model (10). Formation and dissociation rate measurements of i-motif sms Except otherwise stated, all the measurements were performed in 0.4 M NaCl, 10 mM Na acetate and 10 mM Na phosphate, a solution designed as the NAP buffer. During sms formation and dissociation, the samples, typically 50–200 μl, were incubated in a mastercycler Eppendhorf® PCR incubator whose lid temperature was set 5°C above that of the samples in order to avoid condensation on the cap of the Eppendhorf tubes. To study the sms formation kinetics, the samples were initially melted at 100°C and rapidly cooled at the temperature of the incubator. For sms dissociation time measurements, a solution containing only sms was prepared by pooling the species eluted on a GPC 100 column with a molecular weight larger than that of a tetramer. Afterward, this solution was diluted in the NAP buffer in such way as to allow complete dissociation of the sms at equilibrium. The measurements started right after dilution. Gel filtration chromatography During sms formation and dissociation, the sample composition was measured at room temperature by gel filtration chromatography (11) on GPC 100 and GPC 1000 columns (250 × 4.6 mm I.D.) provided by Eprogen®. Aliquots took off the incubated samples were injected in the column immediately after dilution to 25 μl (the volume of the injection loop) in the NAP buffer. They were eluted with the same buffer at a flow rate of 0.4 ml min–1. Systematic addition of thymidine at a final concentration of about 5 μM in the injected sample provided a reference marker on the chromatograms. The elution times ranged from 4 to 8.5 min. (Supplementary Data). The elution buffer pH was generally 4.6. However, when the evolution of the sample composition during elution on the column at pH 4.6 was not negligible, the elution buffer pH was adjusted to the value of the sample pH. The GPC-100 column whose exclusion limit correspond, according to the manufacturer to an oligonucleotide containing about 300 nt was calibrated using tRNA, thymidine, C-rich non-structured oligonucleotides, C-rich i-motif monomers and tetramers. The GPC-1000 column was calibrated with double-stranded plasmids containing from 800 to 16000 nucleosides, the i-motif tetramer of T2C8T2, tRNA and thymidine. The GPC-1000 calibration shows that oligonucleotides containing 50 and 104 residues are eluted, respectively, in the permeation and exclusion limits. The calibration curves displayed in Supplementary Data establishes the reasonable linearity of the measured elution times with the log of the molecular weight of reference oligonucleotides. The dispersion of the data points of the calibration curves suggests that the relative error on the estimated molecular weights is <0.5. The sizes of the sms were characterized by the mass center, Mc, of the sms distribution. Mc was computed according to Mc = ΣA(t)* N(t)/ Σ A(t), where A(t) is the absorbance of the solution and N(t) the number of tetrameric units in the sms eluted on the chromatogram at time t. N(t) was obtained from the calibration curve of the GPC 1000 column (Supplementary Data). The dimer and tetramer stoichiometries were also routinely measured by the slope of the log–log plot of the multimer versus monomer equilibrium concentrations. In pH and temperature conditions favorable to sms formation, an opalescence characteristic of the formation of extremely large structures was observed in concentrated samples (>1 mM) incubated during times much longer than the sms formation time constant. Correlatively with the development of the sample opalescence, the chromatogram of these samples showed a reduction (up to 50%) of the integrated area of the eluted species indicating that a fraction of the sample composed, presumably of the largest sms, was trapped in the column. In that case, the oligonucleotide fraction retained on the column was estimated by comparison with the chromatogram of an identical melted sample and included in the sms fraction. However, in order to have a direct control of all the species formed, we generally restrained the oligonucleotide concentrations and incubation times to a range of values allowing detection of all the species on the chromatograms. NMR method The NMR experiments were performed using a 500 MHz Varian Inova spectrometer with the jump and return sequence for water suppression (12). The spectra were accumulated with a repetition time longer than the relaxation times to avoid saturation. The spectral intensity was multiplied by a 1/sin function in order to correct the distortion introduced by the jump and return excitation (13). The spectra collected during the sms formation were normalized using the methyl proton peak of the acetate buffer. Multimer dissociation constants The dissociation constant of a multimer may be expressed as a function of αeq, the monomer equilibrium fraction of an oligonucleotide solution at concentration [M0] by: Kdis = [M0]s–1/(1– αeq) where s is the multimer stoichiometry. We will characterize the multimers stability by Fi, a parameter whose dimension is independent of the stoichiometry which is equal to the free monomer concentration for which αeq = 0.5. Fi is related to the multimer dissociation constant by: Fi = (Kdis/s)(1/(s–1)) and it will be designated as the reduced dissociation constant. RESULTS Chromatographic evidences for the formation of i-motif supramolecular structures Immediately after melting and fast cooling, a 3 μM C7GC4 sample at pH 6.2 injected into a GPC 100 column is mainly eluted as a monomer (Figure 1). The chromatograms recorded at different times during incubation at 20°C reveal that a dimer is formed with a time constant of about 1.5 h. At equilibrium, the dimer and monomer concentrations correspond to a reduced dissociation constant Fi = 2 μM. It has been shown that the formation rate of i-motif dimers increases with the nucleotide concentration and when the solution pH is shifted near 4.4, the value of the cytidine pKN3 (8). This explains that about 1 min after melting, an aliquot of a 0.15 mM C7GC4 solution, pH 4.6 is eluted as a dimer (Figure 1). The chromatograms of samples incubated at 50°C show the appearance of two peaks, one corresponding to a tetramer and the other to unresolved heavier species that will be designed as sms. Figure 1. View largeDownload slide Association of C7GC4 into i-motif multimers and supra molecular structures (sms) in the NAP buffer. The samples were initially melted. Left panel: GPC-100 chromatograms recorded after incubation during the times indicated. The elution times expected for a monomer (M), a dimer (D) and a tetramer (Te) are indicated. (A) the chromatogram of a 3 μM solution, pH 6.2 injected right after melting shows a 20/80 dimer/monomer mixture. The chromatograms recorded after incubation at 20°C show that the dimer is formed with a time constant of about 1.5 h. The monomer and dimer concentrations at equilibrium correspond to the reduced dissociation constant Fidimer = 2 μM. (B) The chromatograms of a 0.15 mM C7GC4 solution at pH 4.63 collected after melting and incubation at 50°C during the times indicated on the figure show that the dimer is slowly converted into tetramer and supramolecular structures. Right panel: evolution of the sms (red), tetramer (blue) and dimer (green) fractions as a function of the incubation time at 50°C. Figure 1. View largeDownload slide Association of C7GC4 into i-motif multimers and supra molecular structures (sms) in the NAP buffer. The samples were initially melted. Left panel: GPC-100 chromatograms recorded after incubation during the times indicated. The elution times expected for a monomer (M), a dimer (D) and a tetramer (Te) are indicated. (A) the chromatogram of a 3 μM solution, pH 6.2 injected right after melting shows a 20/80 dimer/monomer mixture. The chromatograms recorded after incubation at 20°C show that the dimer is formed with a time constant of about 1.5 h. The monomer and dimer concentrations at equilibrium correspond to the reduced dissociation constant Fidimer = 2 μM. (B) The chromatograms of a 0.15 mM C7GC4 solution at pH 4.63 collected after melting and incubation at 50°C during the times indicated on the figure show that the dimer is slowly converted into tetramer and supramolecular structures. Right panel: evolution of the sms (red), tetramer (blue) and dimer (green) fractions as a function of the incubation time at 50°C. The evolution versus time of the dimer, tetramer and sms fractions is displayed in Figure 1. After 0.5 h, the tetramer fraction reaches a maximum value corresponding to about 12% of the oligonucleotide concentration. The sms increase with a time constant of 1 h toward an equilibrium level corresponding to 65% of the oligonucleotide concentration. The dimer fraction decreases with a comparable time constant. The sms molecular weights were estimated using a GPC-1000 column. The chromatograms of aliquots taken from a 1.5 mM C7GC4 solution incubated at 37°C, pH 4.6 are displayed in Figure 2A. The dimer and the tetramer are eluted together in the permeation volume of the GPC-1000 column. The sms half formation time is 1 h and the oligonucleotide fraction associated in sms at equilibrium is about 75% (Figure 2D). The average size of the sms increases as a function of the incubation time (Figure 2B). After incubation over 4 days at 37°C, the sms exhibit an extremely broad distribution centered around 430 residues, a size corresponding to the association of nine tetramers. The elution time of the largest species, 4.5 min, is that expected for a structure including about 4000 residues, i.e. 80 tetrameric units (Figure 2A). Figure 2. View largeDownload slide Association of C7GC4 into sms as monitored by NMR and gel exclusion chromatography on GPC 1000 column. The oligonucleotide concentration was 1.5 mM. The NMR spectra and the chromatograms were recorded after melting and incubation in the NAP buffer, pH = 4.6 at 37°C during the times indicated. (A) The tetramer and the dimer are eluted together in the GPC 1000 chromatograms. The chromatograms collected as a function of the time show the formation of sms. The upper scale shows the elution times expected for sms including the indicated number of tetrameric repeats. Note that one tetramer contains 48 nucleotides. On each chromatogram, an arrow indicates the mass center of the sms distribution. (B) the number of tetramer repeats at the center of the sms distributions is plotted as a function of the incubation time. At equilibrium, the mass center of the sms distribution corresponds to the association of nine tetramers (i.e. 432 bases). The top 15% of the sms distribution includes structures that are five times larger. (C) exchangeable and aromatic proton regions of NMR spectra collected during sms formation. The line broadening (about 40 Hz) and the reduction of the signal intensity observed as a function of the time reflect the formation of large structures. (D) oligonucleotide fractions included in sms as determined from the chromatograms of panel A (open circles) and from the intensity of the NMR peaks of the spectra of panel C (black circles). The time constant derived from the chromatography measurements, 1 hour, concerns the formation of structures heavier than a tetramer, whereas that derived from the NMR experiments, 20 h, is related to structures too large to be detected on the NMR spectra. Figure 2. View largeDownload slide Association of C7GC4 into sms as monitored by NMR and gel exclusion chromatography on GPC 1000 column. The oligonucleotide concentration was 1.5 mM. The NMR spectra and the chromatograms were recorded after melting and incubation in the NAP buffer, pH = 4.6 at 37°C during the times indicated. (A) The tetramer and the dimer are eluted together in the GPC 1000 chromatograms. The chromatograms collected as a function of the time show the formation of sms. The upper scale shows the elution times expected for sms including the indicated number of tetrameric repeats. Note that one tetramer contains 48 nucleotides. On each chromatogram, an arrow indicates the mass center of the sms distribution. (B) the number of tetramer repeats at the center of the sms distributions is plotted as a function of the incubation time. At equilibrium, the mass center of the sms distribution corresponds to the association of nine tetramers (i.e. 432 bases). The top 15% of the sms distribution includes structures that are five times larger. (C) exchangeable and aromatic proton regions of NMR spectra collected during sms formation. The line broadening (about 40 Hz) and the reduction of the signal intensity observed as a function of the time reflect the formation of large structures. (D) oligonucleotide fractions included in sms as determined from the chromatograms of panel A (open circles) and from the intensity of the NMR peaks of the spectra of panel C (black circles). The time constant derived from the chromatography measurements, 1 hour, concerns the formation of structures heavier than a tetramer, whereas that derived from the NMR experiments, 20 h, is related to structures too large to be detected on the NMR spectra. sms formation monitored by NMR Right after melting and fast cooling at 0°C, the NMR spectrum of C7GC4 shows three broad clusters of exchangeable protons at the positions characteristic of the imino and amino protons of C·C+ pairs (14). The poor spectral resolution suggests the presence of multiple conformations (Supplementary Data). The 10.3 ppm chemical shift and the fast exchange rate, about 40 ms, of the G imino proton establish that the guanine is not H-bonded to the cytidine N3. The number of cytidine imino protons around 15.5 ppm estimated by reference to the guanosine imino proton peak is consistent with the number of C·C+ pairs (seven to nine C·C+ pairs for two guanosine imino protons) expected for the i-motif dimers displayed in Scheme 2. Scheme 2. View largeDownload slide Plausible structures of the C7XC4 dimer (X = A, T or G). The number of intercalated C•C+ pairs in each structure is optimal. The narrow (n) and wide (w) grooves are indicated in structure (A). One residue is enough to make a loop across the i-motif narrow groove (6) and three residues allow loop formation across the wide groove (5). For each loop topology, the hairpins may be in parallel (A, C) or in head to tail orientation (B, D). Many other topologies can be imagined by changing the intercalation order of the C•C+ pairs and the number of residues in the loops. The poor resolution of the NMR spectrum of C7GC4 suggests that several dimers coexist. Scheme 2. View largeDownload slide Plausible structures of the C7XC4 dimer (X = A, T or G). The number of intercalated C•C+ pairs in each structure is optimal. The narrow (n) and wide (w) grooves are indicated in structure (A). One residue is enough to make a loop across the i-motif narrow groove (6) and three residues allow loop formation across the wide groove (5). For each loop topology, the hairpins may be in parallel (A, C) or in head to tail orientation (B, D). Many other topologies can be imagined by changing the intercalation order of the C•C+ pairs and the number of residues in the loops. The poor resolution of the NMR spectrum of C7GC4 suggests that several dimers coexist. Sms formation was followed by NMR in the same experimental conditions ([C7GC4] = 1.5 mM, pH 4.6, T = 37°C) than the gel filtration experiments described just above. At 37°C, the guanosine imino proton peak is broadened out by exchange with water (Figure 2C). The NMR spectra recorded as a function of the time after melting show a broadening of about 40 Hz and a reduction of the intensity of all the NMR peaks indicating the formation of large structures with a slow tumbling rate. The initial intensity is fully restored after heating at 100°C. At equilibrium, the reduction of the spectral intensity (∼75%) is comparable to the oligonucleotide fraction found associated in sms by chromatography, but the intensity of the NMR peaks decreases with a time constant 20 times longer than that measured by chromatography (Figure 2D). The discrepancy of the time constants measured by chromatography and NMR is merely related to the difference of the species that are taken into account by each method. The time constants derived from the chromatographic investigation are related, according to the definition given to sms, to the formation of structures whose molecular weights are larger than that of a tetramer whereas the time constants derived from the NMR experiments correspond to high molecular weight structures whose proton spectra are broadened out. Competition between dimer and tetramer formation The concomitance of the evolution of the dimer and sms fractions (Figure 1) suggests that sms formation is controlled by dimer dissociation. In order to understand how the dimer and the tetramer of C7GC4 interfere with sms formation, we measured their reduced dissociation constant at different pH. The chromatograms of C7GC4 solutions recorded as a function of the time after melting and incubation at 20°C show the following systematic features: the first multimer formed is always a dimer. When the oligonucleotide concentration is sufficiently high, one observes that a tetramer is formed at a slower rate. Tetramer formation is always followed by the appearance of sms. The dimer reduced dissociation constant, Fidimer, measured from the dimer and monomer equilibrium concentrations is plotted vs. pH in Figure 3. Fidimer is close to 2 10–8 M between pH 3.5 and 5.5 and increases sharply when the solution pH is moved away from this range of values. The C7GC4 monomer concentration, which is controlled by the monomer–dimer equilibrium, is extremely small in the range of oligonucleotide concentrations allowing tetramer formation and for this reason the tetramer–reduced dissociation constant is experimentally inaccessible. Assuming that the tetramer of C7GC4 is formed by pairing and intercalation of the seven consecutive cytidines, we considered that its reduced dissociation constant should be comparable to that of any i-motif tetramer formed by intercalation of C7 stretches and we used C7T as a model to evaluate its reduced dissociation constant. The chromatograms of C7T show two peaks at the positions expected for a monomer and a tetramer. The reduced dissociation constant, Fitetra, of [C7T]4 derived from the tetramer and monomer equilibrium concentrations is displayed versus pH in Figure 3. Between pH 4 and 5.5, the tetramer reduced dissociation constant, Fitetra, is about 200 times larger than that of the C7GC4 dimer. Due to the lower sensitivity of Fitetra to pH, the dimer and tetramer reduced dissociation constants are comparable around pH 6.5. Figure 3. View largeDownload slide Effect of pH on dimer and tetramer stability at 20°C. (A) Reduced dissociation constants versus pH of the dimer(s) of C7GC4 (black circles) and of [C7T]4 (open circles). C7T that do not associate into dimer or sms was used as a model to evaluate the stability of the tetramer of C7GC4 (Te in Scheme 1), the structure assumed to be the building block of the sms elongation pathway. (B) Computed monomer (black), dimer (green) and tetramer (blue) fractions in a 0.3 mM C7GC4 solution assuming that the tetramer of [C7GC4]4 has the same reduced dissociation constant than [C7T]4. Due to the greater sensitivity of the dimer stability to pH, the tetramer fraction increases by two magnitude orders between pH 5.5 and pH 6. Figure 3. View largeDownload slide Effect of pH on dimer and tetramer stability at 20°C. (A) Reduced dissociation constants versus pH of the dimer(s) of C7GC4 (black circles) and of [C7T]4 (open circles). C7T that do not associate into dimer or sms was used as a model to evaluate the stability of the tetramer of C7GC4 (Te in Scheme 1), the structure assumed to be the building block of the sms elongation pathway. (B) Computed monomer (black), dimer (green) and tetramer (blue) fractions in a 0.3 mM C7GC4 solution assuming that the tetramer of [C7GC4]4 has the same reduced dissociation constant than [C7T]4. Due to the greater sensitivity of the dimer stability to pH, the tetramer fraction increases by two magnitude orders between pH 5.5 and pH 6. Effect of temperature on the sms formation time We measured by chromatography on GPC 100 column the sms formation times in 0.3 mM C7GC4 solutions, pH 4.6 versus temperature. Figure 4 shows that the sms half formation time varies from 0.15 h at 55°C to about 90 h. at 20°C with an activation energy of 143 ± 30 kJ/M. Figure 4. View largeDownload slide Sms half formation times vs. temperature of C7GC4 (black circles) and C4GC7 (open circles) 0.3 mM solutions in the NAP buffer at pH 4.6. The activation energies related to sms formation, 143 ± 40 and 110 ± 25 kJ/M are comparable. Figure 4. View largeDownload slide Sms half formation times vs. temperature of C7GC4 (black circles) and C4GC7 (open circles) 0.3 mM solutions in the NAP buffer at pH 4.6. The activation energies related to sms formation, 143 ± 40 and 110 ± 25 kJ/M are comparable. Effect of the oligonucleotide concentration on sms formation Figure 5 shows that the sms half formation time decreases as the inverse of the oligonucleotide concentration. The oligonucleotide fraction associated at equilibrium in sms increases from 13% in a 28 μM C7GC4 solution to about 80% in 0.6 mM solution. The oligonucleotide fraction in tetrameric structures, about 10%, depends weakly on the oligonucleotide concentration. The monomer concentration is always negligible in the concentration range explored. Figure 5. View largeDownload slide Effect of the oligonucleotide concentration on the sms half formation time at 42°C, pH 4.6, NAP buffer. (A) Half formation time of the sms of C7GC4 (black circles) and C4TC7 (open circles) versus the oligonucleotide concentration. The slope of the lines drawn through the data points is -1. (B) Dimer (green), tetramer (blue) and sms (red) equilibrium fractions versus the oligonucleotide concentration. The monomer fraction is negligible in this range of concentrations. Figure 5. View largeDownload slide Effect of the oligonucleotide concentration on the sms half formation time at 42°C, pH 4.6, NAP buffer. (A) Half formation time of the sms of C7GC4 (black circles) and C4TC7 (open circles) versus the oligonucleotide concentration. The slope of the lines drawn through the data points is -1. (B) Dimer (green), tetramer (blue) and sms (red) equilibrium fractions versus the oligonucleotide concentration. The monomer fraction is negligible in this range of concentrations. Effect of pH on the sms formation time I-motif formation involves association of neutral and protonated cytidines. For this reason, the i-motif half formation time is minimal when the product, [fC] [fC+], of the neutral and protonated cytidine fractions is maximal, i.e when the pH is equal to the cytidine pKN3 (8). On the contrary, the sms formation time is maximal when the pH is close to the cytidine pKN3 (Figure 6). It is quite puzzling to note that the effect of pH on the sms formation kinetics (shorter sms formation time when the pH is shifted away from 4.4) is opposite to that observed for the i-motif formation kinetics (longer i-motif formation time when the pH is shifted away from 4.4). At equilibrium, the oligonucleotide fraction incorporated in sms is maximal around pH = 4.4. The equilibrium sms fraction at pH 4.4 depends weakly on the temperature, as shown by the similarity of the sms proportions measured at 37 and 50°C. Figure 6. View largeDownload slide Effect of pH on sms formation in 0.3 mM C7GC4 solution at 37 (black circles) and 50°C (open circles) in the NAP buffer. (A) sms half formation time versus pH. The sms formation time is maximal when the pH is equal to the cytidine pKN3. (B) C7GC4 fraction incorporated at equilibrium into sms at 37 and 50°C versus pH. Figure 6. View largeDownload slide Effect of pH on sms formation in 0.3 mM C7GC4 solution at 37 (black circles) and 50°C (open circles) in the NAP buffer. (A) sms half formation time versus pH. The sms formation time is maximal when the pH is equal to the cytidine pKN3. (B) C7GC4 fraction incorporated at equilibrium into sms at 37 and 50°C versus pH. Effect of the NaCl concentration on sms formation Sms formation was examined in 0.3 mM C7GC4 solutions at 42°C, pH 4.6 as a function of the NaCl concentration in a buffer containing 10 mM sodium phosphate and 10 mM sodium acetate. The sms half formation time, 100 ± 25 h in the absence of added NaCl decreases rapidly to 3 ± 0.5 h in 0.4 M NaCl. It decreases more slowly at higher NaCl concentration and reaches about 1 h in 2 M NaCl. The C7GC4 fraction associated in sms structures increases from 10% in the absence of added NaCl to 70% in 0.4 M NaCl solution and decreases slowly to 50% in 2 M NaCl. sms dissociation kinetics Figure 7 shows the evolution versus time of the chromatograms recorded during sms dissociation. The sms fraction decreases exponentially with a time constant of 0.46 h at 40°C, pH 4.6. At equilibrium the residual sms fraction is negligible. The chromatograms recorded during the first 30 min of the experiment show the formation of a tetramer (Te), of a species (Te2) whose molecular weight is about twice that of a tetramer and of monomer. The monomer half formation time, 0.5 h., is close to the sms dissociation time. At last, the chromatograms show the formation of a dimer. The initial zero slope of the plot of the dimer fraction versus time suggests that the dimer is not directly generated by sms dissociation but results of monomer dimerization. The monomer and dimer equilibrium concentrations correspond to a reduced dissociation constant Fi = 2.5 10–6 M for the dimer at 40°C, pH 4.6. Figure 7. View largeDownload slide Dissociation of the sms of C7GC4 at 40°C, pH 4.63 in the NAP buffer. (A) The oligonucleotide concentration is 3 μM. The chromatograms recorded as a function of the time show that sms dissociation first results in the formation of monomer (M), tetramer (Te) and Te2, a species whose elution time corresponds to that expected for the assembly of two tetramers. At last the chromatograms show the formation of a dimer (D). Note that the dimer fraction increases with an initial zero slope. (B) Evolution of the Te (blue), D (green), M (black) and sms (red) fractions as the function of the time. Figure 7. View largeDownload slide Dissociation of the sms of C7GC4 at 40°C, pH 4.63 in the NAP buffer. (A) The oligonucleotide concentration is 3 μM. The chromatograms recorded as a function of the time show that sms dissociation first results in the formation of monomer (M), tetramer (Te) and Te2, a species whose elution time corresponds to that expected for the assembly of two tetramers. At last the chromatograms show the formation of a dimer (D). Note that the dimer fraction increases with an initial zero slope. (B) Evolution of the Te (blue), D (green), M (black) and sms (red) fractions as the function of the time. Effect of pH and temperature on the sms dissociation time Figure 8 shows that the sms lifetime is maximal around pH 4.4 and decreases at higher and lower pH. The activation energy related to sms dissociation, Eac = 255 ± 20 kJ/M, seems nearly independent of pH (Figure 8, lower panel). The sms lifetime around pH 4.4 is extraordinary long. The extrapolation of the values measured between 42 and 60°C indicates a lifetime of about 10 years at 20°C. Figure 8. View largeDownload slide Effect of pH and temperature on the sms lifetimes in the NAP buffer. (A) Half dissociation time of the sms of C7GC4 versus temperature at pH 4.6 (black diamonds), pH 4.3 (black circles), pH 3.58 (open circles), pH 5.3 (black squares), pH 3 (open squares), pH 6 (black triangles) and pH 6.6 (open diamonds). (B) Lifetimes at 40°C versus. pH of the sms of C7GC4 (open circles) and C7 (crosses). The sms lifetimes are maximal when the pH is equal to the cytidine pKN3 and decrease symmetrically at lower or higher pH. Figure 8. View largeDownload slide Effect of pH and temperature on the sms lifetimes in the NAP buffer. (A) Half dissociation time of the sms of C7GC4 versus temperature at pH 4.6 (black diamonds), pH 4.3 (black circles), pH 3.58 (open circles), pH 5.3 (black squares), pH 3 (open squares), pH 6 (black triangles) and pH 6.6 (open diamonds). (B) Lifetimes at 40°C versus. pH of the sms of C7GC4 (open circles) and C7 (crosses). The sms lifetimes are maximal when the pH is equal to the cytidine pKN3 and decrease symmetrically at lower or higher pH. Other oligonucleotides All the oligonucleotides of the C7XC4 family, where X is either A, T, G or an ethane-diol spacer, as well as C4GC7, C4TC7, C6TC3, C7TC3 and C5TC5 associate into sms. The sms molecular weights, formation and dissociation rates and the oligonucleotide fractions associated into sms at equilibrium depend on the length of the C stretches and on the nature of the X spacer. All the CnXCm oligonucleotides examined form preferentially a dimer rather than a tetramer. The dimer may be the thermodynamically stable multimer, as this seems to be the case for C7GC4 (Figure 3) or it may be kinetically trapped during a time exceeding experimental investigations. We also observed the assembly of C7 into sms as originally reported by Yamuna Krishnan and collaborators (9). C4GC7 The reduced dissociation constants of the dimer of C4GC7 is comparable to that measured for the reference C7GC4 oligonucleotide. The formation and dissociation rates of the sms of C4GC7 (Figure 4 and Supplementary Data) are very similar to the rates measured for C7GC4. C7TC4, C4TC7 and C5TC5 At pH 4.6, the reduced dissociation constants of the dimers formed by these oligonucleotides are at least 10 times smaller than that of the reference C7GC4 oligonucleotide. This shows that the nature of the non-cytidine spacer contributes to the dimer stability and therefore interferes with sms formation. The half association times of the sms of C7TC4 and C4TC7 are 20 and 5 times, respectively longer than that of C7GC4. At 42°C, pH 4.6 the formation rate of the sms of C5TC5 is about 100 times slower than that measured for the reference C7GC4 oligonucleotide. After incubation during 5 days, in condition close to equilibrium, the chromatogram of a 5 mM C5TC5 solution shows that the dimer, tetramer and sms fractions are 60, 20 and 20%, respectively. C7 By comparison with C7GC4, the association rate of C7 into sms is faster but the lifetimes of the sms of C7 measured at different pH is much shorter (Figure 8). DISCUSSION A good knowledge of the factors that influence the sms growth is essential to control the assembly of C-rich oligonucleotides into structures with specific applications (16). The detail of the sms formation pathway is beyond the scope of this article; nevertheless some indications on the sms growing process may be derived from the experiments presented above. In summary, we have shown that all the CnXCm oligonucleotides investigated associate into sms. The sms formation rates and equilibrium proportions depend on the C stretch length and on the nature of the spacer. It is interesting to note that the influence of temperature and pH on the sms formation kinetics is quite different from the effect of these factors on the i-motif formation kinetics (5). In contrast with the formation time of i-motif tetramers, which increases as the power of −2 of the monomer concentration (5), the sms formation time increases as the power of −1 (Figure 5). Moreover, we have observed that sms disruption releases short sms fragments, among which only Te2 may be resolved on the GPC-100 column, and tetrameric structures (Te) that in turn dissociate into monomer. The similarity of the time constant for sms dissociation and monomer formation together with the exponential character of the evolution of these species indicate that tetramer dissociation is not limiting. Competition between i-motif dimers and tetramers A constant property of CnXCm oligonucleotides is their capacity to form stable i-motif dimers. It must be noticed that these dimers are certainly not parallel hemiprotonated duplexes. There is no experimental indication in the literature in favor of the existence of stable C-rich hemiprotonated duplexes in oligo-C solutions and recent attempts to enforce their formation led to the conclusion that these species must be considered as short-lived precursors of i-motif tetramers in fast exchange with the monomer (7). Two families of i-motif dimer have been described (2,6,17). One is formed by the parallel arrangement of two hairpins whose loops are on the same side of the i-motif core, the other by two hairpins in head to tail orientations (Scheme 2). Many conformers may coexist in each family, depending on the loop position with respect to the narrow and wide grooves, on the intercalation topology of the C·C+ pairs and on the loop composition. If one considers the structures displayed in Scheme 2, it is hard to imagine that they could induce or contribute to sms formation. It is therefore tempting to regard the dimer as a dead-end way in the sms formation pathway. Several examples have been reported showing that formation of i-motif dimers is kinetically favored by comparison with tetramer formation. The thermodynamically stable multimer of T2C8T2 (14), 5mCCTCTCC (5) and 5mCCTCTCTCC (7) are tetrameric. Nevertheless, after melting and fast cooling these oligonucleotides are kinetically trapped into hairpin dimers and it is observed that the dimer/tetramer conversion is extremely slow. The formation time of i-motif multimers should increase in proportion with the number of the steps leading to the stable final structure. It is therefore reasonable to assume that this is the difference of the formation order of dimer and tetramer, 2 and 3, respectively (6,8), that accounts for the faster dimer formation rate. Dissociation of [CnXCm]2 dimers is the limiting step of sms elongation The study of C7TC4 and C4TC7 gives indirect arguments suggesting that the dimer stability and the sms formation rate are correlated. The slow formation rate of the sms of these sequences may be a consequence of the stability of the dimers formed by these oligonucleotides. It may be also noticed about the stability of the dimers with a T spacer that the loop topology of the structure of Scheme 2C allows the formation of a cross-loop T·T pair (5) that could account for the enhanced stability of the duplexes with a thymidine spacer. If one supposes that the competition between dimer and tetramer hinders sms growing, it is predictable that the pH and temperature conditions favorable to tetramer formation should accelerate the sms elongation. The simulations displayed in Figure 3b indicate that in response to the effect of pH on the stability of the dimer of C7GC4, the tetramer fraction increases between pH 5.3 and 6.3 by more than two magnitude orders and suggests that the acceleration of the sms formation rate observed in this pH range (Figure 6) results of dimer dissociation. Figure 4 shows that the sms formation time decreases when the temperature is raised (Eact = 143 kJ/M for the sms of C7GC4, Figure 4). By contrast, the i-motif tetramer formation times increase with temperature (Eact = −197 and −306 kJ/M for [TC3]4 and [TC5]4, respectively (8). The lengthening of the formation time of i-motif tetramers at high temperature has been ascribed to the reduction of the proportion of an intermediate dimer or trimer species (8). By analogy with the interpretation given above to the effect of pH on the sms formation rate, the acceleration of the sms formation rate at high temperature may be a consequence of the preferential dimer dissociation at high temperature. By contrast with the formation time of i-motif tetramers that varies as the power of −2 of the monomer concentration, the sms formation time varies as the power of −1 of the concentration (Figure 5). This indicates that the sms formation kinetics is not directly controlled by the formation rate of the i-motif building blocks (Te in Scheme 1). This argues also against an sms growing process involving successive addition of monomer units to a tetrameric nucleus whose rate should be strongly dependent on the oligonucleotide concentration. A model for sms formation The sms formation pathway displayed in Scheme 1 is in part inspired by that proposed to account for C7 association into i-wire structures (9). All the arguments developed above together with the similarity of the time constants for sms formation and dimer dissociation suggest that dimer dissociation is the limiting step of the sms formation pathway. According to Scheme 1, it is noteworthy that since dimer formation is faster than tetramers formation (i.e. kondimer > kontetra), the dimer concentration should decrease at a rate k = kofdimer × kontetra/(kondimer + kontetra) which is slower than the dimer dissociation rate, kofdimer. As shown in a recent study of the formation pathway of i-motif tetramers, structures with partial or full intercalation topologies are formed at comparable rates and the species with optimal intercalation topology prevails at equilibrium due to its longer lifetime. The X spacer was inserted in the CnXCm sequence to prevent intercalation of the m + n cytidines into a single i-motif core. The tetramer formed by intercalation of the longer C stretch (Te in Scheme 1) is expected to be the thermodynamically stable species and we postulate that it is the building block of the sms growing process. We assume that after the nucleation step, association by pairing and intercalation of the terminal C4 stretches of two Te building blocks initiates sms formation. The i-motif symmetry gives to the assembly of several building blocks the same overhanging C4 terminations than the building block itself. This allows sms elongation by association of preformed Ten and Tem assemblies into structures including (n + m) building blocks (Scheme 1) or by successive association of Te blocks. The lifetime of i-motif tetramers increases strongly with the number of intercalated C·C+ pairs (8). It is therefore predictable that the connecting i-motif blocks formed by the C4 stretches should be the weak sms links. However, it is interesting to note that the lifetime of the sms of C7GC4, about 100 h at 40°C, pH 4.5 (Figure 8), is about 600 times longer than that of the i-motif tetramer of TC4 (8). This suggests that the i-motif block formed by intercalation of the C4 stretch is stabilized in the sms by staking interactions with the outer C·C+ pairs of the i-motif sections built by the C7 moiety. The structure of supramolecular i-motif assemblies C·C+ intercalation into i-motif structures requires an extension of the helical rise at the C-C steps up to 6.3 Å, a value close to the maximum accessible without base-pair disruption (18). Due to this extreme helical stretch and compactness, the i-motif should be an unbendable structure and it is predictable that the i-motif sms should look like extended stiff structures. We have little information at this time about the structure of supramolecular i-motif assemblies. The formation pathway proposed just above, supposes implicitly that the sms formed by CnXCm oligonucleotides are linear structures. Preliminary AFM investigations indicate that the structures seem markedly influenced by the nature of the X spacer. AFM images of C7TC4 samples deposited on mica show bent structures as long as micrometer with unexpected 120° bifurcations, those of C4AC7 and C4GC7 form rigid rods whereas the structures formed by of C7GC4 look like aggregated shapeless pellets. It is clear that further investigations are required to give a reliable description of the supra molecular formed by CnXCm oligonucleotides. NMR is obviously not appropriated to structural studies of large sms. Nevertheless, the long sms lifetimes should allow purification of tetramer assemblies short enough to be accessible to structural investigations by NMR methods. It should be specially interesting to determine the arrangement of the X spacer in the structure (Te2, Scheme 1) formed by association of two building blocks. It is noteworthy that regardless of their structural differences, the G-quartet (19–21) and i-motif tetramers have similar formation pathways (8,15) and associate into supramolecular structures. While it is assumed that i-motif sms are generated by the association of incompletely intercalated preformed tetrameric building blocks (9), G-wire elongation is described as a process involving association of parallel duplexes to an out-of–register G-quadruplex matrix (22). The lifetimes of partially matched G-quadruplexes and i-motif multimers may be extremely long (6,8,23). For this reason, the formation of both tetramer families is conditioned by the evolution of species that are kinetically trapped (8,15). It is therefore predictable that as observed for the supramolecular assemblies of i-motif, the kinetic trapping of mismatched species is also a crucial limitation to the elongation of G-wire structures. The excellent yield of automated chemical DNA synthesis together with the availability of a large variety of artificial residues with functional specificities make DNA oligonucleotides an attractive building material for sms (24–28). The DNA supramolecular assemblies based on Watson-Crick pairs or on G-quartets must be heated at melting temperature to be dissociated. In contrast, the extreme i-motif sensitivity to pH allows dissociation of i-motif sms by a mild pH change from pH 6 to 7. This property may be extremely interesting in assemblies including standard DNA and i-motif sections by allowing a conformational change triggered by pH. ACKNOWLEDGEMENTS Yamuna Krishna is gratefully acknowledged for discussion and information on i-wire formation. Eric Lesniewska is acknowledged for communication of AFM images. FUNDING Funding for open access charge: Centre National de la Recherche Scientifique. Conflict of interest statement. None declared. REFERENCES 1. Gehring K , Leroy J.-L, Guéron M. A tetrameric DNA structure with protonated cytidine-cytidine base pairs. Nature  ( 1993) 363: 561– 565. Google Scholar CrossRef Search ADS PubMed  2. Nonin S , Phan AT, Leroy J.-L. Solution structure and base-pair opening kinetics of the i-motoif dimer of d(5mCCTTTACC): a non canonical structure with possible role in chromosome stability. Structure  ( 1997) 5: 1231– 1246. Google Scholar CrossRef Search ADS PubMed  3. Phan AT , Guéron M, Leroy J.-L. The solution structure and internal motions of a fragment of the cytidine-rich strand of the human telomere. J. Mol. Biol.  ( 2000) 299: 123– 144. Google Scholar CrossRef Search ADS PubMed  4. Nonin S , Leroy J.-L. Structure and conversion kinetics of a bi-stable DNA i-motif: broken symmetry in the [d(5mCCTCC)]4 tetramer. J. Mol. Biol  ( 1996) 261: 399– 414. Google Scholar CrossRef Search ADS PubMed  5. Leroy J.-L . T.T pair intercalation and duplex interconversion within i-motif tetramers. J. Mol. Biol.  ( 2004) 333: 25– 139. 6. Canalia M , Leroy J.-L. Structure, internal motions and association-dissociation kinetics of the i-motif dimer of d(5mCCTCACTCC). Nucleic Acids Res.  ( 2005) 33: 5471– 548. Google Scholar CrossRef Search ADS PubMed  7. Canalia M , Leroy J.-L. [5mCCTCTCTCC]4: an i-motif tetramer with intercalated T·T pairs. J. Am. Chem. Soc.  ( 2009) 131: 12870– 12871. Google Scholar CrossRef Search ADS PubMed  8. Leroy J.-L. The formation pathway of i-motif tetramers. Nucleic Acids Res.  ( 2009) 37: 4127– 4134. Google Scholar CrossRef Search ADS PubMed  9. Ghodke HB , Krishnan R, Vignesh K, Kumar P, Narayana C, Krishnan Y. The I-Tetraplex building block: Rational design and controlled fabrication of robust 1D DNA scaffolds through non-Watson-Crick interactions. Angew. Chem. Int. Ed.  ( 2007) 46: 2646– 2649. Google Scholar CrossRef Search ADS   10. Cantor CR , Warshaw MM. Oligonucleotide interactions. III. circular dichroism studies of the conformation of deoxyoligonucleotides. Biopolymers  ( 1970) 9: 1059– 1077. Google Scholar CrossRef Search ADS PubMed  11. Bhatia D , Mehtab S, Krishnan R, Indi S, Basu A, Krishnan Y. Icosahedral DNA Nanocapsules by Modular Assembly. Angew Chem.  ( 2009) 48: 4134– 4137. Google Scholar CrossRef Search ADS   12. Plateau P , Guéron M. Exchangeable proton NMR without base-line distortion, using strong pulse sequences. J. Amer. Chem. Soc.  ( 1982) 104: 7310– 7311. Google Scholar CrossRef Search ADS   13. Guéron M , Plateau P, Decorps M. Solvent signal suppression in NMR. Prog. NMR spectr.  ( 1991) 23: 135– 209. Google Scholar CrossRef Search ADS   14. Leroy J.-L , Gehring K, Kettani A, Guéron M. Acid multimers of oligo-cytidine strands: stoichiometry, base-pair characterization and proton exchange properties. Biochemistry  ( 1993) 32: 6019– 6031. Google Scholar CrossRef Search ADS PubMed  15. Bardin C , Leroy J.-L. The formation pathway of tetramolecular G-quadruplexes. Nucleic Acids Res.  ( 2008) 36: 477– 488. Google Scholar CrossRef Search ADS PubMed  16. Davis AV , Yeh RM, Raymond KN. Supramolecular assembly dynamics. Proc. Natl Acad. Sci. USA  ( 2002) 99: 4793– 4796. Google Scholar CrossRef Search ADS   17. Gallego J , Chou S.-H, Reid B. Centromeric pyrimidine strands fold into an intercalated motif by forming a double hairpin with a novel T:G:G:T tetrad: solution structure of the d(TCCCGTTTCCA) dimer. J. Mol. Biol.  ( 1997) 273: 840– 856. Google Scholar CrossRef Search ADS PubMed  18. Lebrun A , Lavery R. Modelling extreme stretching of DNA. Nucleic Acids Res.  ( 1996) 24: 2260– 2267. Google Scholar CrossRef Search ADS PubMed  19. Lane AN , Chaires JB, Gray RD, Trent JO. Stability and kinetics of G-quadruplex structures. Nucleic Acids Res.  ( 2008) 36: 5482– 5515. Google Scholar CrossRef Search ADS PubMed  20. Burge S , Parkinson GN, Hazel P, Todd AK, Neidle S. Quadruplex DNA: sequence, topology and structure. Nucleic Acids Res.  ( 2006) 34: 5402– 5415. Google Scholar CrossRef Search ADS PubMed  21. Davis T , Spada GP. Supramolecular architectures generated by self-assembly of guanosine derivatives. Chem. Soc. Rev.  ( 2007) 36: 296– 313. Google Scholar CrossRef Search ADS PubMed  22. Marsh TC , Vesenka J, Henderson E. A new DNA nanostructure, the G-wire, imaged by scanning probe microscopy. Nucleic Acids Res.  ( 1995) 23: 696– 700. Google Scholar CrossRef Search ADS PubMed  23. Mergny JL , De Cian A, Ghelab A, Saccà B, Lacroix L. Kinetics of tetramolecular quadruplexes. Nucleic Acids Res.  ( 2005) 33: 81– 94. Google Scholar CrossRef Search ADS PubMed  24. Krishnan-Ghosh Y , Dongsheng LD, Balasubramanian S. Formation of an interlocked quadruplex dimer by d(GGGT). J. Am. Chem. Soc.  ( 2004) 126: 11009– 11016. Google Scholar CrossRef Search ADS PubMed  25. Miyoshi D , Karimata H, Wang ZM, Koumoto K, Sugimoto N. Artificial G-wire switch with 2,2′-bipyridine units responsive to divalent metal ions. J. Am. Chem. Soc.  ( 2007) 129: 5919– 5925. Google Scholar CrossRef Search ADS PubMed  26. Niemeyer CM . Self-assembled nanostructures based on DNA: towards the development of nanobiotechnology. Curr. Opin. Chem. Biol.  ( 2000) 4: 609– 618. Google Scholar CrossRef Search ADS PubMed  27. Shen Z , Yan H, Wang T, Seeman NC. Paranemic crossover DNA: a generalized Holliday structure with applications in nanotechnology. J. Am. Chem. Soc.  ( 2004) 126: 1666– 1674. Google Scholar CrossRef Search ADS PubMed  28. Pitchiaya S , Krishnan Y. First blueprint, now bricks: DNA as construction material on the nanoscale. Chem. Soc. Rev.  ( 2006) 35: 1111– 1121. Google Scholar CrossRef Search ADS PubMed  © The Author(s) 2010. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. TI - [C7GC4]4 Association into supra molecular i-motif structures JF - Nucleic Acids Research DO - 10.1093/nar/gkq102 DA - 2010-02-25 UR - https://www.deepdyve.com/lp/oxford-university-press/c7gc4-4-association-into-supra-molecular-i-motif-structures-HBSSubch0n SP - 3817 EP - 3826 VL - 38 IS - 11 DP - DeepDyve ER -