Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 7-Day Trial for You or Your Team.

Learn More →

<i>Coot</i>: model-building tools for molecular graphics

Coot: model-building tools for molecular graphics research papers Acta Crystallographica Section D Biological Crystallography ISSN 0907-4449 Received 26 February 2004 Paul Emsley* and Kevin Cowtan CCP4mg is a project that aims to provide a general-purpose Accepted 4 August 2004 tool for structural biologists, providing tools for X-ray structure solution, structure comparison and analysis, and York Structural Biology Laboratory, University of publication-quality graphics. The map-®tting tools are avail- York, Heslington, York YO10 5YW, England able as a stand-alone package, distributed as `Coot'. Correspondence e-mail: [email protected] 1. Introduction Molecular graphics still plays an important role in the deter- mination of protein structures using X-ray crystallographic data, despite on-going efforts to automate model building. Functions such as side-chain placement, loop, ligand and fragment ®tting, structure comparison, analysis and validation are routinely performed using molecular graphics. Lower resolution (d worse than 2.5 A) data in particular need min interactive ®tting. The introduction of FRODO (Jones, 1978) and then O (Jones et al., 1991) to the ®eld of protein crystallography was in each case revolutionary, each in their time breaking new ground in demonstrating what was possible with the current hardware. These tools allowed protein crystallographers to enjoy what is widely held to be the most thrilling part of their work: giving birth (as it were) to a new protein structure. The CCP4 program suite (Collaborative Computational Project, Number 4, 1994) is an integrated collection of software for macromolecular crystallography, with a scope ranging from data processing to structure re®nement and validation. Until recently, molecular graphics had not been part of the suite. With the recent computational and graphical performance of relatively cheap hardware, the time had arrived for CCP4to provide graphical functionality for knowledge-based (semi-)- automatic building using powerful modern languages in a ¯exible extendible package. CCP4mg (Potterton et al., 2004) is an initiative by CCP4 to provide libraries and a molecular graphics application that is a popular system for represent- ation, modelling, structure determination, analysis and validation. The aim is to provide a system that is easy to use and a platform for developers who wish to integrate macro- molecular computation with a molecular-graphics interface. There are several modules to such graphical functionality; the protein model-building/map-®tting tools described here are only a part. These tools are available as a stand-alone software package, Coot. A map-®tting program has to provide certain functionality, which is not required by a molecular-display program. These functions include symmetry coordinates, electron-density map contouring and the ability to move the coordinates in various ways, such as model idealization or according to side-chain # 2004 International Union of Crystallography Printed in Denmark ± all rights reserved rotamer probabilities. 2126 doi:10.1107/S0907444904019158 Acta Cryst. (2004). D60, 2126±2132 research papers The map-®tting and model-building functions described 2.3. Interface to REFMAC here have a functionality broadly similar to that of programs On reading an MTZ ®le one can optionally assign para- such as O, Xtalview from X®t (McRee, 1999) or QUANTA meters for running REFMAC (Murshudov et al., 1997). (Accelrys, San Diego, CA, USA). However, in the spirit of the REFMAC is a program of the CCP4 suite for maximum- CCP4 program suite, it is possible for others to read and likelihood-based macromolecular re®nement. After a period modify the program. of interactive model building, the user can choose to use Coot attempts (generally speaking) to provide more trans- REFMAC to re®ne the current coordinates (in combination parency, ease of use, better extendability, (semi-)automated with MTZ parameters). Coot blocks until REFMAC has model-building methods and convenient integration with terminated and then automatically reads the newly generated programs of the CCP4 suite. (re®ned) coordinates and MTZ ®le, from which a map is generated (and displayed). 2. Program functions 2.4. Rigid-body refinement Coot has been substantially built around two major libraries: Clipper library functions provide easy access to the map mmdb (Krissinel et al., 2004), a library for the handling of gradients. For a selected coordinate set, the map gradients at macromolecular coordinates, and Clipper (Cowtan, 2002, the atom centres are averaged. A shift is applied (to all the 2003), a library for crystallographic objects and computation selected atoms) that is some simple fraction of the average thereof. The various functions of Coot are split into `stand- gradient. The rotational component of the rigid-body re®ne- alone' classes in the sense that an attempt has been made to ment is generated in the following manner: the rotations to be minimize the dependence of the classes on anything other calculated ( , and ) are (small) rotations around the x y z than the above libraries. With portability in mind, special coordinate axes, the centre of rotation (V) being the centre of effort was made not to introduce GUI dependences into the the rotating atoms. Let V be the projection on to the XY interface to Coot's library of tools. plane of the vector between the position of atom i and V, the c c Coot is event-driven; functions are only run as a result of unit vector being V . The dot product of the gradient with V p p i i user action (typically moving or clicking the mouse). provides d . The required angle is arctan…d =jVj†. These angles are available for each atom and they are averaged to obtain three 2.1. Symmetry perpendicular rotations: , and . These angle transfor- x y z mations are applied to the coordinates. The application of Coordinate symmetry is recomputed and redisplayed at transformations continues until the average shift length is less every recentre event. For each molecule for which the user than 0.0005 A. wishes to display symmetry, symmetry atoms are displayed This is a reasonable approach for much of a protein's within a particular distance criterion of the display centre. By structure, but could behave badly where there is a combina- using a set of pre-computed guide points that mark the extents tion of relatively heavy and light atoms, such as sulfates or of the molecule and applying the symmetry operators and cell methionines. This problem could be countered by weighting shifts to these guide points, a set of operator indexes and cell the atom-density score by the atomic weight. shifts are generated that may contain symmetry-related atoms close to the screen centre (where `close' is de®ned by a user- settable parameter). For each of these sets, all atoms in the 2.5. Rotamers molecule are transformed and a check is made for each to see The rotamer library used in Coot is the backbone- if it is within the symmetry display radius of the position at the independent library of Dunbrack & Cohen (1997). It is formed centre of the screen. Thus, symmetry is kept current and from a reasonably large sample set (850 chains), is reasonably relevant to the current display centre. up to date (May 2002) and provides a more accurate estima- tion of the population of rare rotamers. The Coot function `Auto-®t Rotamer' takes a set of most 2.2. Electron density likely rotamers for a particular side chain and generates Because Coot is based on the Clipper libraries, it is easy to coordinates for each rotamer. Each test rotamer is then rigid- generate maps by reading a ®le of structure factors that body re®ned and the ®nal position is scored according to the contain phase information (typically an MTZ ®le). Density is ®t to the density (the residue's backbone atoms are included in not limited to any particular part of the unit cell; the relevant the set of re®ned atoms). The best ®t rotamer is chosen and symmetry-related density is generated (and then contoured) replaces the previous coordinates. automatically using Clipper functionality. The electron- density maps can be simply recontoured (provoked by script 2.6. Regularization and refinement or keyboard or mouse events) at a different level using a predetermined increment. Every map (displayed or un- Molecular-graphics model building requires the ability to displayed) is regenerated and contoured: this process is not regularize (`idealize') the coordinates of the model. In order to optimally fast but simpli®es the user interface. do so, the ideal values of the geometry of the macromolecule Acta Cryst. (2004). D60, 2126±2132 Emsley & Cowtan Model-building tools for molecular graphics 2127 research papers should be known. These ideal values can come in various position is determined from the mean position of the grid forms. The interface in Coot reads the mmCIF dictionaries of coordinates of the cluster. This position is then optimized by REFMAC, which de®ne idea values and estimated standard re®ning the position to the local maximum as determined by deviations for bond lengths, angles, torsions planes and chiral cubic interpolation of the map. A map sphericity test is then centres. Coot uses the Polak±Ribiere variant of the BFGS applied; the variance of the cubic interpolated electron density (Broyden±Fletcher±Goldfarb±Shanno) conjugate-gradient at points 0.3, 0.6 and 0.9 A from the local maximum in positive multi-variable function minimizer to optimize the coordinates. and negative offsets along the x, y and z axes are determined. The analytical gradient derivations are described in The variances are summed and must be lower than a user- 2 ÿ6 Appendix A. changeable cutoff (default 0.07 e A ). The successful posi- 2.6.1. Fitting to the map. As described above, the map tions are then compared with the coordinates of the protein's gradients are provided by a Clipper function. These map O and N atoms. If the distance is between user-changeable gradients (at the positions of the atom centres) are simply criteria (default 2.4±3.4 A) then the position is accepted as a multiplied by a (user-changeable) scaling factor and added to solvent O atom and (optionally) added to the protein model. the geometric terms to de®ne the target function (this is called `Re®nement' in Coot). 2.8. Add terminal residue Given the selection of a terminal residue (which also could merely be the start of a gap of unplaced residues), two residue- 2.7. Finding ligands type independent randomly selected '/ pairs are made from A map can be masked by a set of coordinates (typically Clipper's Ramachandran distribution of '/ pairs. These those of the currently determined atoms of the protein angles are used to generate positions of C, C , O and N main- model). This approach leaves a map that has positive density chain atoms for the neighbouring two residues using the at places where there are no atoms to represent that density peptide geometry. This set of atoms then undergo rigid-body (similar, in fact, to an F ÿ F map). This masked map is o c re®nement to optimize the ®t to the map. The score of the ®t searched for `clusters' of density above a particular level. The and the positions of the atoms are recorded. This procedure is clustering of the grid points of the asymmetric unit into then repeated a number of times (by default 100). The main- potential ligand sites is performed conveniently using a chain atoms of the neighbouring residue's best-®t atoms are recursive neighbour search of the map. The clusters are sorted then offered as a position of the neighbouring residue (the according to size and electron-density value. Eigenvalues and atoms of the next neighbouring residue are discarded). eigenvectors are calculated for each cluster of grid points. Similarly, the eigenvalues and eigenvectors of the search 2.9. Skeletonization and C building ligands (there can of course be just one search ligand) are computed (the parameters being the positions of the atom Coot uses a Clipper map to generate and store the skeleton. centres). The eigenvalues of the ligands are compared with the This approach is convenient because, like electron-density eigenvalues of each of the electron-density clusters and if they maps, the skeleton can be displayed `on the ¯y' anywhere in are suf®ciently similar the ligand is placed into the cluster by the crystal (i.e. it is not limited to a precalculated region). The matching the centre of the test ligand and the centre of the Clipper skeletonization algorithm is similar to that employed cluster. The ligand is oriented in each of the four different in DM from CCP4 (Cowtan, 1994). A skeleton `bond' (bone) orientations that provide coinciding eigenvectors and then is drawn between neighbouring map grid points if both parts rigid-body re®ned and scored. The score is simply the sum of are marked as skeleton points. the electron density at the atom centres. The score at each site The skeleton can be further trimmed by recursive tip for each different ligand is compared and the best ®t (highest removal (a tip being a grid point with one or zero neighbours). score with suf®cient fraction of atoms in positive density after This process removes side chains and, potentially, parts of the the rigid-body re®nement) is chosen. This last check ensures termini, but provides an easy means of identifying the fold and that oversized ligands are not ®tted into small clusters. non-crystallographic symmetry. 2.7.1. Flexible ligands. Instead of having a series of Like some validation (Kleywegt, 1997) and other attempts different ligand compounds, the search ligands can be gener- at automated model building (Morris et al., 2002; Old®eld & ated from a single ligand that has rotatable bonds. The ligand Hubbard, 1994; Old®eld, 2001), a likelihood distribution for dictionary provides a description of the geometry of the ligand the pseudo-torsion angle C (n)ÐC (n + 1)ÐC (n + 2)Ð including torsions. These torsions are randomly sampled for a C (n +3) versus the angle C (n + 1)ÐC (n + 2)ÐC (n +3) number of trials (by default 1000) to provide coordinates that has been generated from high-resolution structures in the can be checked against the potential ligand sites as described PDB (Berman et al., 2000) (Fig. 1). Once at least three C above. An enhancement would be to allow the determination atoms have been placed, this is used as prior knowledge in the of the number of trials to depend on the number of torsions. placement of the next C position in the following manner. 2.7.2. Finding water molecules. The electron density is Skeleton points between 2.4 and 3.4 A from the current C clustered as described for ligands. For clusters that have a position (which has an associated nearby skeleton point) are volume below a certain upper limit (4.2 A , which stops water selected. These skeleton points are tested for direct connec- molecules being placed in multi-atom ligand sites) a starting tivity to the current skeleton point. Skeleton points that are 2128 Emsley & Cowtan Model-building tools for molecular graphics Acta Cryst. (2004). D60, 2126±2132 research papers C coordinates are converted to main-chain coordinates in a manner similar to that previously described (Jones & Thirup, 1986; Esnouf, 1997). APPENDIX A Regularization and refinement derivatives The function that we are trying to minimize is S, where S ˆ S ‡ S ‡ S ‡ S : bond angle torsion plane Let us take these four parts in turn. A1. Bonds bonds S ˆ …b ÿ b † ; bond i 0 iˆ1 where b is the ideal length (from the dictionary) of the ith bond, b is the bond vector and b is its length. i i @S @S @b @b i i i i ˆ ˆ‰2…b ÿ b †Š ; i 0 @x @b @x @x m i m m Figure 1 C pseudo-torsion angle versus opening angle for proteins in the PDB used in the likelihood assignment of potential C positions. 2 2 2 1=2 b ˆ‰…x ÿ x † ‡…y ÿ y † ‡…z ÿ z † Š : i m k m k m k Therefore directly connected are assigned a score of 100; those that are unconnected are assigned a score of 0.1. For each selected @b 1 1 …x ÿ x † i m k ˆ 2…x ÿ x †ˆ m k skeleton point, a test point is then generated 3.8 A from the @x 2 b b m i i current C position in the direction of skeleton point. A C and pseudo-torsion angle and angle pair are generated from the position of the test point, the current C position and the two @S …x ÿ x † i m k ˆ 2‰b ÿ b Š : previously assigned C positions. This pseudo-torsion angle i 0 @x b m i and angle pair are used to generate a score by looking up the value in the internal representation of Fig. 1 using cubic interpolation. This value is combined with the skeleton-based A2. Angles score for this particular test point. This procedure is then repeated in a `look-ahead' manner, assuming that the test We are trying to minimize S , where (for simplicity, the angle point is a member of the set of four C positions generating weights have been omitted) the C pseudo-torsion/angle pair. The most likely solution for angles the look-ahead is combined with the score for the current test S ˆ … ÿ  † : angle i 0 point. The test points are then sorted by combined score and iˆ1 interactively offered as potential positions for the next C Angle  is contributed to by atoms k, l and m: atom, the positions with the best score being offered ®rst. Occasionally (usually as a result of a positional error in the cosˆ…a b†=…ab†; current C position), 3.8 A is the wrong distance to the next where a is the bond of atoms k and l [(x ÿ x ), (y ÿ y ), k l k l correct C position; thus the user is allowed to change the Ê (z ÿ z )] and b is the bond of atoms l and m [(x ÿ x ), length to something other than 3.8 A. k l m l (y ÿ y ), (z ÿ z )]. Note that the vectors point away from the The depth of the look-ahead in the current implementation m l m l middle atom l. is at level 1 but could trivially be extended (in tests, a level 2 Therefore, look-ahead was better but took too long to be considered pleasantly interactive). ˆ a cos…P†; …1† This algorithm has room for improvement: for example, by considering the value of the density at the test point and along where the C pseudo-bond, one-third and two-thirds of the way Pˆ…a b†=…ab†: along the bond (corresponding to positions that are close to the peptide C and N atoms; Old®eld, 2003). Using the chain rule, Acta Cryst. (2004). D60, 2126±2132 Emsley & Cowtan Model-building tools for molecular graphics 2129 research papers @ @ @P A4. Angles: an end atom (atoms k or m) ˆ : …2† @x @P@x k k This case is simpler because there are no cross-terms in @R=@x and @Q=@x . Given that we are only interested in  in the range 0 ! , k k @R …x ÿ x† @ 1 k l ˆ : ˆÿ : …3† @x ab @P sin Again using the chain rule, and @P @R @Q @Q ˆ Q ‡ R ; …4† ˆ…x ÿ x†; m l @x @x @x @x k k k where and so Q ˆ a b; …5† @ 1 …x ÿ x † x ÿ x l k m l ˆÿ cos‡ : …8† @x sin a ab R ˆ 1=…ab†: …6† A5. Torsion angles The torsion angle is the angle between a  b and b  c A3. Angles: the middle atom (Fig. 2) and this can be written as A middle atom is somewhat more tricky than an end atom ^ ^ ^ because the derivatives of ab and a b are not so trivial. Let us arctan …a b c†= ÿa c‡…a b†…b c† ; …9† change the indexing so that we are actually talking about the ^ ^ where b is a unit vector in the direction of b, b ˆ b=b. middle atom, l. This de®nition of the torsion angle is used rather than the Differentiating (6) gives more common de®nition, which uses three cross-products, @R 1 @a 1 @b because our version and its derivatives are faster to calculate. ˆÿ b ÿ a : …7† 2 2 @x @x @x …ab† …ab† Let us split the expression up into tractable portions; the l l l evaluation of  in the program will combine these expressions, @a=@x here is exactly the same as for bonds, starting at the end (the most simple). @a x ÿ x l k From the primatives, ˆ : @x a a ˆ P ÿ P ; b ˆ P ÿ P ; c ˆ P ÿ P ; x 2 1 x 3 2 x 4 3 x x x x x x Similarly, a ˆ P ÿ P ; b ˆ P ÿ P ; c ˆ P ÿ P ; y 2 1 y 3 2 y 4 3 @b x ÿ x y y y y y y l m ˆ : @x a a ˆ P ÿ P ; b ˆ P ÿ P ; c ˆ P ÿ P ; z 2 1 z 3 2 z 4 3 z z z z z z Therefore, substituting these equations into (7) gives @R x ÿ x x ÿ x  ˆ arctan…D†; l k l m ˆÿ ÿ : 3 3 @x a b ab where Turning to Q, recall (5); therefore a…b c†=b D ˆ : Q ˆ …x ÿ x†…x ÿ x†‡…y ÿ y†…y ÿ y† ÿa c‡…a b†…b c†=b k l m l k l m l ‡…z ÿ z†…z ÿ z† k l m l So and hence @Q ˆÿ…x ÿ x†ÿ…x ÿ x†: k l m l @x Substituting all the above into (4) gives @P x ÿ x x ÿ x ÿ…x ÿ x†ÿ…x ÿ x† l k l m k l m l ˆ…a b†ÿ ÿ ‡ : 3 3 @x a b ab ab Combining this expression and (3) into (2) we obtain @ 1 @P ˆ : @x sin @x l l Figure 2 Nomenclature used for torsion angles. 2130 Emsley & Cowtan Model-building tools for molecular graphics Acta Cryst. (2004). D60, 2126±2132 research papers @ @ @D x ÿ x 2…x ÿ x † @L 2 P P P P 3 2 3 2 ˆ ; …10† ˆÿ ˆÿ : 3 4 @x @D@x @x b b b P P 1 1 P The derivative with respect to x has the opposite sign. @ 1 @D 2 ˆ : …11† Notice that b involves only atoms P and P , so that the 2 3 @x 1‡ D @x P P 1 1 derivates of L with respect to the P and P coordinates are 1 4 zero. Let E ˆ a…b c†=b; A6. Torsion angles: @E/@x terms ÿ1 Recall that Fˆÿ‰Š a c‡…a b†…b c†=b ; E ˆ a…b c†=b: F ˆ 1=G; …12† Let Gˆÿa c‡…a b†…b c†=b ; M ˆ a…b c†; i.e. Hˆÿa c; E ˆ M=b: J ˆ a b; Differentiating gives @E M @b 1 @M K ˆ b c; ˆÿ ‡ ; @x b @x b@x P P P 3 3 3 L ˆ 1=b : where, as for bonds, Differentiating (12) gives @b x ÿ x P P 3 2 ˆ : @F 1 @G @x b ˆÿ : 3 @x G @x P P 1 1 However, note again that the derivative of b is zero for atoms Substituting for the derivative in (10), P and P , i.e. for atoms P and P 1 4 2 3 M…x ÿ x † @E 1 @M P P @ 1 @E @F 3 2 ˆÿ ‡ ; ˆ F ‡ E : @x b b@x @x 1‡ D @x @x P P 3 3 P P P 1 1 1 but for atoms P and P 1 4 We also have @E 1 @M G ˆ H ‡ JKL: ˆ ; @x b@x P P 1 1 Differentiating this gives M ˆ a …b c ÿ b c †‡ a …b c ÿ b c †‡ a …b c ÿ b c †: @G @H @K @J @L x y z z y y z x x z z x y y x ˆ ‡ JL ‡ KL ‡ JK : @x @x @x @x @x P P P P P 1 1 1 1 1 So here are the primitives of M ˆ a…b c† The H, J, K and L derivatives are @M ˆÿ…b c ÿ b c †; y z z y @x Hˆÿa cˆÿa c ÿ a b ÿ a c ; x x y y z z @M @H @H @H @H ˆ…b c ÿ b c †‡…a c ÿ a c †; y z z y y z z y ˆ c ; ˆÿc ; ˆ a ; ˆÿa ; x x x x @x @x @x @x @x 2 P P P P 1 2 3 4 @K @K @K @M ˆ 0; ˆÿc ; ˆ c ‡ b ; x x x ˆ…a c ÿ a c †‡…b a ÿ b a †; @x @x @x z y y z y z z y P P P 1 2 3 @x @K @J @J ˆ b ; ˆÿb ; ˆ b ÿ a ; x x x x @x @x @x @M P P P 4 1 2 ˆ…a b ÿ a b †; y z z y @J @J @x ˆ a ; ˆ 0: @x @x P P 3 4 @M ˆÿ…b c ÿ b c †; The @b=@x terms are just like the bond derivatives, z x x z @y @L @L @b ˆ ; @M @x @b @x P P 1 1 ˆ…b c ÿ b c †‡…a c ÿ a c †; z x x z z x x z @y i.e. Acta Cryst. (2004). D60, 2126±2132 Emsley & Cowtan Model-building tools for molecular graphics 2131 research papers @M Recall the equation of a plane: ax + by + cz + d = 0. Firstly, ˆÿ…a c ÿ a c †‡…b a ÿ b a †; z x x z z x x z @y the centres of the sets of atoms, x , y , z , are determined. P cen cen cen The plane is moved so that it crosses the origin and therefore @M d = 0 (it is moved back later). The problem then involves three ˆÿ…b a ÿ b a †; z x x z equations, three unknowns and an eigenvalue problem, with @y the smallest eigenvalue corresponding to the best-®t plane. The least-squares planes of the plane restraints are recal- @M ˆÿ…b c ÿ b c †; x y y x culated at every iteration. @z The authors thank Garib Murshudov, Eleanor J. Dodson, @M Jack Quine and the many Coot testers. KC is supported by The ˆ…b c ÿ b c †‡…a c ÿ a c †; x y y x x y y x @z Royal Society (grant No. 003R05674). PE is funded by BBSRC grant No. 87/B17320. @M ˆÿ…a c ÿ a c †‡…a b ÿ a b †; x y y x y x x y @z References Berman, H. M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T. N., @M Weissig, H., Shindyalov, I. N. & Bourne, P. E. (2000). Nucleic Acids ˆÿ…a b ÿ a b †: y x x y @z Res. 28, 235±242. Collaborative Computational Project, Number 4 (1994). Acta Cryst. D50, 760±766. Cowtan, K. (1994). Jnt CCP4/ESF±EACBM Newsl. Protein Crystal- logr. 31, 34±38. A7. Combining terms Cowtan, K. (2002). Jnt CCP4/ESF±EACBM Newsl. Protein Crystal- Combining, we obtain the following expression for the logr. 40. derivative of torsion angle  in terms of the primitive deri- Cowtan, K. (2003). Crystallogr. Rev. 9, 73±80. Dunbrack, R. L. Jr & Cohen, F. E. (1997). Protein Sci. 6, 1661±1681. vates, Esnouf, R. M. (1997). Acta Cryst. D53, 665±672. @ 1 @D Jones, T. A. (1978). J. Appl. Cryst. 11, 268±272. ˆ ; Jones, T. A., Cowan, S., Zou, J.-Y. & Kjeldgaard, M. (1991). Acta @x …1‡ tan †@x P P 1 1 Cryst. A47, 110±119. Jones, T. A. & Thirup, S. (1986). EMBO J. 5, 891±822. where Kleywegt, G. J. (1997). J. Mol. Biol. 273, 371±376. "# ! Krissinel, E. B, Winn, M. D., Ballard, C. C., Ashton, A. W., Patel, P., @D @E E @H @K @J @L ˆ F ÿ ‡ JL ‡ KL ‡ JK : Potterton, E. A., McNicholas, S. J., Cowtan, K. D. & Emsley, P. @x @x G @x @x @x @x P P P P P P 1 1 1 1 1 1 (2004). Acta Cryst. D60, 2250±2255. McRee, D. E. (1999). J. Struct. Biol. 125, 156±165. Morris, R. J., Perrakis, A. & Lamzin, V. S. (2002). Acta Cryst. D58, 968±975. A8. Planes Murshudov, G. N., Vagin, A. A. & Dodson, E. J. (1997). Acta Cryst D53, 240±255. Old®eld, T. J. (2001). Acta Cryst. D57, 82±94. N Old®eld, T. J. (2003). Acta Cryst. D59, 483±491. planes atoms P P Old®eld, T. J. & Hubbard, R. E. (1994). Proteins Struct. Funct. Genet. S ˆ e ; plane ij iˆ1 jˆ1 18, 324±337. Potterton, L., McNicholas, S., Krissinel, E., Gruber, J., Cowtan, K., where e is the distance of the ith plane restraint's jth atom ij Emsley, P., Murshudov, G. N., Cohen, S., Perrakis, A. & Noble, M. from the ith plane restraint's least-squares plane. (2004). Acta Cryst. D60, 2288±2294. 2132 Emsley & Cowtan Model-building tools for molecular graphics Acta Cryst. (2004). D60, 2126±2132 http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Acta Crystallographica Section D Unpaywall

<i>Coot</i>: model-building tools for molecular graphics

Acta Crystallographica Section DNov 26, 2004

Loading next page...
 
/lp/unpaywall/i-coot-i-model-building-tools-for-molecular-graphics-SD3pk0AvMd

References (0)

References for this paper are not available at this time. We will be adding them shortly, thank you for your patience.

Publisher
Unpaywall
ISSN
0907-4449
DOI
10.1107/s0907444904019158
Publisher site
See Article on Publisher Site

Abstract

research papers Acta Crystallographica Section D Biological Crystallography ISSN 0907-4449 Received 26 February 2004 Paul Emsley* and Kevin Cowtan CCP4mg is a project that aims to provide a general-purpose Accepted 4 August 2004 tool for structural biologists, providing tools for X-ray structure solution, structure comparison and analysis, and York Structural Biology Laboratory, University of publication-quality graphics. The map-®tting tools are avail- York, Heslington, York YO10 5YW, England able as a stand-alone package, distributed as `Coot'. Correspondence e-mail: [email protected] 1. Introduction Molecular graphics still plays an important role in the deter- mination of protein structures using X-ray crystallographic data, despite on-going efforts to automate model building. Functions such as side-chain placement, loop, ligand and fragment ®tting, structure comparison, analysis and validation are routinely performed using molecular graphics. Lower resolution (d worse than 2.5 A) data in particular need min interactive ®tting. The introduction of FRODO (Jones, 1978) and then O (Jones et al., 1991) to the ®eld of protein crystallography was in each case revolutionary, each in their time breaking new ground in demonstrating what was possible with the current hardware. These tools allowed protein crystallographers to enjoy what is widely held to be the most thrilling part of their work: giving birth (as it were) to a new protein structure. The CCP4 program suite (Collaborative Computational Project, Number 4, 1994) is an integrated collection of software for macromolecular crystallography, with a scope ranging from data processing to structure re®nement and validation. Until recently, molecular graphics had not been part of the suite. With the recent computational and graphical performance of relatively cheap hardware, the time had arrived for CCP4to provide graphical functionality for knowledge-based (semi-)- automatic building using powerful modern languages in a ¯exible extendible package. CCP4mg (Potterton et al., 2004) is an initiative by CCP4 to provide libraries and a molecular graphics application that is a popular system for represent- ation, modelling, structure determination, analysis and validation. The aim is to provide a system that is easy to use and a platform for developers who wish to integrate macro- molecular computation with a molecular-graphics interface. There are several modules to such graphical functionality; the protein model-building/map-®tting tools described here are only a part. These tools are available as a stand-alone software package, Coot. A map-®tting program has to provide certain functionality, which is not required by a molecular-display program. These functions include symmetry coordinates, electron-density map contouring and the ability to move the coordinates in various ways, such as model idealization or according to side-chain # 2004 International Union of Crystallography Printed in Denmark ± all rights reserved rotamer probabilities. 2126 doi:10.1107/S0907444904019158 Acta Cryst. (2004). D60, 2126±2132 research papers The map-®tting and model-building functions described 2.3. Interface to REFMAC here have a functionality broadly similar to that of programs On reading an MTZ ®le one can optionally assign para- such as O, Xtalview from X®t (McRee, 1999) or QUANTA meters for running REFMAC (Murshudov et al., 1997). (Accelrys, San Diego, CA, USA). However, in the spirit of the REFMAC is a program of the CCP4 suite for maximum- CCP4 program suite, it is possible for others to read and likelihood-based macromolecular re®nement. After a period modify the program. of interactive model building, the user can choose to use Coot attempts (generally speaking) to provide more trans- REFMAC to re®ne the current coordinates (in combination parency, ease of use, better extendability, (semi-)automated with MTZ parameters). Coot blocks until REFMAC has model-building methods and convenient integration with terminated and then automatically reads the newly generated programs of the CCP4 suite. (re®ned) coordinates and MTZ ®le, from which a map is generated (and displayed). 2. Program functions 2.4. Rigid-body refinement Coot has been substantially built around two major libraries: Clipper library functions provide easy access to the map mmdb (Krissinel et al., 2004), a library for the handling of gradients. For a selected coordinate set, the map gradients at macromolecular coordinates, and Clipper (Cowtan, 2002, the atom centres are averaged. A shift is applied (to all the 2003), a library for crystallographic objects and computation selected atoms) that is some simple fraction of the average thereof. The various functions of Coot are split into `stand- gradient. The rotational component of the rigid-body re®ne- alone' classes in the sense that an attempt has been made to ment is generated in the following manner: the rotations to be minimize the dependence of the classes on anything other calculated ( , and ) are (small) rotations around the x y z than the above libraries. With portability in mind, special coordinate axes, the centre of rotation (V) being the centre of effort was made not to introduce GUI dependences into the the rotating atoms. Let V be the projection on to the XY interface to Coot's library of tools. plane of the vector between the position of atom i and V, the c c Coot is event-driven; functions are only run as a result of unit vector being V . The dot product of the gradient with V p p i i user action (typically moving or clicking the mouse). provides d . The required angle is arctan…d =jVj†. These angles are available for each atom and they are averaged to obtain three 2.1. Symmetry perpendicular rotations: , and . These angle transfor- x y z mations are applied to the coordinates. The application of Coordinate symmetry is recomputed and redisplayed at transformations continues until the average shift length is less every recentre event. For each molecule for which the user than 0.0005 A. wishes to display symmetry, symmetry atoms are displayed This is a reasonable approach for much of a protein's within a particular distance criterion of the display centre. By structure, but could behave badly where there is a combina- using a set of pre-computed guide points that mark the extents tion of relatively heavy and light atoms, such as sulfates or of the molecule and applying the symmetry operators and cell methionines. This problem could be countered by weighting shifts to these guide points, a set of operator indexes and cell the atom-density score by the atomic weight. shifts are generated that may contain symmetry-related atoms close to the screen centre (where `close' is de®ned by a user- settable parameter). For each of these sets, all atoms in the 2.5. Rotamers molecule are transformed and a check is made for each to see The rotamer library used in Coot is the backbone- if it is within the symmetry display radius of the position at the independent library of Dunbrack & Cohen (1997). It is formed centre of the screen. Thus, symmetry is kept current and from a reasonably large sample set (850 chains), is reasonably relevant to the current display centre. up to date (May 2002) and provides a more accurate estima- tion of the population of rare rotamers. The Coot function `Auto-®t Rotamer' takes a set of most 2.2. Electron density likely rotamers for a particular side chain and generates Because Coot is based on the Clipper libraries, it is easy to coordinates for each rotamer. Each test rotamer is then rigid- generate maps by reading a ®le of structure factors that body re®ned and the ®nal position is scored according to the contain phase information (typically an MTZ ®le). Density is ®t to the density (the residue's backbone atoms are included in not limited to any particular part of the unit cell; the relevant the set of re®ned atoms). The best ®t rotamer is chosen and symmetry-related density is generated (and then contoured) replaces the previous coordinates. automatically using Clipper functionality. The electron- density maps can be simply recontoured (provoked by script 2.6. Regularization and refinement or keyboard or mouse events) at a different level using a predetermined increment. Every map (displayed or un- Molecular-graphics model building requires the ability to displayed) is regenerated and contoured: this process is not regularize (`idealize') the coordinates of the model. In order to optimally fast but simpli®es the user interface. do so, the ideal values of the geometry of the macromolecule Acta Cryst. (2004). D60, 2126±2132 Emsley & Cowtan Model-building tools for molecular graphics 2127 research papers should be known. These ideal values can come in various position is determined from the mean position of the grid forms. The interface in Coot reads the mmCIF dictionaries of coordinates of the cluster. This position is then optimized by REFMAC, which de®ne idea values and estimated standard re®ning the position to the local maximum as determined by deviations for bond lengths, angles, torsions planes and chiral cubic interpolation of the map. A map sphericity test is then centres. Coot uses the Polak±Ribiere variant of the BFGS applied; the variance of the cubic interpolated electron density (Broyden±Fletcher±Goldfarb±Shanno) conjugate-gradient at points 0.3, 0.6 and 0.9 A from the local maximum in positive multi-variable function minimizer to optimize the coordinates. and negative offsets along the x, y and z axes are determined. The analytical gradient derivations are described in The variances are summed and must be lower than a user- 2 ÿ6 Appendix A. changeable cutoff (default 0.07 e A ). The successful posi- 2.6.1. Fitting to the map. As described above, the map tions are then compared with the coordinates of the protein's gradients are provided by a Clipper function. These map O and N atoms. If the distance is between user-changeable gradients (at the positions of the atom centres) are simply criteria (default 2.4±3.4 A) then the position is accepted as a multiplied by a (user-changeable) scaling factor and added to solvent O atom and (optionally) added to the protein model. the geometric terms to de®ne the target function (this is called `Re®nement' in Coot). 2.8. Add terminal residue Given the selection of a terminal residue (which also could merely be the start of a gap of unplaced residues), two residue- 2.7. Finding ligands type independent randomly selected '/ pairs are made from A map can be masked by a set of coordinates (typically Clipper's Ramachandran distribution of '/ pairs. These those of the currently determined atoms of the protein angles are used to generate positions of C, C , O and N main- model). This approach leaves a map that has positive density chain atoms for the neighbouring two residues using the at places where there are no atoms to represent that density peptide geometry. This set of atoms then undergo rigid-body (similar, in fact, to an F ÿ F map). This masked map is o c re®nement to optimize the ®t to the map. The score of the ®t searched for `clusters' of density above a particular level. The and the positions of the atoms are recorded. This procedure is clustering of the grid points of the asymmetric unit into then repeated a number of times (by default 100). The main- potential ligand sites is performed conveniently using a chain atoms of the neighbouring residue's best-®t atoms are recursive neighbour search of the map. The clusters are sorted then offered as a position of the neighbouring residue (the according to size and electron-density value. Eigenvalues and atoms of the next neighbouring residue are discarded). eigenvectors are calculated for each cluster of grid points. Similarly, the eigenvalues and eigenvectors of the search 2.9. Skeletonization and C building ligands (there can of course be just one search ligand) are computed (the parameters being the positions of the atom Coot uses a Clipper map to generate and store the skeleton. centres). The eigenvalues of the ligands are compared with the This approach is convenient because, like electron-density eigenvalues of each of the electron-density clusters and if they maps, the skeleton can be displayed `on the ¯y' anywhere in are suf®ciently similar the ligand is placed into the cluster by the crystal (i.e. it is not limited to a precalculated region). The matching the centre of the test ligand and the centre of the Clipper skeletonization algorithm is similar to that employed cluster. The ligand is oriented in each of the four different in DM from CCP4 (Cowtan, 1994). A skeleton `bond' (bone) orientations that provide coinciding eigenvectors and then is drawn between neighbouring map grid points if both parts rigid-body re®ned and scored. The score is simply the sum of are marked as skeleton points. the electron density at the atom centres. The score at each site The skeleton can be further trimmed by recursive tip for each different ligand is compared and the best ®t (highest removal (a tip being a grid point with one or zero neighbours). score with suf®cient fraction of atoms in positive density after This process removes side chains and, potentially, parts of the the rigid-body re®nement) is chosen. This last check ensures termini, but provides an easy means of identifying the fold and that oversized ligands are not ®tted into small clusters. non-crystallographic symmetry. 2.7.1. Flexible ligands. Instead of having a series of Like some validation (Kleywegt, 1997) and other attempts different ligand compounds, the search ligands can be gener- at automated model building (Morris et al., 2002; Old®eld & ated from a single ligand that has rotatable bonds. The ligand Hubbard, 1994; Old®eld, 2001), a likelihood distribution for dictionary provides a description of the geometry of the ligand the pseudo-torsion angle C (n)ÐC (n + 1)ÐC (n + 2)Ð including torsions. These torsions are randomly sampled for a C (n +3) versus the angle C (n + 1)ÐC (n + 2)ÐC (n +3) number of trials (by default 1000) to provide coordinates that has been generated from high-resolution structures in the can be checked against the potential ligand sites as described PDB (Berman et al., 2000) (Fig. 1). Once at least three C above. An enhancement would be to allow the determination atoms have been placed, this is used as prior knowledge in the of the number of trials to depend on the number of torsions. placement of the next C position in the following manner. 2.7.2. Finding water molecules. The electron density is Skeleton points between 2.4 and 3.4 A from the current C clustered as described for ligands. For clusters that have a position (which has an associated nearby skeleton point) are volume below a certain upper limit (4.2 A , which stops water selected. These skeleton points are tested for direct connec- molecules being placed in multi-atom ligand sites) a starting tivity to the current skeleton point. Skeleton points that are 2128 Emsley & Cowtan Model-building tools for molecular graphics Acta Cryst. (2004). D60, 2126±2132 research papers C coordinates are converted to main-chain coordinates in a manner similar to that previously described (Jones & Thirup, 1986; Esnouf, 1997). APPENDIX A Regularization and refinement derivatives The function that we are trying to minimize is S, where S ˆ S ‡ S ‡ S ‡ S : bond angle torsion plane Let us take these four parts in turn. A1. Bonds bonds S ˆ …b ÿ b † ; bond i 0 iˆ1 where b is the ideal length (from the dictionary) of the ith bond, b is the bond vector and b is its length. i i @S @S @b @b i i i i ˆ ˆ‰2…b ÿ b †Š ; i 0 @x @b @x @x m i m m Figure 1 C pseudo-torsion angle versus opening angle for proteins in the PDB used in the likelihood assignment of potential C positions. 2 2 2 1=2 b ˆ‰…x ÿ x † ‡…y ÿ y † ‡…z ÿ z † Š : i m k m k m k Therefore directly connected are assigned a score of 100; those that are unconnected are assigned a score of 0.1. For each selected @b 1 1 …x ÿ x † i m k ˆ 2…x ÿ x †ˆ m k skeleton point, a test point is then generated 3.8 A from the @x 2 b b m i i current C position in the direction of skeleton point. A C and pseudo-torsion angle and angle pair are generated from the position of the test point, the current C position and the two @S …x ÿ x † i m k ˆ 2‰b ÿ b Š : previously assigned C positions. This pseudo-torsion angle i 0 @x b m i and angle pair are used to generate a score by looking up the value in the internal representation of Fig. 1 using cubic interpolation. This value is combined with the skeleton-based A2. Angles score for this particular test point. This procedure is then repeated in a `look-ahead' manner, assuming that the test We are trying to minimize S , where (for simplicity, the angle point is a member of the set of four C positions generating weights have been omitted) the C pseudo-torsion/angle pair. The most likely solution for angles the look-ahead is combined with the score for the current test S ˆ … ÿ  † : angle i 0 point. The test points are then sorted by combined score and iˆ1 interactively offered as potential positions for the next C Angle  is contributed to by atoms k, l and m: atom, the positions with the best score being offered ®rst. Occasionally (usually as a result of a positional error in the cosˆ…a b†=…ab†; current C position), 3.8 A is the wrong distance to the next where a is the bond of atoms k and l [(x ÿ x ), (y ÿ y ), k l k l correct C position; thus the user is allowed to change the Ê (z ÿ z )] and b is the bond of atoms l and m [(x ÿ x ), length to something other than 3.8 A. k l m l (y ÿ y ), (z ÿ z )]. Note that the vectors point away from the The depth of the look-ahead in the current implementation m l m l middle atom l. is at level 1 but could trivially be extended (in tests, a level 2 Therefore, look-ahead was better but took too long to be considered pleasantly interactive). ˆ a cos…P†; …1† This algorithm has room for improvement: for example, by considering the value of the density at the test point and along where the C pseudo-bond, one-third and two-thirds of the way Pˆ…a b†=…ab†: along the bond (corresponding to positions that are close to the peptide C and N atoms; Old®eld, 2003). Using the chain rule, Acta Cryst. (2004). D60, 2126±2132 Emsley & Cowtan Model-building tools for molecular graphics 2129 research papers @ @ @P A4. Angles: an end atom (atoms k or m) ˆ : …2† @x @P@x k k This case is simpler because there are no cross-terms in @R=@x and @Q=@x . Given that we are only interested in  in the range 0 ! , k k @R …x ÿ x† @ 1 k l ˆ : ˆÿ : …3† @x ab @P sin Again using the chain rule, and @P @R @Q @Q ˆ Q ‡ R ; …4† ˆ…x ÿ x†; m l @x @x @x @x k k k where and so Q ˆ a b; …5† @ 1 …x ÿ x † x ÿ x l k m l ˆÿ cos‡ : …8† @x sin a ab R ˆ 1=…ab†: …6† A5. Torsion angles The torsion angle is the angle between a  b and b  c A3. Angles: the middle atom (Fig. 2) and this can be written as A middle atom is somewhat more tricky than an end atom ^ ^ ^ because the derivatives of ab and a b are not so trivial. Let us arctan …a b c†= ÿa c‡…a b†…b c† ; …9† change the indexing so that we are actually talking about the ^ ^ where b is a unit vector in the direction of b, b ˆ b=b. middle atom, l. This de®nition of the torsion angle is used rather than the Differentiating (6) gives more common de®nition, which uses three cross-products, @R 1 @a 1 @b because our version and its derivatives are faster to calculate. ˆÿ b ÿ a : …7† 2 2 @x @x @x …ab† …ab† Let us split the expression up into tractable portions; the l l l evaluation of  in the program will combine these expressions, @a=@x here is exactly the same as for bonds, starting at the end (the most simple). @a x ÿ x l k From the primatives, ˆ : @x a a ˆ P ÿ P ; b ˆ P ÿ P ; c ˆ P ÿ P ; x 2 1 x 3 2 x 4 3 x x x x x x Similarly, a ˆ P ÿ P ; b ˆ P ÿ P ; c ˆ P ÿ P ; y 2 1 y 3 2 y 4 3 @b x ÿ x y y y y y y l m ˆ : @x a a ˆ P ÿ P ; b ˆ P ÿ P ; c ˆ P ÿ P ; z 2 1 z 3 2 z 4 3 z z z z z z Therefore, substituting these equations into (7) gives @R x ÿ x x ÿ x  ˆ arctan…D†; l k l m ˆÿ ÿ : 3 3 @x a b ab where Turning to Q, recall (5); therefore a…b c†=b D ˆ : Q ˆ …x ÿ x†…x ÿ x†‡…y ÿ y†…y ÿ y† ÿa c‡…a b†…b c†=b k l m l k l m l ‡…z ÿ z†…z ÿ z† k l m l So and hence @Q ˆÿ…x ÿ x†ÿ…x ÿ x†: k l m l @x Substituting all the above into (4) gives @P x ÿ x x ÿ x ÿ…x ÿ x†ÿ…x ÿ x† l k l m k l m l ˆ…a b†ÿ ÿ ‡ : 3 3 @x a b ab ab Combining this expression and (3) into (2) we obtain @ 1 @P ˆ : @x sin @x l l Figure 2 Nomenclature used for torsion angles. 2130 Emsley & Cowtan Model-building tools for molecular graphics Acta Cryst. (2004). D60, 2126±2132 research papers @ @ @D x ÿ x 2…x ÿ x † @L 2 P P P P 3 2 3 2 ˆ ; …10† ˆÿ ˆÿ : 3 4 @x @D@x @x b b b P P 1 1 P The derivative with respect to x has the opposite sign. @ 1 @D 2 ˆ : …11† Notice that b involves only atoms P and P , so that the 2 3 @x 1‡ D @x P P 1 1 derivates of L with respect to the P and P coordinates are 1 4 zero. Let E ˆ a…b c†=b; A6. Torsion angles: @E/@x terms ÿ1 Recall that Fˆÿ‰Š a c‡…a b†…b c†=b ; E ˆ a…b c†=b: F ˆ 1=G; …12† Let Gˆÿa c‡…a b†…b c†=b ; M ˆ a…b c†; i.e. Hˆÿa c; E ˆ M=b: J ˆ a b; Differentiating gives @E M @b 1 @M K ˆ b c; ˆÿ ‡ ; @x b @x b@x P P P 3 3 3 L ˆ 1=b : where, as for bonds, Differentiating (12) gives @b x ÿ x P P 3 2 ˆ : @F 1 @G @x b ˆÿ : 3 @x G @x P P 1 1 However, note again that the derivative of b is zero for atoms Substituting for the derivative in (10), P and P , i.e. for atoms P and P 1 4 2 3 M…x ÿ x † @E 1 @M P P @ 1 @E @F 3 2 ˆÿ ‡ ; ˆ F ‡ E : @x b b@x @x 1‡ D @x @x P P 3 3 P P P 1 1 1 but for atoms P and P 1 4 We also have @E 1 @M G ˆ H ‡ JKL: ˆ ; @x b@x P P 1 1 Differentiating this gives M ˆ a …b c ÿ b c †‡ a …b c ÿ b c †‡ a …b c ÿ b c †: @G @H @K @J @L x y z z y y z x x z z x y y x ˆ ‡ JL ‡ KL ‡ JK : @x @x @x @x @x P P P P P 1 1 1 1 1 So here are the primitives of M ˆ a…b c† The H, J, K and L derivatives are @M ˆÿ…b c ÿ b c †; y z z y @x Hˆÿa cˆÿa c ÿ a b ÿ a c ; x x y y z z @M @H @H @H @H ˆ…b c ÿ b c †‡…a c ÿ a c †; y z z y y z z y ˆ c ; ˆÿc ; ˆ a ; ˆÿa ; x x x x @x @x @x @x @x 2 P P P P 1 2 3 4 @K @K @K @M ˆ 0; ˆÿc ; ˆ c ‡ b ; x x x ˆ…a c ÿ a c †‡…b a ÿ b a †; @x @x @x z y y z y z z y P P P 1 2 3 @x @K @J @J ˆ b ; ˆÿb ; ˆ b ÿ a ; x x x x @x @x @x @M P P P 4 1 2 ˆ…a b ÿ a b †; y z z y @J @J @x ˆ a ; ˆ 0: @x @x P P 3 4 @M ˆÿ…b c ÿ b c †; The @b=@x terms are just like the bond derivatives, z x x z @y @L @L @b ˆ ; @M @x @b @x P P 1 1 ˆ…b c ÿ b c †‡…a c ÿ a c †; z x x z z x x z @y i.e. Acta Cryst. (2004). D60, 2126±2132 Emsley & Cowtan Model-building tools for molecular graphics 2131 research papers @M Recall the equation of a plane: ax + by + cz + d = 0. Firstly, ˆÿ…a c ÿ a c †‡…b a ÿ b a †; z x x z z x x z @y the centres of the sets of atoms, x , y , z , are determined. P cen cen cen The plane is moved so that it crosses the origin and therefore @M d = 0 (it is moved back later). The problem then involves three ˆÿ…b a ÿ b a †; z x x z equations, three unknowns and an eigenvalue problem, with @y the smallest eigenvalue corresponding to the best-®t plane. The least-squares planes of the plane restraints are recal- @M ˆÿ…b c ÿ b c †; x y y x culated at every iteration. @z The authors thank Garib Murshudov, Eleanor J. Dodson, @M Jack Quine and the many Coot testers. KC is supported by The ˆ…b c ÿ b c †‡…a c ÿ a c †; x y y x x y y x @z Royal Society (grant No. 003R05674). PE is funded by BBSRC grant No. 87/B17320. @M ˆÿ…a c ÿ a c †‡…a b ÿ a b †; x y y x y x x y @z References Berman, H. M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T. N., @M Weissig, H., Shindyalov, I. N. & Bourne, P. E. (2000). Nucleic Acids ˆÿ…a b ÿ a b †: y x x y @z Res. 28, 235±242. Collaborative Computational Project, Number 4 (1994). Acta Cryst. D50, 760±766. Cowtan, K. (1994). Jnt CCP4/ESF±EACBM Newsl. Protein Crystal- logr. 31, 34±38. A7. Combining terms Cowtan, K. (2002). Jnt CCP4/ESF±EACBM Newsl. Protein Crystal- Combining, we obtain the following expression for the logr. 40. derivative of torsion angle  in terms of the primitive deri- Cowtan, K. (2003). Crystallogr. Rev. 9, 73±80. Dunbrack, R. L. Jr & Cohen, F. E. (1997). Protein Sci. 6, 1661±1681. vates, Esnouf, R. M. (1997). Acta Cryst. D53, 665±672. @ 1 @D Jones, T. A. (1978). J. Appl. Cryst. 11, 268±272. ˆ ; Jones, T. A., Cowan, S., Zou, J.-Y. & Kjeldgaard, M. (1991). Acta @x …1‡ tan †@x P P 1 1 Cryst. A47, 110±119. Jones, T. A. & Thirup, S. (1986). EMBO J. 5, 891±822. where Kleywegt, G. J. (1997). J. Mol. Biol. 273, 371±376. "# ! Krissinel, E. B, Winn, M. D., Ballard, C. C., Ashton, A. W., Patel, P., @D @E E @H @K @J @L ˆ F ÿ ‡ JL ‡ KL ‡ JK : Potterton, E. A., McNicholas, S. J., Cowtan, K. D. & Emsley, P. @x @x G @x @x @x @x P P P P P P 1 1 1 1 1 1 (2004). Acta Cryst. D60, 2250±2255. McRee, D. E. (1999). J. Struct. Biol. 125, 156±165. Morris, R. J., Perrakis, A. & Lamzin, V. S. (2002). Acta Cryst. D58, 968±975. A8. Planes Murshudov, G. N., Vagin, A. A. & Dodson, E. J. (1997). Acta Cryst D53, 240±255. Old®eld, T. J. (2001). Acta Cryst. D57, 82±94. N Old®eld, T. J. (2003). Acta Cryst. D59, 483±491. planes atoms P P Old®eld, T. J. & Hubbard, R. E. (1994). Proteins Struct. Funct. Genet. S ˆ e ; plane ij iˆ1 jˆ1 18, 324±337. Potterton, L., McNicholas, S., Krissinel, E., Gruber, J., Cowtan, K., where e is the distance of the ith plane restraint's jth atom ij Emsley, P., Murshudov, G. N., Cohen, S., Perrakis, A. & Noble, M. from the ith plane restraint's least-squares plane. (2004). Acta Cryst. D60, 2288±2294. 2132 Emsley & Cowtan Model-building tools for molecular graphics Acta Cryst. (2004). D60, 2126±2132

Journal

Acta Crystallographica Section DUnpaywall

Published: Nov 26, 2004

There are no references for this article.