TY - JOUR AU - Rousseau,, Frederic AB - Abstract Transition of soluble proteins into insoluble amyloid fibrils is driven by self-propagating short sequence stretches. However, accurate prediction of aggregation determinants remains challenging. Here, we describe WALTZ-DB 2.0, an updated and significantly expanded open-access database providing information on experimentally determined amyloid-forming hexapeptide sequences (http://waltzdb.switchlab.org/). We have updated WALTZ-DB 2.0 with new entries, including: (i) experimental validation of an in-house developed dataset of 229 hexapeptides, using electron microscopy and Thioflavin-T binding assays; (ii) manual curation of 98 amyloid-forming peptides isolated from literature. Furthermore, the content has been expanded by adding novel structural information for peptide entries, including sequences of the previous version. Using a computational methodology developed in the Switch lab, we have generated 3D-models of the putative amyloid fibril cores of WALTZ-DB 2.0 entries. Structural models, coupled with information on the energetic contributions and fibril core stabilities, can be accessed through individual peptide entries. Customized filtering options for subset selections and new modelling graphical features were added to upgrade online accessibility, providing a user-friendly interface for browsing, downloading and updating. WALTZ-DB 2.0 remains the largest open-access repository for amyloid fibril formation determinants and will continue to enhance the development of new approaches focused on accurate prediction of aggregation prone sequences. INTRODUCTION Protein folding is a crucial process during which polypeptide chains adopt a thermodynamically stable three-dimensional structure that is pivotal for most cellular functions. Proteins that misfold or fail to retain their native tertiary structure are prone to forming amyloid fibril aggregates (1). Amyloids are linked to a growing number of widespread debilitating diseases, including type II diabetes (T2D), atherosclerosis, systemic amyloidoses and capital neurodegenerative diseases, such as Alzheimer's and Parkinson's disease (2,3). On the other hand, recent studies also suggest that toxicity may precede the formation of large fibrous deposits (3). Phase separation has emerged as an alternative mechanism and has been proposed for several proteins associated to neurodegenerative diseases (4,5), suggesting that protein or peptide molecules with specific sequence properties may self-assemble into oligomeric granular modules with increased toxicity (6). At the same time, amyloid formation also serves as a natural scaffold for the formation of molecular superstructures with impressive functional, protective or structural properties, both in humans and other organisms (7). Amyloid aggregation propensity is encoded in the primary structure of protein molecules, hidden within harboured short sequence segments (8–10). These aggregation prone stretches mediate self-assembly of proteins into ordered perpetuating intermolecular β-sheet assemblies known as ‘cross-β’ spines, which protrude in parallel orientation to the amyloid fibre axis (11). This conformation comprises an extensive network of backbone hydrogen bonds and a set of laterally inter-fitted side chains excluding water molecules, yielding, thus, a tightly packed and energetically favourable amyloid fibril core (12). Aggregation prone regions are usually integral parts buried within the hydrophobic core of the protein native fold and consequently are often enriched with residues favouring β-strand formation, increased hydrophobicity and low charge content (13). Considering such sequence propensities, several computational tools have been developed over the years in an effort to accurately predict aggregation potential from polypeptide sequences (8,14–16). This increasing interest has manifested to a considerable growth in experimental data regarding protein self-assembly regions. Putative aggregation mechanisms based on the notion of aggregation hot spots have been proposed for proteins associated to the formation of both functional and disease-associated amyloid fibrils (17–19). Furthermore, synthetic peptide analogues have been developed as novel strategies for the production of antibacterial or anti-tumoural agents (20,21), for the development of transgenic plants with growth phenotypes (22), or as a new source for the development of potent nanomaterials with various applications (23,24). Following this demand, here we describe the fully updated and significantly expanded WALTZ-DB 2.0, the largest publicly available repository for experimentally determined amyloid-forming peptide sequences. EXPANDED CONTENT AND FEATURE IMPROVEMENTS New peptide entries and database statistics WALTZ-DB 2.0 is currently updated to store 1416 hexapeptide entries, divided into nine distinct subsets of origin. In total, 512 peptides have experimentally determined amyloid-forming properties, whereas 904 peptides self-assemble into aggregates with amorphous morphological characteristics. Two novel peptide subsets were added during this update, containing individual peptide mutation screens of known aggregation prone stretches derived from tau and apolipoprotein A-I, which are known amyloid-forming proteins associated to neurodegeneration (25) and atherosclerosis (26). The subsets, designated as tau mutant set and apoAI mutant set, are composed of 114 and 115 hexapeptides, respectively, and were systematically developed following a single mutation strategy along all available residue positions. Hexapeptide additions were classified into the database as amyloid or non-amyloid-forming sequences when corresponding morphologies were identified, using electron microscopy or by producing positive Thioflavin-T (Th-T) binding spectra. In detail, to characterize a peptide as amyloidogenic, we followed the general convention for amyloid-like morphology. Amyloid fibrils are typically long and unbranched with a diameter ranging between 8 and 10 nm and often tend to coalesce laterally forming superhelices or proto-fibrillar ribbons (27). Once bound to the surface of a β-rich amyloid fibril structure, the benzothiazole Th-T dye displays enhanced fluorescence intensity (28). Peptides producing spectra with increased fluorescence maxima at 480 nm were also considered as amyloid-forming sequences. Following the example of the previous version of the database, all experimental data are available online in order to allow users to independently conclude on the amyloid classification of every hexapeptide entry. Complementary to the above, WALTZ-DB 2.0 also includes an additional number of 98 hexapeptides with detailed annotated amyloid-forming properties which were mined from literature, manually curated and added to the previous Literature subset of the database. To summarize, this major update included the storage of 327 new peptide entries, out of which 268 hexapeptides were classified as amyloid-forming and 59 were judged as non-amyloidogenic sequences. Peptide data entries remain stored in a MySQL database available through a web server built with the Drupal content management system. This provides the required infrastructure to keep WALTZ-DB 2.0 regularly up to date, as well as to ensure fast and secure access to the stored data. Finally, a refreshed version of the methods is mentioned on the help page of the website. WALTZ-DB 2.0 novel features For clarity and in an effort to assist non-experienced users with the evaluation of scoring aggregation properties for peptide entries, predicted aggregation propensities in WALTZ-DB 2.0 are highlighted by specific colouring schemes (Figure 1). In detail, TANGO and WALTZ predictions are shown with a red-to-blue colouring gradient, using thresholds that have been previously reported to provide high specificity (8,15). For sequence hydrophobicity, secondary structure and parallel or antiparallel β-strand formation propensities, positive predictions are shown in blue and negatives in red, respectively. Detailed information on the corresponding threshold values for every individual field can be retrieved from the help page available online. Figure 1. Open in new tabDownload slide Example output of a peptide entry page on WALTZ-DB 2.0. An upper table contains information on the specific peptide sequence and corresponding predicted propensities. Negative predictions are shown in red, whereas positives are indicated in blue. Electron micrographs and Th-T binding spectra are available online for all new peptide entries. A mutation field indicating the position and transitional mutation for peptide entries has been added in the protein information table, along with a field highlighting the steric zipper class for the model prediction. A new table has been added including detailed energy contributions and the overall stability for the structural model. Finally, a JSmol JmolApplet is used to provide an integrated molecular graphics interface displaying the 3D-structural models of the corresponding hexapeptide entries. Figure 1. Open in new tabDownload slide Example output of a peptide entry page on WALTZ-DB 2.0. An upper table contains information on the specific peptide sequence and corresponding predicted propensities. Negative predictions are shown in red, whereas positives are indicated in blue. Electron micrographs and Th-T binding spectra are available online for all new peptide entries. A mutation field indicating the position and transitional mutation for peptide entries has been added in the protein information table, along with a field highlighting the steric zipper class for the model prediction. A new table has been added including detailed energy contributions and the overall stability for the structural model. Finally, a JSmol JmolApplet is used to provide an integrated molecular graphics interface displaying the 3D-structural models of the corresponding hexapeptide entries. Every peptide entry stored in the expanded WALTZ-DB 2.0 now contains detailed structural information regarding the putative amyloid fibril core. Utilizing a structural prediction methodology for amyloidogenic sequences developed in our lab, we have analysed all 1416 peptide entries and now provide a structural steric zipper prediction model. Users can download the corresponding models in a PDB format through links that are accessible in the peptide entry pages. Alternatively, WALTZ-DB 2.0 also provides a novel molecular graphics interface panel that is accessible online on every peptide page. This new feature allows users to actively manipulate and investigate peptide model structures on the spot, whilst browsing the online database. The JSmol plugin interface provides several options available for style effects (cartoon, ball and stick, ribbon etc), structural colouring (by secondary structure, residue, element etc) and surface or cavity representation, selection for hydrogen bond or disulphide bridge annotations, as well as modules for structural rotation or background colouring. Adding to this, following structural analysis of the stability of the models and manual curation, all peptide page entries contain information on the energy contributions for the steric zipper models. Energies of all major interactions, such as the contribution of electrostatics, hydrogen bond networking between backbone groups or side chains, solvation energies of residues with hydrophobic or polar properties, as well as Van der Waals packing interactions or potential clashes are attributed to each model entry. The WALTZ-DB 2.0 database page lists a paged updated table and new filtering steps aiming to make browsing a more user-friendly process. Building on the options of the previous version, the database now offers selections for filtering peptide entries based on availability of Th-T spectral data, as well as energetic stability of the structural models. Using this feature, users can now filter the database and isolate individual sequences by searching within a specific range of overall structural stability energies for predicted steric zippers. Notably, a major disadvantage of the previous version was that users could only download the entire database locally. WATLZ-DB 2.0 now allows users to create and access specific entry datasets by combining any of the provided filters and subsequently downloading the resulting list in a CSV, Excel or JSON format, using buttons that are available at the bottom of the listed table. MATERIALS AND METHODS Peptide synthesis Hexapeptides of the new subsets were synthesized using an in-house Intavis Multipep RSi solid phase peptide synthesis robot capable of parallel synthesis of 24–384 peptides. RP-HPLC purification protocols were used to ensure high levels of peptide purification (>90%). Peptide stock solutions were prepared by dissolving in milli-Q water to a final concentration of 1 mM. Dimethyl Sulfoxide (DMSO) traces (<5%) were used to assist with peptide solubility. The peptide solutions were incubated for 2 weeks at 25°C with shaking prior to analysis of amyloid-forming properties. Determination of amyloid fibril properties Transmission electron microscopy was performed to track the morphological properties of the peptide aggregates. Suspensions (5 μl) of peptide aliquots were adsorbed for 1 min to formvar film coated 400-mesh copper grids (Agar Scientific Ltd., England), following a short glow discharge step to improve adsorption. Grids were subsequently washed in 50 μl of milli-Q water and stained with uranyl acetate (2% w/v) for 60 s. Excess stain was removed by blotting with a filter paper. The grids were examined using a JEM-1400 120 kV transmission electron microscope (JEOL, Japan) operated at 80 keV. Amyloid formation was also monitored using Thioflavin-T binding assays. Thioflavin-T (Th-T) is a rotor dye that acts as an efficient reporter of amyloid fibril formation, since it increases its fluorescence when binding to cross-β rich aggregates (28). Th-T (Sigma) was added in low volume black 384-well microplates at a final concentration of 20 μM. Peptide concentration was set to 30 μM. Fluorescence intensity was measured in triplicates, through a ClarioStar plate reader (BMG Labtech, Germany), using an excitation filter at 440 nm and by recording an emission spectrum ranging between 468 and 600 nm. Emission spectra were corrected by subtracting Thioflavin-T - only spectra as background and binding was evaluated by measuring the intensity peak emitted at 480 nm. Structural models and energy calculations To provide a structural characterization for the database peptide entries, we have followed the structural topologies of steric zippers introduced by the Eisenberg lab (10,12,29). Representative 3D-model structures were generated utilizing a structural prediction methodology, developed by the Switch lab. Briefly, this pipeline comprises a large dataset of steric zipper hexapeptide fragment templates that have been extracted from the Protein Data Bank (30). Following implementation of the FoldX energy force field (31), hexapeptide sequences are threaded against all templates, stability energies are calculated and subsequently fed into a random forest classifier. This non-linear classifier then provides a probability estimation of aggregation propensity as a non-trivial function of the corresponding input energies. The threaded structure producing the optimal stability predicted is finally selected as a putative 3D-model representation of the amyloid fibril core. LINKS TO OTHER DATABASES Every WALTZ-DB 2.0 peptide entry is linked to a Uniprot ID (http://www.uniprot.org/) when information of the parental protein is available (32). For peptide entries directly mined from literature, a corresponding reference link connecting to the PUBMED literature portal (http://www.ncbi.nlm.nih.gov/pubmed/) is maintained. Finally, we also provide useful links to other related databases and web servers on protein aggregation (14,29). SUMMARY The former release of WALTZ-DB has served as the largest available repository for amyloid-forming short sequence stretches containing experimental annotation (33). It has been used extensively for the development or as a carefully annotated validation set of several high performing predictors of aggregation propensity (15,16,34–40) and has also been utilized as major source of information incorporated in related databases of amyloid aggregation, such as CPAD (41), AmyLoad (42) and AmyPro (43). Following the above, in the current release we have opted to significantly expand the content of the database by simultaneously doubling the coverage of amyloid-forming peptide sequences (512 amyloid sequences compared to 244 previously available), thus providing an improved and more balanced dataset of entries. Furthermore, structural data and novel online features have been added to promote online access to WALTZ-DB 2.0 as a more user-friendly experience and to provide a new layer of information to the users. Finally, we encourage users to help us keep the database up to date by submitting newly identified aggregation-prone hexapeptide sequences using the contact form available online (http://waltzdb.switchlab.org/contact). FUNDING European Research Council under the European Union's Horizon 2020 Framework Programme ERC Grant Agreement [647458 (MANGO) to J.S.]; Flanders institute for Biotechnology (VIB); University of Leuven; Fund for Scientific Research Flanders (FWO); Fund for Scientific Research Flanders Post-doctoral Fellowship (FWO) [12P0919N to N.L.]. Conflict of interest statement. None declared. REFERENCES 1. Dobson C.M. Protein folding and misfolding . Nature . 2003 ; 426 : 884 – 890 . Google Scholar Crossref Search ADS PubMed WorldCat 2. Benson M.D. , Buxbaum J.N. , Eisenberg D.S. , Merlini G. , Saraiva M.J.M. , Sekijima Y. , Sipe J.D. , Westermark P. Amyloid nomenclature 2018: recommendations by the International Society of Amyloidosis (ISA) nomenclature committee . Amyloid . 2018 ; 25 : 215 – 219 . Google Scholar Crossref Search ADS PubMed WorldCat 3. Chiti F. , Dobson C.M. Protein misfolding, functional amyloid, and human disease . Annu. Rev. Biochem. 2006 ; 75 : 333 – 366 . Google Scholar Crossref Search ADS PubMed WorldCat 4. Babinchak W.M. , Haider R. , Dumm B.K. , Sarkar P. , Surewicz K. , Choi J.K. , Surewicz W.K. The role of liquid-liquid phase separation in aggregation of the TDP-43 low-complexity domain . J. Biol. Chem. 2019 ; 294 : 6306 – 6317 . Google Scholar Crossref Search ADS PubMed WorldCat 5. Wegmann S. , Eftekharzadeh B. , Tepper K. , Zoltowska K.M. , Bennett R.E. , Dujardin S. , Laskowski P.R. , MacKenzie D. , Kamath T. , Commins C. et al. . Tau protein liquid-liquid phase separation can initiate tau aggregation . EMBO J. 2018 ; 37 : e98049 . Google Scholar Crossref Search ADS PubMed WorldCat 6. Elbaum-Garfinkle S. Matter over mind: Liquid phase separation and neurodegeneration . J. Biol. Chem. 2019 ; 294 : 7160 – 7168 . Google Scholar Crossref Search ADS PubMed WorldCat 7. Fowler D.M. , Koulov A.V. , Balch W.E. , Kelly J.W. Functional amyloid–from bacteria to humans . Trends Biochem. Sci. 2007 ; 32 : 217 – 224 . Google Scholar Crossref Search ADS PubMed WorldCat 8. Fernandez-Escamilla A.M. , Rousseau F. , Schymkowitz J. , Serrano L. Prediction of sequence-dependent and mutational effects on the aggregation of peptides and proteins . Nat. Biotechnol. 2004 ; 22 : 1302 – 1306 . Google Scholar Crossref Search ADS PubMed WorldCat 9. Lopez de la Paz M. , Serrano L. Sequence determinants of amyloid fibril formation . PNAS . 2004 ; 101 : 87 – 92 . Google Scholar Crossref Search ADS PubMed WorldCat 10. Teng P.K. , Eisenberg D. Short protein segments can drive a non-fibrillizing protein into the amyloid state . Protein Eng. Des. Sel. 2009 ; 22 : 531 – 536 . Google Scholar Crossref Search ADS PubMed WorldCat 11. Nelson R. , Sawaya M.R. , Balbirnie M. , Madsen A.O. , Riekel C. , Grothe R. , Eisenberg D. Structure of the cross-beta spine of amyloid-like fibrils . Nature . 2005 ; 435 : 773 – 778 . Google Scholar Crossref Search ADS PubMed WorldCat 12. Sawaya M.R. , Sambashivan S. , Nelson R. , Ivanova M.I. , Sievers S.A. , Apostol M.I. , Thompson M.J. , Balbirnie M. , Wiltzius J.J. , McFarlane H.T. et al. . Atomic structures of amyloid cross-beta spines reveal varied steric zippers . Nature . 2007 ; 447 : 453 – 457 . Google Scholar Crossref Search ADS PubMed WorldCat 13. Buck P.M. , Kumar S. , Singh S.K. On the role of aggregation prone regions in protein evolution, stability, and enzymatic catalysis: insights from diverse analyses . PLoS Comput. Biol. 2013 ; 9 : e1003291 . Google Scholar Crossref Search ADS PubMed WorldCat 14. Conchillo-Sole O. , de Groot N.S. , Aviles F.X. , Vendrell J. , Daura X. , Ventura S. AGGRESCAN: a server for the prediction and evaluation of “hot spots” of aggregation in polypeptides . BMC Bioinformatics . 2007 ; 8 : 65 . Google Scholar Crossref Search ADS PubMed WorldCat 15. Maurer-Stroh S. , Debulpaep M. , Kuemmerer N. , Lopez de la Paz M. , Martins I.C. , Reumers J. , Morris K.L. , Copland A. , Serpell L. , Serrano L. et al. . Exploring the sequence determinants of amyloid structure using position-specific scoring matrices . Nat. Methods . 2010 ; 7 : 237 – 242 . Google Scholar Crossref Search ADS PubMed WorldCat 16. Walsh I. , Seno F. , Tosatto S.C. , Trovato A. PASTA 2.0: an improved server for protein aggregation prediction . Nucleic Acids Res. 2014 ; 42 : W301 – W307 . Google Scholar Crossref Search ADS PubMed WorldCat 17. Louros N.N. , Bolas G.M.P. , Tsiolaki P.L. , Hamodrakas S.J. , Iconomidou V.A. Intrinsic aggregation propensity of the CsgB nucleator protein is crucial for curli fiber formation . J. Struct. Biol. 2016 ; 195 : 179 – 189 . Google Scholar Crossref Search ADS PubMed WorldCat 18. Louros N.N. , Chrysina E.D. , Baltatzis G.E. , Patsouris E.S. , Hamodrakas S.J. , Iconomidou V.A. A common ‘aggregation-prone’ interface possibly participates in the self-assembly of human zona pellucida proteins . FEBS Lett. 2016 ; 590 : 619 – 630 . Google Scholar Crossref Search ADS PubMed WorldCat 19. Tsiolaki P.L. , Louros N.N. , Iconomidou V.A. Hexapeptide tandem repeats dictate the formation of silkmoth chorion, a natural protective amyloid . J. Mol. Biol. 2018 ; 430 : 3774 – 3783 . Google Scholar Crossref Search ADS PubMed WorldCat 20. Gallardo R. , Ramakers M. , De Smet F. , Claes F. , Khodaparast L. , Khodaparast L. , Couceiro J.R. , Langenberg T. , Siemons M. , Nystrom S. et al. . De novo design of a biologically active amyloid . Science . 2016 ; 354 : aah4949 . Google Scholar Crossref Search ADS PubMed WorldCat 21. Khodaparast L. , Khodaparast L. , Gallardo R. , Louros N.N. , Michiels E. , Ramakrishnan R. , Ramakers M. , Claes F. , Young L. , Shahrooei M. et al. . Aggregating sequences that occur in many proteins constitute weak spots of bacterial proteostasis . Nat. Commun. 2018 ; 9 : 866 . Google Scholar Crossref Search ADS PubMed WorldCat 22. Betti C. , Vanhoutte I. , Coutuer S. , De Rycke R. , Mishev K. , Vuylsteke M. , Aesaert S. , Rombaut D. , Gallardo R. , De Smet F. et al. . Sequence-specific protein aggregation generates defined protein knockdowns in plants . Plant Physiol. 2016 ; 171 : 773 – 787 . Google Scholar PubMed WorldCat 23. Mankar S. , Anoop A. , Sen S. , Maji S.K. Nanomaterials: amyloids reflect their brighter side . Nano Rev. 2011 ; 2 : doi:10.3402/nano.v2i0.6032 . WorldCat 24. Onur T. , Yuca E. , Olmez T.T. , Seker U.O.S. Self-assembly of bacterial amyloid protein nanomaterials on solid surfaces . J. Colloid Interface Sci. 2018 ; 520 : 145 – 154 . Google Scholar Crossref Search ADS PubMed WorldCat 25. Bloom G.S. Amyloid-beta and tau: the trigger and bullet in Alzheimer disease pathogenesis . JAMA Neurol. 2014 ; 71 : 505 – 508 . Google Scholar Crossref Search ADS PubMed WorldCat 26. Louros N.N. , Tsiolaki P.L. , Griffin M.D. , Howlett G.J. , Hamodrakas S.J. , Iconomidou V.A. Chameleon ‘aggregation-prone’ segments of apoA-I: A model of amyloid fibrils formed in apoA-I amyloidosis . Int. J. Biol. Macromol. 2015 ; 79 : 711 – 718 . Google Scholar Crossref Search ADS PubMed WorldCat 27. Kodali R. , Wetzel R. Polymorphism in the intermediates and products of amyloid assembly . Curr. Opin. Struct. Biol. 2007 ; 17 : 48 – 57 . Google Scholar Crossref Search ADS PubMed WorldCat 28. Biancalana M. , Makabe K. , Koide A. , Koide S. Molecular mechanism of thioflavin-T binding to the surface of beta-rich peptide self-assemblies . J. Mol. Biol. 2009 ; 385 : 1052 – 1063 . Google Scholar Crossref Search ADS PubMed WorldCat 29. Goldschmidt L. , Teng P.K. , Riek R. , Eisenberg D. Identifying the amylome, proteins capable of forming amyloid-like fibrils . PNAS . 2010 ; 107 : 3487 – 3492 . Google Scholar Crossref Search ADS PubMed WorldCat 30. Berman H.M. , Westbrook J. , Feng Z. , Gilliland G. , Bhat T.N. , Weissig H. , Shindyalov I.N. , Bourne P.E. The Protein Data Bank . Nucleic Acids Res. 2000 ; 28 : 235 – 242 . Google Scholar Crossref Search ADS PubMed WorldCat 31. Schymkowitz J. , Borg J. , Stricher F. , Nys R. , Rousseau F. , Serrano L. The FoldX web server: an online force field . Nucleic Acids Res. 2005 ; 33 : W382 – W388 . Google Scholar Crossref Search ADS PubMed WorldCat 32. UniProt C. UniProt: a worldwide hub of protein knowledge . Nucleic Acids Res. 2019 ; 47 : D506 – D515 . Google Scholar Crossref Search ADS PubMed WorldCat 33. Beerten J. , Van Durme J. , Gallardo R. , Capriotti E. , Serpell L. , Rousseau F. , Schymkowitz J. WALTZ-DB: a benchmark database of amyloidogenic hexapeptides . Bioinformatics . 2015 ; 31 : 1698 – 1700 . Google Scholar Crossref Search ADS PubMed WorldCat 34. Antonets K.S. , Nizhnikov A.A. Predicting amyloidogenic proteins in the proteomes of plants . Int. J. Mol. Sci. 2017 ; 18 : 2155 . Google Scholar Crossref Search ADS WorldCat 35. Burdukiewicz M. , Sobczyk P. , Rodiger S. , Duda-Madej A. , Mackiewicz P. , Kotulska M. Amyloidogenic motifs revealed by n-gram analysis . Sci. Rep. 2017 ; 7 : 12961 . Google Scholar Crossref Search ADS PubMed WorldCat 36. Chen M. , Schafer N.P. , Zheng W. , Wolynes P.G. The Associative memory, water mediated, structure and energy model (AWSEM)-amylometer: predicting amyloid propensity and fibril topology using an optimized folding landscape model . ACS Chem. Neurosci. 2018 ; 9 : 1027 – 1039 . Google Scholar Crossref Search ADS PubMed WorldCat 37. Niu M. , Li Y. , Wang C. , Han K. RFAmyloid: a web server for predicting amyloid proteins . Int. J. Mol. Sci. 2018 ; 19 : 2071 . Google Scholar Crossref Search ADS WorldCat 38. Prabakaran R. , Goel D. , Kumar S. , Gromiha M.M. Aggregation prone regions in human proteome: Insights from large-scale data analyses . Proteins . 2017 ; 85 : 1099 – 1118 . Google Scholar Crossref Search ADS PubMed WorldCat 39. Roche D.B. , Villain E. , Kajava A.V. Usage of a dataset of NMR resolved protein structures to test aggregation versus solubility prediction algorithms . Protein Sci. 2017 ; 26 : 1864 – 1869 . Google Scholar Crossref Search ADS PubMed WorldCat 40. Sankar K. , Krystek S.R. Jr , Carl S.M. , Day T. , Maier J.K.X. AggScore: Prediction of aggregation-prone regions in proteins based on the distribution of surface patches . Proteins . 2018 ; 86 : 1147 – 1156 . Google Scholar Crossref Search ADS PubMed WorldCat 41. Thangakani A.M. , Nagarajan R. , Kumar S. , Sakthivel R. , Velmurugan D. , Gromiha M.M. CPAD, curated protein aggregation database: a repository of manually curated experimental data on protein and peptide aggregation . PLoS One . 2016 ; 11 : e0152949 . Google Scholar Crossref Search ADS PubMed WorldCat 42. Wozniak P.P. , Kotulska M. AmyLoad: website dedicated to amyloidogenic protein fragments . Bioinformatics . 2015 ; 31 : 3395 – 3397 . Google Scholar Crossref Search ADS PubMed WorldCat 43. Varadi M. , De Baets G. , Vranken W.F. , Tompa P. , Pancsa R. AmyPro: a database of proteins with validated amyloidogenic regions . Nucleic Acids Res. 2018 ; 46 : D387 – D392 . Google Scholar Crossref Search ADS PubMed WorldCat © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. TI - WALTZ-DB 2.0: an updated database containing structural information of experimentally determined amyloid-forming peptides JO - Nucleic Acids Research DO - 10.1093/nar/gkz758 DA - 2020-01-08 UR - https://www.deepdyve.com/lp/oxford-university-press/waltz-db-2-0-an-updated-database-containing-structural-information-of-TRbyGL6hn2 SP - D389 VL - 48 IS - D1 DP - DeepDyve ER -