Dictionary of protein secondary structure: pattern recognition of hydrogen‐bonded and geometrical features

W Kabsch, C Sander - Biopolymers: Original Research on …, 1983 - Wiley Online Library
W Kabsch, C Sander
Biopolymers: Original Research on Biomolecules, 1983Wiley Online Library
For a successful analysis of the relation between amino acid sequence and protein
structure, an unambiguous and physically meaningful definition of secondary structure is
essential. We have developed a set of simple and physically motivated criteria for secondary
structure, programmed as a pattern‐recognition process of hydrogen‐bonded and
geometrical features extracted from x‐ray coordinates. Cooperative secondary structure is
recognized as repeats of the elementary hydrogen‐bonding patterns “turn” and “bridge.” …
Abstract
For a successful analysis of the relation between amino acid sequence and protein structure, an unambiguous and physically meaningful definition of secondary structure is essential. We have developed a set of simple and physically motivated criteria for secondary structure, programmed as a pattern‐recognition process of hydrogen‐bonded and geometrical features extracted from x‐ray coordinates. Cooperative secondary structure is recognized as repeats of the elementary hydrogen‐bonding patterns “turn” and “bridge.” Repeating turns are “helices,” repeating bridges are “ladders,” connected ladders are “sheets.” Geometric structure is defined in terms of the concepts torsion and curvature of differential geometry. Local chain “chirality” is the torsional handedness of four consecutive Cα positions and is positive for right‐handed helices and negative for ideal twisted β‐sheets. Curved pieces are defined as “bends.” Solvent “exposure” is given as the number of water molecules in possible contact with a residue. The end result is a compilation of the primary structure, including SS bonds, secondary structure, and solvent exposure of 62 different globular proteins. The presentation is in linear form: strip graphs for an overall view and strip tables for the details of each of 10.925 residues. The dictionary is also available in computer‐readable form for protein structure prediction work.
Wiley Online Library