Protein Folding

A protein is a long polymer of amino acids, the sequence of which is determined by the genetic code of DNA (another long polymer). But unlike DNA, most proteins adopt a highly complex final structure, driven entirely by the interactions of amino acid side chains with other side chains and the surrounding solvent. Proteins fold very quickly, despite the enormous (try 10E90!) number of potential starting conformations. Hence it is expected that protein folding follows a directed pathway, reducing the effective number of conformations. The trick is to figure out what determines the pathway: what interactions of the side chains are necessary and sufficient for directing the polypeptide chain to the final native state?

Our laboratory has described the in vitro and in vivo folding pathways for the tailspike protein of bacteriophage P22. The native structure of tailspike, comprised of three polypeptide chains of 666 amino acids each, is dominated by its extended, repeating b-coil motif, and the overall resemblance of the C-terminal 5/6ths of the protein to a fish (see image above).

Want to learn more about protein folding/misfolding? Read this general-audience article .


Protein Misfolding

Under favorable conditions, most proteins have no problem quickly folding to their native structures. However, there are some proteins which appear unable to fold without the presence of other helper proteins, called chaperones. In the absence of chaperones, these proteins will fail to achieve their native state and instead may associate with other unfolded polypeptide chains to form large aggregate structures (or in vivo, this may result in inclusion body formation). A similar scenario can occur when a protein acquires a mutation, a genetic change resulting in the alteration of one of the amino acid sites on the chain. Misfolding can also occur when a protein is subjected to unfavorable conditions, such as extremes of heat or pH. Clearly, understanding the physical basis for why folding has failed in these cases can provide clues as to the interactions vital for successful protein folding.

The King lab has characterized over 70 genetically-isolated point mutations in the tailspike sequence, and demonstrated that, for the most part, these mutations have no effects on native protein stability. The mutations do, however, have wide-ranging effects on the ability of the protein to fold to its native structure, and hence are assumed to affect the stability or structure of a folding intermediate.

Want to learn more about protein folding/misfolding? Read this general-audience article .


Virus Assembly

Viruses have long been known as sources of human disease, but have now also become important tools of biological research used in DNA cloning, targeted gene therapy, and phage display of novel proteins. Although viruses are extremely diverse in their life cycles and infectious mechanisms, most share a common structure, consisting of an inner core of condensed nucleic acid enclosed within a spherical protein capsid. The structures of these capsids are based upon icosahedral symmetry.

In the case of herpesviruses and adenoviruses, as well as the double-stranded DNA bacteriophages such as P22, the initial product of the viral assembly pathway is not an infectious virion but a closed shell that does not contain DNA. These precursor shells, or procapsids, include proteins not found in the mature virion, but essential for their production. These proteins are termed "scaffolding proteins". Our laboratory has described the assembly pathway for P22, focussing on the role of the scaffolding protein in virus assembly.