An introduction to modeling structure from sequence.

The underlying premise behind all attempts to determine a large number of diverse protein structures is that the total number of protein domain folds is much smaller, by many orders of magnitude, than the total number of sequences; in other words, many sequences adopt essentially the same fold. If the fold of a protein could be recognized from sequence information alone, then a complete database of all possible folds would allow the structure corresponding to any sequence to be modeled. The growth of structure determination has turned most biochemists and biologists into consumers of structural information. As the demand for such information continues to outstrip the supply, all aspects of structure modeling assume increasing importance. This unit provides an introduction to modeling structure from its sequence and surveys the currently available methods described in the subsequent units of this chapter.