Computer analysis of bacterial haloacid dehalogenases defines a large superfamily of hydrolases with diverse specificity. Application of an iterative approach to database search.

Using an iterative approach to sequence database search that combines scanning with individual amino acid sequences and with alignment blocks, we show that bacterial haloacid dehalogenases (HADs) belong to a large superfamily of hydrolases with diverse substrate specificity. The superfamily also includes epoxide hydrolases, different types of phosphatases, and numerous uncharacterized proteins from eubacteria, eukaryotes, and Archaea. Nine putative proteins of the HAD superfamily with functions unknown, in addition to two known enzymes, were found in Escherichia coli alone, making it one of the largest groups of enzymes and indicating that a variety of hydrolytic enzyme activities remain to be described. Many of the proteins with known enzymatic activities in the HAD superfamily are involved in detoxification of xenobiotics or metabolic by-products. All the proteins in the superfamily contain three conserved sequence motifs. Along with the conservation of the predicted secondary structure, motifs I, II, and III include a conserved aspartic acid residue, a lysine, and a nucleophile, namely aspartic acid or serine, respectively. A specific role in the catalysis of the hydrolysis of carbon-halogen and other bonds is assigned to each of these residues.