Gene sequence tags from Plasmodium falciparum genomic DNA fragments prepared by the "genease" activity of mung bean nuclease.

A genes-first approach to genome sequencing is described which efficiently generates gene sequence tags from genomic DNA. Mung bean nuclease (EC 3.1.30.1) cleaves the genomic DNA of many organisms before and after genes and within some introns. Analysis of gene sequence tags prepared from mung bean nuclease-digested Plasmodium falciparum DNA demonstrates that this method has several advantages over the popular cDNA expressed sequence tag approach. To date, 673 sequence tags containing over 215 kb of sequence have been generated from 400 clones. Sixty clones (15%) have significant similarity to sequences in the protein and translated nucleic acid data bases. These represent 51 unique genes, of which only 5 encode previously known P. falciparum proteins. The identified proteins include those expressed in erythrocytic, exoerythrocytic, and gametocytic stages of the parasite. Thirty percent of clones identified appear to carry complete coding regions. The spacer DNA separating genes is rarely cloned. These gene sequence tags will form a useful data base from which to initiate projects to develop new therapeutics, vaccines, and strategies to control human malaria.