Analysis of the Virus Population Present in Equine Faeces Indicates the Presence of Hundreds of Uncharacterized Virus Genomes

Virus DNA was isolated from horse faeces and cloned in a sequence-independent fashion. 268 clones were sequenced and 178140 nucleotides of sequence obtained. Statistical analysis suggests the library contains 17560 distinct clones derived from up to 233 different virus genomes. TBLASTX analysis showed that 32% of the clones had significant identity to GenBank entries. Of these 63% were viral; 20% bacterial; 7% archaeal; 6% eukarya; and 5% were related to mobile genetic elements. Fifty-two percent of the virus identities were with Siphoviridae; 26% unclassified phages; 17% Myoviridae; 4% Podoviridae; and one clone (2%) was a vertebrate Orthopoxvirus. Genes coding for predicted virus structural proteins, proteases, glycosidases and nucleic acid-binding proteins were common.