Classifying the precancers: A metadata approach

BackgroundDuring carcinogenesis, precancers are the morphologically identifiable lesions that precede invasive cancers. In theory, the successful treatment of precancers would result in the eradication of most human cancers. Despite the importance of these lesions, there has been no effort to list and classify all of the precancers. The purpose of this study is to describe the first comprehensive taxonomy and classification of the precancers. As a novel approach to disease classification, terms and classes were annotated with metadata (data that describes the data) so that the classification could be used to link precancer terms to data elements in other biological databases.MethodsTerms in the UMLS (Unified Medical Language System) related to precancers were extracted. Extracted terms were reviewed and additional terms added. Each precancer was assigned one of six general classes. The entire classification was assembled as an XML (eXtensible Mark-up Language) file. A Perl script converted the XML file into a browser-viewable HTML (HyperText Mark-up Language) file.ResultsThe classification contained 4700 precancer terms, 568 distinct precancer concepts and six precancer classes: 1) Acquired microscopic precancers; 2) acquired large lesions with microscopic atypia; 3) Precursor lesions occurring with inherited hyperplastic syndromes that progress to cancer; 4) Acquired diffuse hyperplasias and diffuse metaplasias; 5) Currently unclassified entities; and 6) Superclass and modifiers.ConclusionThis work represents the first attempt to create a comprehensive listing of the precancers, the first attempt to classify precancers by their biological properties and the first attempt to create a pathologic classification of precancers using standard metadata (XML). The classification is placed in the public domain, and comment is invited by the authors, who are prepared to curate and modify the classification.

[1]  P. Trott,et al.  International Classification of Diseases for Oncology , 1977 .

[2]  M. Sporn,et al.  Treatment and Prevention of Intraepithelial Neoplasia: An Important Target for Accelerated New Agent Development: Recommendations of the American Association for Cancer Research Task Force on the Treatment and Prevention of Intraepithelial Neoplasia , 2002 .

[3]  D. Henson,et al.  The Pathology of Incipient Neoplasia , 1986 .

[4]  G. Moore,et al.  The role of cell death in the growth of preneoplastic lesions: a Monte Carlo simulation model , 1992, Cell proliferation.

[5]  M. Sporn,et al.  Treatment and prevention of intraepithelial neoplasia: an important target for accelerated new agent development. , 2002, Clinical cancer research : an official journal of the American Association for Cancer Research.

[6]  E. Mayr The Growth of Biological Thought: Diversity, Evolution, and Inheritance , 1983 .

[7]  Liam Quin,et al.  Mastering XML Premium Edition , 2001 .

[8]  J. Seidman,et al.  Premalignant nonepithelial lesions: a biological classification. , 1993, Modern pathology : an official journal of the United States and Canadian Academy of Pathology, Inc.

[9]  Charles Weijer,et al.  Full House: The Spread of Excellence from Plato to Darwin. , 1997 .

[10]  Joshua Lubell,et al.  Professional Xml Meta Data , 2001 .

[11]  April Fritz,et al.  International Classification of Diseases for Oncology: ICD-0. , 2000 .

[12]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[13]  L. Franks Neoplastic development , 1976, Nature.