HEALTH GeoJunction: place-time-concept browsing of health publications

BackgroundThe volume of health science publications is escalating rapidly. Thus, keeping up with developments is becoming harder as is the task of finding important cross-domain connections. When geographic location is a relevant component of research reported in publications, these tasks are more difficult because standard search and indexing facilities have limited or no ability to identify geographic foci in documents. This paper introduces HEALTHGeoJunction, a web application that supports researchers in the task of quickly finding scientific publications that are relevant geographically and temporally as well as thematically.ResultsHEALTHGeoJunction is a geovisual analytics-enabled web application providing: (a) web services using computational reasoning methods to extract place-time-concept information from bibliographic data for documents and (b) visually-enabled place-time-concept query, filtering, and contextualizing tools that apply to both the documents and their extracted content. This paper focuses specifically on strategies for visually-enabled, iterative, facet-like, place-time-concept filtering that allows analysts to quickly drill down to scientific findings of interest in PubMed abstracts and to explore relations among abstracts and extracted concepts in place and time. The approach enables analysts to: find publications without knowing all relevant query parameters, recognize unanticipated geographic relations within and among documents in multiple health domains, identify the thematic emphasis of research targeting particular places, notice changes in concepts over time, and notice changes in places where concepts are emphasized.ConclusionsPubMed is a database of over 19 million biomedical abstracts and citations maintained by the National Center for Biotechnology Information; achieving quick filtering is an important contribution due to the database size. Including geography in filters is important due to rapidly escalating attention to geographic factors in public health. The implementation of mechanisms for iterative place-time-concept filtering makes it possible to narrow searches efficiently and quickly from thousands of documents to a small subset that meet place-time-concept constraints. Support for a more-like-this query creates the potential to identify unexpected connections across diverse areas of research. Multi-view visualization methods support understanding of the place, time, and concept components of document collections and enable comparison of filtered query results to the full set of publications.

[1]  Ryen W. White,et al.  Supporting exploratory search , 2006 .

[2]  M. Boulos On geography and medical journalology: a study of the geographical distribution of articles published in a leading medical informatics journal between 1999 and 2004 , 2005, International journal of health geographics.

[3]  Gary Marchionini,et al.  Exploratory search , 2006, Commun. ACM.

[4]  Maged N Kamel Boulos,et al.  Web GIS in practice: an interactive geographical interface to English Primary Care Trust performance ratings for 2003 and 2004 , 2004, International journal of health geographics.

[5]  Suzana Dragicevic,et al.  A Web GIS collaborative framework to structure and manage distributed planning processes , 2004, J. Geogr. Syst..

[6]  William W. Cohen,et al.  Next Generation Web Search : Setting Our Sites , 2000 .

[7]  Allan Brown,et al.  Cartographic Design and Production in the Internet Era: The Example of Tourist Web Maps , 2001 .

[8]  Keith C. Clarke,et al.  Interactive Visual Exploration of a Large Spatio-temporal Dataset: Reflections on a Geovisualization Mashup. , 2007, IEEE Transactions on Visualization and Computer Graphics.

[9]  John M. Carroll,et al.  Five reasons for scenario-based design , 2000, Interact. Comput..

[10]  Claus Rinner,et al.  Evaluating web-based static, animated and interactive maps for injury prevention. , 2009, Geospatial health.

[11]  Kevin Li,et al.  Faceted metadata for image search and browsing , 2003, CHI '03.

[12]  David Brown,et al.  IMPLEMENTING EXPLORATORY SPATIAL DATA ANALYSIS METHODS FOR MULTIVARIATE HEALTH STATISTICS , 1997 .

[13]  Claus Rinner,et al.  The use of Web 2.0 concepts to support deliberation in spatial decision-making , 2008, Comput. Environ. Urban Syst..

[14]  Sara Irina Fabrikant,et al.  Cognitively Plausible Information Visualization , 2005 .

[15]  Erik Rauch,et al.  A confidence-based framework for disambiguating geographic terms , 2003, HLT-NAACL 2003.

[16]  Lucy T. Nowell,et al.  ThemeRiver: visualizing theme changes over time , 2000, IEEE Symposium on Information Visualization 2000. INFOVIS 2000. Proceedings.

[17]  Jianping Fan,et al.  Analyzing Large-Scale News Video Databases to Support Knowledge Visualization and Intuitive Retrieval , 2007, 2007 IEEE Symposium on Visual Analytics Science and Technology.

[18]  Maged N Kamel Boulos,et al.  A first look at HealthCyberMap medical semantic subject search engine. , 2004, Technology and health care : official journal of the European Society for Engineering and Medicine.

[19]  Craig A. Knoblock,et al.  From Text to Geographic Coordinates: The Current State of Geocoding , 2007 .

[20]  James J. Thomas,et al.  Visualizing the non-visual: spatial analysis and interaction with information from text documents , 1995, Proceedings of Visualization 1995 Conference.

[21]  Douglas Tudhope,et al.  Faceted Thesauri , 2008 .

[22]  Jochen L. Leidner Toponym resolution in text: annotation, evaluation and applications of spatial grounding , 2007, SIGF.

[23]  Kenneth D. Mandl,et al.  HealthMap: Global Infectious Disease Monitoring through Automated Classification and Visualization of Internet Media Reports , 2008, Journal of the American Medical Informatics Association.

[24]  Maged N Kamel Boulos,et al.  Web GIS in practice II: interactive SVG maps of diagnoses of sexually transmitted diseases by Primary Care Trust in London, 1997 – 2003 , 2005, International journal of health geographics.

[25]  Alan M. MacEachren,et al.  Design and Implementation of a Model, Web-based, GIS-Enabled Cancer Atlas , 2008 .

[26]  Stefan M. Rüger,et al.  Using co‐occurrence models for placename disambiguation , 2008, Int. J. Geogr. Inf. Sci..

[27]  Susan T. Dumais,et al.  Fast, Flexible Filtering with Phlat — Personal Search and Organization Made Easy , 2006 .

[28]  Eero Hyvönen,et al.  Semantic Faceted Search in a Citizens Health Portal , 2007 .

[29]  Ralph Weischedel,et al.  PERFORMANCE MEASURES FOR INFORMATION EXTRACTION , 2007 .

[30]  Xiaohua Hu,et al.  Relation-Based Document Retrieval for Biomedical IR , 2006, Trans. Comp. Sys. Biology.

[31]  Peng Yue,et al.  Web GIS in practice VIII: HTML5 and the canvas element for interactive online mapping , 2010, International journal of health geographics.

[32]  Yi Pan,et al.  Transactions on Computational Systems Biology V , 2006, Trans. Computational Systems Biology.

[33]  Alan M MacEachren,et al.  Distributed usability evaluation of the Pennsylvania Cancer Atlas , 2008, International journal of health geographics.

[34]  Chaomei Chen,et al.  User-controlled mapping of significant literatures , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[35]  Paul A. Longley,et al.  The emergence of geoportals and their role in spatial data infrastructures , 2005, Comput. Environ. Urban Syst..

[36]  Philip James,et al.  Multidimensional visualisation of degrees of relevance of geographic data , 2006, Int. J. Geogr. Inf. Sci..

[37]  Ian Turton A system for the automatic comparison of machine and human geocoded documents , 2008, GIR '08.

[38]  Kiyoshi Honda,et al.  Web GIS in practice IV: publishing your health maps and connecting to remote WMS sources using the Open Source UMN MapServer and DM Solutions MapLab , 2006, International journal of health geographics.

[39]  Marti A. Hearst,et al.  Automating Creation of Hierarchical Faceted Metadata Structures , 2007, NAACL.

[40]  D A Carr,et al.  Two new templates for epidemiology applications: linked micromap plots and conditioned choropleth maps. , 2000, Statistics in medicine.

[41]  Muh-Chyun Tang Browsing and searching in a faceted information space: A naturalistic study of PubMed users' interaction with a display tool , 2007 .

[42]  Mor Naaman,et al.  Generating summaries and visualization for large collections of geo-referenced photographs , 2006, MIR '06.

[43]  Ron Sivan,et al.  Web-a-where: geotagging web content , 2004, SIGIR '04.

[44]  Maged N Kamel Boulos,et al.  A simple method for serving Web hypermaps with dynamic database drill-down , 2002, International journal of health geographics.

[45]  Muh-Chyun Tang Browsing and searching in a faceted information space: A naturalistic study of PubMed users' interaction with a display tool , 2007, J. Assoc. Inf. Sci. Technol..

[46]  Rahman Azari,et al.  Geographic distribution of autism in California: A retrospective birth cohort analysis , 2010, Autism research : official journal of the International Society for Autism Research.

[47]  Desney S. Tan,et al.  FacetMap: A Scalable Search and Browse Visualization , 2006, IEEE Transactions on Visualization and Computer Graphics.

[48]  G. Jacquez,et al.  Visualization and exploratory analysis of epidemiologic data using a novel space time information system , 2004, International journal of health geographics.

[49]  Robert M. Edsall The parallel coordinate plot in action: design and use for geographic visualization , 2003, Comput. Stat. Data Anal..

[50]  Abdul V. Roudsari,et al.  A proposed semantic framework for diabetes education content management, customisation and delivery within the M2DM project , 2006, Comput. Methods Programs Biomed..

[51]  Kenneth A. Ross,et al.  A Faceted Query Engine Applied to Archaeology , 2005, VLDB.

[52]  Jonathan A. Patz,et al.  Emerging Threats to Human Health from Global Environmental Change , 2009 .

[53]  Benjamin M. Good,et al.  Tag clouds for summarizing web search results , 2007, WWW '07.

[54]  Youngihn Kho,et al.  GeoDa: An Introduction to Spatial Data Analysis , 2006 .

[55]  Bruce McGregor,et al.  Constructing a concise medical taxonomy. , 2005, Journal of the Medical Library Association.

[56]  Maged N Kamel Boulos,et al.  Web GIS in practice VII: stereoscopic 3-D solutions for online maps and virtual globes , 2009, International journal of health geographics.

[57]  Kei-Hoi Cheung,et al.  Web GIS in practice VI: a demo playlist of geo-mashups for public health neogeographers , 2008, International journal of health geographics.

[58]  Sheng Gao,et al.  Online GIS services for mapping and sharing disease information , 2008, International journal of health geographics.