CloSe: A Cloud SaaS for Semantic Document Composition

Nowadays a large number of companies world-wide has moved their applications in the cloud, exploiting a lot of types of functions, such as customer relationship management, human resources, accounting and document sharing. In this work we propose "'CloSe"', a cloud system for document composition, which offers editing/composing aiding services by exploiting semantic based technology. Close is the evolution, in cloud technology, of a monolithic system architecture for document processing, based on semantic methodologies, that we have developed in the past years. We intend to migrate our system procedures to the Internet Cloud technology that guarantees several advantages in terms of usability, scalability and fault tolerance. The proposed system will help the users in the process of writing documents, exploiting information and data contained in apposite document bases, collected from heterogeneous sources, in order to suggest proper fragments to be inserted into the document. We have produced a system prototype that realizes semantic retrieval functionalities, in order to assist the specialist and generic user in the document composition aiding activities. Moreover we have reported some experimental results that we have carried out for evaluating the impact of the system on enhancing user effort in composing documents by suggesting appropriate document segments, which encouraged us to extend the system in the cloud.

[1]  Antonino Mazzeo,et al.  E-Government, ICT Professionalism and Competences Service Science - IFIP 20th World Computer Congress, Industry-Oriented Conferences, September 7-10, 2008, Milano, Italy , 2008, E-Government, ICT Professionalism and Competences Service Science.

[2]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[3]  Flora Amato,et al.  Knowledge Representation and Management for E-Government Documents , 2008, E-Government, ICT Professionalism and Competences Service Science.

[4]  Wei-hua Yao,et al.  [Integrating the healthcare enterprise: an overview]. , 2003, Di 1 jun yi da xue xue bao = Academic journal of the first medical college of PLA.

[5]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[6]  Frank Anshen,et al.  Statistics for linguistics , 1978 .

[7]  A. Rector,et al.  A Terminology Server for Medical Language and Medical Information Systems , 1995, Methods of Information in Medicine.

[8]  Michael A. Cusumano,et al.  Cloud computing and SaaS as new computing platforms , 2010, CACM.

[9]  Kenneth Ward Church,et al.  Termight: Identifying and Translating Technical Terminology , 1994, ANLP.

[10]  Ilias Iakovidis,et al.  Towards personal health record: current situation, obstacles and trends in implementation of electronic healthcare record in Europe , 1998, Int. J. Medical Informatics.

[11]  Amnon Shabo,et al.  Model Formulation: HL7 Clinical Document Architecture, Release 2 , 2006, J. Am. Medical Informatics Assoc..

[12]  Sam Heard,et al.  Archetype Definitions and Principles , 2003 .

[13]  Antonietta Alonge,et al.  ItalWordNet: a Large Semantic Database for Italian , 2000, LREC.

[14]  Sergio Bolasco,et al.  Statistica testuale e text mining: alcuni paradigmi applicativi , 2005 .

[15]  Flora Amato,et al.  A system for semantic retrieval and long-term preservation of multimedia documents in the e-government domain , 2009, Int. J. Web Grid Serv..

[16]  Christopher G. Chute,et al.  Standardized problem list generation, utilizing the Mayo canonical vocabulary embedded within the Unified Medical Language System , 1997, AMIA.

[17]  F. Tamburini,et al.  ANNOTAZIONE GRAMMATICALE E LEMMATIZZAZIONE DI CORPORA IN ITALIANO , 2001 .

[18]  Ted Pedersen,et al.  WordNet::Similarity - Measuring the Relatedness of Concepts , 2004, NAACL.

[19]  D. Kalra,et al.  EN 13606 Health informatics - Electronic health record communication - Part 1: Reference model , 2007 .