Intelligent Information Integration: Reclaiming the Intelligence

The authors present their work in the conceptualization, design, implementation, and application of “lean†information integration systems. They present a new data integration approach based on a schema-less data management and integration paradigm, which enables developing cost-effective large scale integration applications. They have designed and developed a highly scalable, information-on-demand system called NETMARK, which facilitates information access and integration based on a theory of articulation management and a context sensitive paradigm. NETMARK has been widely deployed for managing, storing, and searching unstructured or semi-structured arbitrary XML and HTML information at the National Aeronautics Space Administration (NASA). In this paper the authors describe the theory, design and implementation of our system, present experimental benchmark evaluations, and validate our approach through real-world applications in the NASA enterprise.

[1]  Xuan F. Zha,et al.  Artificial Intelligence and Integrated Intelligent Information Systems: Emerging Technologies and Applications , 2006 .

[2]  Cong Yu,et al.  TIMBER: A native XML database , 2002, The VLDB Journal.

[3]  Peter F. Patel-Schneider,et al.  "Reducing" CLASSIC to Practice: Knowledge Representation Theory Meets Reality , 1999, Artif. Intell..

[4]  Eugene J. Shekita,et al.  XTABLES: Bridging relational technology and XML , 2002, IBM Syst. J..

[5]  Vijayan Sugumaran Intelligent Information Technologies: Concepts, Methodologies, Tools and Applications , 2007 .

[6]  Carlos A. Iglesias,et al.  The Agent-Oriented Methodology MAS-CommonKADS , 2005 .

[7]  Jayant Madhavan,et al.  Web-Scale Data Integration: You can afford to Pay as You Go , 2007, CIDR.

[8]  Vasilis Vassalos,et al.  Xpath on steroids: exploiting relational engines for xpath performance , 2007, SIGMOD '07.

[9]  Torsten Grust,et al.  Why off-the-shelf RDBMSs are better at XPath than you might expect , 2007, SIGMOD '07.

[10]  Nick Roussopoulos,et al.  Interoperability of multiple autonomous databases , 1990, CSUR.

[11]  Gio Wiederhold,et al.  Abstraction of Representation for Interoperation , 1997, ISMIS.

[12]  Jayavel Shanmugasundaram,et al.  Context-Sensitive Keyword Search and Ranking for XML , 2005, WebDB.

[13]  Cong Yu,et al.  Enabling Schema-Free XQuery with meaningful query focus , 2008, The VLDB Journal.

[14]  John McCarthy,et al.  Notes on Formalizing Context , 1993, IJCAI.

[15]  G. Marreiros,et al.  A Survey on the use of Emotions, Mood, and Personality in Ambient Intelligence and Smart Environments , 2011 .

[16]  D.A. Maluf,et al.  Managing Unstructured Data With Structured Legacy Systems , 2008, 2008 IEEE Aerospace Conference.

[17]  Fulvio Mastrogiovanni,et al.  Proactive Assistance in Ecologies of Physically Embedded Intelligent Systems: A Constraint-Based Approach , 2011 .

[18]  W.J. McDermott,et al.  Searching Across the International Space Station Databases , 2007, 2007 IEEE Aerospace Conference.

[19]  Alon Y. Halevy Data Integration: A Status Report , 2003, BTW.

[20]  Alon Y. Halevy,et al.  Enterprise information integration: successes, challenges and controversies , 2005, SIGMOD '05.

[21]  Alon Y. Halevy,et al.  An XML query engine for network-bound data , 2002, The VLDB Journal.

[22]  Robert M. MacGregor,et al.  Inside the LOOM description classifier , 1991, SGAR.

[23]  Vassilis J. Tsotras,et al.  Twig query processing over graph-structured XML data , 2004, WebDB '04.

[24]  Thomas R. Gruber,et al.  The Role of Common Ontology in Achieving Sharable, Reusable Knowledge Bases , 1991, KR.

[25]  V. Sugumaran The Inaugural Issue of the International Journal of Intelligent Information Technologies , 2005 .

[26]  T. Warren Liao,et al.  A New Efficient and Effective Fuzzy Modeling Method for Binary Classification , 2011, Int. J. Fuzzy Syst. Appl..

[27]  Joann J. Ordille,et al.  Data integration: the teenage years , 2006, VLDB.

[28]  Prasenjit Mitra,et al.  An Ontology-Composition Algebra , 2004, Handbook on Ontologies.

[29]  Ioana Manolescu,et al.  The XML benchmark project , 2001 .

[30]  David A. Maluf,et al.  Business Intelligence in Large Organizations: Integrating Which Data? , 2006, ISMIS.

[31]  James F. Brinkley,et al.  Issues in biomedical research data management and analysis: needs and barriers. , 2007, Journal of the American Medical Informatics Association : JAMIA.

[32]  Yannis Papakonstantinou,et al.  Efficient LCA based keyword search in xml data , 2007, CIKM '07.

[33]  Alon Y. Halevy,et al.  The Nimble XML data integration system , 2001, Proceedings 17th International Conference on Data Engineering.

[34]  Timothy W. Finin,et al.  Enabling Technology for Knowledge Sharing , 1991, AI Mag..

[35]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[36]  David A. Maluf,et al.  Semi-structured data management in the enterprise: a nimble, high-throughput, and scalable approach , 2005, 9th International Database Engineering & Application Symposium (IDEAS'05).

[37]  Jayant Madhavan,et al.  Web-Scale Data Integration: You can afford to Pay as You Go , 2007, CIDR.

[38]  Alejandro Pazos Sierra,et al.  Encyclopedia of Artificial Intelligence , 2008 .

[39]  David A. Maluf,et al.  NETMARK: A Schema-Less Extension for Relational Databases for Managing Semi-structured Data Dynamically , 2003, ISMIS.

[40]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[41]  Ramanathan V. Guha,et al.  The evolution of CycL, the Cyc representation language , 1991, SGAR.

[42]  Yan Wang Document-Driven Design for Distributed CAD Services , 2007 .

[43]  Christine Collet,et al.  Resource integration using a large knowledge base in Carnot , 1991, Computer.

[44]  Torsten Grust,et al.  MonetDB/XQuery: a fast XQuery processor powered by a relational engine , 2006, SIGMOD Conference.

[45]  David A. Maluf,et al.  Articulation management for intelligent integration of information , 2001, IEEE Trans. Syst. Man Cybern. Part C.

[46]  Cong Yu,et al.  TIMBER: a native system for querying XML , 2003, SIGMOD '03.