Text-Driven Multi-structured Data Analytics for Enterprise Intelligence

Text data constitutes the bulk of all enterprise data. Text repositories are not only tacit store-houses of knowledge about its people, projects and processes but also contain invaluable information about its customers, competitors, suppliers, partners and all other stakeholders. Mining this data can provide interesting and valuable insights provided it is appropriately integrated with other enterprise data. In this paper we propose a framework for text-driven analysis of multi-structured data.

[1]  Lipika Dey,et al.  Opinion mining from noisy text data , 2008, AND '08.

[2]  Lipika Dey,et al.  A framework to integrate unstructured and structured data for enterprise analytics , 2013, Proceedings of the 16th International Conference on Information Fusion.

[3]  Hsinchun Chen,et al.  Textual analysis of stock market prediction using breaking financial news: The AZFin text system , 2009, TOIS.

[4]  Oren Etzioni,et al.  Open domain event extraction from twitter , 2012, KDD.

[5]  Lipika Dey,et al.  Multi-Perspective Linking of News Articles within a Repository , 2011, IJCAI.

[6]  Gerhard Weikum,et al.  From information to knowledge: harvesting entities and relationships from web sources , 2010, PODS '10.

[7]  Rahul Gupta,et al.  LIPTUS: associating structured and unstructured information in a banking environment , 2007, SIGMOD '07.

[8]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[9]  Lotfi A. Zadeh,et al.  The concept of a linguistic variable and its application to approximate reasoning-III , 1975, Inf. Sci..

[10]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[11]  Roger Levy,et al.  Deep Dependencies from Context-Free Statistical Parsers: Correcting the Surface Dependency Approximation , 2004, ACL.

[12]  Gautam Shroff,et al.  Catching the Long-Tail: Extracting Local News Events from Twitter , 2012, ICWSM.

[13]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[14]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[15]  Andrei Z. Broder,et al.  On the resemblance and containment of documents , 1997, Proceedings. Compression and Complexity of SEQUENCES 1997 (Cat. No.97TB100171).

[16]  Satoshi Sekine,et al.  A survey of named entity recognition and classification , 2007 .

[17]  Andreas Thor,et al.  Dynamic Fusion of Web Data , 2007, XSym.

[18]  Diego Reforgiato Recupero,et al.  Sentiment Analysis: Adjectives and Adverbs are Better than Adjectives Alone , 2007, ICWSM.

[19]  Brendan T. O'Connor,et al.  From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series , 2010, ICWSM.

[20]  Akshi Kumar,et al.  Sentiment Analysis on Twitter , 2012 .

[21]  Christopher D. Manning,et al.  Efficient, Feature-based, Conditional Random Field Parsing , 2008, ACL.

[22]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[23]  Gautam Shroff,et al.  Enterprise information fusion for real-time business intelligence , 2011, 14th International Conference on Information Fusion.

[24]  L. A. ZADEH,et al.  The concept of a linguistic variable and its application to approximate reasoning - I , 1975, Inf. Sci..

[25]  Mukesh K. Mohania,et al.  Enhanced Business Intelligence using EROCS , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[26]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[27]  LiGuo Huang,et al.  Text mining in supporting software systems risk assurance , 2010, ASE '10.

[28]  Mukesh K. Mohania,et al.  Efficiently linking text documents with relevant structured information , 2006, VLDB.

[29]  Gautam Shroff,et al.  Acquiring competitive intelligence from social media , 2011, MOCR_AND '11.