Information Extraction from Free-Text Business Documents

The objective of this chapter is an investigation of the applicability of information extraction techniques in real-world business applications dealing with textual data since business relevant data is mainly transmitted through free-text documents. In particular, we give an overview of the information extraction task, designing information extraction systems and some examples of existing information extraction systems applied in the financial, insurance and legal domains. Furthermore, we demonstrate the enormous indexing potential of lightweight linguistic text processing techniques applied in information extraction systems and other closely related fields of information technology which concern processing vast amounts of textual data.

[1]  Wojciech Skut,et al.  Intelligent Information Extraction , 2000 .

[2]  Mehdi Khosrowpour Success and pitfalls of information technology management , 1999 .

[3]  Mehdi Khosrow-Pour,et al.  Printed at: , 2011 .

[4]  Brian W. Hollocks Qualitative Research in IS: Issues and Trends , 2002, Eur. J. Inf. Syst..

[5]  Witold Abramowicz,et al.  Knowledge Discovery for Business Information Systems , 2001 .

[6]  Ralph Grishman,et al.  Message Understanding Conference- 6: A Brief History , 1996, COLING.

[7]  Wei Li,et al.  Information Extraction Supported Question Answering , 1999, TREC.

[8]  Philip J. Hayes,et al.  Automatic Extraction of Facts from Press Releases to Generate News Stories , 1992, ANLP.

[9]  Ellen M. Voorhees,et al.  The TREC-8 Question Answering Track Evaluation , 2000, TREC.

[10]  Witold Abramowicz,et al.  Workflow technology supporting information filtering from the internet , 2003 .

[11]  Amy B. Woszczynski,et al.  The Handbook of Information Systems Research , 2003 .

[12]  Yorick Wilks,et al.  University of Sheffield: description of the LaSIE system as used for MUC-6 , 1995, MUC.

[13]  Steven L. Lytinen,et al.  ATRANS Automatic Processing of Money Transfer Messages , 1986, AAAI.

[14]  Ellen Riloff,et al.  Extraction-based Text Categorization: Generating Domain-specific Role Relationships , 1999 .

[15]  Yorick Wilks,et al.  University of Sheffield: Description of the LaSIE System as Used for MUC-6 , 1995, MUC.

[16]  Douglas E. Appelt,et al.  FASTUS: A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text , 1997, ArXiv.

[17]  Mila Ramos-Santacruz,et al.  REES: A Large-Scale Relation and Event Extraction System , 2000, ANLP.

[18]  Ralph Grishman,et al.  Unsupervised Discovery of Scenario-Level Patterns for Information Extraction , 2000, ANLP.

[19]  Peter Jackson,et al.  Information extraction from case law and retrieval of prior cases by partial parsing and query generation , 1998, CIKM '98.

[20]  PhD Witold Abramowicz MSc,et al.  Filtering the Web to Feed Data Warehouses , 2002, Springer London.

[21]  Douglas E. Appelt,et al.  Introduction to Information Extraction , 1999, AI Commun..

[22]  Emmanuel Morin,et al.  Extracting Semantic Relationships between Terms: Supervised vs. Unsupervised Methods , 1999 .

[23]  Roberto Garigliano,et al.  Natural language processing and information extraction: qualitative analysis of financial news articles , 1997, Proceedings of the IEEE/IAFE 1997 Computational Intelligence for Financial Engineering (CIFEr).

[24]  Romaric Besançon,et al.  Text Mining, knowledge extraction from unstructured textual data , 1998 .

[25]  A. Levine,et al.  New estimates of the storage permanence and ocean co-benefits of enhanced rock weathering , 2023, PNAS nexus.

[26]  Rada Mihalcea,et al.  Document Indexing using Named Entities , 2001 .

[27]  Lisa F. Rau,et al.  SCISOR: extracting information from on-line news , 1990, CACM.

[28]  Sanda M. Harabagiu,et al.  Experiments with Open-Domain Textual Question Answering , 2000, COLING.

[29]  David Fisher,et al.  MITA: An Information-Extraction Approach to the Analysis of Free-Form Text in Life Insurance Applications , 1998, AI Mag..

[30]  Tom Hampton,et al.  SRA: Description of the IE2 System Used for MUC-7 , 1998, MUC.