Detection of Outlier Information Using Linguistic Summarization

The main goal of automatic summarization of databases is usually to characterize the collection of data in terms of the dominant information involved. In complement to this task, the present paper shows the use of linguistic summarization for the characterization of databases containing textual records through detection of outlier information involved. The method applies a fuzzy measure of similarity between sentences to the summarization result.Certain level of standadization of textual records is assumed.

[1]  Witold Pedrycz,et al.  P-FCM: a proximity -- based fuzzy clustering , 2004, Fuzzy Sets Syst..

[2]  Witold Pedrycz,et al.  The Design of Free Structure Granular Mappings: The Use of the Principle of Justifiable Granularity , 2013, IEEE Transactions on Cybernetics.

[3]  Charu C. Aggarwal Outlier Detection in Categorical, Text and Mixed Attribute Data , 2013 .

[4]  Anna Wilbik,et al.  Tracking of multiple target types with a single neural extended Kalman filter , 2010 .

[5]  L. Zadeh,et al.  Data mining, rough sets and granular computing , 2002 .

[6]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[7]  Mark T. Maybury,et al.  Advances in Automatic Text Summarization , 1999 .

[8]  Daniel Sánchez,et al.  Fuzzy quantification: a state of the art , 2014, Fuzzy Sets Syst..

[9]  Slawomir Zadrozny,et al.  Bipolar Queries: Some Inspirations from Intention and Preference Modeling , 2012, Combining Experimentation and Theory.

[10]  Janusz Kacprzyk,et al.  Intelligent Exploration of the Web , 2003, Studies in Fuzziness and Soft Computing.

[11]  L. Zadeh,et al.  Towards a theory of fuzzy systems , 1996 .

[12]  Piotr S. Szczepaniak,et al.  Internet Search Based on Text Intuitionistic Fuzzy Similarity , 2003, Intelligent Exploration of the Web.

[13]  Adam Niewiadomski Methods for the Linguistic Summarization of Data: Applications of Fuzzy Sets and Their Extensions , 2008 .

[14]  Witold Pedrycz,et al.  Granular computing in data mining , 2001 .

[15]  Slawomir Zadrozny,et al.  Computing With Words Is an Implementable Paradigm: Fuzzy Queries, Linguistic Data Summaries, and Natural-Language Generation , 2010, IEEE Transactions on Fuzzy Systems.

[16]  Douglas M. Hawkins Identification of Outliers , 1980, Monographs on Applied Probability and Statistics.