Analysis of Human Behavior by Mining Textual Data: Current Research Topics and Analytical Techniques

The goal of this study was to conduct a literature review of current approaches and techniques for identifying, understanding, and predicting human behaviors through mining a variety of sources of textual data with a focus on enabling classification of psychological behaviors regarding emotion, cognition, and social empathy. This review was performed using keyword searches in ISI Web of Science, Engineering Village Compendex, ProQuest Dissertations, and Google Scholar. Our findings show that, despite recent advancements in predicting human behaviors based on unstructured textual data, significant developments in data analytics systems for identification, determination of interrelationships, and prediction of human cognitive, emotional and social behaviors remain lacking.

[1]  Ravi Kumar Singh,et al.  Web Mining Overview, Techniques, Tools and Applications: A Survey , 2017 .

[2]  James W. Pennebaker,et al.  Mind mapping: Using everyday language to explore social & psychological processes , 2017 .

[3]  Fredrik Johansson,et al.  Harvesting and analysis of weak signals for detecting lone wolf terrorists , 2012, 2012 European Intelligence and Security Informatics Conference.

[4]  Burairah Hussin,et al.  Opinion Mining of Movie Review using Hybrid Method of Support Vector Machine and Particle Swarm Optimization , 2013 .

[5]  Oscar Mayora-Ibarra,et al.  Mobile phones as medical devices in mental disorder treatment: an overview , 2014, Personal and Ubiquitous Computing.

[6]  Masud Ibn Afjal,et al.  Study of Abstractive Text Summarization Techniques , 2017 .

[7]  Meng Wang,et al.  Product Aspect Ranking and Its Applications , 2014, IEEE Transactions on Knowledge and Data Engineering.

[8]  Lisa Kaati,et al.  Detecting Linguistic Markers for Radical Violence in Social Media , 2014 .

[9]  Mario Graff,et al.  Opinion mining and emotion recognition applied to learning environments , 2020, Expert Syst. Appl..

[10]  Dirk Helbing,et al.  Saving Human Lives: What Complexity Science and Information Systems can Contribute , 2014, Journal of statistical physics.

[11]  Jimy Frank Oblitas Cruz,et al.  Study of Coronavirus Impact on Parisian Population from April to June using Twitter and Text Mining Approach , 2020, medRxiv.

[12]  Ramadan Moawad,et al.  Opinion Mining and Sentimental Analysis Approaches: A Survey , 2014 .

[13]  Ryan L. Boyd,et al.  Language-based personality: a new approach to personality in a digital world , 2017, Current Opinion in Behavioral Sciences.

[14]  Alessandro Polli,et al.  Emotional Text Mining: Customer profiling in brand management , 2020, Int. J. Inf. Manag..

[15]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[16]  A. SalloumSaid,et al.  A survey of text mining in social media facebook and twitter perspectives , 2017 .

[17]  Sule Gündüz Ögüdücü,et al.  Identifying topical influencers on twitter based on user behavior and network topology , 2018, Knowl. Based Syst..

[18]  I. R. Vanani Text Analytics of Customers on Twitter: Brand Sentiments in Customer Support , 2019 .

[19]  Rejikumar G,et al.  An Exploration about the Last Mile Logistic Efficiency in Indian E-Commerce Sector- A Text Mining Approach , 2020 .

[20]  Andrew McCallum,et al.  Information Extraction , 2005, ACM Queue.

[21]  J. Pennebaker,et al.  PERSONALITY PROCESSES AND INDIVIDUAL DIFFERENCES Words of Wisdom: Language Use Over the Life Span , 2003 .

[22]  Stefan Smolnik,et al.  When Emotions Rule Knowledge: A Text-Mining Study of Emotions in Knowledge Management Research , 2021, Int. J. Knowl. Manag..

[23]  Meenakshi Bansal,et al.  Hierarchical Sentiment Analysis Model for Automatic Review Classification for E-commerce Users , 2017 .

[24]  James W. Pennebaker,et al.  Automated computer-based feedback in expressive writing , 2012, Comput. Hum. Behav..

[25]  A. Cohen,et al.  Clarifying the Linguistic Signature: Measuring Personality From Natural Speech , 2008, Journal of personality assessment.

[26]  Andrea K. Wittenborn,et al.  #MyDepressionLooksLike: Examining Public Discourse About Depression on Twitter , 2017, JMIR mental health.

[27]  Josimar E. Chire-Saire Characterizing Twitter Interaction during COVID-19 pandemic using Complex Networks and Text Mining , 2020 .

[28]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[29]  D. Moher,et al.  Preferred reporting items for systematic reviews and meta-analyses: the PRISMA Statement , 2009, BMJ : British Medical Journal.

[30]  Gualtiero Fantoni,et al.  Estimating Industry 4.0 impact on job profiles and skills using text mining , 2020, Comput. Ind..

[31]  Jorge A. Balazs,et al.  Opinion Mining and Information Fusion: A survey , 2016, Inf. Fusion.

[32]  Yong Shi,et al.  The Role of Text Pre-processing in Sentiment Analysis , 2013, ITQM.

[33]  Cindy K. Chung,et al.  Revealing Dimensions of Thinking in Open-Ended Self-Descriptions: An Automated Meaning Extraction Method for Natural Language. , 2008, Journal of research in personality.

[34]  Chen Wenyu,et al.  Text‐based emotion detection: Advances, challenges, and opportunities , 2020, Engineering Reports.

[35]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[36]  Chetan Mate Product Aspect Ranking using Sentiment Analysis: A Survey , 2016 .

[37]  Hendrik Blockeel,et al.  Web mining research: a survey , 2000, SKDD.

[38]  C. Schmidt,et al.  A time to think: Circadian rhythms in human cognition , 2007, Cognitive neuropsychology.

[39]  Donald E. Brown,et al.  Text Classification Algorithms: A Survey , 2019, Inf..

[40]  Debabrata Swain,et al.  A Suicide Prediction System Based on Twitter Tweets Using Sentiment Analysis and Machine Learning , 2021 .

[41]  J Brian Rutland,et al.  Development of a Scale to Measure Problem Use of Short Message Service: The SMS Problem Use Diagnostic Questionnaire , 2007, Cyberpsychology Behav. Soc. Netw..

[42]  Cristian Bucur Opinion Mining Platform for Intelligence in Business , 2014 .

[43]  Ole-Christoffer Granmo,et al.  The Tsetlin Machine - A Game Theoretic Bandit Driven Approach to Optimal Pattern Recognition with Propositional Logic , 2018, ArXiv.

[44]  Johan Bollen,et al.  Modeling Public Mood and Emotion: Twitter Sentiment and Socio-Economic Phenomena , 2009, ICWSM.

[45]  Charu C. Aggarwal,et al.  An Introduction to Text Mining , 2022 .

[46]  J. Sterne,et al.  The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials , 2011, BMJ : British Medical Journal.

[47]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[48]  Lokesh Singh,et al.  Clustering Techniques: A Brief Survey of Different Clustering Algorithms , 2012 .

[49]  João Francisco Valiati,et al.  Document-level sentiment classification: An empirical comparison between SVM and ANN , 2013, Expert Syst. Appl..

[50]  Waldemar Karwowski,et al.  Identification and Prediction of Human Behavior through Mining of Unstructured Textual Data , 2020, Symmetry.

[51]  Suad Alhojely,et al.  Sentiment Analysis and Opinion Mining: A Survey , 2016 .

[52]  G. Scepi,et al.  Using Opinion Mining as an educational analytic: An integrated strategy for the analysis of students’ feedback , 2021 .

[53]  David Diehl,et al.  Language and interaction: applying sociolinguistics to social network analysis , 2018, Quality & Quantity.

[54]  Waldemar Karwowski,et al.  Estimating intrinsic dimensionality using the multi-criteria decision weighted model and the average standard estimator , 2010, Inf. Sci..

[55]  Anand Mahendran,et al.  Opinion Mining For Text Classification , 2013 .