DeCAT 2015 - Workshop on Deep Content Analytics Techniques for Personalized and Intelligent Services

Personal Information Management (PIM) research is challenging primarily due to the inherent nature of PIM. Studies have shown that people often adopt their own schemes when organising their personal collections, possibly because PIM tool-support is still lacking. In this paper we investigate the problem of automatic organisation of personal information into task-clusters by transparently exploiting the user’s behaviour while performing some tasks. We conduct a controlled experiment, with 22 participants, using three different task-execution strategies to gather clean data for our evaluation. We use our PiMx (PIM analytix) framework to analyse this data and understand better the issues associated with this problem. Based on this analysis, we then present the incremental density-based clustering algorithm, iDeTaCt, that is able to transparently generate task-clusters by exploiting document switching and revisitation. We evaluate the algorithm’s performance using the collected datasets. The results obtained are very encouraging and merit further investigation.

[1]  Alejandro Bellogín,et al.  Ontology-Based Personalised and Context-Aware Recommendations of News Items , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[2]  Carolyn Penstein Rosé,et al.  Author Age Prediction from Text using Linear Regression , 2011, LaTeCH@ACL.

[3]  A. Braun,et al.  Neural Substrates of Spontaneous Musical Performance: An fMRI Study of Jazz Improvisation , 2008, PloS one.

[4]  Michele Marchesi,et al.  Bitcoin Spread Prediction Using Social and Web Search Media , 2015, UMAP Workshops.

[5]  R. Gunning The Technique of Clear Writing. , 1968 .

[6]  R. Flesch A new readability yardstick. , 1948, The Journal of applied psychology.

[7]  Timothy W. Finin,et al.  Why we twitter: understanding microblogging usage and communities , 2007, WebKDD/SNA-KDD '07.

[8]  Pentti Kanerva,et al.  Sparse Distributed Memory , 1988 .

[9]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[10]  Maria E. Orlowska,et al.  Recency-based collaborative filtering , 2006, ADC.

[11]  Avar Pentel Employing Relation between Reading and Writing Skills on Age Based Categorization of Short Estonian Texts , 2015, UMAP Workshops.

[12]  Graeme Ritchie,et al.  Some Empirical Criteria for Attributing Creativity to a Computer Program , 2007, Minds and Machines.

[13]  Pythagoras Karampiperis,et al.  Creativity Profiling Server: Modelling the Principal Components of Human Creativity over Texts , 2015, UMAP Workshops.

[14]  Mike Thelwall,et al.  Sentiment in Twitter events , 2011, J. Assoc. Inf. Sci. Technol..

[15]  Z. Kövecses,et al.  A new look at metaphorical creativity in cognitive linguistics , 2010 .

[16]  Yehuda Koren,et al.  Collaborative filtering with temporal dynamics , 2009, KDD.

[17]  Shyhtsun Felix Wu,et al.  Estimating the Size of Online Social Networks , 2010, 2010 IEEE Second International Conference on Social Computing.

[18]  Todd Lubart,et al.  How can computers be partners in the creative process: Classification and commentary on the Special Issue , 2005, Int. J. Hum. Comput. Stud..

[19]  Reuben Grinberg Bitcoin: An Innovative Alternative Digital Currency , 2011 .

[20]  Noah A. Smith,et al.  Age Prediction from Text using Linear Regression , 2011 .

[21]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[22]  Fergal Reid,et al.  An Analysis of Anonymity in the Bitcoin System , 2011, PASSAT 2011.

[23]  Shlomo Argamon,et al.  Automatically profiling the author of an anonymous text , 2009, CACM.

[24]  Xiaojin Zhu,et al.  How Creative is Your Writing? , 2009 .

[25]  Gediminas Adomavicius,et al.  Incorporating contextual information in recommender systems using a multidimensional approach , 2005, TOIS.

[26]  Pavlin Mavrodiev,et al.  The digital traces of bubbles: feedback cycles between socio-economic signals in the Bitcoin economy , 2014, Journal of The Royal Society Interface.

[27]  Xi Chen,et al.  Temporal Collaborative Filtering with Bayesian Probabilistic Tensor Factorization , 2010, SDM.

[28]  Fernando Díez,et al.  Simple time-biased KNN-based recommendations , 2010, CAMRa '10.

[29]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[30]  Kenneth O. Stanley,et al.  Beyond Open-endedness: Quantifying Impressiveness , 2012, ALIFE.

[31]  José Palazzo Moreira de Oliveira,et al.  Using Simple Content Features for the Author Profiling Task Notebook for PAN at CLEF 2013 , 2013, CLEF.

[32]  Zoⅇ Lawson,et al.  管理 Interact 数据源 , 2017 .

[33]  Pythagoras Karampiperis,et al.  From Computational Creativity Metrics to the Principal Components of Human Creativity , 2014, KICSS.

[34]  Hui Xiong,et al.  Enhancing recommender systems under volatile userinterest drifts , 2009, CIKM.

[35]  Annalina Caputo,et al.  Modeling Short-Term Preferences in Time-Aware Recommender Systems , 2015, UMAP Workshops.

[36]  Philipp Koehn,et al.  Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.

[37]  Ramrao Wagh,et al.  Application of Social Media for Tracking Knowledge in Agile Software Projects , 2012 .

[38]  Pasquale Lops,et al.  Random Indexing and Negative User Preferences for Enhancing Content-Based Recommender Systems , 2011, EC-Web.

[39]  Michael J. Pazzani,et al.  User Modeling for Adaptive News Access , 2000, User Modeling and User-Adapted Interaction.

[40]  Walter Daelemans,et al.  Predicting age and gender in online social networks , 2011, SMUC '11.

[41]  Xue Li,et al.  Time weight collaborative filtering , 2005, CIKM '05.

[42]  David Maxwell Chickering,et al.  Using Temporal Data for Making Recommendations , 2001, UAI.

[43]  Marie-Francine Moens,et al.  Age and Gender Identification in Social Media , 2014, CLEF.

[44]  Philipp Koehn,et al.  A parallel corpus for statistical machine translation , 2005 .

[45]  E A Smith,et al.  Automated readability index. , 1967, AMRL-TR. Aerospace Medical Research Laboratories.

[46]  Iván Cantador,et al.  Time-aware recommender systems: a comprehensive survey and analysis of existing evaluation protocols , 2013, User Modeling and User-Adapted Interaction.

[47]  Licia Capra,et al.  Temporal collaborative filtering with adaptive neighbourhoods , 2009, SIGIR.

[48]  Peter A. Gloor,et al.  Nowcasting the Bitcoin Market with Twitter Signals , 2014, ArXiv.

[49]  Guy Shani,et al.  Evaluating Recommendation Systems , 2011, Recommender Systems Handbook.

[50]  Jimeng Sun,et al.  Temporal recommendation on graphs via long- and short-term preference fusion , 2010, KDD.

[51]  Siegfried Handschuh,et al.  Automatic Task-Cluster Generation based on Document Switching and Revisitation , 2015, UMAP Workshops.

[52]  Tony Veale,et al.  An analogy-oriented type hierarchy for linguistic creativity , 2006, Knowl. Based Syst..

[53]  John Riedl,et al.  An algorithmic framework for performing collaborative filtering , 1999, SIGIR '99.

[54]  Anshul Mittal,et al.  Stock Prediction Using Twitter Sentiment Analysis , 2011 .

[55]  John Burrows,et al.  All the Way Through: Testing for Authorship in Different Frequency Strata , 2007, Lit. Linguistic Comput..

[56]  Navneet Kaur,et al.  Opinion mining and sentiment analysis , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[57]  A. Cardoso,et al.  Modeling Forms of Surprise in Artificial Agents: Empirical and Theoretical Study of Surprise Functions , 2004 .

[58]  Judith Munat Lexical Creativity, Texts and Contexts , 2007 .

[59]  T. Rao,et al.  Analyzing Stock Market Movements Using Twitter Sentiment Analysis , 2012, ASONAM 2012.

[60]  Simon Günter,et al.  Short Text Authorship Attribution via Sequence Kernels, Markov Chains and Author Unmasking: An Investigation , 2006, EMNLP.

[61]  G. Harry McLaughlin,et al.  SMOG Grading - A New Readability Formula. , 1969 .

[62]  ThelwallMike,et al.  Sentiment strength detection in short informal text , 2010 .

[63]  Adi Shamir,et al.  Quantitative Analysis of the Full Bitcoin Transaction Graph , 2013, Financial Cryptography.

[64]  Kenneth O. Stanley,et al.  Exploiting Open-Endedness to Solve Problems Through the Search for Novelty , 2008, ALIFE.

[65]  Katherine A. Brady,et al.  Computational Models of Surprise in Evaluating Creative Design , 2013 .

[66]  Costin-Gabriel Chiru Creativity Detection in Texts , 2013, ICIW 2013.

[67]  W. Bruce Croft,et al.  Search Engines - Information Retrieval in Practice , 2009 .

[68]  Pasquale Lops,et al.  Content-based Recommender Systems: State of the Art and Trends , 2011, Recommender Systems Handbook.

[69]  Annalina Caputo,et al.  Negation for Document Re-ranking in Ad-hoc Retrieval , 2011, ICTIR.