Knowledge discovery from evolving social media data

[1]  Naoki Shibata,et al.  Identification and Visualization of Emerging Trends from Blogosphere , 2007, ICWSM.

[2]  Hanan Samet,et al.  NewsStand: a new view on news , 2008, GIS '08.

[3]  C. Baldry Theories of The Information Society , 1988 .

[4]  Christos Faloutsos,et al.  Kronecker Graphs: An Approach to Modeling Networks , 2008, J. Mach. Learn. Res..

[5]  Danah Boyd,et al.  Tweet, Tweet, Retweet: Conversational Aspects of Retweeting on Twitter , 2010, 2010 43rd Hawaii International Conference on System Sciences.

[6]  Silvio Lattanzi,et al.  On compressing social networks , 2009, KDD.

[7]  Jeffrey Pennington,et al.  Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions , 2011, EMNLP.

[8]  Cecilia Mascolo,et al.  The Call of the Crowd: Event Participation in Location-Based Social Services , 2014, ICWSM.

[9]  Christos Faloutsos,et al.  EigenSpokes: Surprising Patterns and Scalable Community Chipping in Large Graphs , 2010, PAKDD.

[10]  Xiaowei Xu,et al.  SCAN: a structural clustering algorithm for networks , 2007, KDD '07.

[11]  Jeffrey D. Ullman,et al.  Enumerating subgraph instances using map-reduce , 2012, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[12]  Geoffrey G. Hazel,et al.  Multivariate Gaussian MRF for multispectral scene segmentation and anomaly detection , 2000, IEEE Trans. Geosci. Remote. Sens..

[13]  Nigel Shadbolt,et al.  Tag Meaning Disambiguation through Analysis of Tripartite Structure of Folksonomies , 2007, 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops.

[14]  Denzil Ferreira,et al.  HotCity: enhancing ubiquitous maps with social context heatmaps , 2013, MUM.

[15]  Deepayan Chakrabarti,et al.  Evolutionary clustering , 2006, KDD '06.

[16]  Bamshad Mobasher,et al.  Personalized recommendation in social tagging systems using hierarchical clustering , 2008, RecSys '08.

[17]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[18]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[19]  Lefteris Angelis,et al.  Requirements and architecture design principles for a smart city experiment with sensor and social networks integration , 2013, PCI '13.

[20]  Edwin Simpson,et al.  Clustering Tags in Enterprise and Web Folksonomies , 2021, ICWSM.

[21]  Matthew Hurst,et al.  BlogPulse: Automated Trend Discovery for Weblogs , 2003 .

[22]  Philip S. Yu,et al.  GraphScope: parameter-free mining of large time-evolving graphs , 2007, KDD '07.

[23]  Diane J. Cook,et al.  Graph-based anomaly detection , 2003, KDD '03.

[24]  Jie Li,et al.  Bridging the Gap between Desktop and the Cloud for eScience Applications , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[25]  Yiannis Kompatsiaris,et al.  ImproveMyCity: an open source platform for direct citizen-government communication , 2013, ACM Multimedia.

[26]  Rose Yu,et al.  GLAD: group anomaly detection in social media analysis , 2014, ACM Trans. Knowl. Discov. Data.

[27]  M. Newman Power laws, Pareto distributions and Zipf's law , 2005 .

[28]  A. Kaplan,et al.  Users of the world, unite! The challenges and opportunities of Social Media , 2010 .

[29]  E. Rogers,et al.  Diffusion of innovations , 1964, Encyclopedia of Sport Management.

[30]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[31]  Prasenjit Mitra,et al.  Event Detection and Visualization for Social Text Streams , 2007, ICWSM.

[32]  Chao Wu,et al.  Analysis of tag within online social networks , 2009, GROUP.

[33]  Cecilia R. Aragon,et al.  Randomized search trees , 1989, 30th Annual Symposium on Foundations of Computer Science.

[34]  Jianwen Su,et al.  Maintaining Transitive Closure of Graphs in SQL , 1999 .

[35]  Vittorio Loreto,et al.  Emergent Community Structure in Social Tagging Systems , 2008, Adv. Complex Syst..

[36]  Zhiyong Lu,et al.  Automatic Extraction of Clusters from Hierarchical Clustering Representations , 2003, PAKDD.

[37]  Philip S. Yu,et al.  Incremental tensor analysis: Theory and applications , 2008, TKDD.

[38]  Kazufumi Watanabe,et al.  Jasmine: a real-time local-event detection system based on geolocation information propagated to microblogs , 2011, CIKM '11.

[39]  I. Jolliffe Discarding Variables in a Principal Component Analysis. Ii: Real Data , 1973 .

[40]  Athena Vakali,et al.  Harvesting Opinions and Emotions from Social Media Textual Resources , 2015, IEEE Internet Computing.

[41]  A. Arenas,et al.  Community analysis in social networks , 2004 .

[42]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[43]  Mia Hubert,et al.  ROBPCA: A New Approach to Robust Principal Component Analysis , 2005, Technometrics.

[44]  Raffaele Giancarlo,et al.  New results for finding common neighborhoods in massive graphs in the data stream model , 2008, Theor. Comput. Sci..

[45]  Hendrik Blockeel,et al.  Web mining research: a survey , 2000, SKDD.

[46]  Kyumin Lee,et al.  Seven Months with the Devils: A Long-Term Study of Content Polluters on Twitter , 2011, ICWSM.

[47]  R. Badia,et al.  COMPSs in the VENUS-C Platform : enabling e-Science applications on the Cloud , 2011 .

[48]  Zhengding Lu,et al.  Community mining on dynamic weighted directed graphs , 2009, CIKM-CNIKM.

[49]  Mohamed F. Mokbel,et al.  Location-based and preference-aware recommendation using sparse geo-social networking data , 2012, SIGSPATIAL/GIS.

[50]  Gregory Buehrer,et al.  A scalable pattern mining approach to web graph compression with communities , 2008, WSDM '08.

[51]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[52]  Jimeng Sun,et al.  MetaFac: community discovery via relational hypergraph factorization , 2009, KDD.

[53]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[54]  Raouf Boutaba,et al.  Cloud computing: state-of-the-art and research challenges , 2010, Journal of Internet Services and Applications.

[55]  Christos Faloutsos,et al.  Metric forensics: a multi-level approach for mining volatile graphs , 2010, KDD.

[56]  Silvio Lattanzi,et al.  Filtering: a method for solving graph problems in MapReduce , 2011, SPAA '11.

[57]  Sebastiano Vigna,et al.  Permuting Web Graphs , 2009, WAW.

[58]  Athena Vakali,et al.  Capturing Social Data Evolution Using Graph Clustering , 2013, IEEE Internet Computing.

[59]  Guofei Gu,et al.  Analyzing spammers' social networks for fun and profit: a case study of cyber criminal ecosystem on twitter , 2012, WWW.

[60]  Yizhou Sun,et al.  Graph Regularized Transductive Classification on Heterogeneous Information Networks , 2010, ECML/PKDD.

[61]  Mohammad Ali Abbasi,et al.  TweetTracker: An Analysis Tool for Humanitarian and Disaster Relief , 2011, ICWSM.

[62]  Mor Naaman,et al.  Is it really about me?: message content in social awareness streams , 2010, CSCW '10.

[63]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[64]  Jure Leskovec,et al.  Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters , 2008, Internet Math..

[65]  A. Vespignani,et al.  The architecture of complex weighted networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[66]  Jonathan Cohen,et al.  Graph Twiddling in a MapReduce World , 2009, Computing in Science & Engineering.

[67]  Ling Chen,et al.  Event detection from flickr data through wavelet-based spatial analysis , 2009, CIKM.

[68]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[69]  Charu C. Aggarwal,et al.  When will it happen?: relationship prediction in heterogeneous information networks , 2012, WSDM '12.

[70]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[71]  Hanan Samet,et al.  TwitterStand: news in tweets , 2009, GIS.

[72]  Myra Spiliopoulou,et al.  Evolution in Social Networks: A Survey , 2011, Social Network Data Analytics.

[73]  Athena Vakali,et al.  User communities evolution in microblogs: A public awareness barometer for real world events , 2015, World Wide Web.

[74]  Cécile Bothorel,et al.  An Algorithm for Detecting Communities in Folksonomy Hypergraphs , 2011, IICS.

[75]  Charu C. Aggarwal,et al.  An Introduction to Social Network Data Analytics , 2011, Social Network Data Analytics.

[76]  Bo Zhao,et al.  PET: a statistical model for popular events tracking in social communities , 2010, KDD.

[77]  David Jurgens,et al.  Friends , Enemies , and Lovers : Detecting Communities in Networks Where Relationships Matter , 2012 .

[78]  Yiannis Kompatsiaris,et al.  Community detection in Social Media , 2012, Data Mining and Knowledge Discovery.

[79]  Theresa A. Pardo,et al.  Conceptualizing smart city with dimensions of technology, people, and institutions , 2011, dg.o '11.

[80]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[81]  Gang Wang,et al.  Follow the green: growth and dynamics in twitter follower markets , 2013, Internet Measurement Conference.

[82]  Joan Feigenbaum,et al.  On graph problems in a semi-streaming model , 2005, Theor. Comput. Sci..

[83]  Hirotoshi Iwasaki,et al.  BEIRA: A Geo-semantic Clustering Method for Area Summary , 2007, WISE.

[84]  Venkatesan Guruswami,et al.  CopyCatch: stopping group attacks by spotting lockstep behavior in social networks , 2013, WWW.

[85]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[86]  Barnabás Póczos,et al.  Group Anomaly Detection using Flexible Genre Models , 2011, NIPS.

[87]  P. Erdos,et al.  On the evolution of random graphs , 1984 .

[88]  H. W. B.,et al.  The Principles of Sociology , 1897, Nature.

[89]  Christopher H. Brooks,et al.  Improved annotation of the blogosphere via autotagging and hierarchical clustering , 2006, WWW '06.

[90]  J. Bindé Towards knowledge societies: UNESCO world report , 2005 .

[91]  Felix Jungermann,et al.  Stream-based Community Discovery via Relational Hypergraph Factorization on Evolving Networks , 2010, LWA.

[92]  M. Hubert,et al.  A Robust Measure of Skewness , 2004 .

[93]  Yihong Gong,et al.  Incremental spectral clustering by efficiently updating the eigen-system , 2010, Pattern Recognit..

[94]  Leslie G. Valiant,et al.  A bridging model for parallel computation , 1990, CACM.

[95]  Yun Chi,et al.  Blog Community Discovery and Evolution Based on Mutual Awareness Expansion , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[96]  Yiannis Kompatsiaris,et al.  Cluster-Based Landmark and Event Detection for Tagged Photo Collections , 2011, IEEE MultiMedia.

[97]  A. Barabasi,et al.  Quantifying social group evolution , 2007, Nature.

[98]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[99]  David A. Bader,et al.  Compact graph representations and parallel connectivity algorithms for massive dynamic network analysis , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[100]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[101]  Thomas Liebig,et al.  Using Data from Location Based Social Networks for Urban Activity Clustering , 2013, AGILE Conf..

[102]  David A. Bader,et al.  Designing Multithreaded Algorithms for Breadth-First Search and st-connectivity on the Cray MTA-2 , 2006, 2006 International Conference on Parallel Processing (ICPP'06).

[103]  Andres Agostini THE FOURTH REVOLUTION: HOW THE INFOSPHERE IS RESHAPING HUMAN REALITY! , 2015 .

[104]  Athena Vakali,et al.  Analysis of Content Popularity in Social Bookmarking Systems , 2010 .

[105]  Mor Naaman,et al.  Unfolding the event landscape on twitter: classification and exploration of user categories , 2012, CSCW '12.

[106]  Jiawei Han,et al.  gSkeletonClu: Density-Based Network Clustering via Structure-Connected Tree Division or Agglomeration , 2010, 2010 IEEE International Conference on Data Mining.

[107]  Sebastiano Vigna,et al.  The webgraph framework I: compression techniques , 2004, WWW '04.

[108]  Santo Fortunato,et al.  Finding Statistically Significant Communities in Networks , 2010, PloS one.

[109]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[110]  Christos Faloutsos,et al.  CatchSync: catching synchronized behavior in large directed graphs , 2014, KDD.

[111]  Alex Pentland,et al.  Twitter: who gets caught? observed trends in social micro-blogging spam , 2014, WebSci '14.

[112]  Vern Paxson,et al.  @spam: the underground on 140 characters or less , 2010, CCS '10.

[113]  Dawn Xiaodong Song,et al.  Suspended accounts in retrospect: an analysis of twitter spam , 2011, IMC '11.

[114]  Michael Brinkmeier,et al.  Communities in graphs and hypergraphs , 2007, CIKM '07.

[115]  Zhu Wang,et al.  A sentiment-enhanced personalized location recommendation system , 2013, HT.

[116]  David A. Shamma,et al.  Conversational Shadows: Describing Live Media Events Using Short Messages , 2010, ICWSM.

[117]  Leonidas G. Anthopoulos,et al.  Social networks in smart cities: Comparing evaluation models , 2015, 2015 IEEE First International Smart Cities Conference (ISC2).

[118]  Aart J. C. Bik,et al.  Pregel: a system for large-scale graph processing , 2010, SIGMOD Conference.

[119]  Po-Ching Lin,et al.  A study of effective features for detecting long-surviving Twitter spam accounts , 2013, 2013 15th International Conference on Advanced Communications Technology (ICACT).

[120]  Andrea Lancichinetti,et al.  Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[121]  Sandeep Sripada,et al.  Modeling and Analysis of Real World Networks using Kronecker Graphs , 2010 .

[122]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[123]  R. Hollands Will the real smart city please stand up? , 2008, The Routledge Companion to Smart Cities.

[124]  L. Anthopoulos,et al.  Defining Smart City Architecture for Sustainability , 2015 .

[125]  Tore Opsahl,et al.  Clustering in weighted networks , 2009, Soc. Networks.

[126]  Christos Faloutsos,et al.  PEGASUS: A Peta-Scale Graph Mining System Implementation and Observations , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[127]  R. Garrett The chi-square plot: a tool for multivariate outlier recognition , 1989 .

[128]  H P Reckort,et al.  [Social change]. , 1973, Zahnarztliche Mitteilungen.

[129]  A. Moffat,et al.  Offline dictionary-based compression , 2000, Proceedings DCC'99 Data Compression Conference (Cat. No. PR00096).

[130]  A. Faisal,et al.  Scaling-Laws of Human Broadcast Communication Enable Distinction between Human, Corporate and Robot Twitter Users , 2013, PloS one.

[131]  Marc Lemercier,et al.  SPOT 1.0: Scoring Suspicious Profiles on Twitter , 2011, 2011 International Conference on Advances in Social Networks Analysis and Mining.

[132]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[133]  Philip K. Chan,et al.  Modeling multiple time series for anomaly detection , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[134]  Athena Vakali,et al.  Leveraging Collective Intelligence through Community Detection in Tag Networks ∗ , 2009 .

[135]  Christos Faloutsos,et al.  LOCI: fast outlier detection using the local correlation integral , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[136]  Ruixuan Li,et al.  Incremental K-clique clustering in dynamic social networks , 2012, Artificial Intelligence Review.

[137]  Bin Wu,et al.  Overlapping Community Detection in Bipartite Networks , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[138]  Lefteris Angelis,et al.  Sensors talk and humans sense Towards a reciprocal collective awareness smart city framework , 2013, 2013 IEEE International Conference on Communications Workshops (ICC).

[139]  Yiannis Kompatsiaris,et al.  Benchmarking Graph Databases on the Problem of Community Detection , 2014, ADBIS.

[140]  Guy E. Blelloch,et al.  Ligra: a lightweight graph processing framework for shared memory , 2013, PPoPP '13.

[141]  Jimeng Sun,et al.  gbase: an efficient analysis platform for large graphs , 2012, The VLDB Journal.

[142]  Manfred Schroeder,et al.  Fractals, Chaos, Power Laws: Minutes From an Infinite Paradise , 1992 .

[143]  A. Vakali,et al.  Chapter 2 Massive Graph Management for the Web and Web 2 . 0 , 2011 .

[144]  Luca Becchetti,et al.  Efficient semi-streaming algorithms for local triangle counting in massive graphs , 2008, KDD.

[145]  Srinivasan Parthasarathy,et al.  An event-based framework for characterizing the evolutionary behavior of interaction graphs , 2007, KDD '07.

[146]  James Caverlee,et al.  Transient crowd discovery on the real-time social web , 2011, WSDM '11.

[147]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[148]  Pietro Liò,et al.  Towards real-time community detection in large networks. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[149]  Andreas Hotho,et al.  Emergent Semantics in BibSonomy , 2006, GI Jahrestagung.

[150]  Ricardo Baeza-Yates,et al.  Who Are My Audiences? A Study of the Evolution of Target Audiences in Microblogs , 2014, SocInfo.

[151]  Sudipto Guha,et al.  Graph Sparsification in the Semi-streaming Model , 2009, ICALP.

[152]  M. Castells Rise of the Network Society: The Information Age: Economy, Society and Culture , 1996 .

[153]  Gonzalo Navarro,et al.  A Fast and Compact Web Graph Representation , 2007, SPIRE.

[154]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[155]  Christos Faloutsos,et al.  Spotting Suspicious Link Behavior with fBox: An Adversarial Perspective , 2014, 2014 IEEE International Conference on Data Mining.

[156]  Luc Van Gool,et al.  World-scale mining of objects and events from community photo collections , 2008, CIVR '08.

[157]  G. Strang Introduction to Linear Algebra , 1993 .

[158]  Mia Hubert,et al.  Computational Statistics and Data Analysis Robust Pca for Skewed Data and Its Outlier Map , 2022 .

[159]  Christos Faloutsos,et al.  Netprobe: a fast and scalable system for fraud detection in online auction networks , 2007, WWW '07.

[160]  Paul Erdös,et al.  On random graphs, I , 1959 .

[161]  Yun Chi,et al.  Evolutionary spectral clustering by incorporating temporal smoothness , 2007, KDD '07.

[162]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[163]  Jiawei Han,et al.  Progressive clustering of networks using Structure-Connected Order of Traversal , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[164]  Nick Koudas,et al.  TwitterMonitor: trend detection over the twitter stream , 2010, SIGMOD Conference.

[165]  Valentin Robu,et al.  The complex dynamics of collaborative tagging , 2007, WWW '07.

[166]  Dennis J. Turner,et al.  Symantec Internet Security Threat Report Trends for July 04-December 04 , 2005 .

[167]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[168]  Pradeep Dubey,et al.  Navigating the maze of graph analytics frameworks using massive graph datasets , 2014, SIGMOD Conference.