Building the Profile of Web Events Based on Website Measurement

Nowadays, Web makes it possible to study emergencies from web information due to its real-time, open, and dynamic features. After the emergence of a web event, there will be numerous websites publishing webpages to cover this web event. Measuring temporal features in evolution course of web events can help people timely know and understand which events are emergencies, so harms to the society caused by emergencies can be reduced. In this paper, website preference is formally defined and mined by three proposed strategies which are all explicitly or implicitly based on the three-level networks: website-level, webpage-level and keyword-level. An iterative algorithm is firstly introduced to calculate outbreak power of web events, and increased web pages of events, increased attributes of events, distribution of attributes in web pages and the relationships of attributes are embedded into this iterative algorithm as the variables. By means of prior knowledge, membership grade of web events belong to each type can be calculated, and then the type of web events can be discriminated. Experiments on real data set demonstrate the proposed algorithm is both efficient and effective, and it is capable of providing accurate results of discrimination.

[1]  Chih-Ping Wei,et al.  Discovering Event Evolution Graphs From News Corpora , 2009, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[2]  Dov Te'eni,et al.  Content versus structure in information environments: a longitudinal analysis of website preferences , 2000, ICIS.

[3]  Lizhe Wang,et al.  Incremental building association link network , 2011, Comput. Syst. Sci. Eng..

[4]  Yiming Yang,et al.  Topic Detection and Tracking Pilot Study Final Report , 1998 .

[5]  David M. Blei,et al.  Hierarchical relational models for document networks , 2009, 0909.4331.

[6]  Juha Makkonen,et al.  Investigations on Event Evolution on TDT , 2003, NAACL.

[7]  Xue Chen,et al.  Building Association Link Network for Semantic Link on Web Resources , 2011, IEEE Transactions on Automation Science and Engineering.

[8]  Lan Chen,et al.  Semantic Link Network-Based Model for Organizing Multimedia Big Data , 2014, IEEE Transactions on Emerging Topics in Computing.

[9]  Lan Chen,et al.  Knowle: A semantic link network based system for organizing large scale online news events , 2015, Future Gener. Comput. Syst..

[10]  Guangquan Zhang,et al.  Uncertainty Analysis for the Keyword System of Web Events , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[11]  Yunhuai Liu,et al.  Crowdsourcing based social media data analysis of urban emergency events , 2017, Multimedia Tools and Applications.

[12]  Haiyan Chen,et al.  The semantic analysis of knowledge map for the traffic violations from the surveillance video big data , 2015, Comput. Syst. Sci. Eng..

[13]  Ricardo A. Baeza-Yates,et al.  A content and structure website mining model , 2006, WWW '06.

[14]  Rob Law,et al.  A New Framework on Website Evaluation , 2010, 2010 International Conference on E-Business and E-Government.