Extracting the Dynamic Popularity of Concepts from a Corpus of Short-Sentence Documents

The decomposition of information into smaller bunches of data is a commonly observed process on the Web, Twitter and RSS being manifestations of this process. As a consequence, a shift may be observed from an information world in which information comes in large bunches of data, to a world of short-sentence documents. This shrinking of information chunks goes along with an explosion of the number of these chunks. Therefore, information may often be aggregated in corpuses of documents consisting of many short sentences. The identification of important concepts in corpuses of short-sentence documents is a difficult, but necessary, task to understand the whole information. Understanding the dynamics of the popularity of important concepts is necessary to capture the evolution of the corpus in time. In this paper, a method to extract the important concepts from a corpus of short-sentence documents is proposed. A model of the popularity of concepts and its dynamics is proposed, together with an algorithm to analyze the dynamics of important concepts. Finally, the proposed method is validated with an analysis of the titles of the articles published at eleven IFIP Working Conferences on Virtual Enterprises, from PROVE’99 to PROVE’10.

[1]  Hamideh Afsarmanesh,et al.  Establishing the Foundation of Collaborative Networks , 2007 .

[2]  Hamideh Afsarmanesh,et al.  Leveraging Knowledge for Innovation in Collaborative Networks, 10th IFIP WG 5.5 Working Conference on Virtual Enterprises, PRO-VE 2009, Thessaloniki, Greece, October 7-9, 2009. Proceedings , 2009, PRO-VE.

[3]  Hamideh Afsarmanesh,et al.  Infrastructures for Virtual Enterprises: Networking Industrial Enterprises, IFIP TC5 WG5.3 / PRODNET Working Conference on Infrastructures for Virtual Enterprises (PRO-VE '99), October 27-28, 1999, Porto, Portugal , 1999, Working Conference on Virtual Enterprises.

[4]  Hamideh Afsarmanesh,et al.  Processes and Foundations for Virtual Organizations , 2004, IFIP — The International Federation for Information Processing.

[5]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[6]  Hamideh Afsarmanesh,et al.  Collaborative Networks for a Sustainable World - 11th IFIP WG 5.5 Working Conference on Virtual Enterprises, PRO-VE 2010, St. Etienne, France, October 11-13, 2010. Proceedings , 2010, PRO-VE.

[7]  Hamideh Afsarmanesh,et al.  Collaborative Networks and Their Breeding Environments: IFIP TC 5 WG 5.5 Sixth IFIP Working Conference on VIRTUAL ENTERPRISES, 26-28 September 2005 , 2005 .

[8]  Luis M. Camarinha-Matos,et al.  Pervasive Collaborative Networks , 2008 .

[9]  Hamideh Afsarmanesh,et al.  Collaborative Networks and Their Breeding Environments - IFIP TC5 WG 5.5 Sixth IFIP Working Conference on Virtual Enterprises, PRE-VE 2005, 26-28 September, 2005, Valencia, Spain , 2005, PRO-VE.

[10]  Hamideh Afsarmanesh,et al.  Proceedings of the IFIP TC5 WG5.3 / PRODNET Working Conference on Infrastructures for Virtual Enterprises: Networking Industrial Enterprises , 1999 .

[11]  J. Huisman The Netherlands , 1996, The Lancet.

[12]  Luis M. Camarinha-Matos,et al.  Virtual Enterprises and Collaborative Networks, IFIP 18th World Computer Congress, TC5 / WG5.5 - 5th Working Conference on Virtual Enterprises, 22-27 August 2004, Toulouse, France , 2004, Virtual Enterprises and Collaborative Networks.

[13]  Hamideh Afsarmanesh,et al.  E-Business and Virtual Enterprises: Managing Business-to-Business Cooperation , 2000 .

[14]  Hamideh Afsarmanesh,et al.  Network-Centric Collaboration and Supporting Frameworks , 2006 .