Ontology based recommender system using social network data

Online Social Network (OSN) is considered a key source of information for real-time decision making. However, several constraints lead to decreasing the amount of information that a researcher can have while increasing the time of social network mining procedures. In this context, this paper proposes a new framework for sampling Online Social Network (OSN). Domain knowledge is used to define tailored strategies that can decrease the budget and time required for mining while increasing the recall. An ontology supports our filtering layer in evaluating the relatedness of nodes. Our approach demonstrates that the same mechanism can be advanced to prompt recommendations to users. Our test cases and experimental results emphasize the importance of the strategy definition step in our social miner and the application of ontologies on the knowledge graph in the domain of recommendation analysis.

[1]  Geert-Jan Houben,et al.  Twitcident: fighting fire with information from social web streams , 2012, WWW.

[2]  Daniela Stockmann,et al.  We Don't Know What We Don't Know: When and How the Use of Twitter's Public APIs Biases Scientific Inference , 2017 .

[3]  Azzam Mourad,et al.  Towards Proactive Social Learning Approach for Traffic Event Detection based on Arabic Tweets , 2018, 2018 14th International Wireless Communications & Mobile Computing Conference (IWCMC).

[4]  Héctor M. Pérez Meana,et al.  A Web Scraping Methodology for Bypassing Twitter API Restrictions , 2018, ArXiv.

[5]  Shuai Wang,et al.  Deep learning for sentiment analysis: A survey , 2018, WIREs Data Mining Knowl. Discov..

[6]  Leysia Palen,et al.  Natural Language Processing to the Rescue? Extracting "Situational Awareness" Tweets During Mass Emergency , 2011, ICWSM.

[7]  F. Maxwell Harper,et al.  The MovieLens Datasets: History and Context , 2016, TIIS.

[8]  Carlos Gershenson,et al.  Towards a Standard Sampling Methodology on Online Social Networks: Collecting Global Trends on Twitter , 2015, ArXiv.

[9]  Vijay V. Raghavan,et al.  Big Data and Data Analytics Research: From Metaphors to Value Space for Collective Wisdom in Human Decision Making and Smart Machines , 2017, Int. J. Semantic Web Inf. Syst..

[10]  Azzam Mourad,et al.  Few are as Good as Many: An Ontology-Based Tweet Spam Detection Approach , 2018, IEEE Access.

[11]  Paolo Nesi,et al.  Predicting TV programme audience by using twitter based metrics , 2017, Multimedia Tools and Applications.

[12]  Joseph Murphy,et al.  Total Twitter Error: Decomposing Public Opinion Measurement on Twitter from a Total Survey Error Perspective , 2017 .

[13]  Donald F. Towsley,et al.  Improving Random Walk Estimation Accuracy with Uniform Restarts , 2010, WAW.

[14]  Fred Morstatter,et al.  Tampering with Twitter’s Sample API , 2018, EPJ Data Science.

[15]  Azzam Mourad,et al.  Sampling Online Social Networks with Tailored Mining Strategies , 2019, 2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS).

[16]  Aziz Mohaisen,et al.  Measuring the mixing time of social graphs , 2010, IMC '10.

[17]  Ming Zhao,et al.  An efficient data packet iteration and transmission algorithm in opportunistic social networks , 2019, Journal of Ambient Intelligence and Humanized Computing.

[18]  Aditya Khamparia,et al.  A comprehensive survey of edge prediction in social networks: Techniques, parameters and challenges , 2019, Expert Syst. Appl..

[19]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[20]  Marcus E. Berzofsky,et al.  Probability-Based Samples on Twitter: Methodology and Application , 2018, Survey Practice.

[21]  Emanuele Bellini,et al.  IoT Vulnerability Data Crawling and Analysis , 2019, 2019 IEEE World Congress on Services (SERVICES).

[22]  Enrico Motta,et al.  DSSim-ontology Mapping with Uncertainty , 2006, Ontology Matching.

[23]  Mohammad Reza Meybodi,et al.  Learning Automata Approach for Social Networks , 2019, Studies in Computational Intelligence.

[24]  Xin Xu,et al.  Beyond random walk and metropolis-hastings samplers: why you should not backtrack for unbiased graph sampling , 2012, SIGMETRICS '12.

[25]  Mehrbakhsh Nilashi,et al.  A recommender system based on collaborative filtering using ontology and dimensionality reduction techniques , 2018, Expert Syst. Appl..

[26]  Ming Zhao,et al.  Community recombination and duplication node traverse algorithm in opportunistic social networks , 2020, Peer-to-Peer Netw. Appl..

[27]  A KonstanJoseph,et al.  The MovieLens Datasets , 2015 .

[28]  Minas Gjoka,et al.  Practical Recommendations on Crawling Online Social Networks , 2011, IEEE Journal on Selected Areas in Communications.

[29]  Xin Xu,et al.  A general framework of hybrid graph sampling for complex network analysis , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[30]  Paolo Ceravolo,et al.  Assessing Strategies for Sampling Dynamic Social Networks , 2019, RIIFORUM.

[31]  Paolo Nesi,et al.  Twitter vigilance: A multi-user platform for cross-domain Twitter data analytics, NLP and sentiment analysis , 2017, 2017 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computed, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI).

[32]  John Riedl,et al.  Application of Dimensionality Reduction in Recommender System - A Case Study , 2000 .

[33]  Theus Hossmann,et al.  Twitter in disaster mode: opportunistic communication and distribution of sensor data in emergencies , 2011 .

[34]  Benjamin Lev,et al.  Synergies Between Association Rules and Collaborative Filtering in Recommender System: An Application to Auto Industry , 2019, Data Science and Digital Business.

[35]  Peter M. G. Apers,et al.  Neogeography: The challenge of channelling large and ill-behaved data streams , 2011, 2011 IEEE 27th International Conference on Data Engineering Workshops.

[36]  Tarek F. Abdelzaher,et al.  Finding true and credible information on Twitter , 2014, 17th International Conference on Information Fusion (FUSION).

[37]  Miltiadis D. Lytras,et al.  Social media mining for smart cities and smart villages research , 2020, Soft Comput..

[38]  Huan Liu,et al.  When is it biased?: assessing the representativeness of twitter's streaming API , 2014, WWW.

[39]  Jia Wu,et al.  Advanced Data Delivery Strategy Based on Multiperceived Community with IoT in Social Complex Networks , 2020, Complex..

[40]  Jamie Guillory,et al.  Recruiting Hard-to-Reach Populations for Survey Research: Using Facebook and Instagram Advertisements and In-Person Intercept in LGBT Bars and Nightclubs to Recruit LGBT Young Adults , 2018, Journal of medical Internet research.

[41]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[42]  Lisa Madlberger,et al.  Predictions based on Twitter — A critical view on the research process , 2014, 2014 International Conference on Data and Software Engineering (ICODSE).

[43]  Mourad Oussalah,et al.  A software architecture for Twitter collection, search and geolocation services , 2013, Knowl. Based Syst..

[44]  Donald F. Towsley,et al.  Sampling directed graphs with random walks , 2012, 2012 Proceedings IEEE INFOCOM.

[45]  Karima Benatchba,et al.  Tracking community evolution in social networks: A survey , 2019, Inf. Process. Manag..

[46]  DamianiErnesto,et al.  Big Data and Data Analytics Research , 2017 .