Smart4Job: A Big Data Framework for Intelligent Job Offers Broadcasting Using Time Series Forecasting and Semantic Classification

Abstract Recently, dedicated web portals and social networks for the automatization of the recruitment processes, have emerged with the expansion of the Internet, leading to a wide use of optimized algorithms. To that aim, a lot of job boards websites have been created, for disseminating and sharing at the best the job offers. Choosing the relevant job board for the broadcasting of a given job offer can be sometimes difficult for the recruiters, since they always seek to attract the best candidates, in a short period of time. Moreover, some job boards have different business categories, which can make the selection very difficult. To deal with these problems, we propose in this paper, Smart4Job a new job boards recommendation system, which proposes the adequate job boards for the dissemination of a new job offer. Our system is based on a hybrid representation on a big data platform, which includes both a domain knowledge analysis and a temporal prediction model. The semantic classification of job boards requires a textual analysis using a controlled vocabulary. The time series analysis module is used to predict the best job board for a given offer, using the history of the clicks. The answers of these modules are combined during the decision making process. The proposed system has been evaluated on real data, and preliminary results seem very promising.

[1]  Tak-Chung Fu,et al.  A review on time series data mining , 2011, Eng. Appl. Artif. Intell..

[2]  Peter Brusilovsky,et al.  Reinforcing Recommendation Using Implicit Negative Feedback , 2009, UMAP.

[3]  Qiang Du,et al.  Convergence of the Lloyd Algorithm for Computing Centroidal Voronoi Tessellations , 2006, SIAM J. Numer. Anal..

[4]  Julie Séguéla,et al.  Fouille de données textuelles et systèmes de recommandation appliqués aux offres d'emploi diffusées sur le web. (Text mining and recommender systems applied to job postings) , 2012 .

[5]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[6]  FuTak-chung A review on time series data mining , 2011 .

[7]  Roberto Navigli,et al.  Multilingual Word Sense Disambiguation and Entity Linking for Everybody , 2014, International Semantic Web Conference.

[8]  Roberto Navigli,et al.  Entity Linking meets Word Sense Disambiguation: a Unified Approach , 2014, TACL.

[9]  Sidahmed Benabderrahmane,et al.  Evaluating distance measures and times series clustering for temporal patterns retrieval , 2014, Proceedings of the 2014 IEEE 15th International Conference on Information Reuse and Integration (IEEE IRI 2014).

[10]  Chris Chatfield,et al.  Holt‐Winters Forecasting: Some Practical Issues , 1988 .

[11]  Mounia Lalmas,et al.  A survey on the use of relevance feedback for information access systems , 2003, The Knowledge Engineering Review.

[12]  Joseph S. Kong,et al.  The Love-Hate Square Counting Method for Recommender Systems , 2012, KDD Cup.

[13]  Monnie McGee,et al.  Introduction to Time Series Analysis and Forecasting: With Applications of SAS and SPSS , 2000 .

[14]  Paul Compton,et al.  Combining Career Progression and Profile Matching in a Job Recommender System , 2014, PRICAI.

[15]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[16]  Ying-ting Zhu,et al.  K-medoids clustering based on MapReduce and optimal search of medoids , 2014, 2014 9th International Conference on Computer Science & Education.

[17]  Guo Tao,et al.  Parallel k-modes algorithm based on MapReduce , 2015, 2015 Third International Conference on Digital Information, Networking, and Wireless Communications (DINWC).

[18]  Chayaporn Kaensar Design of Personal Job Recommendation Framework on Smartphone Platform , 2015 .

[19]  Chuck Lam,et al.  Hadoop in Action , 2010 .

[20]  Huan Wang,et al.  A Job Recommender System Based on User Clustering , 2013, J. Comput..

[21]  Mustapha Lebbah,et al.  SOM Clustering Using Spark-MapReduce , 2014, 2014 IEEE International Parallel & Distributed Processing Symposium Workshops.

[22]  Qiang Du,et al.  Centroidal Voronoi Tessellations: Applications and Algorithms , 1999, SIAM Rev..

[23]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[24]  Concha Bielza,et al.  A comparison of clustering quality indices using outliers and noise , 2012, Intell. Data Anal..

[25]  L. Kelley,et al.  An automated approach for clustering an ensemble of NMR-derived protein structures into conformationally related subfamilies. , 1996, Protein engineering.

[26]  F. Trichet,et al.  CommOn: A framework for developing knowledge-based systems dedicated to competency-based management , 2006, 28th International Conference on Information Technology Interfaces, 2006..