Collaborative Company Profiling: Insights from an Employee's Perspective

Company profiling is an analytical process to build an indepth understanding of company's fundamental characteristics. It serves as an effective way to gain vital information of the target company and acquire business intelligence. Traditional approaches for company profiling rely heavily on the availability of rich finance information about the company, such as finance reports and SEC filings, which may not be readily available for many private companies. However, the rapid prevalence of online employment services enables a new paradigm - to obtain the variety of company's information from their employees' online ratings and comments. This, in turn, raises the challenge to develop company profiles from an employee's perspective. To this end, in this paper, we propose a method named Company Profiling based Collaborative Topic Regression (CPCTR), for learning the latent structural patterns of companies. By formulating a joint optimization framework, CPCTR has the ability in collaboratively modeling both textual (e.g., reviews) and numerical information (e.g., salaries and ratings). Indeed, with the identified patterns, including the positive/negative opinions and the latent variable that influences salary, we can effectively carry out opinion analysis and salary prediction. Extensive experiments were conducted on a real-world data set to validate the effectiveness of CPCTR. The results show that our method provides a comprehensive understanding of company characteristics and delivers a more effective prediction of salaries than other baselines.

[1]  Ivan Titov,et al.  A Joint Model of Text and Aspect Ratings for Sentiment Summarization , 2008, ACL.

[2]  John D. Lafferty,et al.  Correlated Topic Models , 2005, NIPS.

[3]  Enhong Chen,et al.  Tracking the Evolution of Social Emotions: A Time-Aware Topic Modeling Perspective , 2014, 2014 IEEE International Conference on Data Mining.

[4]  Chong Wang,et al.  Collaborative topic modeling for recommending scientific articles , 2011, KDD.

[5]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[6]  Gene H. Golub,et al.  Singular value decomposition and least squares solutions , 1970, Milestones in Matrix Computation.

[7]  Hui Xiong,et al.  Recruitment Market Trend Analysis with Sequential Latent Variable Models , 2016, KDD.

[8]  John P. Rice,et al.  Profiling Enterprise Risks in Large Computer Companies Using the Leximancer Software Tool , 2007 .

[9]  H. T. Goranson,et al.  Assessing Enterprise Integration for Competitive Advantage - Workshop 2, Working Group 1 , 1997, ICEIMT.

[10]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[11]  K. Vivekanandan,et al.  Aspect-based Opinion Mining: A Survey , 2014 .

[12]  Jure Leskovec,et al.  Hidden factors and hidden topics: understanding rating dimensions with review text , 2013, RecSys.

[13]  Enhong Chen,et al.  Matrix Factorization with Scale-Invariant Parameters , 2015, IJCAI.

[14]  Benjamin M. Marlin,et al.  Modeling User Rating Profiles For Collaborative Filtering , 2003, NIPS.

[15]  Florian Kerschbaum,et al.  Building a Privacy-Preserving Benchmarking Enterprise System , 2007, 11th IEEE International Enterprise Distributed Object Computing Conference (EDOC 2007).

[16]  Choon Seong Leem,et al.  Information technology maturity stages and enterprise benchmarking: an empirical study , 2008, Ind. Manag. Data Syst..

[17]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Indexing , 1999, SIGIR Forum.

[18]  Thomas L. Griffiths,et al.  The Author-Topic Model for Authors and Documents , 2004, UAI.

[19]  Richard S. Zemel,et al.  The multiple multiplicative factor model for collaborative filtering , 2004, ICML.

[20]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[21]  Joachim Knuf,et al.  Benchmarking the Lean Enterprise: Organizational Learning at Work , 2000 .

[22]  Amélie Marian,et al.  Beyond the Stars: Improving Rating Predictions using Review Text Content , 2009, WebDB.

[23]  K. Schittkowski,et al.  NONLINEAR PROGRAMMING , 2022 .

[24]  John D. Lafferty,et al.  Dynamic topic models , 2006, ICML.