Rank Aggregation for Non-stationary Data Streams

We consider the problem of learning over non-stationary ranking streams. The rankings can be interpreted as the preferences of a population and the non-stationarity means that the distribution of preferences changes over time. Our goal is to learn, in an online manner, the current distribution of rankings. The bottleneck of this process is a rank aggregation problem. We propose a generalization of the Borda algorithm for non-stationary ranking streams. Moreover, we give bounds on the minimum number of samples required to output the ground truth with high probability. Besides, we show how the optimal parameters are set. Then, we generalize the whole family of weighted voting rules (the family to which Borda belongs) to situations in which some rankings are more \textit{reliable} than others and show that this generalization can solve the problem of rank aggregation over non-stationary data streams.

[1]  Masayuki Takeda,et al.  Online Rank Aggregation , 2012, ACML.

[2]  Devavrat Shah,et al.  Iterative Learning for Reliable Crowdsourcing Systems , 2011, NIPS.

[3]  Ya'akov Gal,et al.  A Study of Human Behavior in Online Voting , 2015, AAMAS.

[4]  R. Duncan Luce,et al.  Individual Choice Behavior , 1959 .

[5]  Atri Rudra,et al.  Ordering by weighted number of wins gives a good ranking for weighted tournaments , 2006, SODA '06.

[6]  Ariel D. Procaccia,et al.  Dynamic Social Choice with Evolving Preferences , 2013, AAAI.

[7]  J. A. Lozano,et al.  Sampling and learning the Mallows and Generalized Mallows models under the Cayley distance , 2014 .

[8]  Mark Braverman,et al.  Noisy sorting without resampling , 2007, SODA '08.

[9]  M. Fligner,et al.  Distance Based Ranking Models , 1986 .

[10]  Joseph S. Verducci,et al.  Probability models on rankings. , 1991 .

[11]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[12]  Moni Naor,et al.  Rank aggregation methods for the Web , 2001, WWW '01.

[13]  Eyke Hüllermeier,et al.  Open challenges for data stream mining research , 2014, SKDD.

[14]  Eyke Hüllermeier,et al.  Preference-Based Rank Elicitation using Statistical Models: The Case of Mallows , 2014, ICML.

[15]  J. A. Lozano,et al.  PerMallows: An R Package for Mallows and Generalized Mallows Models , 2016 .

[16]  Ariel D. Procaccia,et al.  When do noisy votes reveal the truth? , 2013, EC '13.

[17]  Craig Boutilier,et al.  Learning Mallows Models with Pairwise Preferences , 2011, ICML.

[18]  Kate Larson,et al.  Matching with Dynamic Ordinal Preferences , 2015, AAAI.

[19]  Valeria Vitelli,et al.  Probabilistic preference learning with the Mallows rank model , 2014, J. Mach. Learn. Res..

[20]  M. Fligner,et al.  Multistage Ranking Models , 1988 .