Adaptive matching for expert systems with uncertain task types

Online two-sided matching markets such as Q&A forums (e.g. StackOverflow, Quora) and online labour platforms (e.g. Upwork) critically rely on the ability to propose adequate matches based on imperfect knowledge of the two parties to be matched. This prompts the following question: Which matching recommendation algorithms can, in the presence of such uncertainty, lead to efficient platform operation? To answer this question, we develop a model of a task / server matching system. For this model, we give a necessary and sufficient condition for an incoming stream of tasks to be manageable by the system. We further identify a so-called back-pressure policy under which the throughput that the system can handle is optimized. We show that this policy achieves strictly larger throughput than a natural greedy policy. Finally, we validate our model and confirm our theoretical findings with experiments based on logs of Math. StackExchange, a StackOverflow forum dedicated to mathematics.

[1]  Leandros Tassiulas,et al.  Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks , 1992 .

[2]  T. L. Lai Andherbertrobbins Asymptotically Efficient Adaptive Allocation Rules , 2022 .

[3]  T. L. Lai Andherbertrobbins Asymptotically Efficient Adaptive Allocation Rules , 1985 .

[4]  Andreas Krause,et al.  Near Optimal Bayesian Active Learning for Decision Making , 2014, AISTATS.

[5]  Upendra Dave,et al.  Applied Probability and Queues , 1987 .

[6]  Morteza Zadimoghaddam,et al.  Online Stochastic Matching with Unequal Probabilities , 2014, SODA.

[7]  J. Harrison Heavy traffic analysis of a system with parallel servers: asymptotic optimality of discrete-review policies , 1998 .

[8]  R. Srikant,et al.  Optimal heavy-traffic queue length scaling in an incompletely saturated switch , 2016, Queueing Systems.

[9]  Ward Whitt,et al.  Scheduling Flexible Servers with Convex Delay Costs in Many-Server Service Systems , 2009, Manuf. Serv. Oper. Manag..

[10]  Ann Appl,et al.  On the Positive Harris Recurrence for Multiclass Queueing Networks: a Uniied Approach via Uid Limit Models , 1999 .

[11]  Shipra Agrawal,et al.  Analysis of Thompson Sampling for the Multi-armed Bandit Problem , 2011, COLT.

[12]  Jon M. Kleinberg,et al.  Using mixture models for collaborative filtering , 2004, STOC '04.

[13]  Aranyak Mehta,et al.  Online Matching with Stochastic Rewards , 2012, 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science.

[14]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[15]  Ronald J. Williams,et al.  Dynamic scheduling of a system with two parallel servers in heavy traffic with resource pooling: asymptotic optimality of a threshold policy , 2001 .

[16]  Yashodhan Kanoria,et al.  Know Your Customer: Multi-armed Bandits with Capacity Constraints , 2016, ArXiv.

[17]  L. Massouli'e Structural properties of proportional fairness: Stability and insensitivity , 2007, 0707.4542.

[18]  J. Dai On Positive Harris Recurrence of Multiclass Queueing Networks: A Unified Approach Via Fluid Limit Models , 1995 .

[19]  Xue-Ming Yuan,et al.  Stability of Data Networks: Stationary and Bursty Models , 2005, Oper. Res..

[20]  Leandros Tassiulas,et al.  Resource Allocation and Cross-Layer Control in Wireless Networks , 2006, Found. Trends Netw..

[21]  J. Dai On Positive Harris Recurrence of Multiclass Queueing Networks: A Unified Approach Via Fluid Limit Models , 1995 .

[22]  Thore Graepel,et al.  WWW 2009 MADRID! Track: Data Mining / Session: Statistical Methods Matchbox: Large Scale Online Bayesian Recommendations , 2022 .

[23]  Jon M. Kleinberg,et al.  Convergent algorithms for collaborative filtering , 2003, EC '03.

[24]  Eytan Modiano,et al.  Dynamic power allocation and routing for time varying wireless networks , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[25]  Chao Gao,et al.  Exact Exponent in Optimal Rates for Crowdsourcing , 2016, ICML.

[26]  Gustavo de Veciana,et al.  A Stable Approach for Routing Queries in Unstructured P2P Networks , 2016, IEEE/ACM Transactions on Networking.

[27]  Christian M. Ernst,et al.  Multi-armed Bandit Allocation Indices , 1989 .

[28]  R. Tweedie Criteria for classifying general Markov chains , 1976, Advances in Applied Probability.

[29]  Xi Chen,et al.  Spectral Methods Meet EM: A Provably Optimal Algorithm for Crowdsourcing , 2014, J. Mach. Learn. Res..

[30]  Devavrat Shah,et al.  Budget-Optimal Task Allocation for Reliable Crowdsourcing Systems , 2011, Oper. Res..

[31]  Tolga Tezcan,et al.  Dynamic Control of N-Systems with Many Servers: Asymptotic Optimality of a Static Priority Policy in Heavy Traffic , 2010, Oper. Res..

[32]  Lei Ying,et al.  On Combining Shortest-Path and Back-Pressure Routing Over Multihop Wireless Networks , 2011, IEEE/ACM Transactions on Networking.

[33]  Andreas Krause,et al.  Near-Optimal Bayesian Active Learning with Noisy Observations , 2010, NIPS.

[34]  Mihalis G. Markakis,et al.  Learning and Hierarchies in Service Systems , 2019, Manag. Sci..

[35]  Chien-Ju Ho,et al.  Online Task Assignment in Crowdsourcing Markets , 2012, AAAI.

[36]  J. Bather,et al.  Multi‐Armed Bandit Allocation Indices , 1990 .

[37]  Alexander L. Stolyar,et al.  Scheduling Flexible Servers with Convex Delay Costs: Heavy-Traffic Optimality of the Generalized cµ-Rule , 2004, Oper. Res..

[38]  Sébastien Bubeck,et al.  Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems , 2012, Found. Trends Mach. Learn..

[39]  Yuval Peres,et al.  Approval Voting and Incentives in Crowdsourcing , 2015, ICML.

[40]  John Odentrantz,et al.  Markov Chains: Gibbs Fields, Monte Carlo Simulation, and Queues , 2000, Technometrics.

[41]  Laurent Massoulié,et al.  On the capacity of information processing systems , 2016, COLT.

[42]  Bhaskar Krishnamachari,et al.  Backpressure with Adaptive Redundancy (BWAR) , 2012, 2012 Proceedings IEEE INFOCOM.

[43]  Yashodhan Kanoria,et al.  Matching while Learning , 2016, EC.

[44]  Hisashi Kashima,et al.  Budgeted stream-based active learning via adaptive submodular maximization , 2016, NIPS.

[45]  Luca de Alfaro,et al.  WorkerRank: Using Employer Implicit Judgements to Infer Worker Reputation , 2015, WSDM.

[46]  Ness B. Shroff,et al.  Delay-Based Back-Pressure Scheduling in Multihop Wireless Networks , 2011, IEEE/ACM Transactions on Networking.

[47]  Alexander L. Stolyar,et al.  A Novel Architecture for Reduction of Delay and Queueing Structure Complexity in the Back-Pressure Algorithm , 2011, IEEE/ACM Transactions on Networking.

[48]  Ying Cui,et al.  Enhancing the Delay Performance of Dynamic Backpressure Algorithms , 2016, IEEE/ACM Transactions on Networking.

[49]  Andreas Krause,et al.  Near-optimal Batch Mode Active Learning and Adaptive Submodular Optimization , 2013, ICML.