Tax Collections Optimization for New York State

The New York State Department of Taxation and Finance (NYS DTF) collects over $1 billion annually in assessed delinquent taxes. The mission of DTF's Collections and Civil Enforcement Division (CCED) is to increase collections, but to do so in a manner that respects the rights of citizens, by taking actions commensurate with each debtor's situation. CCED must accomplish this in an environment with limited resources. In a collaborative work, NYS DTF, IBM Research, and IBM Global Business Services developed a novel tax collection optimization solution to address this challenge. The operations research-based solution combines data analytics and optimization using the unifying framework of constrained Markov decision processes (C-MDP). The system optimizes the collection actions of agents with respect to maximizing long-term returns, while taking into account the complex dependencies among business needs, resources, and legal constraints. It generates a customized collections policy instead of broad-brush rules, thereby improving both the efficiency and adaptiveness of the collections process. It also enhances and improves the tax agency's ability to administer taxes equitably across the broad scope of individual taxpayers' situations. The system became operational in December 2009; from 2009 to 2010, New York State increased its collections from delinquent revenue by $83 million (8 percent) using the same set of resources. Given a typical annual increase of 2 to 4 percent, the system's expected benefit is approximately $120 to $150 million over a period of three years, far exceeding the initial target of $90 million.

[1]  Edwin P. D. Pednault,et al.  Segmented Regression Estimators for Massive Data Sets , 2002, SDM.

[2]  Edwin P. D. Pednault,et al.  Segmentation-based modeling for advanced targeted marketing , 2001, KDD '01.

[3]  L. C. Baird,et al.  Reinforcement learning in continuous time: advantage updating , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[4]  Naoki Abe,et al.  Optimizing debt collections using constrained reinforcement learning , 2010, KDD.

[5]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[6]  Fusheng Wang,et al.  XBiT: An XML-Based Bitemporal Data Model , 2004, ER.

[7]  John N. Tsitsiklis,et al.  Analysis of temporal-difference learning with function approximation , 1996, NIPS 1996.

[8]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[9]  Naoki Abe,et al.  Cross channel optimized marketing by reinforcement learning , 2004, KDD.

[10]  Eitan Altman,et al.  Asymptotic properties of constrained Markov Decision Processes , 1993, ZOR Methods Model. Oper. Res..

[11]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[12]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[13]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[14]  G. J. Hahn,et al.  Managing Consumer Credit Delinquency in the US Economy: A Multi-Billion Dollar Management Science Application , 1992 .