Population Synthesis Using Iterative Proportional Fitting (IPF): A Review and Future Research

Abstract Activity-Based travel demand modeling requires the detailed socioeconomic data of the study area population. Since the collection of such detailed data for the whole population is too expensive, if not infeasible, population synthesis has been proposed to predict the data and produce them synthetically based on a sample. This much cheaper alternative for forecasting population characteristics is based on iterative proportional fitting (IPF) and has been the focus of much research due to its many advantages. This paper seeks to critically review the state of the art of IPF, classify the peer-reviewed literature, investigate the major problems of IPF, and identify gaps for future research to enhance the usefulness of IPF for the synthesis. Our review shows that integer conversion and zero-cell are among the most important problems necessitating empirical investigation. Unbiased tabular (controlled) rounding methods should be developed to integerize the fractional numbers estimated by IPF for the frequency of household types. Zero-cell problem, although already dealt with, still lacks unbiased solutions. Simulation-based synthesis can help to avoid zero-cell problem while it has many other advantages.

[1]  Paul Williamson,et al.  An evaluation of the combinatorial optimisation approach to the creation of synthetic microdata , 2000 .

[2]  Wilfred W. Recker,et al.  A model of complex travel behavior: Part II—An operational model , 1986 .

[3]  Friedrich Pukelsheim,et al.  Biproportional scaling of matrices and the iterative proportional fitting procedure , 2014, Ann. Oper. Res..

[4]  W. Deming,et al.  On a Least Squares Adjustment of a Sampled Frequency Table When the Expected Marginal Totals are Known , 1940 .

[5]  Hillel Bar-Gera,et al.  ESTIMATING SURVEY WEIGHTS WITH MULTIPLE CONSTRAINTS USING ENTROPY OPTIMIZATION METHODS , 2009 .

[6]  Robin Lovelace,et al.  'Truncate, replicate, sample': A method for creating integer weights for spatial microsimulation , 2013, Comput. Environ. Urban Syst..

[7]  R. Kitamura An evaluation of activity-based travel analysis , 1988 .

[8]  S. Fienberg An Iterative Procedure for Estimation in Contingency Tables , 1970 .

[9]  Michel Bierlaire,et al.  Simulation based Population Synthesis , 2013 .

[10]  David A. Hensher,et al.  TRESIS: A transportation, land use and environmental strategy impact simulator for urban areas , 2002 .

[11]  W.W. Recker,et al.  A MODEL OF COMPLEX TRAVEL BEHAVIOR: PART I. THEORETICAL DEVELOPMENT , 1985 .

[12]  Amir Reza Mamdoohi,et al.  Population Synthesis in Activity-Based Models , 2015 .

[13]  Juan José Salazar González,et al.  Enhanced controlled tabular adjustment , 2014, Comput. Oper. Res..

[14]  Graham Clarke,et al.  SimBritain: a spatial microsimulation approach to population dynamics , 2005 .

[15]  Alison J. Heppenstall,et al.  Creating Realistic Synthetic Populations at Varying Spatial Scales: A Comparative Critique of Population Synthesis Techniques , 2012, J. Artif. Soc. Soc. Simul..

[16]  S. Srinivasan,et al.  Procedure for Forecasting Household Characteristics for Input to Travel-demand Models , 2008 .

[17]  David Voas,et al.  Evaluating Goodness-of-Fit Measures for Synthetic Microdata , 2001 .

[18]  Lu Ma,et al.  Synthetic Population Generation with Multilevel Controls: A Fitness‐Based Synthesis Approach and Validations , 2015, Comput. Aided Civ. Infrastructure Eng..

[19]  P. Waddell,et al.  Methodology to Match Distributions of Both Household and Person Attributes in Generation of Synthetic Populations , 2009 .

[20]  M. D. McKay,et al.  Creating synthetic baseline populations , 1996 .

[21]  Eric J. Miller,et al.  Advances in population synthesis: fitting many attributes per agent and fitting to household and person margins simultaneously , 2012 .

[22]  Frederick Mosteller,et al.  Association and Estimation in Contingency Tables , 1968 .

[23]  Ta Theo Arentze,et al.  Creating Synthetic Household Populations , 2007 .

[24]  Harry Timmermans,et al.  Progress in Activity-Based Analysis , 2005 .

[25]  Joshua Auld,et al.  Efficient Methodology for Generating Synthetic Populations with Multiple Control Levels , 2010 .

[26]  Joshua Auld,et al.  Population Synthesis with Region-Level Control Variable Aggregation , 2008 .

[27]  Chandra R. Bhat,et al.  Population Synthesis for Microsimulating Travel Behavior , 2007 .

[28]  Johan Barthelemy,et al.  Synthetic Population Generation Without a Sample , 2013, Transp. Sci..

[29]  S. Kullback,et al.  Contingency tables with given marginals. , 1968, Biometrika.

[30]  David W. S. Wong,et al.  The Reliability of Using the Iterative Proportional Fitting Procedure , 1992 .

[31]  A. Agresti Categorical data analysis , 1993 .