论文信息 - Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States

Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States

The United States spends more than $1B each year on initiatives such as the American Community Survey (ACS), a labor-intensive door-to-door study that measures statistics relating to race, gender, education, occupation, unemployment, and other demographic factors. Although a comprehensive source of data, the lag between demographic changes and their appearance in the ACS can exceed half a decade. As digital imagery becomes ubiquitous and machine vision techniques improve, automated data analysis may provide a cheaper and faster alternative. Here, we present a method that determines socioeconomic trends from 50 million images of street scenes, gathered in 200 American cities by Google Street View cars. Using deep learning-based computer vision techniques, we determined the make, model, and year of all motor vehicles encountered in particular neighborhoods. Data from this census of motor vehicles, which enumerated 22M automobiles in total (8% of all automobiles in the US), was used to accurately estimate income, race, education, and voting patterns, with single-precinct resolution. (The average US precinct contains approximately 1000 people.) The resulting associations are surprisingly simple and powerful. For instance, if the number of sedans encountered during a 15-minute drive through a city is higher than the number of pickup trucks, the city is likely to vote for a Democrat during the next Presidential election (88% chance); otherwise, it is likely to vote Republican (82%). Our results suggest that automated systems for monitoring demographic trends may effectively complement labor-intensive approaches, with the potential to detect trends with fine spatial resolution, in close to real time.

[1] H. D. Brunk,et al. Statistical inference under order restrictions : the theory and application of isotonic regression , 1973 .

[2] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[3] Patricia L. Mokhtarian,et al. What type of vehicle do people drive? The role of attitude and lifestyle in influencing vehicle type choice - eScholarship , 2004 .

[4] Hal Daumé,et al. Frustratingly Easy Domain Adaptation , 2007, ACL.

[5] Panagiotis G. Ipeirotis,et al. Get another label? improving data quality and data mining using multiple, noisy labelers , 2008, KDD.

[6] Jia Deng,et al. A large-scale hierarchical image database , 2009, CVPR 2009.

[7] David A. McAllester,et al. Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8] Erez Lieberman Aiden,et al. Quantitative Analysis of Culture Using Millions of Digitized Books , 2010, Science.

[9] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[10] Hao Su,et al. Crowdsourcing Annotations for Visual Object Detection , 2012, HCOMP@AAAI.

[11] Ramesh Raskar,et al. Streetscore -- Predicting the Perceived Safety of One Million Streetscapes , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[12] Byoungkwon An,et al. Looking Beyond the Visible Scene , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[13] Michael J. Cafarella,et al. Using Social Media to Measure Labor Market Flows , 2014 .

[14] Bolei Zhou,et al. Recognizing City Identity via Attribute Analysis of Geo-tagged Images , 2014, ECCV.

[15] Vicente Ordonez,et al. Learning High-Level Judgments of Urban Perception , 2014, ECCV.

[16] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17] Gabriel Cadamuro,et al. Predicting poverty and wealth from mobile phone metadata , 2015, Science.

[18] Sang Michael Xie,et al. Combining satellite imagery and machine learning to predict poverty , 2016, Science.

[19] Jonathan Krause,et al. Scalable Annotation of Fine-Grained Categories Without Experts , 2017, CHI.

[20] Jonathan Krause,et al. Fine-Grained Car Detection for Visual Census Estimation , 2017, AAAI.