FaceLift: a transparent deep learning framework to beautify urban scenes

In the area of computer vision, deep learning techniques have recently been used to predict whether urban scenes are likely to be considered beautiful: it turns out that these techniques are able to make accurate predictions. Yet they fall short when it comes to generating actionable insights for urban design. To support urban interventions, one needs to go beyond predicting beauty, and tackle the challenge of recreating beauty. Unfortunately, deep learning techniques have not been designed with that challenge in mind. Given their ‘black-box nature’, these models cannot be directly used to explain why a particular urban scene is deemed to be beautiful. To partly fix that, we propose a deep learning framework (which we name FaceLift1) that is able to both beautify existing urban scenes (Google Street Views) and explain which urban elements make those transformed scenes beautiful. To quantitatively evaluate our framework, we cannot resort to any existing metric (as the research problem at hand has never been tackled before) and need to formulate new ones. These new metrics should ideally capture the presence (or absence) of elements that make urban spaces great. Upon a review of the urban planning literature, we identify five main metrics: walkability, green spaces, openness, landmarks and visual complexity. We find that, across all the five metrics, the beautified scenes meet the expectations set by the literature on what great spaces tend to be made of. This result is further confirmed by a 20-participant expert survey in which FaceLift has been found to be effective in promoting citizen participation. All this suggests that, in the future, as our framework’s components are further researched and become better and more sophisticated, it is not hard to imagine technologies that will be able to accurately and efficiently support architects and planners in the design of the spaces we intuitively love.

[1]  Nadine Eberhardt Measuring Urban Design Metrics For Livable Places , 2016 .

[2]  Alexei A. Efros,et al.  City Forensics: Using Visual Elements to Predict Non-Visual City Attributes , 2014, IEEE Transactions on Visualization and Computer Graphics.

[3]  R. Kaplan,et al.  Rated preference and complexity for natural and urban visual material , 1972 .

[4]  Tobias Preis,et al.  Using deep learning to quantify the beauty of outdoor places , 2017, Royal Society Open Science.

[5]  Brooks Paige,et al.  Take a Look Around , 2018, ACM Trans. Intell. Syst. Technol..

[6]  R. Kitchin,et al.  Thinking critically about and researching algorithms , 2014, The Social Power of Algorithms.

[7]  R. Kaplan,et al.  The Experience of Nature: A Psychological Perspective , 1989 .

[8]  Marcus Foth,et al.  Handbook of Research on Urban Informatics: The Practice and Promise of the Real-Time City , 2008 .

[9]  Constantino Arce,et al.  CLASSIFICATION OF LANDSCAPES USING QUANTITATIVE AND CATEGORICAL DATA, AND PREDICTION OF THEIR SCENIC BEAUTY IN NORTH-WESTERN SPAIN , 2000 .

[10]  Christian Ledig,et al.  Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Paul Dourish,et al.  Algorithms and their others: Algorithmic culture in context , 2016, Big Data Soc..

[12]  James Hays,et al.  SUN attribute database: Discovering, annotating, and recognizing scene attributes , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Luc Van Gool,et al.  Disentangled Person Image Generation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[14]  Brandon K. Vaughn,et al.  Data analysis using regression and multilevel/hierarchical models, by Gelman, A., & Hill, J. , 2008 .

[15]  Thomas Hofmann,et al.  TrueSkill™: A Bayesian Skill Rating System , 2007 .

[16]  Ramesh Raskar,et al.  Computer vision uncovers predictors of physical urban change , 2017, Proceedings of the National Academy of Sciences.

[17]  Max Jacobson,et al.  A Pattern Language: Towns, Buildings, Construction , 1981 .

[18]  Rossano Schifanella,et al.  The Digital Life of Walkable Streets , 2015, WWW.

[19]  Morgan M. Larson,et al.  The Architecture of Happiness , 2018 .

[20]  A. Bauman,et al.  Perceived environmental aesthetics and convenience and company are associated with walking for exercise among Australian adults. , 2001, Preventive medicine.

[21]  Stewart Brand,et al.  How Buildings Learn: What Happens After They're Built , 1997 .

[22]  Ramesh Raskar,et al.  Deep Learning the City: Quantifying Urban Perception at a Global Scale , 2016, ECCV.

[23]  Lior Wolf,et al.  Unsupervised Cross-Domain Image Generation , 2016, ICLR.

[24]  Marcus Foth,et al.  Urban informatics , 2011, CSCW.

[25]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Thomas Brox,et al.  Synthesizing the preferred inputs for neurons in neural networks via deep generator networks , 2016, NIPS.

[27]  Andreas Geiger,et al.  Augmented Reality Meets Computer Vision: Efficient Data Generation for Urban Driving Scenes , 2017, International Journal of Computer Vision.

[28]  César A. Hidalgo,et al.  The Collaborative Image of The City: Mapping the Inequality of Urban Perception , 2013, PloS one.

[29]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[30]  S. Raudenbush,et al.  Seeing Disorder: Neighborhood Stigma and the Social Construction of “Broken Windows” , 2004 .

[31]  Nicu Sebe,et al.  Are Safer Looking Neighborhoods More Lively?: A Multimodal Investigation into Urban Life , 2016, ACM Multimedia.

[32]  R. Ulrich Aesthetic and Affective Response to Natural Environment , 1983 .

[33]  Bolei Zhou,et al.  Places: An Image Database for Deep Scene Understanding , 2016, ArXiv.

[34]  Vasiliki Triga,et al.  'Neither agree, nor disagree': a critical analysis of the middle answer category in Voting Advice Applications , 2012 .

[35]  Rossano Schifanella,et al.  The shortest path to happiness: recommending beautiful, quiet, and happy routes in the city , 2014, HT.

[36]  Henriette Cramer,et al.  Aesthetic capital: what makes london look beautiful, quiet, and happy? , 2014, CSCW.

[37]  Thomas Brox,et al.  Generating Images with Perceptual Similarity Metrics based on Deep Networks , 2016, NIPS.

[38]  Marcus Foth,et al.  Lessons from Urban Guerrilla Placemaking for Smart City Commons , 2017, C&T.

[39]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[40]  Honglak Lee,et al.  Attribute2Image: Conditional Image Generation from Visual Attributes , 2015, ECCV.

[41]  Pall J. Lindal,et al.  Architectural variation, building height, and the restorative quality of urban residential streetscapes , 2013 .

[42]  R. Kitchin,et al.  The ethics of smart cities and urban science , 2016, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[43]  G. Moors,et al.  Exploring the effect of a middle response category on response style in attitude measurement , 2007, Quality & quantity.

[44]  Andrew Gelman,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2006 .

[45]  J. Jacobs The Death and Life of Great American Cities , 1962 .

[46]  Jana Reinhard Walkable City How Downtown Can Save America One Step At A Time , 2016 .

[47]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  Alexei A. Efros,et al.  Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[49]  Tom Minka,et al.  TrueSkillTM: A Bayesian Skill Rating System , 2006, NIPS.

[50]  Yao Shen,et al.  Street-Frontage-Net: urban image classification using deep convolutional neural networks , 2018, Int. J. Geogr. Inf. Sci..

[51]  Bolei Zhou,et al.  Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[52]  Kevin Lynch,et al.  The Image of the City , 1960 .

[53]  Virgílio A. F. Almeida,et al.  Psychological maps 2.0: a web engagement enterprise starting in London , 2013, WWW.

[54]  B. Giles-Corti,et al.  Increasing walking: how important is distance to, attractiveness, and size of public open space? , 2005, American journal of preventive medicine.

[55]  C. Norberg-Schulz Genius Loci: Towards a Phenomenology of Architecture , 1979 .