Predicting human perception of the urban environment in a spatiotemporal urban setting using locally acquired street view images and audio clips

Abstract This study investigates people's perception of visual and auditory landscapes in a mixed-use urban environment. A set of audio and visual data is collected at different intervals during the day in local streets with the help of an audio recorder and camera setup. The High and Low-level features from the collected audio and visual datasets are captured with the help of custom Deep Learning (DL) models and other standard algorithms. The collected data is used in the perception survey, which included human subjects (n = 73). The evaluation of the individual perception is done with the help of eight and six auditory and visual perceptual attributes, respectively. The results from the survey are then studied in relation to the features extracted from algorithms. Finally, a street of 10 km length is chosen within the study area where a spatiotemporal street-level visual and auditory data is collected. Statistical analysis and Machine Learning modeling are performed in the surveyed dataset to predict the human perception of audio and visual scenes in the chosen street. The results helped in understanding specific audio and visual features that are related to individual perceptions. Further, these relationships are utilized to create prediction models, which helped in creating spatiotemporal visual and auditory perception maps.

[1]  R. Kaplan,et al.  The Prediction of Preference for Familiar Urban Places , 1976 .

[2]  Aren Jansen,et al.  Audio Set: An ontology and human-labeled dataset for audio events , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3]  Jack L. Nasar,et al.  Environmental correlates of evaluative appraisals of central business district scenes , 1987 .

[4]  Justin Salamon,et al.  Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification , 2016, IEEE Signal Processing Letters.

[5]  Mostafa Refat Ismail,et al.  Sound preferences of the dense urban environment: Soundscape of Cairo , 2014 .

[6]  Dick Botteldooren,et al.  Classification of soundscapes of urban public open spaces , 2019, Landscape and Urban Planning.

[7]  Peter Suedfeld,et al.  Perceived safety from crime in the urban environment , 1993 .

[8]  Austin Troy,et al.  Effects of skeletal streetscape design on perceived safety , 2015 .

[9]  H. Keselman,et al.  Backward, forward and stepwise automated subset selection algorithms: Frequency of obtaining authentic and noise variables , 1992 .

[10]  Mariana Belgiu,et al.  Random forest in remote sensing: A review of applications and future directions , 2016 .

[11]  Stephen J Mooney,et al.  Street Audits to Measure Neighborhood Disorder: Virtual or In-Person? , 2017, American journal of epidemiology.

[12]  T. R. Herzog,et al.  A cognitive analysis of preference for urban nature , 1989 .

[13]  Félix Gontier,et al.  Estimation of the Perceived Time of Presence of Sources in Urban Acoustic Environments Using Deep Learning Techniques , 2019, Acta Acustica united with Acustica.

[14]  R. Kaplan,et al.  Rated preference and complexity for natural and urban visual material , 1972 .

[15]  Qi Meng,et al.  On the effectiveness of facial expression recognition for evaluation of urban sound perception. , 2019, The Science of the total environment.

[16]  Bolei Zhou,et al.  Places: A 10 Million Image Database for Scene Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Carlo Ratti,et al.  Mapping the spatial distribution of shade provision of street trees in Boston using Google Street View panoramas , 2018 .

[18]  Catherine Marquis-Favre,et al.  Noise and its effects - : A review on qualitative aspects of sound. Part I: Notions and acoustic ratings , 2005 .

[19]  Jack L. Nasar,et al.  Perception, Cognition, and Evaluation of Urban Places , 1989 .

[20]  Donald Appleyard,et al.  The View From the Road , 1965 .

[21]  Haldun Akoglu,et al.  User's guide to correlation coefficients , 2018, Turkish journal of emergency medicine.

[22]  Rahul Goel,et al.  Estimating city-level travel patterns using street imagery: A case study of using Google Street View in Britain , 2018, PloS one.

[23]  Pyoung Jik Lee,et al.  Perceptual assessment of quality of urban soundscapes with combined noise sources and water sounds. , 2010, The Journal of the Acoustical Society of America.

[24]  Ke-Tsung Han Responses to Six Major Terrestrial Biomes in Terms of Scenic Beauty, Preference, and Restorativeness , 2007 .

[25]  K. Byrka,et al.  Restorative Qualities of and Preference for Natural and Urban Soundscapes , 2017, Front. Psychol..

[26]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[27]  Jingwei Zhao,et al.  A good sound in the right place: Exploring the effects of auditory-visual combinations on aesthetic preference , 2019, Urban Forestry & Urban Greening.

[28]  Kevin Lynch,et al.  The Image of the City , 1960 .

[29]  Daniel Shepherd,et al.  Do Quiet Areas Afford Greater Health-Related Quality of Life than Noisy Areas? , 2013, International journal of environmental research and public health.

[30]  Jin Yong Jeon,et al.  Effects of audio-visual interactions on soundscape and landscape perception and their influence on satisfaction with the urban environment , 2020 .

[31]  Grigori Yourganov,et al.  The Perception of Naturalness Correlates with Low-Level Visual Features of Environmental Scenes , 2014, PloS one.

[32]  Stephen J Mooney,et al.  Development and deployment of the Computer Assisted Neighborhood Visual Assessment System (CANVAS) to measure health-related neighborhood conditions. , 2015, Health & place.

[33]  Ingmar Lissner,et al.  Toward a Unified Color Space for Perception-Based Image Processing , 2012, IEEE Transactions on Image Processing.

[34]  Ramesh Raskar,et al.  Computer vision uncovers predictors of physical urban change , 2017, Proceedings of the National Academy of Sciences.

[35]  Ying Long,et al.  Measuring visual quality of street space and its temporal variation: Methodology and its application in the Hutong area in Beijing , 2019, Landscape and Urban Planning.

[36]  R. D. Bromley,et al.  City-centre Revitalisation: Problems of Fragmentation and Fear in the Evening and Night-time City , 2000 .

[37]  Terry C. Daniel,et al.  Scenic landscape assessment: the effects of land management jurisdiction on public perception of scenic beauty. , 2000 .

[38]  Kate E. Jones,et al.  CityNet—Deep learning tools for urban ecoacoustic assessment , 2018, Methods in Ecology and Evolution.

[39]  Jian Kang,et al.  Spatiotemporal variability of soundscapes in a multiple functional urban area , 2013 .

[40]  Tadayoshi Fushiki,et al.  Estimation of prediction error by using K-fold cross-validation , 2011, Stat. Comput..

[41]  Ricardo Hurtubia,et al.  Explaining subjective perceptions of public spaces as a function of the built environment: A massive data approach , 2019, Landscape and Urban Planning.

[42]  B. Berglund,et al.  A principal components model of soundscape perception. , 2010, The Journal of the Acoustical Society of America.

[43]  Krithi Ramamritham,et al.  Classification and mapping of sound sources in local urban streets through AudioSet data and Bayesian optimized Neural Networks , 2019, Noise Mapping.

[44]  Jin Yong Jeon,et al.  Relationship between spatiotemporal variability of soundscape and urban morphology in a multifunctional urban area: A case study in Seoul, Korea , 2017 .

[45]  Jian Kang,et al.  Relationship between street scale and subjective assessment of audio-visual environment comfort based on 3D virtual reality and dual-channel acoustic tests , 2018 .

[46]  Jin Yong Jeon,et al.  Soundwalk approach to identify urban soundscapes individually. , 2013, The Journal of the Acoustical Society of America.

[47]  C. Sémidor Listening to a City with the soundwalk method , 2006 .

[48]  Bert De Coensel,et al.  Effects of natural sounds on the perception of road traffic noise. , 2011, The Journal of the Acoustical Society of America.

[49]  M. Kamal,et al.  Effect of predictors of visual preference as characteristics of urban natural landscapes in increasing perceived restorative potential , 2014 .

[50]  Tobias Preis,et al.  Using deep learning to quantify the beauty of outdoor places , 2017, Royal Society Open Science.

[51]  Tobias Preis,et al.  Quantifying the Impact of Scenic Environments on Health , 2015, Scientific Reports.

[52]  V. Mehta Lively Streets , 2007 .

[53]  Toon De Pessemier,et al.  Interactive soundscape augmentation by natural sounds in a noise polluted urban park , 2020, Landscape and Urban Planning.

[54]  Jin Yong Jeon,et al.  Influence of urban contexts on soundscape perceptions: A structural equation modeling approach , 2015 .

[55]  Yuhua Qian,et al.  Environmental sound classification with dilated convolutions , 2019, Applied Acoustics.

[56]  R. Simons,et al.  Stress recovery during exposure to natural and urban environments , 1991 .

[57]  Mark Bush,et al.  Anuran call classification with deep learning , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[58]  Krithi Ramamritham,et al.  Machine-based understanding of manually collected visual and auditory datasets for urban perception studies , 2019, Landscape and Urban Planning.

[59]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[60]  S. Stansfeld,et al.  Noise and Health in the Urban Environment , 2000, Reviews on environmental health.

[61]  Jian Kang,et al.  The Impact and Outreach of Soundscape Research , 2018 .

[62]  Paul H. Gobster,et al.  Visual resource stewardship at the neighborhood scale: methods for assessing a vacant land reuse program , 2018 .

[63]  Ramesh Raskar,et al.  Deep Learning the City: Quantifying Urban Perception at a Global Scale , 2016, ECCV.

[64]  Jian Kang,et al.  A model for implementing soundscape maps in smart cities , 2018 .

[65]  C. S. De Silva,et al.  An examination of the temporal effects of environmental cues on pedestrians' feelings of safety , 2017, Comput. Environ. Urban Syst..