Media Device Orchestration for Immersive Spatial Audio Reproduction

Whilst it is possible to create exciting, immersive listening experiences with current spatial audio technology, the required systems are generally difficult to install in a standard living room. However, in any living room there is likely to already be a range of loudspeakers (such as mobile phones, tablets, laptops, and so on). "Media device orchestration" (MDO) is the concept of utilising all available devices to augment the reproduction of a media experience. In this demonstration, MDO is used to augment low channel count renderings of various programme material, delivering immersive three-dimensional audio experiences.

[1]  Jon Francombe,et al.  A Quantitative Evaluation of Media Device Orchestration for Immersive Spatial Audio Reproduction , 2018 .

[2]  Takeshi Nakayama,et al.  Sound-Image Localization in Multichannel Matrix Reproduction , 1972 .

[3]  Hyunkook Lee,et al.  The Effect of Interchannel Time Difference on Localization in Vertical Stereophony , 2015 .

[4]  Stavros Paschalakis,et al.  BRIDGET: an approach at sustainable and efficient production of second screen media applications , 2015 .

[5]  Tim Brookes,et al.  Evaluation of spatial audio reproduction methods (part 2) : analysis of listener preference , 2017 .

[6]  Frank Melchior,et al.  Presenting the S 3 A object based audio drama , 2018 .

[7]  Frank Melchior,et al.  A Subjective Comparison of Discrete Surround Sound and Soundbar Technology by Using Mixed Methods , 2016 .

[8]  Heiko Purnhagen,et al.  Immersive Audio Delivery Using Joint Object Coding , 2016 .

[9]  Francis Rumsey,et al.  Relationships between experienced listener ratings of multichannel audio quality and naïve listener preferences. , 2005, The Journal of the Acoustical Society of America.

[10]  Frank Melchior,et al.  An Audio-Visual System for Object-Based Audio: From Recording to Listening , 2018, IEEE Transactions on Multimedia.

[11]  T. Bachmann,et al.  Investigation on the Quality of 3D Sound Reproduction , 2011 .

[12]  Michael J. Gerzon Periphony: With-Height Sound Reproduction , 1973 .

[13]  Mark D. Plumbley,et al.  Perceptual Evaluation of Source Separation for Remixing Music , 2017 .

[14]  Wieslaw Woszczyk,et al.  Sound Source Localization in a Five-Channel Surround Sound Reproduction System , 1999 .

[15]  Frank Melchior,et al.  Categorization of broadcast audio objects in complex auditory scenes , 2016 .

[16]  Frank Melchior,et al.  Object-Based Reverberation for Spatial Audio , 2017 .

[17]  Francis Rumsey,et al.  Spatial Audio Quality Perception (Part 1): Impact of Commonly Encountered Processes , 2015 .

[18]  James Woodcock,et al.  Personalized Object-Based Audio for Hearing Impaired TV Viewers , 2017 .

[19]  Matti Karjalainen,et al.  Localization, Coloration, and Enhancement of Amplitude-Panned Virtual Sources , 1999 .

[20]  Will Howie,et al.  Subjective Evaluation of Orchestral Music Recording Techniques for Three-Dimensional Audio , 2017 .

[21]  Jan Plogsties,et al.  Design, Coding and Processing of Metadata for Object-Based Interactive Audio , 2014 .

[22]  Methods for the subjective assessment of small impairments in audio systems , 2015 .

[23]  Aaron J. HELLER The Ambisonic Decoder Toolbox : Extensions for Partial-Coverage Loudspeaker Arrays , 2014 .

[24]  Russell Mason How Important Is Accurate Localization in Reproduced Sound , 2017 .

[25]  Adrian Hilton,et al.  Person Tracking Using Audio and Depth Cues , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[26]  Francis Rumsey,et al.  On the relative importance of spatial and timbral fidelities in judgments of degraded multichannel audio quality. , 2005, The Journal of the Acoustical Society of America.

[27]  Georg Plenge,et al.  Localization of Lateral Phantom Sources , 1976 .

[28]  Ville Pulkki,et al.  Virtual Sound Source Positioning Using Vector Base Amplitude Panning , 1997 .

[29]  Francis Rumsey,et al.  Localization Curves for a Regularly-Spaced Octagon Loudspeaker Array , 2009 .

[30]  Frank Melchior,et al.  Presenting the S3A object-based audio drama dataset , 2016 .

[31]  V. Braun,et al.  Using thematic analysis in psychology , 2006 .

[32]  Helmut Wittek,et al.  Principles in Surround Recordings with Height , 2011 .

[33]  Tim Brookes,et al.  Evaluation of Spatial Audio Reproduction Methods (Part 1): Elicitation of Perceptual Differences , 2017 .

[34]  Jean-Marc Jot,et al.  Digital Signal Processing Issues in the Context of Binaural and Transaural Stereophony , 1995 .

[35]  José López Vicario,et al.  A Review of Pedestrian Indoor Positioning Systems for Mass Market Applications , 2017, Sensors.

[36]  Francis Rumsey,et al.  Evaluating the Sensation of Envelopment Arising from 5-Channel Surround Sound Recordings , 2008 .

[37]  Norbert Schnell,et al.  Soundworks – A playground for artists and developers to create collaborative mobile web performances , 2015 .

[38]  Koichiro Hiyama,et al.  Reproducing Spatial Impression With Multichannel Audio , 2003 .

[39]  Jan Berg,et al.  The Contrasting and Conflicting Definitions of Envelopment , 2009 .

[40]  Gianmarco,et al.  The Future of Second Screen Experience , 2013 .

[41]  Takehiro Sugimoto,et al.  Downmixing Method for 22.2 Multichannel Sound Signal in 8K Super Hi-Vision Broadcasting , 2015 .

[42]  Etienne Parizet,et al.  Investigation on localisation accuracy for first and higher order ambisonics reproduced sound sources , 2013 .

[43]  Mark A. Poletti Robust Two-Dimensional Surround Sound Reproduction for Nonuniform Loudspeaker Layouts , 2007 .

[44]  Jan Plogsties,et al.  MPEG-H Audio—The New Standard for Universal Spatial / 3D Audio Coding , 2014 .