论文信息 - Multimodal Location Estimation of Videos and Images

Multimodal Location Estimation of Videos and Images

This book presents an overview of the field of multimodal location estimation. The authors' aim is to describe the research results in this field in a unified way. The book describes fundamental methods of acoustic, visual, textual, social graph, and metadata processing as well as multimodal integration methods used for location estimation. In addition, the book covers benchmark metrics and explores the limits of the technology based on a human baseline. The book also outlines privacy implications and discusses directions for future research in the area.

Gerald Friedland | Jaeyoung Choi | G. Friedland | Jaeyoung Choi