Chapter IV Statistical Audio-Visual Data Fusion for Video Scene