Extraction of information of audio-visual contents