Low-level cross-media statistical approach for semantic partitioning of audio-visual content in a home multimedia environment