Spatio-temporal visual consistency for video copy detection