High level forensic voice comparison based on fused long-term fundamental frequency and word n -gram features