On amplitude modulation for monaural speech segregation

We propose a computational auditory scene analysis (CASA) model for monaural speech segregation. It deals with low-frequency and high-frequency signals differently. For high-frequency signals, it generates segments based on the common amplitude modulation (AM) and groups them according to AM repetition rates. This model performs substantially better than previous CASA systems.