With selective auditory attention (SAA), the human brain can cope with dynamic natural environments efficiently by prioritizing relevant auditory information while filtering out the rest. Auditory saliency describes how bottom-up auditory attention is attracted by local acoustic properties and plays an important role in SAA. In existing neuroimaging studies, the neural processing of auditory saliency has usually been explored via strictly controlled laboratory paradigms with abstract stimuli, overlooking the intrinsic functional interactions among spatially distributed brain networks underlying auditory saliency. In addition, it is unclear how auditory saliency related neural responses vary in auditory information in different semantic categories. In this study, we explored functional brain networks underlying auditory saliency via a multivariate brain decoding approach, in which naturalistic auditory excerpts in three semantic categories (pop music, classic music and speech) were used as stimuli in functional magnetic resonance imaging (fMRI). Our experimental results demonstrate that: 1) Dynamic auditory saliency can be decoded with relatively high accuracy from fMRI brain activities; 2) The neural processing of auditory saliency in different types of auditory information on the one hand shares some common brain regions (e.g., the primary auditory cortex), on the other hand recruits distinct brain regions/networks, e.g., the sensorimotor network and working memory network for classic music and the auditory-language network for speech. Our study provides complementary evidence to the neural processing of auditory saliency under naturalistic experience.