Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal Feature Fusion