Enhanced Audio Feature Extraction: Combining Mel-Spectrogram with Temporal Derivatives
This paper introduces a multi-stage feature extraction approach that combines Mel-spectrogram with temporal derivative analysis. By incorporating this technique, the generated multi-dimensional Mel-spectrograms can effectively capture the temporal variations of both the target signals and the ambient noise. Consequently, the unique characteristics of these signals become more distinguishable. Furthermore, the inclusion of an additional spatial dimension allows for the modeling of reverberation effects, enhancing the overall feature representation.
原文地址: https://www.cveoy.top/t/topic/hKm 著作权归作者所有。请勿转载和采集!