Abstract: In this work, we propose CleanMel, a single-channel Mel-spectrogram denoising and dereverberation network for improving both speech quality and automatic speech recognition (ASR) performance ...
Abstract: Monitoring of prevalent airborne diseases such as COVID-19 characteristically involves respiratory assessments. While auscultation is a mainstream method for preliminary screening of disease ...
CATS is a signal processing technique and framework for detecting and denoising sparse signals in the time-frequency domain. Particularly, very useful for processing earthquakes. This work is still in ...
All the datasets must be located in the datasets folder. This folder should contain the following subfolders after downloading the datasets: GTZAN Speech_Music: Contains the GTZAN Speech Music dataset ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results