We present audio samples for the causal CleanUNet model proposed in
Speech Denoising in the Waveform Domain with Self-Attention.
We use CleanUNet with N=5 self attention blocks in the bottleneck layer and L1 plus high-band STFT losses.
We compare CleanUNet to other SOTA models including the FAIR-denoiser and FullSubNet.
The official PyTorch implementation can be found in this link