Text this: NSE-CATNet: deep neural speech enhancement using convolutional attention transformer network