Text this: Improving image denoising methods for PDF image files