Text this: Data Mining for Building Neural Protein Sequence Classification Systems with Improved Performance