Text this: An Improve k-NN Classifier using Similarity Distance Plot-Data Reduction and Dask for Big Datasets