Text this: Classification of machine learning engines using latent semantic indexing