Text this: Automatic document clustering and indexing of multiple documents using KNMF for feature extraction through Hadoop and lucene on big data