Text this: A review of visualization techniques for duplicate detection in cancer datasets