Text this: A Review of Similarity Measurement for Record Duplicate Detection