Text this: Scalability and performance in duplicate detection : relational vs. graph database