Text this: Improved BVBUC algorithm to discover closed itemsets in long biological datasets