Text this: Variable extractions using principal component analysis and multiple correspondence analysis for large number of mixed variables classification problems