Text this: A phonetically rich and balanced lexical corpus using zipfian distribution for an under resourced language / Aminath Farshana