Text this: A phonetically rich and balanced lexical corpus using Zipfian distribution for an under-resourced language /