Text this: Classification for large number of variables with two imbalanced groups