Text this: A probabilistic classifier for imbalanced dataset problems