Text this: Human activity recognition in low quality videos using spatio-temporal features