Text this: Improved pattern extraction scheme for clustering multidimensional data