Text this: Duplicates detection approach within incomplete data sets using dynamic sorting key and hot deck compensation method