Text this: Optimized subtractive clustering for cluster-based compound selection