Text this: Arabic language script and encoding identification with support vector machines and rough set theory