Levenstein distance calculation, character comparison, string comparison, tokenization etc can all be performed on UTF8 strings. However, language/locale dependent methods such toLowerCase(); toUpperCase() etc will not work for Hindi and Malayalam strings.