There are several token filters available which try to normalize special
characters of a certain language.

You can currently choose between arabic_normalization and
persian_normalization normalization in your token filter
configuration. For more information check the
ArabicNormalizer
or the
PersianNormalizer
documentation.