You must provide a minimum of three words for language identification. However, you can improve the accuracy by providing more text. The amount of text that you must provide for accurate language identification depends on both the language and the type of text. For UTF-8 encoded languages that use a unique script, the Language Identification API might be able to identify the language using only a few characters. For other languages, the API might need a few sentences to accurately identify the language, and it might need a large paragraph to distinguish between two similar languages.

The amount of text required also depends on the type of text. For example, it is difficult to identify the language from a list of places, numbers, and names. If your text contains these things, you might need to provide more text to identify the language. For natural language text, such as a news article, the API can usually detect the language from fewer characters.

Haven OnDemand uses cookies to enhance and improve the experience it provides. By continuing to use this site or pressing Continue,
we will assume that you accept receiving all cookies. If you would like to change which cookies are set, you can change your settings.