Engkoo is a linguistic technology stack that leverages the cloud, born from over a decade of cutting-edge natural language processing research, powering various Microsoft products and features.

Engkoo is developed in China and so the focus initially has been on providing technology for linguistic tasks relevant to the Chinese market, such as Chinese-English dictionary, machine translation, language learning, and now, input and writing assistance.

Engkoo Pinyin is an Input Method Editor (IME) that uniquely leverages the power of the cloud to help people input what is on their mind: it could be Chinese, English, a mix of both, and beyond just text, such as images, videos, and maps.

At a system level, Engkoo supports a multitude of NLP and Speech technologies such as cross language retrieval, alignment, sentence classification, statistical machine translation, text-to-speech, and phonetic search. The data set that supports this system is primarily built from mining a massive set of bilingual terms and sentences from across the web. Specifically, web pages that contain both Chinese and English are discovered and analyzed for parallelism, extracted and formulated into clear term definitions and sample sentences. This approach allows us to build the world's largest lexicon linking both Chinese and English together - at the same time covering the most up-to-date terms as captured by the net. In addition, our data set is intelligently merged with licensed data from sources including Microsoft Office and Encarta. Finally, the resulting vast, ranked, high quality composite data set is analyzed by a machine learning based classifier, allowing users to filter down sample sentences by combinable categories.