About our term extraction

OneClick Terms is a simple term extractor interface
giving easy access to terminology extraction functionality.
It is powered by the Sketch Engine technology which
guarantees speedy processing. Unparalleled linguistic
analysis uses part-of-speech tagging and lemmatization to
produces exceptionally clean term extraction results
requiring hardly any manual cleaning. The extracted terms
are ready for import into a CAT (Computer Assisted
Translation) tool or a term management system.

OneClick Terms can be used for text analytics and topic
modelling to identify the main topic(s) of a large quantity
of text through keywords and terms which serve as indicators
of the main subject(s).

The term extraction quality is achieved by using language
specific criteria describing the allowed terminology
structures in the language. For example, a term in English
will most likely take the form of (noun+)noun+noun or
adjective+noun while in Spanish, most likely,
noun+adjective(+adjective) or noun+de+noun. There is a more
complex set of rules for each language which ensures that no
noise is included in the results. This approach does not
require black lists or stoplists either.

OneClick Terms can extract terminology from a number of
common document formats (TMX, XLIFFv2, PDF, DOC, DOCX, HTML,
TXT) and export the results into plain text, CSV or TBX formats.