T3.2 - Fine-tuning to specific languages

The fine-tuning of the categorization tool for each specific language requires that we have identified the best feature combination for a predefined classification. In addition, if we want to have classification of multi-lingual document sets to a cross-linguistic classifier/taxonomy, we need to experiment with the translation and linguistic processing of the documents, or if we avoid translation and use LSA, we need to tune matrix properties of extracted document matrix.