Scientific texts in balanced corpus

This subcorpus contains 5 million words scientific texts. The PhD dissertations make up about half of it, the remaining half contains scientific journals like „Eesti Arst“, „Arvutitehnika ja Andmetöötlus“, ’Agraarteadus’, the yearbooks of Emakeele Selts and Eesti Matemaatika Selts etc. The full list of the included texts can be found in this table.