HIMERA Corpus

HIMERA

The HIMERA annotated corpus contains a set of published historical medical documents that have been manually annotated with semantic information that is relevant to the study of medical history and public health. Specifically, annotations correspond to seven different entity types and two different event types (which encode relationships amongst entities), chosen based on extensive discussions with medical historians.

Attribution Details: a) The annotations in HIMERA were created by the National Centre for Text Mining (NaCTeM), School of Computer Science, Univesity of Manchester. Please attribute NaCTeM and cite the follwing article:
Paul Thompson, Riza Theresa Batista-Navarro, Georgios Kontonatsios, Jacob Carter, Elizabeth Toon, John McNaught, Carsten Timmermann, Michael Worboys and Sophia Ananiadou (2015). Text Mining the History of Medicine. PLOS ONE.
b) The British Medical Journal (BMJ) kindly consented to the use of the 35 articles from the BMJ archive.
c) The Wellcome Trust made available the MOH reports.