The training dataset from the Epigenetics and Post-translational Modifications (EPI) task in the BioNLP Shared Task 2011.
The core entities of the task are genes and gene products (RNA and proteins), identified in the data simply as "Protein" annotations.