Code

Code I published can generally be found on my GitHub profile. The most important contributions are listed below.

CorefAnnotator

CorefAnnotator is a tool for annotating coreference in texts. It has been developed to be able to cope with long texts with many different chains. The core idea is not to annotate binary relations between mentions (e.g., cataphoric), but to consider all mentions that co-refer to form an equivalence set. Consequently, the tool displays a list of entities on the right and allows free assignment of text spans to these entities.

Citation

Nils Reiter. CorefAnnotator - A New Annotation Tool for Entity References. In Abstracts of EADH: Data in the Digital Humanities, December2018.

DramaAnalysis

This R-package is developed in the context of the QuaDramA project and is a collection of frequently used functions for the analysis of dramatic texts. It takes texts as input that are formatted in a CSV-like format (and that are produced by DramaNLP).

SimpleXmiViewer

A very simple viewer for XMI files, which is an extension and re-packaging of the UIMA Annotation Viewer. On Mac OS X, it nicely integrates with Finder.app, allowing to open and inspect XMI files with a double click or open on the command line.

TreeAnno

TreeAnno is a web-based tool to annotate tree structures on texts. Instead of drawing relations with pointing device, sentences or tokens can be indented. This makes the tool similar to an outliner (which is also its inspiration source). Under the hood, indentations are stored as trees on text snippets encoded via their character offsets. The tool has been used to annotate information structure in interactive texts.

GenericXmlReader

DramaNLP

DramaNLP is a collection of UIMA components that we use to process dramatic texts in the project QuaDramA. The collection contains importers from various TEI dialects as well as NLP components specific to the text type. This package is developed jointly with Janis Pagel.