An LDA dataset is a collection of items; each item has an ID
(retrieved with the getID function) and a sequence of terms
(retrieved with the getTerms function). See static constructors
in companion object.