CharStream adds correctOffset(int)
functionality over Reader. All Tokenizers accept a
CharStream instead of Reader as input, which enables
arbitrary character based filtering before tokenization.
The correctOffset(int) method fixed offsets to account for
removal or insertion of characters, so that the offsets
reported in the tokens match the character offsets of the
original Reader.