original data of the NP chunking experiments by Lance Ramshaw and Mitch Marcus

data contains one word per line and each line contains six fields of which only the first three fields are relevant: the word, the part-of-speech tag assigned by the Brill tagger, and the correct IOB tag