There are approximately 113,000 annotated verb tokens. These verb tokens include all those occurring in over one million words of the Wall Street Journal section of the Penn Treebank, excluding 'be' and auxiliary uses of 'do' and 'have.' There are annotations for over 3,200 unique verbs. These annotations are stored in a single file in standoff format, totalling ~9.6 MB of uncompressed data

Given nominalization/verb mappings, the combination of NomBank and PropBankallows for generalization of arguments across parts of speech.

Given nominalization/verb mappings, the combination of NomBank and PropBankaffords even greater generalization.

The PDTB is being built directly on top of the Penn Treebank and Propbank, thus supporting the extraction of useful syntactic and semantic features and providing a richer substrate for the development and evaluation of practical algorithms.

PTB and Propbank provide a sort of shallow semantic representation (predicate-argument structure, frames, and role sets), which can permit a level of inference in various NLP tasks, such as IE, QA, summarization, and MT tasks.