The simple_pattern tokenizer uses a regular expression to capture matching
text as terms. The set of regular expression features it supports is more
limited than the pattern tokenizer, but the
tokenization is generally faster.

This tokenizer does not support splitting the input on a pattern match, unlike
the pattern tokenizer. To split on pattern
matches using the same restricted regular expression subset, see the
simple_pattern_split tokenizer.