1. Character filter: “tidy up” a string before it is tokenize. Example: remove html tags
2. Tokenizer: MUST have a single tokenizer. It’s used to break up the string into individual terms or tokens
3. Token filter: change, add or remove tokens. Stemmer is a token filter, it is used to get base of word, for example: “happy”, “happiness” => “happi” (Snowball demo)

We don’t store PII(Personally Identifiable Information). From time to time, the software will collect
anonymous usage data. None of it is being sold to anyone, so chill. The data collected gives us the
information we need to
customize the software for our users.