Alex will now, by default, parse a UTF-8-encoded byte sequence. (to
disable this behaviour, there is a new flag --latin1, which we will be
using in GHC).
Because parsing UTF-8 generates a large state machine, I added DFA
minimisation (source file DFAMin.hs). This makes Alex itself somewhat
slower, but the generated lexers are smaller - dramatically so for
UTF-8 lexers.