21
Likelihood ratio test  To decide if the multi-set A has been generated according to a physico-chemical group G or not by a likelihood ratio test:  Given a threshold, we test the expansion of A to G and reject it when LR G/A <

27
From short automata to long automata  Previous experiment only the first SFPs of the ordered list of SFPs short automaton first common fragment automaton  Next experiment larger cut-offs in the list of SFPs Protomat-L is able to create longer automata with more common subparts Long patterns are closed of the topoly (3D-structure) of the family

30
Error Correcting Cost The error correcting cost of a sequence S represents the distance (blossum similarity) between S and the closest sequence given by the automaton A. Distibution of sequences with long automata (size Approx. 100)