Figure 6.

The observed and expected number of new gene clusters found at the addition of each
genome to the clustering dataset. Modeling predictions are based on the eight strain
training set (see 'Mathematical development of a finite supragenome model').