This comment has been minimized.

Unfortunately we cannot know a priori how many columns are needed to display the results. We could expand this to 3 digits, but you would still get the same problem if you had over 999 tests. The same issue also occurs if you have more than 99 warnings or skips for a given context (admittedly less rare).

The simple solution would be to use 3 digits for tests and 2 for everything else, which should handle the majority of cases.