The “charcount” files help ensure
that no strange character makes it in the data, and help spot easily
the presence of ambiguous characters. We do not count characters,
but rather clusters of characters, which are more or less combining
sequences.

For the PDF files, we selected fonts which we believe are
available without a fee to anyone working on this project. Many
thanks to the individuals and foundries who generously made those
fonts available. If we misinterpreted a license, please let us know
and accept our apologies. This site does not provide the fonts
themselves, but here are the places where we found them: