types123: Type accumulation curves

Jukka Suomela

types123 is a set of freely available corpus tools for comparing the frequencies of words, types, and hapax legomena across subcorpora. These tools use accumulation curves and the statistical technique of permutation testing to compare the subcorpora with a “typical” corpus of a similar size, in order to visualize the frequencies and to identify statistically significant findings.

Three generations of this software are available: