In particular, DBEF tests can yield different conclusions for two samples in which the numerals appear with the same relative frequencies, but sample sizes differ. The null hypothesis of no fraud is more likely to be rejected for larger sample sizes if the data have the same probability densities over digits and significance threshold.