Values for some general parameters
parameter |
value |
number of sentences |
9806 |
average sentence length in characters |
120.0310 |
average sentence length in words |
12.4615 |
number of distinct word forms |
18094 |
percentage of lower case word forms |
67.9673 |
percentage of multi word units |
2.3102 |
number of running word forms |
148190 |
percentage of lower case running words |
78.4825 |
average word form length |
10.8298 |
average running word length |
8.54441459 |
percentage of word forms with frequency=1 |
65.4416 |
number of sentence based co-occurrences |
27070 |
minimal likelihood ratio |
6.63 |
maximal likelihood ratio |
3537.17 |
number of neighbour based co-occurrences |
4034 |
minimal likelihood ratio |
3.85 |
maximal likelihood ratio |
6513.18 |
average number of sentence based co-occurrences per sentence |
49.6861 |
average number of neighbour co-occurrences per sentence |
5.7571 |
most frequent word |
éí |
frequent word's frequency |
12949 |
1064 msec needed at 2018-01-06 17:30