Values for some general parameters
Parameter |
Value |
Number of sentences |
21964 |
Average sentence length in characters |
67.3992 |
Average sentence length in words |
11.1800 |
Number of distinct word forms |
61554 |
Number of distinct word forms (without multiwords) |
61554 |
Percentage of lower case word forms |
45.1847 |
Number of multi word units |
0 |
Percentage of multi word units |
0.0000 |
Number of running word forms |
243856 |
Number of running word forms (without multiwords) |
243856 |
Percentage of lower case running words |
66.2046 |
Average word form length |
8.0390 |
Average running word length |
4.97575619 |
Percentage of word forms with frequency=1 |
73.1715 |
Number of sentence based co-occurrences |
50312 |
- minimal likelihood ratio |
6.63 |
- maximal likelihood ratio |
1347.93 |
Number of neighbour based co-occurrences |
6178 |
- minimal likelihood ratio |
3.84 |
- maximal likelihood ratio |
1697.51 |
Average number of sentence based co-occurrences per sentence |
16.6879 |
Average number of neighbour co-occurrences per sentence |
1.6290 |
Most frequent word |
und |
Frequent word's frequency |
4327 |
2512 msec needed at 2018-06-01 11:30