Korpus: sna-zw_web_2019

Weitere Korpora

1.1 Summary

Values for some general parameters

Parameter Value
Number of sentences 3876
Average sentence length in characters 86.1850
Average sentence length in words 10.9835
Number of distinct word forms 16473
Number of distinct word forms (without multiwords) 16473
Percentage of lower case word forms 82.1040
Number of multi word units 0
Percentage of multi word units 0.0000
Number of running word forms 42479
Number of running word forms (without multiwords) 42479
Percentage of lower case running words 84.9479
Average word form length 8.4854
Average running word length 6.76779115
Percentage of word forms with frequency=1 73.2593
Number of sentence based co-occurrences 4868
- minimal likelihood ratio 6.63
- maximal likelihood ratio 1043.86
Number of neighbour based co-occurrences 749
- minimal likelihood ratio 3.90
- maximal likelihood ratio 2320.25
Average number of sentence based co-occurrences per sentence 6.3044
Average number of neighbour co-occurrences per sentence 0.9959
Most frequent word kuti
Most frequent word's frequency 1099
819 msec needed at 2022-02-20 14:00