Korpus: nav_wikipedia_2014

Weitere Korpora

1.1 Summary

Values for some general parameters

parameter value
number of sentences 9806
average sentence length in characters 120.0310
average sentence length in words 12.4615
number of distinct word forms 18094
percentage of lower case word forms 67.9673
percentage of multi word units 2.3102
number of running word forms 148190
percentage of lower case running words 78.4825
average word form length 10.8298
average running word length 8.54441459
percentage of word forms with frequency=1 65.4416
number of sentence based co-occurrences 27070
minimal likelihood ratio 6.63
maximal likelihood ratio 3537.17
number of neighbour based co-occurrences 4034
minimal likelihood ratio 3.85
maximal likelihood ratio 6513.18
average number of sentence based co-occurrences per sentence 49.6861
average number of neighbour co-occurrences per sentence 5.7571
most frequent word éí
frequent word's frequency 12949
1064 msec needed at 2018-01-06 17:30