Korpus: ast_web_2012

Weitere Korpora

1.1 Summary

Values for some general parameters

Parameter Value
Number of sentences 468
Average sentence length in characters 93.8611
Average sentence length in words 15.1154
Number of distinct word forms 2757
Number of distinct word forms (without multiwords) 2745
Percentage of lower case word forms 77.8020
Number of multi word units 12
Percentage of multi word units 0.4353
Number of running word forms 7073
Number of running word forms (without multiwords) 7060
Percentage of lower case running words 80.3195
Average word form length 7.2539
Average running word length 5.13470255
Percentage of word forms with frequency=1 71.7446
Number of sentence based co-occurrences 1566
- minimal likelihood ratio 6.65
- maximal likelihood ratio 277.96
Number of neighbour based co-occurrences 164
- minimal likelihood ratio 3.86
- maximal likelihood ratio 510.23
Average number of sentence based co-occurrences per sentence 22.5171
Average number of neighbour co-occurrences per sentence 2.0641
Frequent word's frequency 340
262 msec needed at 2018-04-04 04:50