Korpus: orm_wikipedia_2018

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
an Qananiisaan 29
An Qananiisaan 29
an Aanan 16
An Aanan 16
ha dhahaa 13
Al walaloo 13
al walaloo 13
is fiizisistii 11
an shanan 9
an shananii 9
Subword Length 2 - Most frequent subwords
Subword Count
an 37
An 37
Al 30
al 30
in 18
In 18
ta 17
am 17
si 16
Da 16
Amount of words containing repeated subwords of length 2 - per mille
Per mille
9.2626
Subword Length 3 - most frequent words
Subword Word Frequency
fin Finfinnee 71
fin Finfinneetti 30
gar gargar 17
gar gargari 16
gar gargara 14
fin Finfinneerraa 11
yaa yaayaa 10
Yaa yaayaa 10
gal galgala 8
iin Pirootiiniin 4
Subword Length 3 - Most frequent subwords
Subword Count
fin 19
iin 14
IIn 14
gar 10
dha 9
ala 6
aka 5
Laa 5
gal 4
aga 3
Amount of words containing repeated subwords of length 3 - per mille
Per mille
3.7893
Subword Length 4 - most frequent words
Subword Word Frequency
gara garagara 10
Gara garagara 10
gara garagaraa 5
Gara garagaraa 5
dhaa dhadhaadhaan 2
dhaa hidhaadhaa 2
bara barabaraan 1
gara garagaraatti 1
gara garagarat 1
Bara barabaraan 1
Subword Length 4 - Most frequent subwords
Subword Count
gara 4
Gara 4
dhaa 2
bara 1
Bara 1
gala 1
itti 1
Itti 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.5344
Subword Length 5 - most frequent words
Subword Word Frequency
garaa garaagaraa 2
Garaa garaagaraa 2
garaa garaagaraatti 1
Garaa garaagaraatti 1
Subword Length 5 - Most frequent subwords
Subword Count
garaa 2
Garaa 2
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.2270
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
Da badda-daree 2
Da badda-dareen 2
da badda-daree 2
da badda-dareen 2
an Jean-André 1
An Jean-André 1
ti fiildi-iffekti-tiraanzisterii 1
Subword Length 2 - Most frequent subwords
Subword Count
Da 2
da 2
an 1
An 1
ti 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.1425
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
81748 msec needed at 2024-04-09 14:01