Korpus: mri_wikipedia_2018

Weitere Korpora

3.12.13 Compounds

Compounds among the top 10.000 words

Quantity of compounds without interfix
Count
32
Most frequent compounds
compound structure w_id compound w_id word1 w_id word2
Kawairirangi Kawa-irirangi 2427 5433 1904
kohikohinga kohi-kohinga 2918 1046 1489
tinimoutere tini-moutere 3142 940 196
Manamotuhake Mana-motuhake 5668 1373 384
Rakiwhakaputa Raki-whakaputa 6286 785 587
motupapatea motu-papatea 7718 207 3020
nekenekehanga neke-nekehanga 7774 689 2989
tuhituhinga tuhi-tuhinga 8320 700 755
wetetāmitanga wete-tāmitanga 8461 8460 8348
whaitikanga whai-tikanga 8468 249 242
whakamāramatanga whaka-māramatanga 1084 3212 1950
whakawhitiwhiti whaka-whitiwhiti 2084 3212 3261
whakamarumaru whaka-marumaru 3225 3212 2960
whakanohoanga whaka-nohoanga 3231 3212 1241
whakatakotoranga whaka-takotoranga 3245 3212 8185
whakawhirinaki whaka-whirinaki 3254 3212 954
Rongowhakaata Rongo-whakaata 6396 2627 1080
whakaahuatanga whaka-ahuatanga 8470 3212 2796
whakaakoranga whaka-akoranga 8472 3212 7071
whakamōhiotia whaka-mōhiotia 8531 3212 498
whakapahupahu whaka-pahupahu 8553 3212 7856
whakapūmautanga whaka-pūmautanga 8568 3212 3066
whakarangatira whaka-rangatira 8570 3212 223
whakararuraru whaka-raruraru 8573 3212 641
whakarerekētanga whaka-rerekētanga 8581 3212 8081
whakarāpopoto whaka-rāpopoto 8591 3212 8114
whakarārangihia whaka-rārangihia 8592 3212 8115
whakatairanga whaka-tairanga 8597 3212 3108
whakaturituri whaka-turituri 8627 3212 2039
whakatutukitanga whaka-tutukitanga 8629 3212 8337
Quantity of compounds with interfix length 1
Count
52
Most frequent compounds
compound structure w_id compound w_id word1 w_id word2
West Virginia West- -Virginia 2761 2760 1435
Kāti Hinetewai Kāti- -Hinetewai 5538 478 5236
Kāti Huirapa Kāti- -Huirapa 5539 478 5280
Kāti Irakehu Kāti- -Irakehu 5540 478 5337
Kāti Rakiwhakaputa Kāti- -Rakiwhakaputa 5544 478 6286
Kāti Ruahikihiki Kāti- -Ruahikihiki 5545 478 6409
Kāti Tahupōtiki Kāti- -Tahupōtiki 5546 478 6560
Kāti Tūahuriri Kāti- -Tūahuriri 5550 478 6812
Liliʻuokalani Lili-ʻ-uokalani 5601 1739 2049
Ngāi Tamaoki Ngāi- -Tamaoki 5884 274 6588
Ngāi Tamapare Ngāi- -Tamapare 5885 274 6590
Ngāi Tamarāwaho Ngāi- -Tamarāwaho 5886 274 6591
Ngāi Tamawera Ngāi- -Tamawera 5887 274 6593
Ngāi Tuariki Ngāi- -Tuariki 5888 274 2704
Rewi Maniapoto Rewi- -Maniapoto 6366 677 536
Witi Ihimaera Witi- -Ihimaera 7023 7022 5312
Wīwī-Puruhia Wīwī---Puruhia 7042 308 6229
raro-paparahi raro---paparahi 8056 279 288
tino-hangarau tino---hangarau 8268 198 1461
wahi-tūmatanui wahi---tūmatanui 8439 610 1272
āhua-perehitini āhua---perehitini 8751 232 7900
Ngāti Maniapoto Ngāti- -Maniapoto 723 143 536
Ngāti Tūwharetoa Ngāti- -Tūwharetoa 994 143 379
Ngāti Kahungunu Ngāti- -Kahungunu 2534 143 2409
Ngāti Ranginui Ngāti- -Ranginui 2535 143 1393
Ngāti Tūrangitukua Ngāti- -Tūrangitukua 2539 143 1195
Islas Baleares Islas- -Baleares 5338 2399 2274
Islas Canarias Islas- -Canarias 5339 2399 1339
Ngāti Hikakino Ngāti- -Hikakino 5894 143 5224
Ngāti Hinematua Ngāti- -Hinematua 5895 143 5232
Quantity of compounds with interfix length 2
Count
2
Most frequent compounds
compound structure w_id compound w_id word1 w_id word2
whakakotahitanga whaka-ko-tahitanga 8508 3212 8154
hangaia-kaiwhakamahi hangai-a--kaiwhakamahi 7232 2837 2892
Most frequent interfixes of length 1
interfix count
- - 42
--- 9
-ʻ- 1
Most frequent interfixes of length 2
interfix count
-a-- 1
-ko- 1
183426 msec needed at 2019-04-23 00:03