The data are taken from the Phonological Textual Sub-corpus. The domain is the phonological word.
· Abbreviations and symbols
· Detailed description of the Phonological Corpus (including transcription)
CV Pattern | Occ Pgm/PhWto | Rat Pgm/PhWto | Occ Pgm/PhWty | Rat Pgm/PhWty |
CV | 564365 | 0.54817 | 522404 | 0.55040 |
CCV | 171713 | 0.16679 | 161680 | 0.17034 |
CVC | 142849 | 0.13875 | 131027 | 0.13805 |
CCVC | 44195 | 0.04293 | 40789 | 0.04298 |
CVCC | 30923 | 0.03004 | 29316 | 0.03089 |
V | 29069 | 0.02823 | 23670 | 0.02494 |
CCCV | 16144 | 0.01568 | 15080 | 0.01589 |
VC | 14749 | 0.01433 | 11435 | 0.01205 |
CCVCC | 7549 | 0.00733 | 7121 | 0.00750 |
CCCVC | 3604 | 0.00350 | 3431 | 0.00362 |
CCCVCC | 1144 | 0.00111 | 1077 | 0.00114 |
VCC | 847 | 0.00082 | 617 | 0.00065 |
C | 785 | 0.00076 | 3 | <0.00001 |
CVCCC | 727 | 0.00071 | 672 | 0.00071 |
CCCCV | 465 | 0.00045 | 450 | 0.00047 |
CCVCCC | 199 | 0.00019 | 169 | 0.00018 |
CCCCVC | 154 | 0.00015 | 143 | 0.00015 |
VCCC | 21 | 0.00002 | 18 | 0.00002 |
CCCVCCC | 20 | 0.00002 | 17 | 0.00002 |
CCCCVCC | 6 | <0.00001 | 6 | <0.00001 |
CVCCCC | 3 | <0.00001 | 3 | <0.00001 |
CC | 2 | <0.00001 | 2 | <0.00001 |
CCCCCVC | 2 | <0.00001 | 2 | <0.00001 |
CCCCCV | 1 | <0.00001 | 1 | <0.00001 |
Total | 949133 | 1 | 1029536 | 1 |