Frequency of the Czech phonemes

The data are taken from the Phonological Textual Sub-corpus. The domain is the phonological word. Symbol "lL" stands for the phoneme /l/ and symbol "rR" for the phoneme /r/. In the corpora, symbols "l" and "r" are used for non-nuclear /l/ and /r/, and symbols "L" and "R" for nuclear /r/ and /l/.

· Abbreviations and symbols
· Detailed description of the Phonological Corpus (including transcription)

Phoneme Occ PhWto Rat PhWto Occ PhWty Rat PhWty
e 1485699 0.09816 316790 0.09243
a 1146783 0.07577 222064 0.06479
o 1029996 0.06805 217540 0.06347
i 929895 0.06144 214252 0.06251
Ll 810157 0.05353 172321 0.05028
n 632573 0.04180 156221 0.04558
s 555712 0.03672 141215 0.04120
m 489198 0.03232 126079 0.03679
ī 504197 0.03331 125581 0.03664
v 509243 0.03365 124686 0.03638
t 531905 0.03514 122413 0.03572
Rr 440009 0.02907 108029 0.03152
T 491063 0.03244 107972 0.03150
j 461664 0.03050 106104 0.03096
p 437317 0.02889 102280 0.02984
S 398572 0.02633 100689 0.02938
k 421260 0.02783 97827 0.02854
u 357177 0.02360 83489 0.02436
d 402679 0.02661 79877 0.02331
ā 329891 0.02180 78910 0.02302
ň 273582 0.01808 57342 0.01673
z 215306 0.01422 55179 0.01610
b 261286 0.01726 54180 0.01581
h 228752 0.01511 50813 0.01483
š 216579 0.01431 50311 0.01468
ř 185079 0.01223 42565 0.01242
ē 145032 0.00958 33763 0.00985
Š 161404 0.01066 33573 0.00980
ť 140937 0.00931 32971 0.00962
K 174466 0.01153 32107 0.00937
ö 111518 0.00737 29701 0.00867
F 112064 0.00740 28038 0.00818
X 83579 0.00552 25078 0.00732
ž 122217 0.00807 22408 0.00654
ū 67777 0.00448 16676 0.00487
x 71755 0.00474 16069 0.00469
ď 87243 0.00576 14750 0.00430
P 26629 0.00176 6924 0.00202
f 16603 0.00110 5762 0.00168
M 34117 0.00225 4397 0.00128
g 10459 0.00069 4305 0.00126
Ť 17684 0.00117 4139 0.00121
ō 3428 0.00023 1060 0.00031
ä 2684 0.00018 819 0.00024
ë 127 <0.00001 71 0.00002
Total 15135297 1 3427340 1