Frequency of the Czech phonemes

The data are taken from the Phonological Lexical Sub-corpus. The domain is the phonological word. Symbol "lL" stands for the phoneme /l/ and symbol "rR" for the phoneme /r/. In the corpora, symbols "l" and "r" are used for non-nuclear /l/ and /r/, and symbols "L" and "R" for nuclear /r/ and /l/.

· Abbreviations and symbols
· Detailed description of the Phonological Corpus (including transcription)

Phoneme Occ PhWto Rat PhWto Occ PhWty Rat PhWty
o 208815 0.08124 192294 0.08083
a 196059 0.07628 181195 0.07616
i 192516 0.07490 177361 0.07455
e 174913 0.06805 158491 0.06662
v 124487 0.04843 117079 0.04921
ī 115520 0.04494 109241 0.04592
T 120864 0.04702 108671 0.04568
Rr 115343 0.04487 107019 0.04498
n 112412 0.04373 103892 0.04367
Ll 104308 0.04058 97927 0.04116
k 97745 0.03803 92554 0.03890
t 97157 0.03780 89966 0.03782
S 92295 0.03591 85167 0.03580
p 82945 0.03227 76919 0.03233
ť 62305 0.02424 59463 0.02499
s 67063 0.02609 58688 0.02467
m 50650 0.01970 46685 0.01962
d 48453 0.01885 45212 0.01900
ā 47252 0.01838 44856 0.01885
ň 45325 0.01763 42816 0.01800
u 46777 0.01820 42691 0.01795
z 46736 0.01818 40916 0.01720
j 39220 0.01526 36431 0.01531
š 34792 0.01354 32758 0.01377
ř 33482 0.01303 31676 0.01331
b 33348 0.01297 31190 0.01311
h 25458 0.00990 24102 0.01013
Š 22500 0.00875 21801 0.00916
K 20927 0.00814 19656 0.00826
ö 16540 0.00643 15521 0.00652
x 13076 0.00509 12385 0.00521
ž 12577 0.00489 11891 0.00500
f 11472 0.00446 10572 0.00444
g 10267 0.00399 9267 0.00390
F 9430 0.00367 8950 0.00376
ď 7446 0.00290 7124 0.00299
P 6816 0.00265 6109 0.00257
ē 6740 0.00262 5975 0.00251
ū 6455 0.00251 5736 0.00241
M 2574 0.00100 2484 0.00104
X 2501 0.00097 2371 0.00100
ō 2621 0.00102 2016 0.00085
ä 1255 0.00049 1069 0.00045
Ť 528 0.00021 513 0.00022
ë 407 0.00016 342 0.00014
Total 2570372 1 2379042 1