These tables were generated from the following randomly selected texts:
Word lengths (sample contains 1226563 words): 1 - 53963 4.40% 2 - 212605 17.33% 3 - 304910 24.86% 4 - 222728 18.16% 5 - 132060 10.77% 6 - 98057 7.99% 7 - 78570 6.41% 8 - 51606 4.21% 9 - 33874 2.76% 10 - 20215 1.65% 11 - 8768 0.71% 12 - 5504 0.45% 13 - 2159 0.18% 14 - 1202 0.10% 15 - 262 0.02% 16 - 65 0.01% 17 - 12 0.00% 18 - 1 0.00% 19 - 2 0.00% 20 - 0 0.00% >20 - 0 0.00% Monogram frequencies (sample contains 6442495 letters) A - 425718 6.61% - 1226563 19.04% B - 74218 1.15% E - 655257 10.17% C - 123682 1.92% T - 474521 7.37% D - 237658 3.69% A - 425718 6.61% E - 655257 10.17% O - 392957 6.10% F - 112983 1.75% I - 362252 5.62% G - 103528 1.61% N - 358902 5.57% H - 349152 5.42% H - 349152 5.42% I - 362252 5.62% S - 326957 5.08% J - 5329 0.08% R - 294810 4.58% K - 42573 0.66% D - 237658 3.69% L - 209675 3.25% L - 209675 3.25% M - 131917 2.05% U - 146596 2.28% N - 358902 5.57% M - 131917 2.05% O - 392957 6.10% C - 123682 1.92% P - 84603 1.31% W - 122200 1.90% Q - 4923 0.08% F - 112983 1.75% R - 294810 4.58% Y - 106312 1.65% S - 326957 5.08% G - 103528 1.61% T - 474521 7.37% P - 84603 1.31% U - 146596 2.28% B - 74218 1.15% V - 56626 0.88% V - 56626 0.88% W - 122200 1.90% K - 42573 0.66% X - 9205 0.14% X - 9205 0.14% Y - 106312 1.65% J - 5329 0.08% Z - 3378 0.05% Q - 4923 0.08% Å - 0 0.00% Z - 3378 0.05% Ä - 0 0.00% Å - 0 0.00% Ö - 0 0.00% Ä - 0 0.00% - 1226563 19.04% Ö - 0 0.00% Most common bigrams including space (sample includes 6442495 bigrams) E - 245521 3.81% | M - 54707 0.85% | H - 34308 0.53% T - 188459 2.93% | AT - 54679 0.85% | ME - 33498 0.52% HE - 158681 2.46% | ON - 54317 0.84% | P - 33488 0.52% TH - 155382 2.41% | B - 52647 0.82% | NT - 33309 0.52% D - 151912 2.36% | HI - 51487 0.80% | EA - 33115 0.51% A - 137885 2.14% | EN - 50680 0.79% | AL - 31638 0.49% T - 131548 2.04% | TO - 48934 0.76% | L - 31413 0.49% S - 127468 1.98% | NG - 48452 0.75% | L - 31271 0.49% H - 103608 1.61% | C - 46867 0.73% | A - 31181 0.48% S - 97862 1.52% | IS - 46795 0.73% | LL - 30942 0.48% IN - 94900 1.47% | IT - 46750 0.73% | NE - 29606 0.46% N - 90466 1.40% | F - 44074 0.68% | N - 28561 0.44% AN - 89239 1.39% | OR - 43306 0.67% | TI - 27954 0.43% W - 87123 1.35% | F - 42456 0.66% | DE - 27149 0.42% ER - 84372 1.31% | AS - 41550 0.64% | NO - 27144 0.42% I - 78395 1.22% | G - 40856 0.63% | BE - 25716 0.40% R - 71433 1.11% | TE - 40346 0.63% | RO - 25665 0.40% RE - 69581 1.08% | ES - 40152 0.62% | R - 25511 0.40% O - 69365 1.08% | D - 39144 0.61% | WA - 25409 0.39% Y - 69357 1.08% | AR - 38194 0.59% | WH - 25352 0.39% ND - 64917 1.01% | ST - 38056 0.59% | M - 24953 0.39% O - 61336 0.95% | LE - 37620 0.58% | HO - 24900 0.39% OU - 59917 0.93% | SE - 36629 0.57% | Y - 24563 0.38% HA - 58931 0.91% | OF - 35593 0.55% | EL - 24556 0.38% ED - 56774 0.88% | VE - 35534 0.55% | AD - 24154 0.37% Most common bigrams in the beginning of words (sample includes 1226563 trigrams) TH - 125714 10.25% | SO - 12480 1.02% | SI - 6781 0.55% AN - 50095 4.08% | MO - 12065 0.98% | GO - 6575 0.54% TO - 40128 3.27% | AS - 12000 0.98% | MY - 6421 0.52% HE - 39426 3.21% | WE - 11936 0.97% | SU - 6383 0.52% OF - 34439 2.81% | SE - 11028 0.90% | DA - 6012 0.49% IN - 28313 2.31% | CA - 10927 0.89% | FI - 5343 0.44% HI - 26851 2.19% | BU - 10719 0.87% | CH - 5325 0.43% HA - 26660 2.17% | ME - 10697 0.87% | LA - 5276 0.43% WH - 24883 2.03% | ST - 10569 0.86% | PE - 5042 0.41% A - 23513 1.92% | DO - 10360 0.84% | EX - 4975 0.41% BE - 22023 1.80% | AT - 9867 0.80% | FE - 4805 0.39% WA - 20721 1.69% | LI - 9455 0.77% | PO - 4757 0.39% YO - 20708 1.69% | DE - 9078 0.74% | BY - 4756 0.39% NO - 19878 1.62% | PR - 9064 0.74% | MI - 4720 0.38% CO - 19722 1.61% | WO - 9033 0.74% | UP - 4719 0.38% WI - 19434 1.58% | IS - 8833 0.72% | GR - 4691 0.38% I - 18192 1.48% | FR - 8512 0.69% | NE - 4654 0.38% SH - 16490 1.34% | HO - 8188 0.67% | OU - 4632 0.38% SA - 15659 1.28% | DI - 8171 0.67% | UN - 4629 0.38% IT - 15521 1.27% | LO - 7779 0.63% | CR - 4578 0.37% FO - 15241 1.24% | LE - 7583 0.62% | EV - 4517 0.37% RE - 15029 1.23% | AR - 7413 0.60% | TR - 4428 0.36% ON - 14957 1.22% | S - 7372 0.60% | BR - 4323 0.35% MA - 14752 1.20% | FA - 7149 0.58% | BA - 4295 0.35% AL - 12594 1.03% | PA - 6801 0.55% | TA - 4134 0.34% Most common bigrams in the end of words (sample includes 1226563 trigrams) HE - 101821 8.30% | TH - 14891 1.21% | UR - 5982 0.49% ED - 53080 4.33% | AD - 14338 1.17% | MY - 5978 0.49% ND - 51591 4.21% | VE - 14022 1.14% | TY - 5944 0.48% NG - 39647 3.23% | ST - 13369 1.09% | TS - 5844 0.48% ER - 38873 3.17% | NT - 13130 1.07% | ET - 5778 0.47% TO - 37868 3.09% | LE - 13047 1.06% | SO - 5498 0.45% AT - 33811 2.76% | LD - 12476 1.02% | RT - 5286 0.43% OF - 32699 2.67% | ID - 12256 1.00% | KE - 5192 0.42% IS - 29806 2.43% | CH - 12086 0.99% | DE - 5097 0.42% AS - 26232 2.14% | CE - 11760 0.96% | AL - 5047 0.41% IN - 25271 2.06% | OT - 11697 0.95% | BY - 4857 0.40% RE - 24297 1.98% | SE - 11433 0.93% | IR - 4769 0.39% A - 23513 1.92% | NE - 10613 0.87% | LF - 4555 0.37% ON - 22656 1.85% | OW - 9434 0.77% | US - 4472 0.36% EN - 19830 1.62% | AY - 8627 0.70% | DS - 4406 0.36% LL - 19094 1.56% | IM - 8566 0.70% | HO - 4228 0.34% ES - 18196 1.48% | RY - 7904 0.64% | AR - 4211 0.34% I - 18192 1.48% | S - 7372 0.60% | NS - 4183 0.34% LY - 17917 1.46% | HT - 7283 0.59% | EE - 4178 0.34% OR - 17357 1.42% | RS - 7167 0.58% | NO - 4178 0.34% ME - 17309 1.41% | SS - 7124 0.58% | RD - 3814 0.31% UT - 16237 1.32% | OM - 7054 0.58% | WN - 3793 0.31% IT - 15953 1.30% | TE - 7045 0.57% | GE - 3681 0.30% OU - 15459 1.26% | EY - 6965 0.57% | CK - 3635 0.30% AN - 15178 1.24% | BE - 6501 0.53% | DO - 3421 0.28% Most common bigrams not including space (sample includes 5215931 bigrams) TH - 167258 3.21% | TE - 42514 0.82% | SI - 26473 0.51% HE - 159235 3.05% | TI - 40982 0.79% | SO - 26287 0.50% IN - 95194 1.83% | SE - 39804 0.76% | RA - 26255 0.50% ER - 90930 1.74% | AR - 39143 0.75% | EC - 26225 0.50% AN - 90006 1.73% | LE - 38271 0.73% | YO - 25772 0.49% RE - 71383 1.37% | OF - 37388 0.72% | BE - 25717 0.49% ND - 66692 1.28% | SA - 36088 0.69% | AD - 25681 0.49% ED - 66683 1.28% | VE - 35538 0.68% | SS - 25358 0.49% HA - 64086 1.23% | ME - 33804 0.65% | DA - 25316 0.49% ES - 63216 1.21% | AL - 33710 0.65% | LI - 24618 0.47% OU - 60474 1.16% | NO - 32644 0.63% | OM - 24394 0.47% TO - 58346 1.12% | NE - 31669 0.61% | RT - 24148 0.46% AT - 56683 1.09% | LL - 31649 0.61% | EW - 24054 0.46% EN - 55832 1.07% | EL - 31405 0.60% | DI - 24030 0.46% ON - 55755 1.07% | SH - 30650 0.59% | CO - 23975 0.46% EA - 55459 1.06% | OT - 30566 0.59% | EE - 23940 0.46% NT - 54694 1.05% | TT - 30218 0.58% | MA - 23817 0.46% ST - 54195 1.04% | RO - 29790 0.57% | EM - 23453 0.45% HI - 53885 1.03% | DE - 29619 0.57% | AI - 22856 0.44% NG - 49388 0.95% | TA - 28744 0.55% | UT - 22840 0.44% IS - 49156 0.94% | DT - 28373 0.54% | WI - 22502 0.43% IT - 48057 0.92% | RI - 28017 0.54% | CE - 22365 0.43% AS - 45974 0.88% | WA - 26889 0.52% | OW - 22174 0.43% OR - 45043 0.86% | WH - 26749 0.51% | CH - 22152 0.42% ET - 42573 0.82% | HO - 26702 0.51% | RS - 21231 0.41% Most common trigrams including space (sample includes 6442494 trigrams) TH - 125714 1.95% | WH - 24883 0.39% | OR - 17357 0.27% HE - 101821 1.58% | RE - 24297 0.38% | ME - 17309 0.27% THE - 98530 1.53% | A - 23513 0.36% | E H - 17282 0.27% ED - 53080 0.82% | E S - 23064 0.36% | D A - 16997 0.26% ND - 51591 0.80% | HAT - 22861 0.35% | SH - 16490 0.26% AN - 50095 0.78% | ON - 22656 0.35% | FOR - 16426 0.25% AND - 48312 0.75% | E A - 22344 0.35% | UT - 16237 0.25% TO - 40128 0.62% | BE - 22023 0.34% | S T - 16139 0.25% NG - 39647 0.62% | N T - 21385 0.33% | IT - 15953 0.25% HE - 39426 0.61% | HIS - 20975 0.33% | ERE - 15807 0.25% ER - 38873 0.60% | T T - 20809 0.32% | SA - 15659 0.24% ING - 38182 0.59% | WA - 20721 0.32% | IT - 15521 0.24% TO - 37868 0.59% | YO - 20708 0.32% | OU - 15459 0.24% OF - 34439 0.53% | YOU - 20678 0.32% | FO - 15241 0.24% AT - 33811 0.52% | E W - 19929 0.31% | AN - 15178 0.24% OF - 32699 0.51% | NO - 19878 0.31% | WAS - 15122 0.23% IS - 29806 0.46% | EN - 19830 0.31% | RE - 15029 0.23% D T - 28343 0.44% | CO - 19722 0.31% | E C - 15001 0.23% IN - 28313 0.44% | WI - 19434 0.30% | ON - 14957 0.23% HI - 26851 0.42% | THA - 19227 0.30% | TH - 14891 0.23% HA - 26660 0.41% | LL - 19094 0.30% | MA - 14752 0.23% E T - 26459 0.41% | ES - 18196 0.28% | AD - 14338 0.22% AS - 26232 0.41% | I - 18192 0.28% | D H - 14309 0.22% HER - 26208 0.41% | LY - 17917 0.28% | E O - 14113 0.22% IN - 25271 0.39% | S A - 17434 0.27% | VE - 14022 0.22% Most common trigrams not including space (sample includes 5215930 trigrams) THE - 104376 2.00% | VER - 12279 0.24% | ESA - 9302 0.18% AND - 48638 0.93% | TER - 12274 0.24% | EVE - 9271 0.18% ING - 38500 0.74% | ALL - 12021 0.23% | NCE - 9249 0.18% HER - 30219 0.58% | ION - 11289 0.22% | EDA - 9239 0.18% THA - 24760 0.47% | FTH - 11247 0.22% | AID - 9213 0.18% HAT - 23177 0.44% | STH - 11210 0.21% | HIN - 9203 0.18% HIS - 21322 0.41% | OFT - 11144 0.21% | NDT - 9190 0.18% YOU - 20873 0.40% | HAD - 11113 0.21% | HEN - 9184 0.18% ERE - 20173 0.39% | REA - 11110 0.21% | BUT - 9178 0.18% DTH - 18382 0.35% | EST - 10757 0.21% | OME - 9149 0.18% ENT - 17684 0.34% | ERS - 10698 0.21% | ILL - 9120 0.17% ETH - 16638 0.32% | GHT - 10475 0.20% | AST - 9111 0.17% FOR - 16484 0.32% | ESS - 10280 0.20% | RTH - 9067 0.17% NTH - 16221 0.31% | HIM - 10191 0.20% | OUL - 8901 0.17% THI - 15782 0.30% | EAR - 10173 0.20% | ATT - 8848 0.17% SHE - 15440 0.30% | EAN - 9983 0.19% | STO - 8836 0.17% WAS - 15277 0.29% | AVE - 9720 0.19% | SAI - 8753 0.17% HES - 14937 0.29% | ONE - 9672 0.19% | ATH - 8683 0.17% ITH - 14829 0.28% | HEC - 9606 0.18% | OUN - 8664 0.17% TTH - 14454 0.28% | TIN - 9590 0.18% | ERT - 8579 0.16% OTH - 14352 0.28% | RES - 9485 0.18% | SAN - 8556 0.16% INT - 13802 0.26% | HEW - 9480 0.18% | HOU - 8465 0.16% NOT - 13411 0.26% | ONT - 9445 0.18% | OUR - 8460 0.16% WIT - 13084 0.25% | ATI - 9437 0.18% | OUT - 8436 0.16% EDT - 12922 0.25% | HEM - 9363 0.18% | HEA - 8393 0.16%