Mono-, Bi and Trigram Frequency for English

These tables were generated from the following randomly selected texts:

All were taken from Project Gutenberg

Word lengths (sample contains 1226563 words):
  1 -  53963   4.40%
  2 - 212605  17.33%
  3 - 304910  24.86%
  4 - 222728  18.16%
  5 - 132060  10.77%
  6 -  98057   7.99%
  7 -  78570   6.41%
  8 -  51606   4.21%
  9 -  33874   2.76%
 10 -  20215   1.65%
 11 -   8768   0.71%
 12 -   5504   0.45%
 13 -   2159   0.18%
 14 -   1202   0.10%
 15 -    262   0.02%
 16 -     65   0.01%
 17 -     12   0.00%
 18 -      1   0.00%
 19 -      2   0.00%
 20 -      0   0.00%
>20 -      0   0.00%

Monogram frequencies (sample contains 6442495 letters)
A -  425718   6.61%		  - 1226563  19.04%
B -   74218   1.15%		E -  655257  10.17%
C -  123682   1.92%		T -  474521   7.37%
D -  237658   3.69%		A -  425718   6.61%
E -  655257  10.17%		O -  392957   6.10%
F -  112983   1.75%		I -  362252   5.62%
G -  103528   1.61%		N -  358902   5.57%
H -  349152   5.42%		H -  349152   5.42%
I -  362252   5.62%		S -  326957   5.08%
J -    5329   0.08%		R -  294810   4.58%
K -   42573   0.66%		D -  237658   3.69%
L -  209675   3.25%		L -  209675   3.25%
M -  131917   2.05%		U -  146596   2.28%
N -  358902   5.57%		M -  131917   2.05%
O -  392957   6.10%		C -  123682   1.92%
P -   84603   1.31%		W -  122200   1.90%
Q -    4923   0.08%		F -  112983   1.75%
R -  294810   4.58%		Y -  106312   1.65%
S -  326957   5.08%		G -  103528   1.61%
T -  474521   7.37%		P -   84603   1.31%
U -  146596   2.28%		B -   74218   1.15%
V -   56626   0.88%		V -   56626   0.88%
W -  122200   1.90%		K -   42573   0.66%
X -    9205   0.14%		X -    9205   0.14%
Y -  106312   1.65%		J -    5329   0.08%
Z -    3378   0.05%		Q -    4923   0.08%
Å -       0   0.00%		Z -    3378   0.05%
Ä -       0   0.00%		Å -       0   0.00%
Ö -       0   0.00%		Ä -       0   0.00%
  - 1226563  19.04%		Ö -       0   0.00%


Most common bigrams including space (sample includes 6442495 bigrams)
 E  - 245521   3.81% |   M -  54707   0.85% |  H  -  34308   0.53%
  T - 188459   2.93% |  AT -  54679   0.85% |  ME -  33498   0.52%
 HE - 158681   2.46% |  ON -  54317   0.84% |   P -  33488   0.52%
 TH - 155382   2.41% |   B -  52647   0.82% |  NT -  33309   0.52%
 D  - 151912   2.36% |  HI -  51487   0.80% |  EA -  33115   0.51%
  A - 137885   2.14% |  EN -  50680   0.79% |  AL -  31638   0.49%
 T  - 131548   2.04% |  TO -  48934   0.76% |   L -  31413   0.49%
 S  - 127468   1.98% |  NG -  48452   0.75% |  L  -  31271   0.49%
  H - 103608   1.61% |   C -  46867   0.73% |  A  -  31181   0.48%
  S -  97862   1.52% |  IS -  46795   0.73% |  LL -  30942   0.48%
 IN -  94900   1.47% |  IT -  46750   0.73% |  NE -  29606   0.46%
 N  -  90466   1.40% |   F -  44074   0.68% |   N -  28561   0.44%
 AN -  89239   1.39% |  OR -  43306   0.67% |  TI -  27954   0.43%
  W -  87123   1.35% |  F  -  42456   0.66% |  DE -  27149   0.42%
 ER -  84372   1.31% |  AS -  41550   0.64% |  NO -  27144   0.42%
  I -  78395   1.22% |  G  -  40856   0.63% |  BE -  25716   0.40%
 R  -  71433   1.11% |  TE -  40346   0.63% |  RO -  25665   0.40%
 RE -  69581   1.08% |  ES -  40152   0.62% |   R -  25511   0.40%
  O -  69365   1.08% |   D -  39144   0.61% |  WA -  25409   0.39%
 Y  -  69357   1.08% |  AR -  38194   0.59% |  WH -  25352   0.39%
 ND -  64917   1.01% |  ST -  38056   0.59% |  M  -  24953   0.39%
 O  -  61336   0.95% |  LE -  37620   0.58% |  HO -  24900   0.39%
 OU -  59917   0.93% |  SE -  36629   0.57% |   Y -  24563   0.38%
 HA -  58931   0.91% |  OF -  35593   0.55% |  EL -  24556   0.38%
 ED -  56774   0.88% |  VE -  35534   0.55% |  AD -  24154   0.37%

Most common bigrams in the beginning of words (sample includes 1226563 trigrams)
 TH - 125714  10.25% |  SO -  12480   1.02% |  SI -   6781   0.55%
 AN -  50095   4.08% |  MO -  12065   0.98% |  GO -   6575   0.54%
 TO -  40128   3.27% |  AS -  12000   0.98% |  MY -   6421   0.52%
 HE -  39426   3.21% |  WE -  11936   0.97% |  SU -   6383   0.52%
 OF -  34439   2.81% |  SE -  11028   0.90% |  DA -   6012   0.49%
 IN -  28313   2.31% |  CA -  10927   0.89% |  FI -   5343   0.44%
 HI -  26851   2.19% |  BU -  10719   0.87% |  CH -   5325   0.43%
 HA -  26660   2.17% |  ME -  10697   0.87% |  LA -   5276   0.43%
 WH -  24883   2.03% |  ST -  10569   0.86% |  PE -   5042   0.41%
 A  -  23513   1.92% |  DO -  10360   0.84% |  EX -   4975   0.41%
 BE -  22023   1.80% |  AT -   9867   0.80% |  FE -   4805   0.39%
 WA -  20721   1.69% |  LI -   9455   0.77% |  PO -   4757   0.39%
 YO -  20708   1.69% |  DE -   9078   0.74% |  BY -   4756   0.39%
 NO -  19878   1.62% |  PR -   9064   0.74% |  MI -   4720   0.38%
 CO -  19722   1.61% |  WO -   9033   0.74% |  UP -   4719   0.38%
 WI -  19434   1.58% |  IS -   8833   0.72% |  GR -   4691   0.38%
 I  -  18192   1.48% |  FR -   8512   0.69% |  NE -   4654   0.38%
 SH -  16490   1.34% |  HO -   8188   0.67% |  OU -   4632   0.38%
 SA -  15659   1.28% |  DI -   8171   0.67% |  UN -   4629   0.38%
 IT -  15521   1.27% |  LO -   7779   0.63% |  CR -   4578   0.37%
 FO -  15241   1.24% |  LE -   7583   0.62% |  EV -   4517   0.37%
 RE -  15029   1.23% |  AR -   7413   0.60% |  TR -   4428   0.36%
 ON -  14957   1.22% |  S  -   7372   0.60% |  BR -   4323   0.35%
 MA -  14752   1.20% |  FA -   7149   0.58% |  BA -   4295   0.35%
 AL -  12594   1.03% |  PA -   6801   0.55% |  TA -   4134   0.34%

Most common bigrams in the end of words (sample includes 1226563 trigrams)
 HE - 101821   8.30% |  TH -  14891   1.21% |  UR -   5982   0.49%
 ED -  53080   4.33% |  AD -  14338   1.17% |  MY -   5978   0.49%
 ND -  51591   4.21% |  VE -  14022   1.14% |  TY -   5944   0.48%
 NG -  39647   3.23% |  ST -  13369   1.09% |  TS -   5844   0.48%
 ER -  38873   3.17% |  NT -  13130   1.07% |  ET -   5778   0.47%
 TO -  37868   3.09% |  LE -  13047   1.06% |  SO -   5498   0.45%
 AT -  33811   2.76% |  LD -  12476   1.02% |  RT -   5286   0.43%
 OF -  32699   2.67% |  ID -  12256   1.00% |  KE -   5192   0.42%
 IS -  29806   2.43% |  CH -  12086   0.99% |  DE -   5097   0.42%
 AS -  26232   2.14% |  CE -  11760   0.96% |  AL -   5047   0.41%
 IN -  25271   2.06% |  OT -  11697   0.95% |  BY -   4857   0.40%
 RE -  24297   1.98% |  SE -  11433   0.93% |  IR -   4769   0.39%
  A -  23513   1.92% |  NE -  10613   0.87% |  LF -   4555   0.37%
 ON -  22656   1.85% |  OW -   9434   0.77% |  US -   4472   0.36%
 EN -  19830   1.62% |  AY -   8627   0.70% |  DS -   4406   0.36%
 LL -  19094   1.56% |  IM -   8566   0.70% |  HO -   4228   0.34%
 ES -  18196   1.48% |  RY -   7904   0.64% |  AR -   4211   0.34%
  I -  18192   1.48% |   S -   7372   0.60% |  NS -   4183   0.34%
 LY -  17917   1.46% |  HT -   7283   0.59% |  EE -   4178   0.34%
 OR -  17357   1.42% |  RS -   7167   0.58% |  NO -   4178   0.34%
 ME -  17309   1.41% |  SS -   7124   0.58% |  RD -   3814   0.31%
 UT -  16237   1.32% |  OM -   7054   0.58% |  WN -   3793   0.31%
 IT -  15953   1.30% |  TE -   7045   0.57% |  GE -   3681   0.30%
 OU -  15459   1.26% |  EY -   6965   0.57% |  CK -   3635   0.30%
 AN -  15178   1.24% |  BE -   6501   0.53% |  DO -   3421   0.28%


Most common bigrams not including space (sample includes 5215931 bigrams)
 TH - 167258   3.21% |  TE -  42514   0.82% |  SI -  26473   0.51%
 HE - 159235   3.05% |  TI -  40982   0.79% |  SO -  26287   0.50%
 IN -  95194   1.83% |  SE -  39804   0.76% |  RA -  26255   0.50%
 ER -  90930   1.74% |  AR -  39143   0.75% |  EC -  26225   0.50%
 AN -  90006   1.73% |  LE -  38271   0.73% |  YO -  25772   0.49%
 RE -  71383   1.37% |  OF -  37388   0.72% |  BE -  25717   0.49%
 ND -  66692   1.28% |  SA -  36088   0.69% |  AD -  25681   0.49%
 ED -  66683   1.28% |  VE -  35538   0.68% |  SS -  25358   0.49%
 HA -  64086   1.23% |  ME -  33804   0.65% |  DA -  25316   0.49%
 ES -  63216   1.21% |  AL -  33710   0.65% |  LI -  24618   0.47%
 OU -  60474   1.16% |  NO -  32644   0.63% |  OM -  24394   0.47%
 TO -  58346   1.12% |  NE -  31669   0.61% |  RT -  24148   0.46%
 AT -  56683   1.09% |  LL -  31649   0.61% |  EW -  24054   0.46%
 EN -  55832   1.07% |  EL -  31405   0.60% |  DI -  24030   0.46%
 ON -  55755   1.07% |  SH -  30650   0.59% |  CO -  23975   0.46%
 EA -  55459   1.06% |  OT -  30566   0.59% |  EE -  23940   0.46%
 NT -  54694   1.05% |  TT -  30218   0.58% |  MA -  23817   0.46%
 ST -  54195   1.04% |  RO -  29790   0.57% |  EM -  23453   0.45%
 HI -  53885   1.03% |  DE -  29619   0.57% |  AI -  22856   0.44%
 NG -  49388   0.95% |  TA -  28744   0.55% |  UT -  22840   0.44%
 IS -  49156   0.94% |  DT -  28373   0.54% |  WI -  22502   0.43%
 IT -  48057   0.92% |  RI -  28017   0.54% |  CE -  22365   0.43%
 AS -  45974   0.88% |  WA -  26889   0.52% |  OW -  22174   0.43%
 OR -  45043   0.86% |  WH -  26749   0.51% |  CH -  22152   0.42%
 ET -  42573   0.82% |  HO -  26702   0.51% |  RS -  21231   0.41%

Most common trigrams including space (sample includes 6442494 trigrams)
 TH - 125714   1.95% |  WH -  24883   0.39% | OR  -  17357   0.27%
HE  - 101821   1.58% | RE  -  24297   0.38% | ME  -  17309   0.27%
THE -  98530   1.53% |  A  -  23513   0.36% | E H -  17282   0.27%
ED  -  53080   0.82% | E S -  23064   0.36% | D A -  16997   0.26%
ND  -  51591   0.80% | HAT -  22861   0.35% |  SH -  16490   0.26%
 AN -  50095   0.78% | ON  -  22656   0.35% | FOR -  16426   0.25%
AND -  48312   0.75% | E A -  22344   0.35% | UT  -  16237   0.25%
 TO -  40128   0.62% |  BE -  22023   0.34% | S T -  16139   0.25%
NG  -  39647   0.62% | N T -  21385   0.33% | IT  -  15953   0.25%
 HE -  39426   0.61% | HIS -  20975   0.33% | ERE -  15807   0.25%
ER  -  38873   0.60% | T T -  20809   0.32% |  SA -  15659   0.24%
ING -  38182   0.59% |  WA -  20721   0.32% |  IT -  15521   0.24%
TO  -  37868   0.59% |  YO -  20708   0.32% | OU  -  15459   0.24%
 OF -  34439   0.53% | YOU -  20678   0.32% |  FO -  15241   0.24%
AT  -  33811   0.52% | E W -  19929   0.31% | AN  -  15178   0.24%
OF  -  32699   0.51% |  NO -  19878   0.31% | WAS -  15122   0.23%
IS  -  29806   0.46% | EN  -  19830   0.31% |  RE -  15029   0.23%
D T -  28343   0.44% |  CO -  19722   0.31% | E C -  15001   0.23%
 IN -  28313   0.44% |  WI -  19434   0.30% |  ON -  14957   0.23%
 HI -  26851   0.42% | THA -  19227   0.30% | TH  -  14891   0.23%
 HA -  26660   0.41% | LL  -  19094   0.30% |  MA -  14752   0.23%
E T -  26459   0.41% | ES  -  18196   0.28% | AD  -  14338   0.22%
AS  -  26232   0.41% |  I  -  18192   0.28% | D H -  14309   0.22%
HER -  26208   0.41% | LY  -  17917   0.28% | E O -  14113   0.22%
IN  -  25271   0.39% | S A -  17434   0.27% | VE  -  14022   0.22%

Most common trigrams not including space (sample includes 5215930 trigrams)
THE - 104376   2.00% | VER -  12279   0.24% | ESA -   9302   0.18%
AND -  48638   0.93% | TER -  12274   0.24% | EVE -   9271   0.18%
ING -  38500   0.74% | ALL -  12021   0.23% | NCE -   9249   0.18%
HER -  30219   0.58% | ION -  11289   0.22% | EDA -   9239   0.18%
THA -  24760   0.47% | FTH -  11247   0.22% | AID -   9213   0.18%
HAT -  23177   0.44% | STH -  11210   0.21% | HIN -   9203   0.18%
HIS -  21322   0.41% | OFT -  11144   0.21% | NDT -   9190   0.18%
YOU -  20873   0.40% | HAD -  11113   0.21% | HEN -   9184   0.18%
ERE -  20173   0.39% | REA -  11110   0.21% | BUT -   9178   0.18%
DTH -  18382   0.35% | EST -  10757   0.21% | OME -   9149   0.18%
ENT -  17684   0.34% | ERS -  10698   0.21% | ILL -   9120   0.17%
ETH -  16638   0.32% | GHT -  10475   0.20% | AST -   9111   0.17%
FOR -  16484   0.32% | ESS -  10280   0.20% | RTH -   9067   0.17%
NTH -  16221   0.31% | HIM -  10191   0.20% | OUL -   8901   0.17%
THI -  15782   0.30% | EAR -  10173   0.20% | ATT -   8848   0.17%
SHE -  15440   0.30% | EAN -   9983   0.19% | STO -   8836   0.17%
WAS -  15277   0.29% | AVE -   9720   0.19% | SAI -   8753   0.17%
HES -  14937   0.29% | ONE -   9672   0.19% | ATH -   8683   0.17%
ITH -  14829   0.28% | HEC -   9606   0.18% | OUN -   8664   0.17%
TTH -  14454   0.28% | TIN -   9590   0.18% | ERT -   8579   0.16%
OTH -  14352   0.28% | RES -   9485   0.18% | SAN -   8556   0.16%
INT -  13802   0.26% | HEW -   9480   0.18% | HOU -   8465   0.16%
NOT -  13411   0.26% | ONT -   9445   0.18% | OUR -   8460   0.16%
WIT -  13084   0.25% | ATI -   9437   0.18% | OUT -   8436   0.16%
EDT -  12922   0.25% | HEM -   9363   0.18% | HEA -   8393   0.16%