New to Typophile? Accounts are free, and easy to set up.
How would one go about finding out about relative frequencies of two-letter combinations in different languages* – ideally even discerning between uppercase and lowercase letters? I'm essentially looking for something like a list that would say, an "r" is (I'm making this up for the sake of illustration) 13% likely to be followed by an "e" in English, but only 2% likely to be followed by a "k".
I'm aware there will not be a Final Verdict on this to be found, but there must be some approximative data to work with?
* I'm most interested in German and English for the moment.
A quick-ish web search has brought up charts with relative single letter frequencies in different languages, as well as word frequency lists, but not a lot about pairs of letters. Wikipedia has a list of Bigram Frequencies in the English language, but this is single-case – and also only lists the most common combinations, while I'm looking for something more in the vein of a "pairing" frequency table.
Any good, reasonably reliable references/literature you guys can recommend?
I'm done looking at wonky charts that don't say what data they're based on…
Of course, please re-direct me if this has been discussed before. I couldn't find anything, but then the search has been hiccuping.