Breaking Caesar Ciphers with Frequency Analysis

Learn how frequency analysis can crack a Caesar cipher by comparing letter frequencies in the ciphertext to expected frequencies in the target language.

Caesar Cipher Variants

Detailed Explanation

Frequency Analysis Attack

Frequency analysis is a more elegant method of breaking the Caesar cipher than brute force. Instead of trying all shifts, you analyze the statistical distribution of letters in the ciphertext.

English Letter Frequencies

In standard English text, letters appear with predictable frequencies:

E: 12.7%  T: 9.1%   A: 8.2%   O: 7.5%
I: 7.0%   N: 6.7%   S: 6.3%   H: 6.1%
R: 6.0%   D: 4.3%   L: 4.0%   C: 2.8%

The Attack Method

  1. Count frequencies: Tally the occurrence of each letter in the ciphertext
  2. Find the most common: The most frequent letter is likely the encryption of E
  3. Calculate the shift: If the most common ciphertext letter is H, then H − E = 3, suggesting a shift of 3
  4. Verify: Apply the derived shift and check if the result is readable

Worked Example

Ciphertext: "WKH TXLFN EURZQ IRA MXPSV RYHU WKH ODCB GRJ"

Letter frequencies in this ciphertext:

  • Most common: R (appears 3 times)
  • Second: H, W, K (appear 2–3 times each)

If R corresponds to E: R(17) − E(4) = 13... but trying shift 13 gives gibberish. If W corresponds to T: W(22) − T(19) = 3... trying shift 3 gives:

"THE QUICK BROWN FOX JUMPS OVER THE LAZY DOG" — success!

Limitations

Frequency analysis requires:

  • Sufficient text length: Short messages may not have representative frequencies
  • Known language: You need to know (or guess) what language the plaintext is in
  • Single-alphabet cipher: Does not directly apply to polyalphabetic ciphers like Vigenère

Historical Significance

Al-Kindi, a 9th-century Arab polymath, first documented frequency analysis in his treatise on cryptography. This technique remained the primary codebreaking method for nearly a thousand years, until polyalphabetic ciphers were developed in the Renaissance.

Use Case

Frequency analysis is a core topic in cryptography education and competitive programming. It teaches statistical thinking, pattern recognition, and demonstrates why simple substitution ciphers fail against even basic statistical attacks.

Try It — ROT13 / Caesar Cipher

Open full tool