The Shoulders · Reveal 05

Learning is compression.

Memorization is not.

Act 1 · The Moment

In Zone 03, Q13, you measured the entropy of your own text. That number was Kolmogorov complexity in real time. You were using his definition without knowing his name.

Act 2 · The Reveal

Two sequences. Same length. Write the shortest rule you can that generates each. Watch which one resists.

Sequence A

ABABABABABABABABABAB

Your shortest rule

Rule: 0 charsvs 20 chars

Sequence B

K7mQpX2nRvLw9cYj4sT

Your shortest rule

Rule: 0 charsvs 19 chars

Low Kolmogorov complexity

Your rule is shorter than the sequence. The pattern compressed. This sequence is learnable.

High Kolmogorov complexity

No rule shorter than the sequence exists. It must be memorized.

Kolmogorov defined complexity in 1963 as the length of the shortest description of a thing.

A truly random sequence cannot be compressed: its shortest description is itself. A learnable pattern can always be compressed.

This is why neural networks work. They are compression engines, finding the shortest description of the structure in their training data. A model that has truly learned can describe a million examples in fewer parameters than one that memorized them.

Act 3 · The Human

Andrei Kolmogorov · 1903–1987

Kolmogorov worked in Moscow under Stalin. He navigated ideological pressure on mathematics while simultaneously defining probability theory, turbulence, and computational complexity.

In 1963 he defined algorithmic complexity: the length of the shortest computer program that can generate a given string. Truly random data has maximum Kolmogorov complexity — it cannot be compressed. Learnable patterns have low complexity — they can be described in fewer rules than examples.

This distinction defines the difference between memorization and learning.

The entropy meter in Zone 03 of The Inquiry measures Kolmogorov complexity in real time. You were using his definition without knowing his name.

← Hopfield 05 / 05 The Shoulders →