'Cause a fear of disrepair can grow overblown and trite
And surely less sustainable than some ailment more defined
So give me tribulations, so give me trials
And errors, and errors, and errors, causes, and "whys?"

What in the...

I returned to an old textbook on Error Correcting Codes¹ due to some bafflement caused by one of the explanations for the simple ISBN-10 algorithm for error detection and tripped and fell down a rabbit hole.

Pragmatically, the motivation for the algorithms discussed in this post centers around error detecting codes (but not correcting codes, unlike the Galois code covered in this post² which goes into more depth on the nature of both types of codes and some of the requisite Group Theory basics I needed to grokk the Galois).

I stumbled across an arcane piece of code at work that employed a related algorithm to validate a debit card number before passing it along to a third party payment processing network since it's both faster and more affordable to attempt this on the "client" side before passing it along to e.g. Visa who could munge it over for hundreds of milliseconds before returning an error and an invoice t'boot.

In this post I'll briefly summarize:

The ISBN-10 code
The aforementioned Luhn algorithm widely used for computing a check-digit for debit cards
A superior code developed (independently) by both Jacobus Verhoeff³ and Peter Gumm⁴ which detects more errors than Luhn's imperfect-but-simpler method
- This is where I'll spend most of the words (and symbols) allotted for this post, providing an alternate geometric interpretation of the proof(s) provided by the original authors

ISBN-10

Cumulative Sum of Digits

As mentioned, Todd Moon¹ describes the International Standard Book Number as a sequence of digits^† whose sum-of-the-sum of digits must be equal to $0 \mod 11$ and provides following tabular example of the confusing computation:

\underbrace{0}_{\text{country}} - \underbrace{20}_{\text{publisher}} - \underbrace{1 - 36186}_{\text{book \#}} - \underbrace{8}_{check}

digit	cum. sum	cum. sum of sum
0	0	0
2	2	2
0	2	4
1	3	7
3	6	13
6	12	25
1	13	38
8	21	59
6	27	86
8	35	121

The final sum is $121 = 0 \mod 11$ .

† not restricted to strictly numerical digits since summation $\mod 11$ requires a digit to represent $10$ , so this code also employs the digit $X$ . The same method can be generalized to other bases as well such as alphanumeric sequences.

Weighted Sum of Digits

Conventional implementations of ISBN-10 describe a different process relying on a weighted single sum of the digits, assigning descending weights to each index:

\begin{aligned} ISBN_{10}(d) &= \Big[\sum_{i=0}^{9} (10-i) \cdot d_i\Big] + c = (10 \cdot 0) + (9 \cdot 2) + \dots + (2 \cdot 6) + c &= 113 + c \end{aligned}

for some minimum positive value of $c$ constrained by:

ISBN_{10}(d) \mod 11 = 0

yielding an expression that indicates a valid check-digit is $c = 8$ :

\begin{aligned} ISBN_{10}(d) &= 113 + c \\ &= 113 + 8 = 121 \mod 11 = 0 \checkmark \end{aligned}

Curse the Senior Engineer and their Lizard Brains 🧠

It wasn't intuitively obvious at first how weighted modular arithmetic was equivalent to Moon's sum of a sum of digits, but we can observe that in the erstwhile cumulative summation, the first digit is accumulated in the sum 10 times, the 2nd digit 9 times, and so on – effectively weighting each digit according to its index in the same way the straightforward weighted summation does. Here's a sample implementation of the cumulative summation which neatly illustrated this property for me courtesy of the homie Will who's brain is sufficiently wrinkled to see the matrix:

def psychoDoubleCumSum(isbn: String): Int =
    isbn.filter(_ != '-')
        .map(_.toString.toInt)
        .foldLeft((0, 0)) { case (acc, next) =>
            val (middle, next) = acc
            (middle + next, right + middle + next)
        }._2

So, charged with the primitive ISBN-10 check-digit algorithm, we can approach the self-checkout at the library, and if a book (or any item) is scanned or otherwise processed for some arbitrary end, and it doesn't validate according to this check, we know that something's gone wrong. In the exempli bibliothecae, we're kind of just screwed (what're you gonna do, scan the book again 5head?), but there are myriad scenarios where error detection proves useful and where we might have more agency.

The Luhn Algorithm

Luhn's Algorithm is another basic modulo check-digit error detecting code developed by Hans Peter Luhn for IBM in 1960.⁵ At a high level, this algorithm works on a numerical sequence $S$ of length $|S|$ to generate an $|S| + 1$ th check-digit by doubling every other digit, summing over the residual digits and adding the smallest positive value $c$ such that the resultant residual sum $\mod 10 = 0$ :

(10 - \sum_i S_i') \mod 10 = c

It is widely used to validate credit/debit card numbers (a scenario where a user might misread or fatfinger a digit).

For example, given hypothetical credit card with a Physical Account Number (PAN) of $S = 7992739871$ , we compute the following $S'$ :

$i$	0	1	2	3	4	5	6	7	8	9
$S_i$	7	9	9	2	7	3	9	8	7	1
$S_i'$	7	$\color{red} 18$	9	$\color{red} 4$	7	$\color{red} 6$	9	$\color{red} 16$	7	$\color{red} 2$
$S_i'$	7	$\color{red} 1 + 8$	9	$\color{red} 4$	7	$\color{red} 6$	9	$\color{red} 1 +6$	7	$\color{red} 2$

Doubling every other digit, splitting those digits greater than 9 into two additional digits, and summing modulo ten we get:

\begin{aligned} (10 - \sum_i {S_i}') \mod 10 &= c \\ \implies \sum_i {S_i}' &= 67 \\ (10 - 67) \mod 10 &= c \\ 3 &= c \end{aligned}

It's worth noting that the implementation of modulo for negative number varies from language to language 🙃,⁶ so be careful and leverage associative operations to avoid the possibility of mistake. For example, we can instead prefer

10 - (\sum_i {S_i}' \mod 10) = c

performing modular summation before subtraction to guarantee that we avoid any zane (fairly assuming $S_i \in \N_0| \forall S_i \in S$ ). Associating operations to avoid calamity in this way will prove important later.

Shortcomings of the Luhn Algorithm

I once again refer back to my lengthier post about Error Correcting Codes which details more properties of such codes, but to summarize – Luhn's code detects:

All single digit errors
Most –but not all– adjacent transpositions provided one of the numbers involved isn't zero
- This "most" here is the crux of this post. Verhoeff set out to obliterate all missed adjacent transposition errors

Examples

As we can see, the Luhn code is susceptible to missing transpositions involving a 0-digit:

Let $S = x\;y\;9\;0\;z$ , and an erroneous transposition of $S$ be $S_{bad} = x\;y\;\color{red}0\;9\color{black}\;z$ . The every-other-digit-doubled versions of these strings will be:

\begin{aligned} S &= x\;y\;9\;0\;z \\ Luhn(S) &= \sum_i S_i' \\ S' &\leftarrow x + (2 \cdot y) + 9 + (2 \cdot 0) + z \\ &= x + 2y + 9 + z \\ &= x + y + z + 11 \\ \\ S_{bad} &= x\;y\;\color{red}0\;9\;\color{black}z \\ Luhn(S_{bad}) &= \sum_i S_{{bad}_i}' \\ S_{bad}' &\leftarrow x + (2 \cdot y) + \color{red}0\color{black} + (2 \cdot \color{red}9\color{black}) + z \\ &= x + (2 \cdot y) + 0 + \color{red}1\color{black} + \color{red}8\color{black} + z \\ &= x + 2y + \color{red}9\color{black} + z \\ &= x + y + z + 11 \\ &\implies Luhn(S) = Luhn(S_{bad}) \end{aligned}

which means that a payment request in the common example of credit card validation will fail after hundreds/thousands of milliseconds rather than a dozen or so for the non-edge cases of transposition errors. Good thing "0" and "9" are nowhere near on the keyboard 🙃.⁷

In his own derivation of a superior check-digit generation technique, Gumm notes that:

Various check-digit methods have been designed for the decimal number system. Each method is able to detect all "single-error mistakes," but they fail to detect all transposition errors. At least one (and often not more than one) erroneous transposition is undetectable with those methods.

Errors

To motivate his own publication (which is really just an blog post ~~peer reviewed~~ gatekept by the establishment if you think about it), Verhoeff researched the frequency and category of all the types of errors which can occur in a code. Verhoeff identified the following types of errors:

transcription: a straight up typo
- i.e. "Peter Gumm" -> "🅱️eter Gumm"
transposition: two or more digits swapping positions
- i.e. "Peter Gumm" -> "Geter Pumm"
twin transcription errors: a double fatfinger
- i.e. "Peter Gumm" -> "Peter Gunn"
jump transpositions: a transposition of non-adjacent letters
- i.e. "Peter Gumm" -> "Peter muGm"
phonetic errors: using a prohibited character
- i.e. "Peter Gumm" -> "Peter Gu44" (assuming a strictly alphabetic domain)
jump twin errors: non-adjacent double typos
- i.e. "Peter Gumm" -> "Patar Gumm"

As well as their observed frequencies in real life scenarios:

digits in error	error class	frequency (%)
1	transcription	79.05
2	transpositions	10.21
2	twins	0.55
2	phonetic	0.49
2	other adjacent	1.92
2	jump transpositions	0.82
2	jump twins	0.29
2	other jumps	0.36
2	other	0.81
3	(errors of the dyslexic variety)	1.40
4	bruh	0.97
5	BRUH	1.81
6	BRUHHHH	1.34

From this table, we can see how Gumm is able to make the at first lofty-seeming claim that every single-error check-digit code immediately detects about 90% of all format errors, since about 90% of common errors are either single digit transcriptions (~80%) or mundane transpositions (~10%) for which – of the two digits, one will necessarily be detected.

Relying on Verhoeff's research which showed that the vast majority of real errors are effectively detected by Luhn's algorithm, Gumm also notes that –despite how tempting it may be to just tack on additional check-digits to capture the remaining 2% of transposition errors–

The use of two check-digits ... is not advised since the previously mentioned studies have also shown that the absolute number of errors that occur roughly doubles when the number of digits increases by two. ... We therefore concentrate on methods to detect single-digit errors and transposition errors.

That is – each additional check-digit increases the literal and statistical surface area for a fatfingered input.

Gumm asserts that all error detecting codes of the form $\mod 2k$ are flawed for this reason.

Criteria for an Improved Error Detecting Code

Gumm introduces the following preqrequisites for an improved check-digit code. Borrowing his notation which I'll discard shortly, he defines a check-digit method base $r$ as a set $D$ with $|D| = r$ together with a family $F \overset{\Delta}{=} (f_1, f_2, ..., f_n, ... )$ of operations $f_n : D^n \rightarrow D$ . An operation $f_k$ on the elements $d_k, ..., d_1 \in D$ is interpreted as the check-digit for a number with decimal representation $d_k, ..., d_1$ in base $r$ . To improve on $\mod 2k$ methods, the following axioms must satisfied for all $n \in \N$ :

Axiom 1: Typos are Detected

\begin{aligned} f_n(x_n, ..., x_i,...,x_1) &= f_n(x_n, ..., \color{red}{x_i'}\color{black},...,x_1) \\ &\implies x_i = \color{red}{x_i'}\color{black} \end{aligned}

If the check-digit calculation is identical for two sequences involving $x_i, \color{red}{x_i'}\color{black}$ , then the two "differing" digits must be not actually be different. This axiom guarantees single-error detection.

Axiom 2: Transpositions are Detected

\begin{aligned} f_n(x_n, ..., x_i, \color{red}{x_{i-1}}\color{black},...,x_1) &= f_n(x_n, ..., \color{red}{x_{i-1}}\color{black}, x_i, ... ,x_1) \\ &\implies \color{red}{x_{i-1}}\color{black} = x_i \end{aligned}

If two digits are transposed, and the check-digit remains equivalent, then the two digits must be identical.

Axiom 3: Transpositions Involving the Check-Digit and its Left Neightbor Are Still Detected

\begin{aligned} f_n(x_n, ..., x_2, f_n(x_n, ..., x_1)) &= x_1 \\ &\implies f_n(x_n, ..., x_1) &= x_1 \end{aligned}

The Verhoeff-Gumm Algorithm

With these criteria in mind, Verhoeff and Gumm both sought out to develop a better code which could detect 100% of all single-digit and transposition errors. Verhoeff proved the existence of such a code, and Gumm provided a concrete permutation of digits in the Dihedral Group of order 10 ( $D_5$ ) which satisfy these criteria. Let's take a look at how we might go about trying to do the same. We'll use a simple 4-digit example throughout: $S = 3529$ and determine a means of generating a 5th digit $c$ such that the whole code is impervious to the two aforementioned classes of errors.

Looking at the arithmetic toolbox, we might reach for simple operations such as addition, subtraction, multiplication, or division. However, by themselves, each of these operations fails for either single-digit or transposition errors, or both (lookin' at you division):

operation / error	code	detected?
$+$	$3 + 5 + 9 + 2 = 19$
single-digit	$3 + \color{red}7\color{black} + 9 + 2 = \color{red}21\color{black}$	✅
transposition	$3 + \color{red}9\color{black} + \color{red}5\color{black} + 2 = 19$	❌ addition is commutative
$\times$	$3 \times 5 \times 9 \times 2 = 270$
single-digit	$3 \times \color{red}7\color{black} \times 9 \times 2 = \color{red}378\color{black}$	✅
transposition	$3 \times \color{red}9\color{black} \times \color{red}5\color{black} \times 2 = 270$	❌ multiplication is commutative
$-$	$3 - 5 - 9 - 2 = -13$
single-digit	$3 - \color{red}7\color{black} - 9 - 2 = -15$	✅
transposition	$\color{red}5\color{black} - \color{red}3\color{black} - 9 - 2 = \color{orange}-9\color{black}$	❌ subtraction is commutative^†
$\div$	$3 / 5 / 9 / 2 = 0.0\overline{33}$
single-digit	$3 / \color{red}7\color{black} / 9 / 2 = 0.0238...$	✅ but bad in general since we'd like $c \in \N_0$
transposition	not even gonna bother

† subtraction fails to detect transposition errors unless a distinct, adjacent pair not including the leading digit is transposed: e.g.

\begin{aligned} 3 - 5 - 9 - 2 &= 13 \\ \color{red}5\color{black} - \color{red}3\color{black} - 9 - 2 &= \color{red}-9\color{black} \\ 3 - 5 - \color{red}2\color{black} - \color{red}9\color{black} &= -13 \\ \end{aligned}

which is a false negative!

Consider: the Square

So, we'll have to plumb the depths of our operations for something less trivial. Let's reach for geometry. Consider the square and the things we can do to it.

If we take a square, we can rotate it counter clockwise about its center by 90º yielding a new square:

0 $\quad \overset{90º}{\circlearrowleft} \quad$ 1

we can also flip it along its vertical axis yielding yet another new square:

0 $\quad \overset{90º}{\circlearrowleft} \quad$ 1 $\quad \leftrightarrows \quad$ 7

What happens if we flip first, then rotate?

0 $\quad \leftrightarrows \quad$ 4 $\quad \overset{90º}{\circlearrowleft} \quad$ 5

BAHA yet another new square – like a caveman discovering fire, we've defined two non-commutative, primitive operations on our square: $Rotate$ and $Flip$ . From these, we can get 8 distinct squares:

$S_0$ : 0: our arbitrarily chosen progenitor
$S_1$ : 1: one 90º rotation from $S_0$
$S_2$ : 2: two 90º rotations from $S_0$
$S_3$ : 3: three 90º rotations from $S_0$ (note that four 90º rotations from $S_0$ is $S_0$ )
$S_4$ : 4: $S_0$ flipped about its vertical axis
$S_5$ : 5: $S_1$ flipped about its vertical axis
$S_6$ : 6: $S_2$ flipped about its vertical axis
$S_7$ : 7: $S_3$ flipped about its vertical axis

Each square can also be thought of as an operation rather than just the element, with $S_0$ being the identitive element (a 0º rotation with no flips leaves any square unchanged), $S_1$ being a single rotation (I'll drop the directional notation since a -90º rotation is equivalent to three +90º degree rotations which is $S_3$ , so if we ever wanted to perform, say, an $R_{-1}$ we'd instead do $R_{3}$ ), and each of the flips $S_{4-7}$ are just combinations of some amount of rotations before flipping about the axis (implicitly the vertical axis since we can make the horizontal axis the vertical axis via rotation).

Using our Square, we can potentially map 8 distinct digits to our arrangements of the Square which are formed by non-commutative primitive operations. However, modern society (since the 7th century Indian Vedic mathematicians pioneered the base 10 system commonly used today) has embraced a decimal system rather than an octal system (and even then, sometimes 10 digits still ain't enough, see ISBN-10, and written language, and the languid existence of culture), so we're going to need a couple more squares...

Consider: The Pentagon

Luckily, we can also generalize our same primitive operations to the Pentagon, which has 5 distinct rotations of 72º each, and 5 axes of symmetry, which conveniently gives us 10 arrangements (See the Appendix on $D_5$ for more details).

Define the elements of our set of Pentagons (which I'll refer to as $D_5$ for no reason in particular😈) to be the following:

$P_0$ : 0: an arbitrarily chosen, promethean identitive element from which we'll anchor the definitions of all our other Pentagons
$P_1$ : 1: the 1st rotation by 72º
$P_2$ : 2: the 2nd rotation by 144º
$P_3$ : 3: the 3rd rotation by 216º
$P_4$ : 4: the 4th rotation by 288º
$P_5$ : 5: the 1st flip, along the blue/red axis (axis 1) of $P_0$
$P_6$ : 6: the 2nd flip, along the yellow/green axis (axis 2) of $P_0$
$P_7$ : 7: the 3rd flip, along the red/orange axis (axis 3) of $P_0$
$P_8$ : 8: the 4th flip, along the green/blue axis (axis 4) of $P_0$
$P_9$ : 9: the 5th flip, along the orange/yellow axis (axis 5) of $P_0$

We can interpret each pentagon as a digit according to its index subscript.

How to use it tho?

Returning to our example string of $S = 3592$ for which we still desperately need a check-digit and interpretting it as a sequence of pentagons, we get

$S =$ 3 $\quad ? \quad$ 5 $\quad ? \quad$ 2 $\quad ? \quad$ 9

But even armed with the knowledge that our Shapes themselves can be interpreted as operations, how are we to combine them with either operation?

Composition

Composition babyyyy! We can treat each Pentagon as a simple algebraic composition of functions akin to $f(g(x)) = f \circ g$ :

$S =$ 3 $\quad \circ \quad$ 5 $\quad \circ \quad$ 2 $\quad \circ \quad$ 9

In general, Group Theoretic operations compose from right to left. The Pentagonal analog for a Luhnean check-digit is whatever Pentagon we start the chain of operations on (so, still the rightmost digit) that results in $P_0 = 0$

So what Pentagon can we prepend the composition with such that our result is $P_0$ ?

Computing a Pentagonal Check-Digit

Though our family of operations $(R, F)$ isn't commutative, it is associative, so we can group the operations with parentheses however we please. Knowing that a sequence of Pentagons is equivalent to the composition of operations on a starting Pentagon, and that each operation is itself a Pentagon, we can group and reduce our previous equation:

\begin{aligned} S||c = P_0 &= P_3 \circ P_5 \circ P_2 \circ P_9 \circ P_c\\ &= R_3(F_1(R_2(P_9)))(P_c) \\ &= R_3(F_1(P_6))(P_c) \\ &= R_3(P_4)(P_c) \\ &= P_2 \circ P_c \\ \end{aligned}

We introduce the inverse of $P_2$ to find the Pentagon corresponding to our check-digit:

\begin{aligned} P_0 &= R_2(P_c) \\ R_2^{-1}(P_0) &= P_c \\ R_3(P_0) &= P_c \\ P_3 &= P_c \\ \end{aligned}

Intuitively, the inverse of of $R_2$ is $R_3$ since that's 5 rotations == 0 rotations, so our check-digit is $P_3$ .

We can verify this mechanism is resistant to transposition by swapping operations to see that the result of composing $S \circ P_c$ is no longer $P_0$ :

$S || \color{blue}P_c\color{black} =$ 3 $\quad \circ \quad$ 5 $\quad\circ \quad$ 2 $\quad \circ \quad$ 9 $\quad \circ \quad$ 3 $\quad = \quad$ 0

\begin{aligned} &= P_3 \circ P_5 \circ P_2 \circ P_9 \circ \color{blue}P_3\color{black} = P_0\\ &\overset{?}{=} P_3 \circ P_5 \circ \color{red}P_9\color{black} \circ \color{red}P_2\color{black} \circ P_3 = P_0 \\ &\neq P_3 \circ P_5 \circ P_9 \circ P_2 \circ P_3 = \color{red}P_4\color{black} \\ \end{aligned}

Can we prove this works in general though? Or are there any edge cases (like adjacent 0,9 Pentagons that would cause trouble as in Luhn's algorithm?) for which we might accidentally wind up with a false positive green flag $P_0$ .

Proving Pentagonal Correctness

We can verify with a tabular illustration of all mappings of Pentagons to their corresponding elements under each operation – this is known as a Cayley Table.

$\circ$	$P_0$ 0	$P_1$ 1	$P_2$ 2	$P_3$ 3	$P_4$ 4	$P_5$ 5	$P_6$ 6	$P_7$ 7	$P_8$ 8	$P_9$ 9
$P_0$ 0	$P_0$ 0	$P_1$ 1	$P_2$ 2	$P_3$ 3	$P_4$ 4	$P_5$ 5	$P_6$ 6	$P_7$ 7	$P_8$ 8	$P_9$ 9
$P_1$ 1	$P_1$ 1	$P_2$ 2	$P_3$ 3	$P_4$ 4	$P_0$ 0	$P_6$ 6	$P_7$ 7	$P_8$ 8	$P_9$ 9	$P_5$ 5
$P_2$ 2	$P_2$ 2	$P_3$ 3	$P_4$ 4	$P_0$ 0	$P_1$ 1	$P_7$ 7	$P_8$ 8	$P_9$ 9	$P_5$ 5	$P_6$ 6
$P_3$ 3	$P_3$ 3	$P_4$ 4	$P_0$ 0	$P_1$ 1	$P_2$ 2	$P_8$ 8	$P_9$ 9	$P_5$ 5	$P_6$ 6	$P_7$ 7
$P_4$ 4	$P_4$ 4	$P_0$ 0	$P_1$ 1	$P_2$ 2	$P_3$ 3	$P_9$ 9	$P_5$ 5	$P_6$ 6	$P_7$ 7	$P_8$ 8
$P_5$ 5	$P_5$ 5	$P_9$ 9	$P_8$ 8	$P_7$ 7	$P_6$ 6	$P_0$ 0	$P_4$ 4	$P_3$ 3	$P_2$ 2	$P_1$ 1
$P_6$ 6	$P_6$ 6	$P_5$ 5	$P_9$ 9	$P_8$ 8	$P_7$ 7	$P_1$ 1	$P_0$ 0	$P_4$ 4	$P_3$ 3	$P_2$ 2
$P_7$ 7	$P_7$ 7	$P_6$ 6	$P_5$ 5	$P_9$ 9	$P_8$ 8	$P_2$ 2	$P_1$ 1	$P_0$ 0	$P_4$ 4	$P_3$ 3
$P_8$ 8	$P_8$ 8	$P_7$ 7	$P_6$ 6	$P_5$ 5	$P_9$ 9	$P_3$ 3	$P_2$ 2	$P_1$ 1	$P_0$ 0	$P_4$ 4
$P_9$ 9	$P_9$ 9	$P_8$ 8	$P_7$ 7	$P_6$ 6	$P_5$ 5	$P_4$ 4	$P_3$ 3	$P_2$ 2	$P_1$ 1	$P_0$ 0

which looks like troglodytic vomit. A heatmap is easier to digest:

$\circ$	$P_0$	$P_1$	$P_2$	$P_3$	$P_4$	$P_5$	$P_6$	$P_7$	$P_8$	$P_9$
$P_0$	$P_0$	$P_1$	$P_2$	$P_3$	$P_4$	$P_5$	$P_6$	$P_7$	$P_8$	$P_9$
$P_1$	$P_1$	$P_2$	$P_3$	$P_4$	$P_0$	$P_6$	$P_7$	$P_8$	$P_9$	$P_5$
$P_2$	$P_2$	$P_3$	$P_4$	$P_0$	$P_1$	$P_7$	$P_8$	$P_9$	$P_5$	$P_6$
$P_3$	$P_3$	$P_4$	$P_0$	$P_1$	$P_2$	$P_8$	$P_9$	$P_5$	$P_6$	$P_7$
$P_4$	$P_4$	$P_0$	$P_1$	$P_2$	$P_3$	$P_9$	$P_5$	$P_6$	$P_7$	$P_8$
$P_5$	$P_5$	$P_9$	$P_8$	$P_7$	$P_6$	$P_0$	$P_4$	$P_3$	$P_2$	$P_1$
$P_6$	$P_6$	$P_5$	$P_9$	$P_8$	$P_7$	$P_1$	$P_0$	$P_4$	$P_3$	$P_2$
$P_7$	$P_7$	$P_6$	$P_5$	$P_9$	$P_8$	$P_2$	$P_1$	$P_0$	$P_4$	$P_3$
$P_8$	$P_8$	$P_7$	$P_6$	$P_5$	$P_9$	$P_3$	$P_2$	$P_1$	$P_0$	$P_4$
$P_9$	$P_9$	$P_8$	$P_7$	$P_6$	$P_5$	$P_4$	$P_3$	$P_2$	$P_1$	$P_0$

Cayley Tables are useful since they remove the need for guess and checking (or thinking whatsoever) about inverse operations: cells with $P_0$ entries indicate inverses. Additionally, we can detect transpositions by noting that e.g.

\begin{aligned} R_1 \circ F_1 &\neq F_1 \circ R_1 \\ P_1 \circ P_5 &\neq P_5 \circ P_1 \\ P_6 &\neq P_9 \\ \end{aligned}

If we fold the whole table along the major diagonal, we can note that most of the table has asymmetric results (highlighted in green), indicating that transpositions produce different $P_c$ check-digits (you're gonna have to flip your phone sideways for this one – sorry not sorry):

$\circ$	$P_1$	$P_2$	$P_3$	$P_4$	$P_5$	$P_6$	$P_7$	$P_8$	$P_9$
$P_0$	$P_1$ $P_1$	$P_2$ $P_2$	$P_3$ $P_3$	$P_4$ $P_4$	$P_5$ $P_5$	$P_6$ $P_6$	$P_7$ $P_7$	$P_8$ $P_8$	$P_9$ $P_9$
$P_1$		$P_3$ $P_3$	$P_4$ $P_4$	$P_0$ $P_0$	$P_9$ $P_6$	$P_5$ $P_7$	$P_6$ $P_8$	$P_7$ $P_9$	$P_8$ $P_5$
$P_2$			$P_0$ $P_0$	$P_1$ $P_1$	$P_8$ $P_7$	$P_9$ $P_8$	$P_5$ $P_9$	$P_6$ $P_5$	$P_7$ $P_6$
$P_3$				$P_2$ $P_2$	$P_7$ $P_8$	$P_8$ $P_9$	$P_9$ $P_5$	$P_5$ $P_6$	$P_6$ $P_7$
$P_4$					$P_6$ $P_9$	$P_7$ $P_5$	$P_8$ $P_6$	$P_9$ $P_7$	$P_5$ $P_8$
$P_5$						$P_1$ $P_4$	$P_2$ $P_3$	$P_3$ $P_2$	$P_1$ $P_4$
$P_6$							$P_1$ $P_4$	$P_2$ $P_3$	$P_3$ $P_2$
$P_7$								$P_1$ $P_4$	$P_2$ $P_3$
$P_8$									$P_1$ $P_4$
$P_9$

Alternatively, here's an image for mobile plebians.

The non-green cells are symmetric entries indicating undetected transpositions e.g. $P_3 \circ P_4 = P_4 \circ P_3 = P_2$ . In general, any combination of two rotations, or any operation with the identity $P_0$ has an insecured symmetry.

Blasted Symmetries

We can circumvent these blind spots by adding a pre-processing step to introduce further asymmetry across the table, just like how the Luhn algorithm doubles every other digit before summation, and ISBN utilizes index weights. We'll call our asymmetric "activation function" $\sigma$ .

We'll apply $\sigma$ to pairs of Pentagons that are about to be combined. We need to define our $\sigma$ s.t. for pairs $P_i, P_j$ appearing in a sequence:

\begin{aligned} \cdots \circ P_i &\circ P_j \circ \cdots \\ \sigma(P_i) \circ P_j &\neq \sigma(P_j) \circ P_i, \text{ unless } i = j \end{aligned}

Implying that $\sigma$ should produce a different check-digit for two sequence differing by a transposition.

To prove this property, we have to deviate from our colorful geometric fantasy land of Pentagons, and instead model our elements as germane integer tuples (this is how $D_5$ is conventionally tracted). Each Pentagon can be uniquely described by a tuple of the form:

P = (f, r) \begin{cases} f \in \{-1, 1\}, \\ r \in \{0, 1, 2, 3, 4, \} \end{cases}

where negative $f$ indicates a flip, and $r$ indicates how many rotations have been applied to $P$ . Our new exhaustive list of Pentagons is:

$P_0 = (1,0)$ : 0
$P_1 = (1,1)$ : 1
$P_2 = (1,2)$ : 2
$P_3 = (1,3)$ : 3
$P_4 = (1,4)$ : 4
$P_5 = (-1,0)$ : 5
$P_6 = (-1,1)$ : 6
$P_7 = (-1,2)$ : 7
$P_8 = (-1,3)$ : 8
$P_9 = (-1,4)$ : 9

We combine tupled Pentagons according to the formula:

\begin{aligned} P_i \circ P_j &= (\color{blue}f_i\color{black}, \color{blue}r_i\color{black}) \circ (\color{red}f_j\color{black}, \color{red}r_j\color{black}) \\ &= (\color{blue}f_i\color{black}\color{red}f_j\color{black}, \color{blue}f_i\color{black}\color{red}r_j\color{black} + \color{blue}r_i\color{black}) \mod 5 \\ P_2 \circ P_8 &= (f_2, r_2) \circ (f_8, r_8) \\ &= (1, 2) \circ (-1, 3) \\ &= (1 \cdot -1, 1 \cdot 3 + 2) \\ &= (-1, 5) = (-1, 0) \\ &= P_5 \\ \end{aligned}

which checks out according to our Cayley Table! Note that we constrain our arithmetic $\mod 5$ to bound the number of rotations to the number of smmetrical axes in our polygon (where $D_5$ get the name). With our new (gross) algebraic model of our Pentagons, we can devise a $\sigma$ .

Bad Sigma

Intuitively, we might just try adding some constant $b$ and see if that works, or at least how it behaves:

\sigma(P) = \sigma((f, r)) = (f, r + b)

To verify, we just have to show that this sigma function detects transpositions:

\begin{aligned} \cdots \circ P_i &\circ P_j \circ \cdots \\ \sigma(P_i) \circ P_j &\neq \sigma(P_j) \circ P_i \\ (\forall P_i, P_j &\in S; i \neq j) \\ \sigma((\color{blue}f_i\color{black}, \color{blue}r_i\color{black})) \circ (\color{red}f_j\color{black}, \color{red}r_j\color{black}) &\neq \sigma((\color{red}f_j\color{black}, \color{red}r_j\color{black})) \circ (\color{blue}f_i\color{black}, \color{blue}r_i\color{black}) \\ (\color{blue}f_i\color{black}\color{red}f_j\color{black}, \color{blue}f_i\color{black}\color{red}r_j\color{black} + \color{blue}r_i\color{black} + b) &\neq (\color{red}f_j\color{black}\color{blue}f_i\color{black}, \color{red}f_j\color{black}\color{blue}r_i\color{black} + \color{red}r_j\color{black} + b) \end{aligned}

We know that $\color{blue}f_i\color{red}f_j\color{black} = \color{red}f_j\color{blue}f_i$ for all values since arithmetic multiplication is commutative, so that will cancel the first element from both sides, leaving us to examine the second:

\begin{aligned} (\cancel{\color{blue}f_i\color{black}\color{red}f_j\color{black}}, \color{blue}f_i\color{black}\color{red}r_j\color{black} + \color{blue}r_i\color{black} + b) &\neq (\cancel{\color{red}f_j\color{black}\color{blue}f_i\color{black}}, \color{red}f_j\color{black}\color{blue}r_i\color{black} + \color{red}r_j\color{black} + b) \\ \color{blue}f_i\color{black}\color{red}r_j\color{black} + \color{blue}r_i\color{black} + \cancel{b} &\neq \color{red}f_j\color{black}\color{blue}r_i\color{black} + \color{red}r_j\color{black} + \cancel{b} \\ \color{blue}f_i\color{black}\color{red}r_j\color{black} + \color{blue}r_i\color{black} &\neq \color{red}f_j\color{black}\color{blue}r_i\color{black} + \color{red}r_j\color{black} \\ \end{aligned}

Breaking the remaining possibilities out into cases:

\begin{aligned} \text{case } \color{blue}f_i\color{black} &= \color{red}f_j\color{black} = 1 \\ \color{red}r_j\color{black} + \color{blue}r_i\color{black} &\neq \color{blue}r_i\color{black} + \color{red}r_j\color{black} \\ &\text{ and i oop } \end{aligned}

⚔️ well shucks, looks like adding a constant $b$ produces a contradiction right away – back to the drawing board.

Gud Sigma

A better $\sigma$ posited by Gumm is:

\sigma((f, r)) = (f, f \times (a - r) + b), a \neq 0

Geometrically, we can interpret this to mean

f \begin{cases} -1: r \implies r + -a + b \text{ rotate a flipped pentagon } a \text{ times CW and } b \text { times CCW}, \\ \\ 1: r \implies a + -r + b \text{ the pentagon is inversed, then rotated } a + b \text{ times CCW } \end{cases}

Proof

\begin{aligned} \sigma((\color{blue}f_i\color{black}, \color{blue}r_i\color{black})) \circ (\color{red}f_j\color{black}, \color{red}r_j\color{black}) &\neq \sigma((\color{red}f_j\color{black}, \color{red}r_j\color{black})) \circ (\color{blue}f_i\color{black}, \color{blue}r_i\color{black}) \\ (\color{blue}f_i\color{black}, \color{blue}f_i\color{black} \times (a - \color{blue}r_i\color{black}) + b) \circ (\color{red}f_j\color{black}, \color{red}r_j\color{black}) &\neq (\color{red}f_j\color{black}, \color{red}f_j\color{black} \times (a - \color{red}r_j\color{black}) + b) \circ (\color{blue}f_i\color{black}, \color{blue}r_i\color{black}) \\ (\color{blue}f_i\color{black}\color{red}f_j\color{black}, \color{blue}f_i\color{black}\color{red}r_j\color{black} + \color{blue}f_i\color{black}a -\color{blue}f_ir_i\color{black} + b) &\neq (\color{red}f_j\color{black}\color{blue}f_i\color{black}, \color{red}f_j\color{black}\color{blue}r_i\color{black} + \color{red}f_j\color{black}a -\color{red}f_jr_j\color{black} + b) \\ \end{aligned}

Again, only having to consider the second term of each tuple:

\begin{aligned} \color{blue}f_i\color{black}\color{red}r_j\color{black} + \color{blue}f_i\color{black}a -\color{blue}f_ir_i\color{black} + \cancel{b} &\neq \color{red}f_j\color{black}\color{blue}r_i\color{black} + \color{red}f_j\color{black}a -\color{red}f_jr_j\color{black} + \cancel{b} \\ \color{blue}f_i\color{black}a - \color{red}f_j\color{black}a &\neq \color{red}f_j\color{black}\color{blue}r_i\color{black} -\color{red}f_jr_j\color{black} - \color{blue}f_i\color{black}\color{red}r_j\color{black} + \color{blue}f_ir_i\color{black} \\ a(\color{blue}f_i\color{black} - \color{red}f_j\color{black}) &\neq (\color{blue}f_i\color{black} + \color{red}f_j\color{black})(\color{blue}r_i\color{black} - \color{red}r_j\color{black})\\ \end{aligned}

we have two cases:

\begin{aligned} \text{case 1: } \color{blue}f_i\color{black} &= -\color{red}f_j\color{black} &\qquad \text{case 2: } \color{blue}f_i\color{black} &= \color{red}f_j\color{black} \\ a(2\color{blue}f_i\color{black} ) &\neq 0(\color{blue}r_i\color{black} - \color{red}r_j\color{black}) &\qquad a(\color{blue}f_i\color{black} - \color{blue}f_i\color{black}) &\neq (\color{blue}f_i\color{black} + \color{blue}f_i\color{black})(\color{blue}r_i\color{black} - \color{red}r_j\color{black})\\ 2a &\neq 0 \mod 5 &\qquad a(0) &\neq 2\color{blue}f_i\color{black} (\color{blue}r_i\color{black} - \color{red}r_j\color{black}) \\ &\; &\; \color{blue}r_i\color{black} &\neq \color{red}r_j\color{black} \end{aligned}

Case 1 is always true since we require $a \neq 0$ , and case 2 is only true if $i = j$ which is by definition not a transposition, so it's always true.

By choosing some fixed $a$ , we can pre-compute a $\sigma$ table of permutations of the elements of $D_5$ like the following:

\begin{aligned} \sigma((f,r)) &= \begin{Bmatrix} (1,0) \rightarrow (1,0), & (-1,0) \rightarrow (-1, 1) \\ (1,1) \rightarrow (1,4), & (-1,1) \rightarrow (-1, 2) \\ (1,2) \rightarrow (1,3), & (-1,2) \rightarrow (-1, 3) \\ (1,3) \rightarrow (1,2), & (-1,3) \rightarrow (-1, 4) \\ (1,4) \rightarrow (1,1), & (-1,4) \rightarrow (-1, 0) \\ \end{Bmatrix}\\ \sigma(P) &= \begin{Bmatrix} P_0 \rightarrow P_0, & P_5 \rightarrow P_6 \\ P_1 \rightarrow P_4, & P_6 \rightarrow P_7 \\ P_2 \rightarrow P_3, & P_7 \rightarrow P_8 \\ P_3 \rightarrow P_2, & P_8 \rightarrow P_9 \\ P_4 \rightarrow P_1, & P_9 \rightarrow P_5 \\ \end{Bmatrix} \\ \sigma(d) &= \begin{Bmatrix} 0 \rightarrow 0, & 5 \rightarrow 6 \\ 1 \rightarrow 4, & 6 \rightarrow 7 \\ 2 \rightarrow 3, & 7 \rightarrow 8 \\ 3 \rightarrow 2, & 8 \rightarrow 9 \\ 4 \rightarrow 1, & 9 \rightarrow 5 \\ \end{Bmatrix} \end{aligned}

Does this technique scale to more than one transposition/more than two elements?

Yes! If we prove that applying $\sigma$ to the 2nd digit rather than the 1st also holds for our transposition property, then we can borrow the preprocessing pattern from Luhn and apply $\sigma$ to every other element in a sequence. E.g.

P_i \circ P_j \circ P_k \circ P_\ell \neq P_i \circ \color{red}P_k\color{black} \circ \color{red}P_j\color{black} \circ P_\ell

Leveraging the associativity of our group operation, we can ~group~ some terms to conveniently ~do something tbd to them~

P_i \circ \sigma(P_j) \circ P_k \circ \sigma(P_\ell) \neq P_i \circ \sigma(\color{red}P_k\color{black}) \circ \color{red}P_j\color{black} \circ \sigma(P_\ell)

Next, we'll introduce inverses to start deleting our problems.

\begin{aligned} \color{blue}P_i^{-1}\color{black} \circ P_i \circ \sigma(P_j) \circ P_k \circ \sigma(P_\ell) &\neq \color{blue}P_i^{-1}\color{black} \circ P_i \circ \sigma(\color{red}P_k\color{black}) \circ \color{red}P_j\color{black} \circ \sigma(P_\ell) \\ P_0 \circ \sigma(P_j) \circ P_k \circ \sigma(P_\ell) &\neq P_0 \circ \sigma(\color{red}P_k\color{black}) \circ \color{red}P_j\color{black} \circ \sigma(P_\ell) \\ \sigma(P_j) \circ P_k \circ \sigma(P_\ell) &\neq \sigma(\color{red}P_k\color{black}) \circ \color{red}P_j\color{black} \circ \sigma(P_\ell) \\ \end{aligned}

Being extra tricky, since we can't introduce an inverse $P_k$ nor $P_j$ into the middle of either side, we'll simply apply the sigma function on the last term on each side, yielding some $P_\ell'$ , and then we'll append $[P_\ell']^{-1}$ (brackets added for readability) to both sides:

\begin{aligned} \sigma(P_j) \circ P_k \circ P_\ell' &\neq \sigma(\color{red}P_k\color{black}) \circ \color{red}P_j\color{black} \circ P_\ell' \\ \sigma(P_j) \circ P_k \circ P_\ell' \circ [P_\ell']^{-1} &\neq \sigma(\color{red}P_k\color{black}) \circ \color{red}P_j\color{black} \circ P_\ell' \circ [P_\ell']^{-1} \\ \sigma(P_j) \circ P_k \circ P_0 &\neq \sigma(\color{red}P_k\color{black}) \circ \color{red}P_j\color{black} \circ P_0 \\ \sigma(P_j) \circ P_k &\neq \sigma(\color{red}P_k\color{black}) \circ \color{red}P_j\color{black} \\ \end{aligned}

which leaves us with the very case that we have already proven for which $\sigma$ will indicate a transposition.

The Complete Verhoeff-Gumm Algorithm

Permute the digits^† in our sequence $S$ according to our permutation function/table $\sigma$ :

\begin{aligned} S &= 3\; 5 \;2 \; 9 \\ S' &= \begin{cases} S_i \text{ if } i \mod 2 = 0, \\ \\ \sigma(S_i) \text { o.w. } \\ \end{cases}\\ S' &= 3 \; \sigma(5) \; 2 \; \sigma(9) \\ S' &= 3 \; \color{blue}6\color{black} \; 2 \; \color{blue}5\color{black} \\ \end{aligned}

Map the digits to their corresponding Pentagons/elements in $D_5$ :

\begin{aligned} S' &= 3 \; \color{blue}6\color{black} \; 2 \; \color{blue}5\color{black} \\ &= P_3 \circ P_6 \circ P_2 \circ P_5 \\ &= R_3(F_2(R_2(F_1(P_0)))) \end{aligned}

Combine

\begin{aligned} S' &= R_3(F_2(R_2(F_1(P_0)))) &= P_2 \end{aligned}

Find/lookup $c$ = $[S']^{-1}$ according to the precomputed Cayley Table and append the inverse of the result to the end

\begin{aligned} c &= [S']^{-1} \\ &= P_2^{-1} \\ &= P_3 &\;\\ S||c &= 3\; 5 \;2 \; 9 \; \color{blue}3\color{black} \end{aligned}

† I opted to permute every other digit, similar to Luhn, however this deviates from the proposed permutations of the original authors. Verhoeff's version of the algorithm applied $\sigma$ once per index:

\sigma^1(S_1) \circ \sigma^2(S_2) \circ \cdots \circ \sigma^n(S_n) \circ c = 0

And Gumm (a known ISBN shill) applied $\sigma$ once per index in reverse order:

\sigma^n(S_1) \circ \sigma^{n-1}(S_2) \circ \cdots \circ \sigma^1(S_n) \circ c = 0

Conclusion

Gumm's version is slightly superior since it allows for any amount of leading 0s in the sequence without throwing off the check, but if you're a credit card issuer minting cards with several leading zeroes, YTA.

Both methods account for the remaining ~2% of transposition errors that Luhn's algorithm overlooks.

In pursuit of Mentat medulla oblongata⁸, Cartesian Set-Set⁹ Synapses, or just good ol' brain candy, this visualization of the proof is just some good stuff.

Appendix: Taxonomy of $D_5$

$D_5$ has 10 elements:

An Identity: $e$ (or $P_0$ )
4 rotations: $R_1,R_2,R_3,R_4$
5 reflections: $F_1,F_2,F_3,F_4, F_5$

It can also be interpreted as a graph:

where edges are bidirectional, with the back-edge imlicitly being the inverse of the labeled forward-edge.

Or, more comprehensively:

Footnotes

Error Correction Coding: Mathematical Methods and Algorithms, Todd Moon ↩ ↩²
Voyager 2, ↩
Verhoeff, Jacobus. "Error Detecting Decimal Codes." Mathematical Centre Tracts, 118 S. Amsterdam, 1969. ↩
Gumm, Peter. "A new class of check-digit methods for arbitrary number systems." IEEE Transactions on Information Theory, 31(1). 1985. ↩
Patent granted on August 23rd, 1960, nearly 43 years to the day after I started delving into Verhoeff's algorithm – just in time to observe the anniversary by celebrating its theoretical obsolescence (though Luhn's algorithm is *cough* clearly still used in the industry today) ↩
https://torstencurdt.com/tech/posts/modulo-of-negative-numbers/ ↩
On keypads designed for entering numeric digits, "0" and "9" are usually separated for this very reason (amongst others) ↩
Hilbert's "Mentat" ↩
Palmer's "Set-sets" ↩

Librarians, Luhn, and Lizard Brain

Tags