Undefined Behavior: Not Just for Programming Languages

This is an oldie but goodie. Start with this premise:
a = b
Multiply both sides by a:
a² = ab
Subtract b² from both sides:
a² – b² = ab – b²
Factor the left side:
(a + b)(a – b) = ab – b²
Factor the right side:
(a + b)(a – b) = b(a – b)
Divide both sides by (a – b) and cancel:
a + b = b
Substitute b for a:
b + b = b
Finally, let b = 1 and simplify:
2 = 1

I ran into this derivation when I was nine or ten years old and it made me deeply uneasy. The explanation, that you’re not allowed to divide by (a – b) because this term is equal to zero, seemed to raise more questions than it answered. How are we supposed to keep track of which terms are equal to zero? What if something is equal to zero but we don’t know it yet? What other little traps are lying out there, waiting to invalidate a derivation? This was one of many times where I noticed that in school they seemed willing to teach the easy version, and that the real world was never so nice, even in a subject like math where — you would think — everything is clean and precise.

Anyway, the point is that undefined behavior has been confusing people for well over a thousand years — we shouldn’t feel too bad that we haven’t gotten it right in programming languages yet.

October 11, 2016

regehr

Computer Science, Education, Software Correctness

10 responses to “Undefined Behavior: Not Just for Programming Languages”

Jeff says:

October 11, 2016 at 11:38 am

In “(a + b)(a â€“ b) = b(a â€“ b)”, where does the leftmost “a” go? Should it not be “(a + b)(a â€“ b) = ab(a â€“ b)”?
Jeff says:

October 11, 2016 at 11:39 am

Disregard my second sentence. I typed wrong.
regehr says:

October 11, 2016 at 12:25 pm

Hi Jeff, I split out the steps for factoring the left side and the right side, does that help?
Jeff says:

October 11, 2016 at 12:48 pm

I guess my issue is that the equality is wrong before you divide both sides by (a-b). You can’t factor out a zero either. Maybe I’m just stupid. I have not drank coffee yet today.
adam says:

October 11, 2016 at 1:24 pm

there is no undefined behaviour in mathematics, as dividing both sides of an equation is no mathematical transformation, you have a theorem that for all x,y,z real numbers where z!=0 if zx=zy then x=y, the only thing you can do is evaluate the bounded variables and apply modus ponens to acquire a “new” formula, if you don’t have z!=0 you simply CAN’T get the formula, this is the difference between math and a programming language (or other system) specification which generates a state subspace in a bigger state space. when this state subspace isn’t closed that’s when undefined behaviour occures, whereas the space of proveable formulas is exactly the formulas which are proveable with the given deductive apparatus, so no escape there

for a more comprehensive view take a look at “A Concise Introduction to Mathematical Logic” or the Coq or Isabelle proof assistants
Eden Segal says:

October 11, 2016 at 2:50 pm

Hi, i want to comment on the last paragrath you wrote in this post.
A lot of problems in geometry arise because you draw things incorrectly, for example https://www.youtube.com/watch?v=Yajonhixy4g. The feel when you discover the reason for your problem is your own drawing disability was horrible for me.
Kurt says:

October 11, 2016 at 4:29 pm

If it is not known that the divisor is nonzero, this has to be added as a premise. The final “theorem” is not a = b -> 1 = 2 but a = b & a-b != 0 -> 1 = 2, which is trivially true.
regehr says:

October 11, 2016 at 7:01 pm

Yes, folks, it’s a bit of a lighthearted post about something that I found confusing when I was 10.
Sanjoy Das says:

October 11, 2016 at 11:41 pm

Another one on the same vein:

What is 1 – 1 + 1 – 1 … ?

Let 1 – 1 + 1 – 1 … = C

Then C = 1 – (1 – 1 + 1 – 1 …) => C = 1 – C => C = 1/2.

But you can prove C = 1/4 etc. using the same “technique”. 🙂
Olivier Galibert says:

October 12, 2016 at 2:09 am

Actually, it’s the other way around: you’re allowed to do the divide only if you can prove it is not by zero. So it’s not about remembering what could be zero, it’s about proving what can’t be. Entirely different point of view.

One problem with CS is that it’s horribly hard to prove anything substantial.