Introduction to Mathematical Proof

Chapter 2: What is a proof?

Simply stated

A proof is an explanation of why a statement is objectively valid.

Thus, we have two goals for our proofs.

However, these two goals are sometimes in conflict. So how to achieve both?

The Proof Spectrum

To be certain that our proof is correct, we need to be exceedingly careful and rigorous. To be clear in our exposition, we need to be succinct and elegant.

To obtain elegant clarity without sacrificing correctness, we will begin with proofs that are objectively correct by virtue of the fact that they can be verified by a machine. This style of proof is called a formal proof. Then we will use a well-defined set of proof shortcuts to eliminate tedious, repetitive, and uninteresting parts of our proofs. Thus, we will construct a bridge between our formal proofs and the more traditional proofs found in journals, textbooks, and problem solutions.

Proof Bridge

Rigor and Elegance

On the one hand, mathematical proofs need to be rigorous. Whether submitting a proof to a math contest or submitting research to a journal or science competition, we naturally want it to be correct. One way to ensure our proofs are correct is to have them checked by a computer. (Note that checking to see if a proof is correct is much easier for a computer to do than finding a proof in the first place.)

There is much discussion in mathematics today about the value of computer verified proofs and their counterparts - rigorous, detailed, formal proofs. Many mathematicians and computer scientists have been making a strong case for formal, rigorous, computer-verified proofs.

On the other hand, most mathematicians are attracted to mathematics because of its intrinsic beauty. A proof that communicates the key ideas of a proof to the reader in a succinct and beautiful way is very effective for its expository properties, even if it is not as rigorous as a formal proof. The legendary mathematician Paul Erdős always spoke of “The Book”, an imaginary book in which God had written down the best and most elegant proofs for mathematical theorems. When he saw any particularly inspiring proof, he would exclaim “That proof is from ‘The Book’!”

We will strive for both rigor and elegance in our proofs by building a bridge between highly rigorous formal proofs and more elegant traditional proofs. We begin with formal proofs.

Math is a cross between art and law. Law is about the reasoning and proving. And the art is because what we’re trying to prove are statements that are somehow elegant. That’s where the artist decides what is art.
– US IMO Coach Po-Shen Loh, after his team won the 2015 IMO

Toy Proofs

We begin by playing with some simple “toy” proof systems that illustrate some aspects of formal proofs.

Definition . A Formal Proof System consists of

  1. A set of expressions and sentences called statements.
  2. A subset of those statements called theorems.

Thus, to specify formal proof system we must provide two pieces of information.

The goal of our game will be to play according to the rules in order to determine when a statement in the system is a theorem. The theorems are, by definition, the valid statements in a formal system. They don’t need to be true about something else, via some interpretation of the statements. To determine a statement is valid in a formal system we only need to verify that it is a theorem.

That is not to say that there is no value in interpreting our statements to be talking about something. Indeed, our intuition about what the statements might be talking about is an essential aspect of mastering the art of theorem proving. But such interpretations are not required in order to determine that a particular statement is a theorem in a formal system. This allows us to interpret the statements individually however we like, while maintaining the ability to mechanically, and therefore objectively, show that they are theorems.

We do not place any restrictions on what a rule can be in a formal system, other than to say that the rules must provide a mechanism for knowing that a statement is a theorem. There are many ways to do that. Let’s illustrate a few.

Example . Consider a language with only the two statements Gina loves chocolate and Mocha is chocolate. There are four possible distinct formal systems we can define with this language.

  1. No statement is a theorem.
  2. Only Gina loves chocolate is a theorem.
  3. Only Mocha is chocolate is a theorem.
  4. Every statement is a theorem.

In the first and third formal systems Gina loves chocolate is not a theorem, but in the second a fourth it is. Whether it is a theorem, and thus valid, only depends on what formal system we are considering, not whether it is ‘true’ about some actual person named Gina or actual yummie foods called mocha or chocolate.

This example illustrates one of the first and simplest mechanisms for determining if a statement is a theorem in a formal system - just list the theorems! Such statements, which are explicitly stated to be theorems, are called axioms of the system. We can also refer to a rule that simply lists axioms as an axiom as well.

For small finite languages like the one in the example, specifying the theorems by simply listing them is perfectly reasonable. But in mathematics our formal systems usually have infinitely many statements and infinitely many theorems, so listing them all as axioms individually becomes impractical. But fear not! There are many ways to overcome this limitation. For example, we might give some criterion that tells us which statements are theorems.

Example . Consider the formal system whose statements are the positive integers, and whose theorems are the even integers. Then we know that in this system $42$ is a theorem but $67$ is not. This system has infinitely many theorems, but we have specified them all by a rule that tells us how to easily determine if a statement is or is not a theorem very easily in a finite amount of time.

In more sophisticated mathematical systems there might not be such a simple direct criterion to immediately identify theorems.

Recursively Defined Sets of Theorems

Another way to specify the set of theorems in a formal system is to do it recursively.

Example . Consider the formal system whose set of statements is the set of all English words. The theorems are defined by the following rules.

  • Rule 0. The word cat is a theorem.
  • Rule 1. Any English word that is obtained by changing only one letter of a theorem is also a theorem.

To claim that dog is a theorem we can write the following.

Theorem. dog

and we can prove it as follows.

Proof.

  1. The statement cat is a theorem by Rule 0.
  2. The statement cot is a theorem by applying Rule 1 to cat which is a theorem by line 1.
  3. The statement dot is a theorem by applying Rule 1 to cot which is a theorem by line 2.
  4. The statement dog is a theorem by applying Rule 1 to dot which is a theorem by line 3.

$\square$

The proof in the box is an explanation of why dog is theorem. Notice that there might be more than one way to prove the same theorem.

Alternatively, we could specify the theorems in Example 2.3 recursively by saying that $2$ is a theorem, and that whenever $n$ is a theorem, so is $n+2$. Of course, it is much easier to define the theorems explicitly by just saying they are the even integers, but we obtain the same set of theorems either way.

Conditional Theorems

In a recursively defined system it is also often useful to show that a statement $Q$ follows from some other statements $P_1,\ldots,P_n$ in the sense that if those statements were theorems, then we could prove that $Q$ is a theorem, too. We refer to such a fact as a conditional theorem.

Example . Consider the formal system in the previous example. Prove that cool follows from from math.

Theorem. If math then cool.

Proof.

  1. Suppose math is a theorem.
  2. Then mats is a theorem by Rule 1 applied to mats which is a theorem by line 1.
  3. Then mass is a theorem by Rule 1 applied to mass which is a theorem by line 2.
  4. Then moss is a theorem by Rule 1 applied to moss which is a theorem by line 3.
  5. Then most is a theorem by Rule 1 applied to most which is a theorem by line 4.
  6. Then cost is a theorem by Rule 1 applied to cost which is a theorem by line 5.
  7. Then coot is a theorem by Rule 1 applied to coot which is a theorem by line 6.
  8. Then cool is a theorem by Rule 1 applied to cool which is a theorem by line 7.

$\square$

Notice that math is definitely not a theorem in the formal system defined in Example 2.4 (can you explain why?), but if it were a theorem, then cool would be too. In many formal systems, use of this type of rule allows us to reduce a potentially hard problem of proving that Q is a theorem to a potentially easier problem of proving the statements that Q follows from are theorems.

Problems

Toy Proof Systems

There are several examples of simple Formal Proof Systems available online at

Scrambler! is a formal proof system where the statements are finite sequences of colors. The Rules are permutations of these sequences (and so have one premise and one conclusion each). The goal is to apply the Rules to show that a given sequence of colors is provable from another given sequence of colors.

Try to find a strategy for reliably beating each of the difficulty levels (Weeny, Easy, Fun, Three Ring Circus, Frogs, Dizzy, Mutant Frogs, and Death!) of Scrambler! They are listed in increasing order of difficulty. Warning: it can be both addictive and hard!

Trix Game is a formal proof system where the statements are positive integers. There are only two Rules, both of which take a single positive integer as a premise, and return a single positive integer as their conclusion. This system illustrates a rule that has a condition on when you can use it. The goal is to show that a given positive integer is provable from the premise $1$ in the system.

Try to find a strategy for winning. Warning: if you can prove that you will always win this game no matter what integer you have for the goal, you will win money and be famous forever!

Circle-Dot is a formal proof system where the statements are just finite sequences of one or more circles and dots. This formal system has many of the features in common with actual mathematical formal axiom systems. There are five rules, two of which are axioms. The goal is to prove various circle-dot strings in the system.

  1. Try to prove Theorem A thru Theorem R in the Circle-Dot web application.
  2. Bonus: Can you explain why every circle-dot expression can be proven in the Circle-Dot toy system, or if not, determine with absolute certainty exactly those expressions which can?

Define a toy formal system as follows. The statements are all nonempty words that only contain the character $\star$. There are two rules.

  • Rule 0: The string $\star\!\star\!\star$ is a theorem.
  • Rule 1: The string $\star\!\star\!\star\!\star\!\star$ is a theorem.
  • Rule 2: The concatenation of any two theorems is a theorem.
  1. Prove all theorems that are less than 12 characters long.
  2. What are all of the theorems in this system?