Mathematical Reasoning
Introductory Notes on Mathematical Proofs and Discrete Math

Hermish Mehta
University of California, Berkeley
(March 21, 2026)
Abstract

This document presents a compilation of notes I took while taking Discrete Math & Probability Theory (CS70) at the University of California at Berkeley, with an emphasis on Discrete Math. These notes were inspired by the book Introduction to Mathematical Proofs.

Note 1 Numbers, Sets, and Functions

1.1 Quadratic Equations

One of the simplest classes of equations we regularly solve are linear equations, equations of the form below, where the greatest power of our variable is one. As a matter of convention, we write the general equation as

ax+b=0. (1.1)

Solving this equation can be done by rearranging and then dividing both sides by a, yielding

x=ba.

Naturally, we may want to solve some more complicated quadratic equations, where the greatest power of the variable is two. Such equations take the general form

ax2+bx+c=0. (1.2)

To solve this equation, we need to isolate x and in doing so, reduce the number of terms with the variable from two to one. This can be accomplished factoring through a problem solving strategy called wishful thinking. First, we being by taking out the coefficient of the first term, a.

a(x2+bax)+c

Writing the left-hand side of the equation like this, we can see by adding the right constant inside the parentheses, this can be reduced to a square. Returning to our original equation, this results in the following steps.

0 =ax2+bx+c
=a(x2+bax)+c
=a(x2+bax+b24a2)b24a+c
=a(x+b2a)2(b24ac)

We’ve derived the exact type of equation we can easily solve, one in which the variable x is restricted to a single term.

a(x+b2a)2(b24ac) =0
a(x+b2a)2 =b24ac
(x+b2a)2 =b24ac4a2
x+b2a =±b24ac4a2

We needed to include the plus-minus (±) sign since squaring removes the sign and therefore the term

x+b2a

could be either positive or negative and have the same square. By simplifying the square root, and moving the constant beside x to the other side, we derive the quadratic formula which gives the solutions to a quadratic equation in terms of its coefficients.

x =b2a±b24ac4a2 (1.3)
=b±b24ac2a (1.4)

From this, we can construct our first theorem in this course.

Theorem 1.

A quadratic equation ax2+bx+c=0 has at least one solution when

b24ac0 (1.5)
Proof.

In the case where b24ac0, we have explicitly constructed a solution to the quadratic equation. To verify this, we will plug the positive root into the equation to ensure the result is 0.

ax2+bx+c =a(b+b24ac2a)2+b(b+b24ac2a)+c
=(b22abb24ac2ac)+(bb24ac2ab22a)+c
=0

Therefore, we have show a solution exists when b24ac0 for a quadratic equation. More than that, we have determined the specific form of such a solution. End of proof

We can actually show a stronger result about the solutions of a quadratic equation, though the proof is to the reader.

Theorem 2.

If b24ac is positive, a quadratic equation has two solutions; if it is negative, the equation has no real solutions; and if it is exactly equal to zero there is only one unique solution. This can be express in the following way.

{b24ac>0: two real solutionsb24ac=0: one real solutionsb24ac<0: no real solutions

The techniques used to solve this problem and prove its solution provide a preview of the kinds of thinking and problem solving which will be invaluable to this course. More than any one theorem or result, this course is an introduction to mathematical thinking.

1.2 Inequalities

Our discussion so far has revolved around statements of equality, and so naturally we might wonder about inequalities. To clearly define our terminology going forward, we will use the following definitions.

Terminology Definition
Positive x>0
Negative x<0
Nonnegative x0

In order to work with these, it is important to review some basic properties of inequalities.

  1.   i.

    For any real numbers a and b, one of the following statements is true:

    a>b or a=b or a<b
  2.   ii.

    For real numbers a, b, and c

    if a>b and b>c, then a>c
  3.   iii.

    Given that a>b, adding or subtracting constants from both sides preserves equality, meaning for any c, the inequalities below are also true.

    a+c >b+c
    ac >bc
  4.   iv.

    If a>b, the effect of multiplying by another number c0 depends on the sign of c.

    ac>bc if c>0
    ac<bc if c<0

    In other words, if c is negative, the sign of the inequality is flipped.

  5.   v.

    The square of any real number is nonnegative. Furthermore, the square of a number is 0 if and only if the number itself is 0.

These five fundamental properties are essential in proving some foundational inequalities, the first of which is called the Arithmetic-Geometric Mean Inequality. In the presentation of the theorem, we explain how to approach proofs.

Theorem 3 (AGM Inequality).

If x and y are real numbers, then:

(x+y2)2xy (1.6)
Rough Work.

A mathematical proof begins with true statements and derives the desired result through a series of logical sound steps. In looking for an appropriate place to start, it often helps to rearrange the statement we want to prove until we reach a true statement, working backwards to construct the proof. In this case:

(x+y2)2 xy
x2+2xy+y24 xy
x2+2xy+y2 4xy
x22xy+y2 0
(xy)2 0

We know the last line is true, since the square of any number must be nonnegative, hence writing the proof is now easy.

Proof.

For any real numbers x and y, we know the square of the difference must be positive.

(xy)2 0
x22xy+y2 0
x2+2xy+y2 4xy
x2+2xy+y24 xy
(x+y2)2 xy

Therefore, the Arithmetic-Geometric Mean holds for any real numbers. End of proof

Furthermore, in the case where both x and y are nonnegative, we can take positive square roots to obtain another form of the inequality.

x+y2xy for x,y0 (1.7)

This form elucidates the name of this inequality; the left side is the arithmetic mean of x and y, while the right-hand side represents the geometric mean, while powerful inequality relates the two types of averages. Our proof also provides us some additional insight, since it begins with the claim

(xy)20

Remember the square of any real number is equal to 0 if and only if the number is 0. Therefore we have a simple condition for equality in the AGM Inequality.

Theorem 4.

Equality hold in the AGM Inequality if and only if (xy)=0 or x=y. In other words

(x+y2)2=xy if and only if x=y. (1.8)

To introduce the second major inequality, the absolute value of a number, traditionally denoted as |x| needs to be defined.

|x|={x:x0x:x<0

The absolute value can be thought of as the positive distance to the origin. For example, 2 is exactly 2 units away from the origin. The absolute value can also be visualized through its graph.

422424
Figure 1.1: The graph y=|x|

Once again, we present fundamental properties of absolute values, in order to provide a foundation for working with these mathematical functions.

  1.   i.

    For any real number x, |x|x and |x|0.

  2.   ii.

    Given real numbers x and y, |xy|=|x||y|.

  3.   iii.

    Squaring a number strips its negative sign, and taking the positive root ensures the result will be positive. Therefore the [positive] square root of the square is the absolute value.

    |x|=x2

The last property is rather interesting, and can be proved. Notice for proving these simple properties of absolute values, a proof by cases is often most effective, a proof technique in which we break all possibilities into distinct cases or scenarios and prove the claim for each of them. Since the absolute value function behaves different for positive and negative numbers, these are naturally our cases.

Theorem 5.

For any real number x, we have:

|x|=x2 (1.9)
Proof by Cases.

Consider the cases when x is positive, zero and negative separately. Note formally, one of these cases must be true from the trichotomy property (see the first property of inequalities above).

Case 1. x>0

x2 =x
=|x|

Case 2. x=0

02 =0
=|0|

Case 3. x<0

Assume that x=a for some positive real number a. Notice that |x|=a Plugging this into the expression of the square root of the square:

x2 =(a)2
=a2
=a
=|x|

End of proof

At this point, we can go ahead and introduce the next major inequality. Note that we mentioned that the absolute value of the product of two numbers is the product of the absolute values, or

|xy|=|x||y|.

Unfortunately, the same does not hold true for sums–the analogous statement is false.

|2+(1)||2|+|1|

However, there is a relationship between the two quantities which can be expressed as an inequality.

Theorem 6 (Triangle Inequality).

Given real numbers x, y the sum of the absolute values is always greater than or equal to the absolute value of the sum.

|x+y||x|+|y| (1.10)
Rough work.

Once again, we begin with rough work to determine how to progress on the proof. In order to remove the absolute value sides, it seems reasonable to square both sides.

|x+y| |x|+|y|
|x+y|2 (|x|+|y|)2
(x+y)2 (|x|+|y|)2
x2+2xy+y2 |x|2+2|x||y|+|y|2
x2+2xy+y2 x2+2|xy|+y2
xy |xy|

Notice we arrived at a statement which is clearly true. Remember that the absolute value of any real number is always greater than or equal to the number. At this point we can write the proof.

Proof.

Consider any real number x and y. From the basic properties of absolute values we know the product of these numbers must be less than or equal to its absolute value.

xy |xy|
x2+2xy+y2 x2+2|xy|+y2
x2+2xy+y2 |x|2+2|x||y|+|y|2
(x+y)2 (|x|+|y|)2
|x+y|2 (|x|+|y|)2

Both terms being squared are positive, hence we can safely square root both sides without affecting the inequality.

|x+y||x|+|y|

End of proof

Once again in this proof, we used rough work to arrive at a statement we knew was true, and then worked backward to derive the result we wanted from this true statement. We had to be more careful working backward in this example since we square rooted an inequality. Though this works when both terms inside the square are positive, it may fail spectacularly otherwise.

41
(2)2 (1)2
 2 1 false!

1.3 Sets

So far we have been relying on intuition to define the real numbers, the focus of our attention. Although the formal definitions of the real numbers are outside the scope of these notes, it is worth paying attention to this fundamental notion of a group of numbers. In mathematics, this concept is represented as a set, a well-defined collection of objects. Some examples are listed below.

S={Saturday,Sunday}
T={1,2,3,}
R={n2:n is even}
ϕ or {}

The last example presents two ways to denote the empty set, the set with no elements. Notice R is the set of squares of all even numbers. The ‘:’ introduces a condition in the set definition. Since every set must be well-defined, every object is either an element of the set or not. If set A contains element a but does not contain b, we can write

aA and bA.

In defining sets, we often specify the elements to be included from some universe of objects. However, the elements of elements not in A clearly make a set as well, one which is intrinsically linked to A. This set is denoted the complement of A, written as Ac and defined as

Ac={xuniverse :xA}

Though we will explore notions of size in more depth later on, for now we define the cardinality, written as

|A| or card(A),

as its size. For finite sets, sets without an infinite number of elements, the cardinality is simply the number of elements. All such cardinalities are in the set {0,1,2,}, with 0 reserved for the empty set.

Since a set is unordered and cannot contain duplicate items, a natural definition of equality is that both sets contain the same elements. On the other hand, if every element of set B is in set A, then in some sense the latter set contains the former. Mathematically, we write that B is a subset of A or

BA.

Notice under this definition, the empty set is a subset of every set. Similarly, a set is always a subset of itself. In cases where we want to exclude this possibility, that the sets are equal, in explaining the relationship between the two sets, we can write B is a proper subset of A or

BA

which means BA but BA. Along with the real numbers there are a handful of other important sets used to refer to classes of numbers.

Symbol Name Set
Natural Numbers {1,2,3,4,}
Integers {,2,1,0,1,2,}
Rational Numbers {pq:p,q}
Real Numbers

Notice we defined the natural numbers to exclude 0, but the symbol 0 could be used to refer to all naturals along with 0. Based on the descriptions of each of these sets, each contains the previous plus some additional elements. Therefore, we can write

0. (1.11)

Since each set contains the previous, we may wonder what elements are in one but not the other. In some cases, this set itself might be valuable. For example all real numbers which are not rational are referred to as as irrational, and the set which contains all such numbers the irrational numbers. This set then is a result of taking the real numbers and subtracting away the rational numbers. This operation, along with a few more are formalized before for arbitrary sets A and B.

Subtraction. AB ={xA:xB}
Intersection. AB ={x:xA and xB}
Union. AB ={x:xA or xB}
Cartesian Product. A×B ={(a,b):aA and bB}

The intersection is the set of all elements in both, while the union is the set of all elements in either. Two sets are called disjoint if their intersection is the empty set meaning they have no common elements. Written out this would mean for disjoint sets S and T,

ST=ϕ

The cartesian product of two sets outputs a set with a completely different structure: a set of tuples containing all possibly pairs of elements from A and B. Unlike sets, tuples are ordered lists of elements which could contain duplicates. This idea of product can be extended beyond two sets.

A×B ={(a,b):aA,bB}
B×A ={(b,a):aA,bB}
A×B×C ={(a,b,c):aA,bB,cC}

Repeated products of the same set can be abbreviated as a power, similar to the real numbers.

Ak =A××Ak times
={(a1,a2,,ak):a1,a2,,akA}

One final concept that must be introduced to complete our discussion on introductory set theory is the power set of a set, which contains all possible subsets. Formally we write this as

Power set. 𝒫(A)={S:SA}.

To ensure this section includes a proof, we end with a simple theorem about the size of a power set. We will explore another proof of this theorem later in the course.

Theorem 7.

For a finite set A, if |A|=n, then its power set 𝒫(A) has cardinality 2n.

Examples.

One way to begin working on a proof is the test the claim with examples, since they may yield insight into why the result is true.

ϕ ={ϕ}
𝒫({0,1}) ={ϕ,{0},{1},{0,1}}

In each of these cases, the result holds. When the sizes of sets were 0 and 2, their power sets had sizes of 1 and 4 respectively. Notice we would also expect the number of subsets to be even, since for every subset SA, (AS) is also one, and this theorem seems to check out in that regard. At this point we can present the proof

Proof.

Since A is finite and |A|=n then the set must have n elements. Let us arbitrarily assign labels so that

A={a1,a2,an}

In constructing a subset, each of these n distinct elements can be chosen to either be included or excluded from the set. Since every subset can be created with these series of n choices, and each choice has two options, the number of subsets is 2n. End of proof

A major class of proofs involves proving two sets are equal. One possible way of doing this involves showing each set is a subset of another. To prove set S is a subset of T, we must show if x is an element of S then it is also an element of T. This would mean every element in S is contained in T and therefore ST. To provide an example, we introduce another theorem.

Theorem 8 (De Morgan’s laws).

Given two sets A and B, the following statements are true.

(AB)c =AcBc (1.12)
(AB)c =AcBc (1.13)
Proof.

Since the proof the statements are nearly identical, only the first will be proved; the second is left as an exercise. Since we are proving the equality of sets, we proceed by showing the left-hand side (LHS) is a subset of the right-hand side (RHS) and vice versa.

Part 1. (AB)c(AcBc)

Assume that x(AB)c, we will show x is also an element of (AcBc) through a series of implications.

x(AB)c x(AB)
xA and xB
xAc and xBc
xAcBc

Part 2. (AcBc)(AB)c

Now assume that x(AcBc), we will show x is also an element of x(AB)c.

x(AcBc) xAc and xBc
xA and xB
x(AB)
x(AB)c

Since we have proved that (AB)c(AcBc) and (AcBc)(AB)c, this is enough to show equality.

(AB)c =AcBc

End of proof

1.4 Functions

At this point we can introduce a very special type of set which has profound uses in mathematics. This, of course, is a function, which is a set of input-output ordered pairs which can represented as

{(x,f(x)):xdomain}
Term Definition
Domain The set of all allowed inputs
Codomain (Target Space) The set of all allowed outputs
Image (Range) The set of all actual outputs

The function, then, takes an input from the domain and returns a single output in the codomain. This property of linking the domain to the codomain explains why function are often called maps. Since the image is the set of outputs, and a subset of the codomain, it would be the set that results from applying the function to every element in the domain. Thus, for a function f with domain A, the image is often denoted as f(A).

f(A) ={f(a):aA}

In defining a function, the domain and codomain must be explicitly specified, hence the following notation is often used.

domainf:ABcodomain

A function is called real-valued if its image is a subset of the real numbers, and thus all its outputs are real numbers. The graph of a function is a plot of all the input-output pairs. In working with functions there are a few elementary ways to combine them.

Addition. (f+g)(x)=f(x)+g(x) (1.14)
Multiplication. (fg)(x)=f(x)g(x) (1.15)
Composition. (fg)(x)=f(g(x)) (1.16)

We are often extremely interested in the image of a real-valued function. In particular, we may wonder about the nature of this set: does it have a maximum or minimum, or does the function go to infinity? Formally, a function f:A is bounded if there exists some M such that

|f(x)|M

for all xA. Notice in this definition M acts both a positive and negative bound since the statement is equivalent to

Mf(x)M.

It turns out that sums and products of bounded functions are bounded as well, results we prove below. Notice in these proofs we clearly state our assumptions and what we plan on showing, a technique that helps inform our approach.

Theorem 9.

If functions f and g are bounded so are the functions (fg) and (f+g), formed by the sum and product of these functions respectively.

Proof.

Since f and g are bounded, there exist M and N for which

|f(x)|M and |g(x)|N.

We hope to show there exists some bound on |(fg)(x)|, therefore our aim should be to express this in terms of the quantities we already have bounds for.

|(fg)(x)| =|f(x)g(x)|
=|f(x)||g(x)|
MN

This shows that we have discovered a bound for the function: the product of the two bounds of our original function. Hence the function is bounded. End of proof

Proof.

Once again, assume there exist M and N for which

|f(x)|M and |g(x)|N.

We wish to find a bound for |(f+g)(x)|. However, in this case it is more tricky to decompose this into |f(x)| and |g(x)|, and requires the triangle inequality.

|(f+g)(x)| =|f(x)+g(x)|
|f(x)|+|g(x)|
M+N

Thus, we have proved (f+g), showing the sum of the bounds of the functions is the bound for the sum of the functions. End of proof

In general, approach bounding proofs by expressing the bounded functions by their definitional inequality. Then work on manipulating these inequalities through inequality or absolute value rules to obtain the desired result.

1.5 Fields

So far in our deals with , we have been implicitly relying on properties or axioms of arithmetic in the real numbers. Many of these properties are not exclusive to the reals, and apply to a class of algebraic structures called fields. A field, F, consists of

(F,0,1,+,) where F={0,1,}.

F is a set of elements in the field in including at least the additive identity 0 and the multiplicative identity 1. A field also includes two predefined binary operations, addition and multiplication, which take two inputs in F and return the sum and product in F respectively. These function must obey the following axioms for x,y,zF.

+:F×FF
:F×FF
Name Addition Multiplication
Closure x+yF xyF
Associativity (x+y)+z=x+(y+z) (xy)z=x(yz)
Commutativity x+y=y+x xy=yx
Identity x+0=x x1=x
Inverses each non-zero element has an additive inverse each element has an multiplicative inverse
Distributivity x(y+z)=xy+xz

To clarify the inverse axioms, the first states that for any xF, there exists a x in the field, such that

x+(x)=0,xF.

In this case, x is simply an element of the field, and is called the additive inverse of x. Commutatively, x is the additive inverse of x. Sometimes, the additive inverse of a field element is itself–the trivial example is 0:

0+0=0.

For non-zero elements in the field, a multiplicative inverse exists as well, which means for xF{0}, there exists a x1, such that

xx1=1,x1F.

Similar to the additive inverse, this inverse is an element of the field and likewise can be the same number. With this axiom, we can define two new operations in the field. Subtraction amounts to adding by the additive inverse and division is multiplication by the multiplicative inverse. For field elements x, y in some field F.

xy :=x+(y) (1.17)
x/y :=xy1 given y0 (1.18)

Notice the := here emphasizes the expression on the left is defined to be equal to the expression on the right. To avoid confusion, when working in fields, we will refrain from using subtraction and division and instead refer to the inverses.

Naturally, we may now wonder, having laid out these axioms, which sets which classes of numbers, with regularly defined addition and multiplication, constitute fields. To prove whether a set with defined operations is a field is simply a matter of checking each of the axioms.

Theorem 10.

The natural numbers () and integers () are not fields.

Proof.

We will prove this theorem in two parts, first dealing with the natural numbers proceeded by the integers.

Part 1.

The additive inverse axiom fails for all integers. Specifically, the number 1, for example, has no additive inverse since for all n,1+n>0. Since 1 had no additive inverse this is not a field.

Part 2.

The multiplicative inverse axiom fails for 2. Once way of rigourously showing this is to notice 2n is even for all n. Therefore this value can never be 1, and hence no multiplicative inverse exists.
End of proof

Theorem 11.

The rational numbers () and real numbers () with regularly defined addition and multiplication constitute fields.

Proof.

The process of proving these are fields are rather tedious, verifying each of the field axioms individually, and left as an exercise. However, note that the inverse axioms that failed for the natural numbers and integers hold for these two sets. The additive and multiplicative inverses are respectively shown below.

pqpq,qp
xx,1x

End of proof

These suite of axioms lead to a host of simple properties, which should feel familiar to us after working with the real numbers. We introduce an expansive theorem which lays out many of these properties.

Theorem 12.

Given a field F defined with addition and multiplication and three elements x, y, and z, the following properties must hold.

x0=0 (1.19)
if xy=0 then x=0 or y=0 (1.20)
x=x(1) (1.21)
(x)(y)=xy (1.22)
if x+y=x+z then y=z (1.23)
if xy=xz then y=z if x0 (1.24)
Proof.

Proving every statement in the theorem would take up a lot of space, hence we will present proofs for the first and last statement, hopefully providing a model for approaching proofs to the remaining statements.

Statement 1.18. x0=0

Notice that although we take this for grated in the real numbers, it is not a field axiom and must be proved. Since 0 is the additive inverse, in searching for a proof we should look to exploit its properties.

x0 =x(0+0) identity
=x0+x0 distributive

At this point notice by closure we know x0 lies within the field. Therefore, by the inverses axiom it must have an additive inverse. Now we add this to both sides of the equation.

x0+x0 =x0
x0+x0+((x0)) =x0+((x0))
x0+(x0+((x0))) =0 associative
x0+0 =0 inverse
x0 =0 identity

This is the result we were looking for. Notice instead of subtracting x0 from each side, as we would in the real numbers, we added the additive inverse to both sides. In this proof we clearly listed which axiom allowed us to make each step, and although this is not necessary, it makes it less likely to perform an operation which is not allowed.

Statement 1.23. if xy=xz then y=z if x0

Once again, we attempt a direct proof based on the field axioms. Since x0 from the inverses axiom, we know there exists some x1F such that

xx1=1

We simply multiply both sides of out equation with this term.

xy =xz
x1xy =x1xz
(x1x)y =(x1x)z associativity
1y =1z inverse
y =z identity

Once again, we have arrived at our result, hence we have proved this this statement for arbitrary fields. Notice we could approach this problem as if we were asked to prove this for real numbers. Instead of dividing both sides by x we multiplied by x1 to arrive at the result. End of proof

The two fields we have discussed so far have had an infinite number elements with predefined addition and multiplication operations that mirror their common usage. However, we may wonder what other fields exists, and in particular, what such a field would look like for a finite number of elements, a finite field. Since a field must have at least two elements, let us attempt to construct a two element field, called the binary field. Consider the field

(S,0,1,+,) where S={0,1}.

Unfortunately, we must define multiplication and addition in this field since otherwise the latter would defy the closure field axiom as

1+1 =2S.

For finite fields with a small number of elements, the most effective method of defining these operations involves writing addition and multiplication tables, providing outputs for all possible pairs of input for each function.

      

Notice, that multiplying by 0 always returns 0 (as we proved above) and multiplying by 1 gives the same number. Similarly, adding 0 returns the same number by the identity property.

      

This only leaves one output to be decided, the sum 1+1. In the next theorem we prove that this is equal to 0. To do this, we introduce a new type of proof, a proof by contradiction.

Theorem 13.

If S is the binary field with S={0,1}, then 1+1=0.

Proof by Contradiction.

By the closure property of fields, we know the sum 1+1 must lie within S and is therefore it is either 0 or 1. Assume for sake of contradiction that it is indeed 1, so that

1+1=1.

By the inverse axiom, we know that 1 has an additive inverse in the field. We now add this inverse to both sides of the equation.

1+1+(1) =1+(1)
1+0 =0
1 =0

This is clearly false since by definition a field must have distinct elements 0 and 1. Since this would be a logical consequence of letting 1+1=1, that assumption must be false. Hence we are left only with the possibility that

1+1=0.

End of proof

In a proof by contradiction, we assume the opposite of the statement we are trying to prove, and show how this leads a result that is clearly false. This contradiction shows that our initial assumption must be false, and therefore its opposite true. In this way, we indirectly prove our claim is true by showing it cannot be false. In the last proof, we demonstrated if 1+1=1 then 0=1, which is clearly false. Therefore, we could conclude 1+11, which meant 1+1=0, what we wanted to show.

Moving on, we can tackle the construction of three-element field T={0,1,ω}. Immediately, we write the addition and multiplication tables and fill out the simple rows and columns.

      

Completing the multiplication table is simple; since every element in the field must have a multiplicative inverse, in the case of ω, this must be itself. In other words, every non-zero row or column in a field multiplication table must have a single 1.

      

This leaves the addition table to be completed, to do this we introduce a new theorem which helps complete the non-diagonal entries.

Theorem 14.

For a three-element field (T,0,1,+,) with T={0,1,ω} the following equation must be true.

1+ω =0
Proof by Contradiction.

Assume for sake of contradiction that 1+ω0. Then, by closure the sum must either be ω or 1. We would show either case results in a contradiction.

Case 1. 1+ω=1

Since 1 is in the field, the inverses property tells us that an additive inverse exists. Adding both sides with this additive inverse we immediately see the contradiction.

1+ω =1
(1)+1+ω =1+(1)
ω =0

Since the elements are distinct this is a contradiction.

Case 2. 1+ω=ω

We approach this case in a very similar way, clearing the ω’s from both sides to expose a contradiction.

1+ω =ω
1+ω+(ω) =ω+(ω)
1 =0

This again, is a contradiction. Therefore in the absence of any alternatives, we have proved that 1+ω=0 End of proof

Note in this proof, we applied a case-by-case approach to a proof by contradiction. This is analogous to a proof by cases where we prove the result in every case, except we actually show a contradiction in each case.

      

The remaining two spaces are rather easy to fill out if we remember the property that in a field

if x+y=x+z then y=z

Fixing x by focusing on an individual role or column, this tells us that different entries must be different. This is because, for example,

if 1+1=1+ω then 1=ω.

Interestingly, this means every row and column of the addition table must contain each of the field elements. Exploiting the analogous rule for multiplication, every non-zero row or column must have every field element appear exactly once. Armed with this knowledge, we can finally complete the operator definitions, showing a three-element field exists.

T={0,1,ω}

      

1.6 Exercises

  1. (a)

    Prove that for two real numbers x and y, |xy||x||y|.

  2. (b)

    Given x+2y=100, determine the maximum value of xy using the arithmetic-geometric mean inequality.

  3. (c)

    For real numbers x,y,u,v, show that

    (xu+yv)2(x2+y2)(u2+v2).
  4. (d)

    When does equality hold for the triangle inequality? In other words, for what values of x and y does

    |x+y|=|x|+|y|?
  5. (e)

    Prove for any a,b>0 the following inequality holds true.

    4ab(1a+1b)2
  6. (f)

    Prove the following for three real numbers x, y, and z.

    |x+y+z||x|+|y|+|z|
  7. (g)

    Show the image of the function f is the set (0, 1], for

    f:,f(x)=1x2+1.
  8. (h)

    Let C and D be sets in the domain of g. Prove the following relationship about the images of the function.

    g(CD)g(C)g(D)
  9. (i)

    Let f and g be two bounded functions. Which of the following functions are also bounded?

    f+g,fg,fg,1f2+1
  10. (j)

    If (fg) and (fg) are bounded, prove that the function (f+g) is bounded.

  11. (k)

    Consider the four element field F={0,1,a,b} with operations + and . Complete the addition and multiplication tables below, if possible, and determine the number of different fields which satisfy the axioms.

          

  12. (l)

    For each of the statements below either provide a proof or counterexample. (1) For any xS, we have x0=0, and (2) y11=y1 for any y0 in S.

Note 2 Language and Proofs

2.1 Statements and Connectives

The atomic unit of mathematical reasoning is the mathematical statement or proposition, a sentence that is either true or false. This requirement is called the principle of bivalence. For a statement, its truth value is whether it is true or false. Examples of statements are given below.

P π is rational
Q x2+x+1 has a real solution
R July 4, 1776 was a Thursday

In our examples, the first statement is false while the other two are true. On the other hand “let them eat cake” is not a mathematical statement. These statements can be combined with logical connectives to express more complicated thoughts, which are listed below.

Name Usage Meaning
Negation ¬P not P
Conjunction PQ P and Q
Disjunction PQ P or Q
Conditional PQ if P then Q
Biconditional PQ P if and only if Q

Similar to defining operators in a finite field, we provide rigorous definitions for these logical connectives by showing whether the compound statement is true of false for every possible set input. Fortunately there are very few possible inputs since each statement can either be true or false. In composing a table, called a truth table, we let each row correspond to a unique input.

Notice this truth table shows completely negation, since it tells us how the statement behaves given every input. In this case, if a statement is true its negation is false and if a statement is false its negation is true. In English, then, the negation of a statement would be its opposite. The truth tables for the remaining connective are given below; notice each require two inputs and thus have four possible inputs.

These truth tables provide complete definitions for each of these operators, but fail to provide very much intuition.

  1.   i.

    Negation: As mentioned before, if R is true its negation is false and vice versa. The simple interpretation of this that ¬R is the opposite of R.

    P π is rational
    ¬P π is not rational
    Q 2>0
    ¬Q 20
  2.   ii.

    Conjunction: PQ is true only when both P and Q are true, which matches with we think about “and.”

    P 2 is irrational
    Q (x2=2) holds when x=2
    PQ 2 is irrational and a solution to x2=2
  3.   iii.

    Disjunction: PQ is true when either P, Q, or both P and Q are true. Another way to think about this is that PQ is only false when both statements P and Q are both false. There is a slight difference between this logical “or” and the one in everyday usage; often times, “or” in English can mean one or the other but not both, which would be called the exclusive-or in mathematics.

    P x is positive
    Q x is negative
    PQ x is positive or negative
    x is nonzero
  4.   iv.

    Conditional: This connective is essential in proofs, and thus is worth taking closer look at.

    hypothesisPQconclusion

    The first statement is called the hypothesis and the second statement the conclusion. The statement (PQ), interpreted as either

    “if P then Q” or

    P implies Q,”

    means when P (the hypothesis) is true, Q (the conclusion) must be true as well. On the other hand, when P is false, it does matter whether Q is true or false, and the statement is true in either case. Once again, the only time a conditional is false is when P is true but Q is false.

  5.   v.

    Biconditional: A biconditional (PQ) is true when both P and Q have the truth value and false they do not. Therefore, if the biconditional is true the statements P and Q are logically interchangeable since both are true or false together. For this reason, P and Q can be said to be “logically equivalent.” This concept can be expressed in two ways as either

    PQ or PQ,

    where the second notation is traditionally used to indicate the two logical compound expressions are equivalent, especially when one side might itself contain the biconditional connective.

It turns out that from these logical connective we can construct many expressions which are identical. We prove a few of the important logical equivalences.

Theorem 15 (De Morgan’s laws).

For two logical statements R and S, the following equivalences are true.

¬(RS) ¬R¬S (2.1)
¬(RS) ¬R¬S (2.2)
Proof.

The general way to prove logical statements are equivalent is to construct truth tables and show both statements have identical outputs for the same inputs. In order to avoid receptiveness, we will do this for the first statement and leave the proof of the second as an exercise.

R S RS ¬(RS)
T T T F
T F F T
F T F T
F F F T

This gives us the truth table for the left-hand side of the equation. We follow the same process, slow building up the right-hand side.

R S ¬R ¬S ¬R¬S
T T F F F
T F F T T
F T T F T
F F T T T

Since (¬R¬S) and ¬(RS) are true and false together, given the same inputs (have identical columns) the two statements are logically equivalent.
End of proof

Theorem 16.

The following important logical equivalences are also true.

PQ (PQ)(QP) (2.3)
PQ (¬Q¬P) (2.4)
PQ ¬(P¬Q) (2.5)

2.2 Predicates and Quantifiers

Even after the application of connectives, statements are rather limited and specific, and so to express classes of statements, predicates are used. A predicate with variables is an expression which is either true or false when its variables are supplied. Since predicates are not statements until its variables are defined, they are sometime called open statements.

P(n)=n is even

Above is an example of such a predicate; although it currently is neither true nor false, as soon as P is given a reasonable input its truth value is defined.

P(0) is true
P(1) is false
P(2) is true

Notice the truth value changes as a function of the input, though we need only consider inputs for which it is well-defined. For example, in this case, the expression P(π) makes no sense since even and odd are not defined for real numbers.

Having introduced this machinery, we can return to DeMorgan’s laws and explain its connection to the identically named rules for sets. For two sets A and B, consider the following statements.

R(x) xA
S(x) xB

From DeMorgan’s laws, we know the following logical equivalence is true.

¬[R(x)S(x)] ¬R(x)¬S(x)

We will translate both sides into the language of set theory.

¬[R(x)S(x)] ¬[(xA)(xB)]
¬[x(AB)]
x(AB)
x(AB)c
¬R(x)¬S(x) ¬(xA)¬(xB)
xAxB
xAcxBc
xAcBc
(AB)c=AcBc

Similarly, with these defined statements,

¬[R(x)S(x)] ¬R(x)¬S(x)
means (AB)c =AcBc

So far we have only discussed whether statements are true for individual inputs, but often times we are interested in the behavior of a predicate over a set of inputs. To do this, we introduce two quantifiers.

Universal. xS  “for all x in S
Existential. xS  “there exists an x in S

Attaching either a universal quantifier or existential quantifier to a predicate makes it a statement.

(xS)P(x) “for all x in S,P(x) is true”
(xS)P(x) “there exists an x in S such that
P(x) is true”

In English these quantifiers can often be disguised, but are commonly used when talking about mathematical statements. A predicate with multiple variables can have multiple quantifiers, where order matters.

S =(x)(M)(|x2|M)
=for all real numbers x, there exists a real M such that |x2|M
T =(M)(x)(|x2|M)
=there exists a real M such that for all real x|x2|M

The first statement means that for any real input of x2, we can find a y-value which is more extreme that it. This is trivially true since any finite input of the function is finite, take for example

M=x2+1.

On the other hand, the second statements says there is some bound M for which all values in the image of f(x)=x2 lie between M and M. Note this is identical to the definition of a bounded function we discussed earlier, only written in the form of a statement. Now consider the negation of this statement. To argue a function is not bounded is to should for all bounds M we can chose at least one point which exceeds this bound.

¬T =(M)(x)(x2>M)

If we compare the definition of bounded with its negation we notice an interesting relationship, where, in addition to the predicate being negated, all universal quantifiers turn into existential ones and vice versa. This is formalized below.

Theorem 17.

When negating a quantified predicate, flip the quantifiers in addition to negating the predicate.

¬[(xS)P(x)] (xS)¬P(x) (2.6)
¬[(xS)P(x)] (xS)¬P(x) (2.7)

2.3 Proving Existential Statements

The simplest class of statements we need to prove are those with existential quantifiers; those of the form

(xS)P(x).

To prove such a statement is rather straightforward and usually involves finding the x for which the claim is true. For example, we can prove the following statement, which is sufficient to show the irrational numbers do not obey closure under regular addition and therefore cannot be considered a field.

Theorem 18.

There exist two irrational numbers r and s for which r + s is rational.

Proof.

Expressed as a logical statement, we are interested in the statement

(r,s)(rs).

We will prove this existentially qualified statement by fining such a pair of irrational r and s for which their sum is irrational. Consider now:

{r=πs=π
rs=π+(π)=0

Since we have found at least one pair of elements for which this statement is true, we have proved that the sum of two irrational numbers can be rational. End of proof

Of course, explicitly finding an example which satisfies the statement may not be as easy. Sometimes it is possible to show such an example must exist without actually finding it in a type of proof called a non-constructive proof. The most famous example of such a proof is given below.

Theorem 19.

There exist two irrational numbers r and s for which rs is rational.

Proof by Cases.

This proof seems nearly impossible to attempt directly. The irrational numbers are a very diverse set and a clear starting point is elusive. Instead, we attempt a non-constructive proof with a trick.

Case 1. 22 is irrational

Notice what happens when this number is raised to the power of 2.

(22)2 =222
=22
=2

Therefore if 22 is irrational, then we have found two irrational numbers for which their power is rational.

Case 2. 22 is rational

It turns out that if this case is correct, then our proof is automatically done, since the square root of two, an irrational number, when raised to the power of itself is rational.

Ultimately, we have shown that either

(22)2 or 22

must be rational and the result of raising one irrational number to the power another. Although we do not know which case is correct, one of them must be true and thus there must exist two irrational numbers who power is rational. (In case you were still curious, it turns out the first case is correct.) End of proof

We will now introduce even and odd integers and use this an an opportunity to prove statements about this property.

Even. x is even means x=2k for some k
Odd. x is odd means x=2k+1 for some k

Notice with definition every integer is either even or odd but not both. The sets of even and odd numbers look as follows

e ={,4,2,0,2,4,}
o ={,3,1,1,3,5,}

where e denotes the even integers and o the odd integers. We will leave our discussion of parity, the property of being either even or odd, here for now.

2.4 Unary Proofs

Despite this discussion about proving existential quantifiers, we are primarily interested in proving or disproving statements with universal quantifiers. That is, we prefer more general results about all natural numbers, all integers, or all real numbers. To disprove a statement, we can prove its negation. For a universally quantified statement, these have the following forms.

P (xS)P(x)
¬P (xS)¬P(x).

Hence, proving such a statement false amounts to finding at least one example for which the result fails to hold. As we showed above, this can either be done explicitly or implicitly (non-constructively). In the scenario where we find such case, we call it a counterexample. Since the negation of a universal quantifier is existential, only one counterexample is needed to prove the entire claim false.

It follows that we may now want to explore techniques to help prove these statements true. Of course, we usually deal with infinite sets and it is impossible to manually verify the predicate holds for all possible inputs. However, once we realize the majority of results fall within one of three classes of statements,

S1 (xS)P(x),
S2 (xS)[P(x)Q(x)], and
S3 (xS)[P(x)Q(x)],

we can apply the tools mathematical logic to discover more efficient methods to approach these proofs. But first, we provide examples of statements of this form statements we have already encountered.

Type Examples
P(x) for all real numbers x, x2=|x|
for all x,y, |x|+|y||x+y|
for any finite set S, the size of 𝒫(S) is 2|S|
for any statements S and T, ¬(ST)¬S¬T
P(x)Q(x) if x=y and x,y>0 then their arithmetic mean equals their geometric mean.
if b24ac0 then a quadrative equation has at least one solution
if F is a field with x,yF, then (x+y)F

We first begin our examination of proof techniques with statements of the form

(xS)P(x).

Notice that these statements are actually of the form

(xS)[H(x)P(x)],

where H(x) represents many of the definitions and assumptions we take for granted. For example, in proving the triangle inequality, we assumed that the absolute value function was defined in a particular way, along with a few axioms of the real numbers. However since the hypotheses are not meaningful in the sense that they provides any significant information apart from the generally accepted rules, they are acceptable to omit. The two main methods of proving such a statement are a direct proof and a proof by contradiction. In the former,we derive the statement directly from those which are true. An example of such a proof is given below.

Theorem 20.

For any real number x the following inequality is always true.

|x|+1|x|2 (2.8)
Proof.

It turns out that this statement is rather useful, and appears frequently in other places. This statement is relatively easy to prove as well, since it appears to be a special case of the arithmetic-geometric inequality. Take:

{a=|x|b=1/|x|

Since both a and b are greater than 0 we can apply the special case of the theorem which hold for positive numbers.

a+b 2ab
=2|x|1/|x|
=2
|x|+1|x| 2

End of proof

While a direct proof is rather straightforward, it can difficult to see how to proceed when confronted with a statement. In this case, a proof by contradiction can be used: assume the negation of the statement and show that it must be wrong. Remember that a either a statement or its negation must be true, therefore showing the negation to be false is equivalent to proving the statement true. An example where a direct proof might be difficult is given below.

Theorem 21.

For any list of real numbers, there must be at least one value which is a large as the average. Alternatively, this can be expressed as:

max(x1,x2,,xn)x1+x2++xnn (2.9)
Proof by Contradiction.

Attempting a direct proof seems unwise; instead, we approach this with a proof by contradiction. Assume there exist a list of real numbers for which every number is less than the average. Therefore we would have

x1,x2,,xn<x1+x2++xnn

Since we have a bound on every single term, we can find a bound on the sum.

x1+x2++xn <(x1++xnn)for each of n terms+
=n(x1++xnn)
=x1+x2++xn

At this point, we have arrive at the conclusion that the sum of the numbers is less than itself. This is a contradiction, and thus such a list cannot exist. End of proof

Notice in this case, the result we where trying to prove was actually doubly quantified. Formally, we would write

((x1,,xn)n)(xi(x1,,xn))[ximean(x1,,xn)]
for all lists of real numbers, there exists some element in the list for which…

In proving this statement via contradiction, we need to be careful in taking the negation, remembering to swap quantifier types. Our negation is then

((x1,,xn)n)(xi(x1,,xn))[ximean(x1,,xn)]
there exists some list of real numbers for which all elements are…

Notice this is precisely what we assumed during the proof, thus making it logically sound. Proofs by contradiction are also rather useful for claims of irrationality. Although rational numbers are well-defined and easy to work with, the irrationals are less so. Using a proof by contradiction would allows to work with the former class of numbers rather than the latter. An example of such a proof is below.

Theorem 22.

The square root of two (2) is irrational.

Proof by Contradiction.

Above we used the fact that the square root of 2 was irrational, but we never proved it. We do so now using a proof by contradiction. Assume, then, that the square root of two is rational so we can write

2=pq

for p and q in lowest terms. We emphasize this point here, that the fraction is in lowest terms, since ever rational number can be written in this way. At this point we square both sides to remove the square root.

2=p2q2
2q2=p2

Notice that p2 can be expressed as two times an integer so it must even. It follows that p must also be even, since if it was odd so would its square. therefore, for some k, we can write

p=2k
2q2=(2k)2
q2=2k2

This means that q2 is also even and so q must be as well. This however is a contradiction, since we assumed the fraction was in reduced form and therefore both the number and denominator cannot be divisible by two. End of proof

2.5 Conditional Proofs

In the last chapter, we also mentioned two other classes of statements we are likely to prove.

S2 (xS)[P(x)Q(x)]
S3 (xS)[P(x)Q(x)]

Despite being slightly more complex, many of the techniques we see in this section are similar to those we explored earlier. We now tackle the second class of statements of the form P(x)Q(x). A summary of the three techniques we will use throughout the course of the section are below.

Method Approach Statement
Direct assume P(x), show Q(x) P(x)Q(x)
Contrapostive assume ¬Q(x), show ¬P(x) ¬Q(x)¬P(x)
Contradiction assume P(x)¬Q(x), show contradiction ¬[P(x)¬Q(x)]

Once again, the most straightforward approach is to attempt a direct proof. For an example, we can prove the following theorem rather simply, assuming the hypothesis and manipulating it to arrive at the conclusion.

Theorem 23.

The sum and product of two even numbers are both even.

Proof.

We proceed with a direct proof; to help see the hypothesis and conclusion we can write this out as a logical statement.

(n,m)[(n,me)(n+m,nme)]

We’re using shorthand here where n,m is a more compact way of writing n and m. Nonetheless, our assumptions and desired conclusion are now clear, and we can proceed. Assume n,m are even so that for some k1,k2:

{n=2k1m=2k2

We will now express both the product and sum in terms of k1 and k2.

n+m =2k1+2k2
=2(k1+k2)
nm =(2k1)(2k2)
=4k1k2
=2(2k1k2)

Since the sum and product can be expressed as 2 times some integer we can conclude they are both even. For sake of completeness, we introduce a more general theorem. The proof of each of its remaining parts are very similar and left as an exercise. End of proof

Theorem 24.

For integers n,m the following conditions determine whether the sum or product is even or odd.

  1.   i.

    if n is even and m is even, n+m is even and nm is even

  2.   ii.

    if n is even and m is odd, n+m is odd and nm is even

  3.   iii.

    if n is odd and m is even, n+m is odd and nm is even

  4.   iv.

    if n is odd and m is odd, n+m is even and nm is odd

This theorem can be summarized with the following two tables,

      

which are oddly reminiscent of the addition and multiplication tables in a binary field. In fact, if we constructed a field

(F,e,o,+,) with F={e,o},

its operations would be defined in exactly this way, with e as the additive identity and o the multiplicative identity. For now, we leave this connection as an unlikely coincidence, but we will revisit this after tackling modular arithmetic.

P(x)Q(x)¬Q(x)¬P(x)contrapositive

Above is one of the logical equivalences we proved earlier, the right-hand side is called is called the contrapositive or contraposition. This fundamental equivalence tells us that to prove a conditional we can instead prove the contrapositive by assuming the negation of the conclusion and deriving the negation of the hypothesis. The canonical example is the set of statements:

Original. if it rains the streets are wet

Contrapositive. if the streets are dry then it did not rain

This is useful when the negations of the hypothesis and conclusions are better defined or easier to work with. This often occurs when we have statements with “not equal to” and in claims about parity.

Theorem 25.

If the n2 is even, then n must be even as well. Similarly, if n2 is odd, then n must be odd. In other words, n2 always has the same parity as n.

Proof by Contrapositive.

We will show the two parts of this theorem separately, and prove the first with the contrapositive. The second part is left as an exercise.

Part 1. n2 is even

We can write out our statement formally, so we can see its hypothesis and conclusion more easily. This also makes it easy to see the contrapositive.

(n)(n2 is even n is even)
(n)(n is odd n2 is odd)

The contrapositive is substantially easier to prove; assume that n is odd so that for some integer k, n=2k+1. We can then express the square in terms of k.

n2 =(2k+1)2
=4k2+4k+1
=2(2k2+2)+1

Since n2 can also be expressed as one more than an even number, it must also be odd. Therefore, we have proved our original statement. Notice we used this fact in the proof that the square root of two was irrational, though only casually proved it. End of proof

Theorem 26.

For any two non-negative numbers x and y,

if xy then x2y2
Proof.

This result means that different positive numbers must have different squares. This fact was actually essential in many of our proofs, since it means that given we are working with positive numbers, we are allowed to square root both sides of the equation without affecting the equality. In other words, knowing each positive number has a unique squares allows us to work backward and determine a number uniquely from its square. It turns out this property is not unique to the squaring function but rather applies to an entire class of functions; but for now we stick with proving this statement, or rather its contrapositive.

(x,y)(x2=y2x=y)

The proof at this point is rather straightforward, and involves simplifying the hypothesis to reach the conclusion.

x2 =y2
x2 =y2
|x| =|y|
x =y, since x,y0

End of proof

2.6 Conditional Statements and Contradiction

The final proof technique, as outlined earlier, is a proof by contradiction. Remember the only case in which a conditional is false is when the hypothesis is true and the conclusion is false. Therefore its negation is as follows.

¬[P(x)Q(x)]P(x)¬Q(x)

Once again, in a proof by contradiction the idea is to assume the negation and show a contradiction. Since we are interested in proving a conditional, we assume the hypothesis and the negation of the conclusion, showing this yields a contradiction. We use this proof technique a revisit and strengthen a theorem we presented earlier.

Theorem 27.

For any prime number p, the kth root of p is irrational, where k is any integer greater than 1.

Proof.

Notice that this statement can be expressed as a conditional; if p is prime then

pk, given k2.

Once again, proving something is irrational positively is difficult, hence it makes sense to approach this via a proof by contradiction. For sake of contradiction then, we assume the hypothesis and the negation of the conclusion, that p is prime and for k2 the kth root of p is rational. By definition, this means

pk=rs, for r,s

were r and s are co-prime integers. Proving this same theorem in the special case where both k and p were 2, we can guess the proof for this will likely be similar. Therefore we raise both sides of the equation to the power of k and rearrange.

p=(rs)k
psk=rk

Clearly the left-hand side of this equation is divisible by p, so therefore rk must be as well. Since p is prime, this implies that r itself must be divisible by p. Hence, for some n, we can write

r=np so
psk =rk
=nkpk

Dividing both sides by p, we reach the following equality, which immediately yields the contradiction.

sk=nkpk1

Now the right-hand side is divisible by p so this implies so is the left-hand side. But since p is a prime, p dividing sk means p also divides s. But this is a contradiction, since this means that both r and s are divisible by p and therefore the fraction

rs

was not in lowest terms after all. Therefore, we have proved what we intended to. End of proof

Notice what we proved could be interpreted as showing the polynomial with integer coefficients

xkp=0

has no rational roots. Naturally, we maybe interested in a more general questions of when polynomials have rational roots; so we now prove a more general result to this effect and see how this above theorem simply follows as a corollary.

Theorem 28 (Rational Root Theorem).

Let p(x) be an arbitrary polynomial with integer coefficients, which can be written as

p(x)=anxn+an1xn1++a1x+a0.

If p has rational roots, meaning there exists some x such that p(x)=0, then if x is expressed as

pq

in lowest terms, then p must divide a0, the constant term, and q must divide an, the leading coefficient.

Proof.

This could be approached with a proof by contradiction, but instead we provide a direct proof. Assume that there exists a rational solution the p(x) which is expressed as a fraction in lowest terms. Then we can write the evaluation of the polynomial at x which should be equal to 0, by definition.

p(pq) =0
anpnqn+an1pn1qn1++a1pq+a0 =0

Once again, similar to the insight we used in the previous few proofs, we want to clear the fractions so we can once again work with integers. This can easily be done by multiplying both sides of the equation with qn, yielding

anpn+an1pn1q+an2pn2q2++a1pqn1+a0qn=0.

We then rearrange this polynomial in two ways by isolating all the p and all the q terms, so we can factor.

anpn+an1pn1q+an2pn2q2++a1pqn1 =a0qn and
an1pn1q+an2pn2q2++a1pqn1+a0qn =anpn

To reach the first equation we moved the last term to the other side, and for the second equation, we moved the leading coefficient. Notice that both left-hand sides now have a common factor so we can simplify even further.

p(anpn1+an1pn2q++a1qn1) =a0qn and
q(an1pn1++a1pqn2+a0qn1) =anpn

In the first equation, we can see that left-hand side is divisible by p, therefore so must the right-hand side. Notice that since

pq

was in lowest terms, qn clearly is not divisible by p, hence this means that p divides a0. By symmetry, the second equation shows that q must divide an. This is exactly what we needed to show, which is that if p(x) has a rational root, then

{p divides a0q divides an.

End of proof

We can now offer a slicker proof to our original statement, using the theorem we just proved. Remember we wanted to prove

pk

where k is any integer greater than 1 and p is a prime number. Above we used a long proof by contradiction to show this, but now we offer a simpler proof.

Proof.

Consider the following polynomial, for which the kth root of p is a solution.

xkp=0

It is sufficient to show that the polynomial has no rational roots, because that implies the root of p must be irrational. Notice that this is a polynomial with rational roots, hence we can apply the rational roots theorem to conclude that if

x=mn for m,n and n0

is a rational root of the polynomial then n must divide 1, the leading coefficient, and m must divide p, the constant term. There the set of all possible rational roots to this polynomial is

n{1,1},m{p,1,1,p}
x{p,1,1,p}

At this point, we simply try all possibilities to conclude no such rational x can exist.

1kp 0
(1)kp 0
pkp 0
(p)kp 0

Notice this is technically a proof by contradiction; we assume that a rational root exists, then use the rational roots theorem to that this root must then be one of

{p,1,1,p}.

Showing that none of these are actually roots results in a contradiction, proving our original statement. End of proof

In some sense, the rational roots theorem we proved was a more general result than showing

f(x)=xkp

has no rational roots. Proving the harder general case allowed us to revisit our original problem and more easily show it was true. In this case it simply amounted to evaluating the polynomial at four points.

We will revisit this technique of proving more general results and then looking back to apply these to more specific cases later. In the next chapter, we will see cases where this may be necessary; but finding more general principles is a virtue in itself. It turns out that the rational roots theorem, once again, is a specific case of Gauss’ Lemma, but this remains out of scope of this set of notes.

Note 3 Mathematical Induction

3.1 Sum and Product Notation

In the last note, we ended up working with polynomials with an arbitrary number of terms, relying on the following notation.

p(x)=anxn+an1xn1++a1x+a0n1 terms

While this was perfectly rigorous, it relies on the reader being able to notice the pattern among terms in the summation. It may be less ambiguous to explicitly state this pattern, for which we introduce the following notation.

i=abf(i) =f(a)+f(a+1)++f(b1)+f(b)
i=abf(i) =f(a)f(a+1)f(b1)f(b)

Using this notation, we would express a generic n degree polynomial with coefficients {ana0} as the following.

p(x) =anxn+an1xn1++a1x+a0
=i=0naixi

Notice that both of these are common computational processes in computer science which can be expressed as with the following pseudo-code.

def sum(f, a, b):
sum = 0
for i in [a, ..., b]:
sum = sum + f(i)
return sum
def prod(f, a, b):
prod = 1
for i in [a, ..., b]:
prod = prod * f(i)
return prod

The following properties of summation and product naturally extend from its definition, and should intuitively be true, though writing out a proof should not be too difficult.

i=abcf(i) =ci=abf(n)
i=ab(f(i)+c) =(ba+1)number of termsc+i=abf(n)

There is an analogous equivalence for a product, though this time the number we we pull out get raised to the power of the number of terms.

i=abcf(i) =cba+1i=abf(n)

These summations (and products) appear frequently in mathematics, especially in calculus where the number of terms goes to infinity, so over the course of this chapter, we help develop tools to help simplify then more useful formula. As an example, consider the sum of the first n natural numbers.

Theorem 29.

For all natural numbers n, the following equation holds.

i=1ni =1+2++n
=n(n+1)2
Proof.

We offer a direct proof of this fact using a trick; first denote the sum itself as S, so we have the expression

S=1+2+3++(n1)+n.

Now we sum the sequence the with itself in reverse order, in essence adding two equations.

S=1+2++(n1)+nS=n+(n1)++2+12S=(n+1)+(n+1)++(n+1)+(n+1)

Notice in the bottom sum, each of the n terms is (n+1). Therefore we can simplify the expression even further to get

2S=n(n+1) or (3.1)
S=n(n+1)2 (3.2)

This is exactly what we need to show, hence we have derived the formula. End of proof

Notice the simplified expression is more useful to us, since it more clearly expresses the relationship between n and the sum. More specifically, from this formula we can tell that the sum is actually a well-defined polynomial in n, a fact which is obfuscated by the summation. However in the above proof we used a trick, and it is not clear how we would approach proving formulae like this more generally.

3.2 The Naturals Numbers

While most of our proofs until how have been focused on the real numbers and integers, we now turn our attention to the natural numbers. To develop another fundamental proof technique for working with these numbers, we need a more precise definition of the natural numbers, which we currently refer to with the set

={1,2,3,}.

To avoid the ambiguity of the ellipses, we can define the natural numbers () as the intersection of all subsets A of the real numbers with the following properties

  1.   i.

    1A

  2.   ii.

    if nA, then s(n)A where s is the successor function which represents s(n)=n+1.

Notice this is consistent with our intuitive definition of the natural numbers. If A obeys these two definitions, then we know 1 must be in A, along with all number in the sequence 1,2,3,

{1,2,3,}A

Furthermore, since this set itself satisfies these conditions, we know that it is an example of one such set A. Therefore the intersection of all such sets is indeed what we expected,

={1,2,3,}.

Taking the intersection of all such sets A is essentially equivalent to taking the “smallest” set which satisfies these properties; otherwise, notice that the real numbers themselves also satisfy both properties. With this definition, we have a more systematic way to understand the natural numbers which yields a new proof technique.

Theorem 30 (Principle of Induction).

Let P(n) be a predicate defined for natural numbers n. If the following two statements hold,

  1.   i.

    P(1) is true

  2.   ii.

    if P(k) is true, then P(k+1) is true

then P(n) is true for all natural numbers n.

Proof.

We consider the set A for which this predicate is true and show that this is equal to the set of natural numbers.

A={n:P(n) is true}

First notice that since P(n) is defined for natural numbers only, hence immediately we know A. Now we know that

  1.   i.

    1A because P(1) is true, and

  2.   ii.

    if P(k) is true, then P(k+1) is true, so if kA then (k+1)A.

Therefore we have shown that A is one of the sets used in the above definition. Since is the intersection of all possible such sets, we know that it must be contained in S so that

A

Since we have showed A is both a subset and a superset of , this means A=. Since A represents the set of number for which P(n) is true, this means P(n) is true for all natural numbers n. End of proof

This means that we now have a proof technique which can be applied to proving statements which are true for over the natural numbers, namely by showing that the statements holds for n=1 and that for all k, if P(k) is true then it implies P(k+1). This can be precisely expressed with the following logical equivalence, which conveys the same idea.

(n)P(n)P(1)(k)[P(k)P(k+1)]

3.3 Inductive Proofs

Through the above manipulation, we have discovered a fundamentally different way to prove universally quantified statements, specifically those which are true over the natural numbers, by exploiting the definition of the natural numbers. The logical equivalence

(n)P(n)P(1)(k)[P(k)P(k+1)],

shows exactly what is required, but we can unwrap this. Proving some statement (n)P(n) using this tool is called a proof by induction. Since the right-hand side of the expression is a conjunction, we need to show both sides are true, meaning a proof by induction has two separate parts.

  1.   i.

    Base Case. Show P(1) is true.

  2.   ii.

    Induction Step. Assume P(k) is true for any natural number k, and use this to show P(k+1) must be true.

The validity of this proof technique follows directly from the principle of mathematical induction. We can use this the approach the problem of proving equalities like the following systematically.

i=1ni=n(n+1)2

First, let us define the statement we are trying to prove more explicitly. If we denote the above expression with the predicate P(n), we want to show that if the free variable n is a natural number, then the statement is true, or

(n)P(n).
Proof.

Since we have a statement we wish to prove the over the natural numbers, we use a proof by induction. This involves showing the base case and then the induction step.

  1.   i.

    Base Case. We want to show P(1) is true, which is easy to show.

    i=11i=1=1(1+1)2
  2.   ii.

    Induction Step. Assume P(k) is true for some natural number k so that we have

    1+2++k=k(k+1)2.

    We want to show that the statement P(k+1) must follow; which written out corresponds to

    1+2++(k+1)=(k+1)(k+2)2.

    Notice the similarity in structure on the right-hand side of both equations, which suggests a obvious substitution. We prove P(k+1) by starting with the right-hand side and deriving the left-hand side.

    1+2++(k+1) =1+2++kleft-hand side of P(k)+(k+1)
    =k(k+1)2+(k+1)
    =k(k+1)2+2(k+1)2
    =(k+1)(k+2)2

    We are allowed to make the substitution in the second line, because we assumed this about the sum of the first k natural numbers. We use this assumption to extend what we know to conclude the same thing about the first (k+1) natural numbers. Therefore, this concludes the proof; we have shown that P(n) is true for all natural numbers n.

End of proof

At this point, we should point out that never do we assume what we are trying to prove. Instead, in the induction step we assume P(k) is true for one particular value of k and show P(k+1). This assumption, that P(k) is true, referred to as the induction hypothesis in the inductive step. Our first inductive proof was rather verbose, but we now offer a more streamlined example to prove something similar.

Theorem 31.

For all non-negative integers n, the summation formulas below have the following closed form solutions.

  1.   i.
    i=1ki2 =k(k+1)(2k+1)6 (3.3)
  2.   ii.
    i=1ki2 =k2(k+1)24 (3.4)
Proof.

We only prove the first statement by induction; the proof for the second statement is very similar.

  1.   i.

    Base Case. We want to confirm the statement is true for n=1.

    i=11i2=1=1(1+1)(2+1)6
  2.   ii.

    Induction Step. Assume that the statement is true for some arbitrary number k, so that

    i=1ki2 =k(k+1)(2k+1)6.

    We want to show that the statement is also true for (k+1). To show this, we write out the sum of the first (k+1) squares and then use the induction to simplify.

    i=1k+1i =12+22++k2induction hypothesis+(k+1)2
    =k(k+1)(2k+1)6+(k+1)2
    =(k+1)(k+2)(2k+3)6

    This is exactly what we needed to show, hence this concludes the proof by induction.

End of proof

3.4 Sequences and Recurrences

In some sense, induction treats the natural numbers as an ordered list of elements, starting with 1 and increasing in the order we write them out.

={1,2,3,}

A natural generalization of this idea is to consider ordered lists of items with arbitrary elements, a notion we denote a sequence, which can be represented as

a=a1,a2,a3,

The subscript represents the term number, which gives the position of a particular element, or term, in the sequence. More formally a sequence is defined as a function from the natural numbers, which maps term numbers to terms.

n1234aa1a2a3a4

Since the natural numbers are ordered, constructing this function ties the order of the natural numbers to those of the terms in the sequence, imposing an order. Two kinds of sequences that appear frequently in mathematics are arithmetic sequences and geometric sequences. The former is a sequence such that that the difference between consecutive terms is constant while the later is a sequence in which consecutive terms have the same ratio.

Arithmetic. a,a+d,a+2d,a+3d,a+4d,
Geometric. a,ar,ar2,ar3,ar4,

In the above sequences, we can specify an arithmetic sequence completely by its first term a and the common difference d. Meanwhile, for a geometric sequence, the first term a is needed along with the common ratio r. By inspection, we can see that the nth term of an arithmetic sequence is

an=a+(n1)d,

while for a geometric sequence, we simply multiply the ratio to the first term (n1) times, giving the formula

bn=arn1.

Given the similarity in structure of sequences to the natural numbers, induction provides a natural way to prove things about these sequences. Consider the following theorem about the sum of first n terms of arithmetic and geometric sequences.

Theorem 32.

Consider the arithmetic sequence a and the geometric sequence b, with the following terms

ai =a+(i1)d
bi =ari1

The sum of the first n terms of each sequence have the following formula, given r1.

i=1nai =a+(a+d)+(a+2d)++(a+(n1)d) (3.5)
=n2(2a+(n1)d) (3.6)
i=1nbi =a+ar+ar2++arn1 (3.7)
=arn1r1 (3.8)
Proof.

We prove both statements in order, using induction, starting with the sum of an arithmetic sequence.

  1.   i.

    Base Case. This is for n=1, because we are simply summing a sequence with 1 term.

    i=1nai=a=12(2a+(11)d)
  2.   ii.

    Induction Step. Assume this statement is true for the sum of the first k terms of an arithmetic sequence. We show that also holds for the first (k+1) terms.

    i=1k+1ai =ak+1+i=1kai
    =ak+1+k2(2a+(k1)d)
    =(a+kd)+k2(2a+(k1)d)
    =k2(2ak+2d)+k2(2a+(k1)d)
    =k2[2ak+1k+(k+1)d]
    =k+12(2a+kd)

This proves the formula for the sum for the first n terms of an arithmetic sequence; we now approach a similar proof for the sum of the first n terms of a geometric sequence.

  1.   i.

    Base Case. Once again, this is trivial for n=1 since that involves summing a one-term sequence.

    i=1nbi=a=ar11r1
  2.   ii.

    Induction Step. Assume this statement is true for the sum of the first k terms. We show that also holds for the first (k+1) terms.

    i=1k+1bi =bk+1+i=1kai
    =bk+1+ark1r1
    =ark+ark1r1
    =ark(r1)r1+ark1r1
    =ark(r1)+rk1r1
    =ark+11r1

End of proof

Notice that the algebra involved in these proofs were much messier that for those we dealt with before, yet we forged ahead. One benefit of induction is that that if the statement is true, then the algebra will almost certainly work out, so we know how exactly to simply the expression. Now consider a different kind of sequence.

Theorem 33.

If we define a sequence a recursively, so that

an={1:n=1an1+n:n>1

then, for all natural numbers n, this sequence is equivalent to the polynomial

an=n(n+1)2. (3.9)
Proof.

We provide an informal proof of this statement, based on our previous result; a more thorough proof would simplify formalize this with induction. Consider an for some arbitrary n, we begin simplifying the expression using the second rule.

an =an1+n
=an2+(n1)+n
=an3+(n2)+(n1)+n
=a1+2++(n2)+(n1)+n
=1+2++(n1)+n
=n(n+1)2

Notice there is however, one additional case we need to check now, which is when n=1, but clearly the formula trivially holds here. End of proof

The sequence we encountered above is rather different from those which we previously encountered. Such a sequence is by a recurrence relation is characterized by two features:

  1.   i.

    Base cases. Explicit values of the sequence for a small number of terms. In this case, by definition, a1=1.

  2.   ii.

    Recursive definition. A relationship between the value of the sequence an to other values, typically smaller. In the above example, for non-base cases, we were given

    an=an1+n.

While this type of recurrence equation describes a perfectly well-defined sequence, a more explicit definition of each term is often desirable. This explicit formula is called closed-form solution of the recurrence, which usually relates an directly to n, rather than other terms. However the recursive definition is sometimess cleaner; we can convey the essence of an arithmetic and geometric sequence with the following recurrence relations.

Arithmetic. an={a:n=1an1+d:n>1
Geometric. bn={a:n=1bn1r:n>1

The closed forms of these are the term expression we began with, which are

an =a+(n1)d
bn =arn1

3.5 Extending Induction

So far the power of induction seems rather limited since it applies to only statements of a very particular form, namely statements which are true over the natural numbers. However it turns out that induction can can applied to prove statements which are true over an entire class of sets, including for all non-negative integers, or all even numbers, or all odd numbers, which is explained with the following theorem.

Theorem 34.

Suppose we want to prove some predicate P(n) for all nA, where A is some set which can be expressed as

A={a1,a2,a3,}.

A more precise condition is that A be countable, but we return to this notion in a later chapter. Then it is sufficient to show

  1.   i.

    Base Case. P(a1) is true.

  2.   ii.

    Induction Step. if P(ak) is true for any natural number k, then P(ak+1) must be true

Proof.

We prove this by reduction, showing that this technique is exactly identical to induction. First consider defining the following new predicate which is related to P(n).

Q(n)P(an)

First notice that the elements of A are indexed by n, so to prove P(an) for all anA is equivalent to proving Q(n) for all n. At this point, we notice that we can apply induction on n to prove Q(n) for all natural numbers, which entails showing the following two things.

  1.   i.

    Base Case. Q(1) is true.

  2.   ii.

    Induction Step. if Q(k) is true for any natural number k, then Q(k+1) must be true

Now if we show these both hold, then this proves Q(n) over the natural numbers, which is equivalent to showing P(a) is true for all a in A. All that is left is to translate these conditions back into terms of P.

  1.   i.

    Base Case. P(a1) is true.

  2.   ii.

    Induction Step. if P(ak) is true for any natural number k, then P(ak+1) must be true

End of proof

This shows that to prove some predicate over a set which can be expressed as an ordered list, we can: (1) show that it is true for the first element, and (2) that if it is true for some element, it is true for the next element.

P(a1) is trueP(a1)P(a2)P(a3)P(a4)P(an) is true for all nA

This suggests the common “domino” analogy for understanding induction. To prove that that a line of dominoes have fallen down, it is sufficient to show that the first one had fallen down and that if a domino has been knocked down, it causes the next one to fall.

With this theorem, we can introduce some common patterns used in induction proofs to prove statements over different sets, and armed with these we can revisit and generalize some of our past results.

Set Base Case Induction Step
={1,2,3,} P(1) P(k)P(k+1)
0={0,1,2,} P(0) P(k)P(k+1)
{0,2,4,} P(2) P(k)P(k+2)
{1,3,5,} P(1) P(k)P(k+2)
{20,21,22,} P(1) P(k)P(2k)
{a,a+d,a+2d,} P(a) P(k)P(k+d)

So, for example, to prove a statement is true for all odd natural number, we first show that it is true for 1, and that if it true for some odd integer k it is also true for k+2. This is a direct application of the theorem we proved where we define

ak=2k1, so A={1,3,5,}

Now, with this technique, we can extend many of the inequalities we showed previously, and solve a more general recurrence.

Theorem 35 (Generalized Triangle Inequality).

For any number of real numbers n2, the absolute value of their sum is bounded by the sum of their absolute values.

|a1+a2++an||a1|+|a2|++|an|
Proof.

We apply a proof by induction on n over the set of natural numbers greater than 1. Our base case should then be n=2.

  1.   i.

    Base Case. In the case where n=2, this reduces into the triangle inequality we proved previously.

    |a1+a2||a1|+|a2|
  2.   ii.

    Induction Step. Assume that this generalized triangle inequality holds for k numbers, so that

    |a1+a2++ak||a1|+|a2|++|ak|.

    We will now show that this also holds for k+1 numbers; consider the absolute value of their sum.

    |a1++ak+ak+1| =|(a1++ak)+ak+1|
    |a1++ak|+|ak+1|
    |a1|+|a2|++|ak|+|ak+1|

    In the last step, we make the substitution from the induction hypothesis, which complete the inductive step and thus the proof.

End of proof

We can now approach more general solutions recurrence relations as well. A sequence of the form

xn={x1:n=1axn1+b:n>1

is called a first-order linear recurrence relation, which is a well-defined sequence, given the base case xn is defined. The “first-order” refers to the fact that xn refers only to one previous term, and the “linear” comes from the fact that xn is a linear function of xn1, plus some constant.

Theorem 36 (First-Order Linear Recurrences).

The solution to a first-order linear recurrence relation defined as

xn={x1:n=1axn1+b:n>1

has closed-form solution

xn=an1x1+ban11a1, a1
xn=x1+(n1)b, a=1
Proof.

Unsurprisingly, we are going to prove this by induction in a long and messy proof. However, we actually have two cases to consider, one where a is 1 and one where it is not. We will first prove the more general of the two results, which is when a1.

  1.   i.

    Base Case. This is easy to check, since it only involves the first erm, which we are given.

    x1=a0x1+ba01a1
  2.   ii.

    Induction Step. Assume that the closed form is correct for the kth term so that

    xk=ak1x1+bak11a1

    Now we show the formula also holds for the next term, namely for the (k+1)th term. Do this this, we apply the recursive definition of this term.

    xk+1 =axk+b
    =a(ak1x1+bak11a1)+b
    =akx1+abak11a1+b
    =akx1+b(aak11a1+1)
    =akx1+b(akaa1+a1a1)
    =akx1+bak1a1

Now, unfortunately, we still have to deal with the case of a=1 separately. In this case, however, this recurrence is easier to work with.

xn={x1:n=1xn1+b:n>1
  1.   i.

    Base Case. Once again, it is easy to verify the closed-form solution works for just one term.

    x1=x1+(11)b
  2.   ii.

    Induction Step. Assume the formula holds for the kth term, and show that it also hold for the term after that.

    xk+1 =xk+d
    =x1+(k1)d+d
    =x1+kd

End of proof

Notice that this immediately proves the equivalence of the recurrence relation definition of arithmetic and geometric sequences to the one we originally used. If we take a=1, then we get

xn={x1:n=1xn1+b:n>1
then xn=x1+(n1)d

which refers to arithmetic sequences, while if we take b=0 then we find

xn={x1:n=1axn1:n>1
then xn=x1an1

which corresponds exactly to geometric sequences. Of course, it may seem odd that we need a separate case at all to account for when a=1, but it turns out, in some sense, the general formula still holds. Rather than being thrown off guard by noticing the (a1) in the denominator, we can instead take the limit as a tends to 1.

lima1xn =lima1(an1x1+ban11a1)
=x1lima1an1+blima1an11a1
=x1+(n1)b

The first limit should be easy to see; the second one is the difference quotient representing derivative of the function

f(a)=an1

at a=1. This is out of the scope of this chapter, but it goes to show that this closed-form solution is not as disjoint as it may appear to be.

3.6 Strong Induction

The power of induction is that it allow use to make stronger assumptions in the process of proving our statement. Instead of proving the statement directly for any arbitrary element n, in the induction step, we assume P(k) is true for some natural number k and simply have to prove P(k+1) to be true. This advantage comes from knowing something about the structure of the natural numbers themselves.

However, it turns out that we are allowed to assume something even stronger during the induction step.

Theorem 37 (Principle of Strong Induction).

Let P(n) be a predicate defined for natural numbers n. If the following two statements hold,

  1.   i.

    P(1) is true

  2.   ii.

    if P(1),P(2),,P(k) are true, then P(k+1) is true

then P(n) is true for all natural numbers n.

Proof.

We prove this by showing that this is essential equivalent to the principle of induction. First, we define the following predicate

Q(n)P(1)P(2)P(n)

so that Q(n) is true if and only if P(i) is true for all i up to n. Notice showing P(n) is true for all n is equivalent to showing Q(n) is true for all natural numbers. Remember the principle of mathematical induction tell us to show this predicate is true, all we need is to show is the following.

  1.   i.

    Q(1) is true

  2.   ii.

    if Q(k) is true, then Q(k+1) is true

Therefore, if we show these two statements hold, then P(n) must be true as well. First consider the first statement; by definition, Q(1) is the same as P(1). Next, we consider the implication, which we express in terms of P.

Q(k) Q(k+1)
P(1)P(k) P(1)P(k)redundantP(k+1)
P(1)P(k) P(k+1)

Therefore, we have shown to prove Q(n) is true far all natural numbers n, and thus P(n) is true, we can show

  1.   i.

    P(1) is true, and

  2.   ii.

    if P(1),P(2),,P(k) are true, then P(k+1) is true.

End of proof

This allows us to augment the amount of information we get for “free” during the induction step, which allows us to prove statements that we otherwise might not be able to. While this unlocks the door to more complicated recurrence relations, we can also re-visit a fact about the natural numbers we have been relying on without proof.

Theorem 38 (Prime Factorization).

Any natural number n can be expressed as a, possibly empty, product of primes; meaning

n=ipiαi

for some set of primes {pi} and corresponding powers {αi}. By convention, we say that the empty product is 1. In other words, every natural number has a prime factorization.

Proof.

We can prove this by strong induction on n, with a base case of n=1.

  1.   i.

    Base case. As explained, the empty product is taken to be 1 so 1 is trivially the product of primes, namely none at all.

  2.   ii.

    Induction Step. Assume that this holds for all numbers less than or equal to some k. We now prove this for (k+1) by casework. Now by definition, every number greater than or equal to 2 is either prime or composite. If (k+1), then we are immediately done since we take

    k+1=(k+1)1prime

    Otherwise, it it composite, which means by definition, it can be written as the product of two numbers less than itself, since it has a divisor that is not equal to 1 or itself. Therefore,

    k+1=ab, for some a,bk.

    By our induction hypothesis, we know both a and b can be expressed as a product of primes. Since (k+1) is the product of these, it can clearly also be expressed as a product of primes.

End of proof

It turns out, as you probably already know, that something even stronger is true–that every natural number has a unique prime factorization. This is a theorem called the fundamental theorem of arithmetic, which is currently outside of our grasp, but which we will prove later. As we will see, what we proved just now is an important building block in the proof of the fundamental theorem. A proof that follows a similar pattern, and is also foundational to computers today, is the following.

Theorem 39.

Every non-negative integer can be expressed in binary so that it is a sum of distinct powers of 2. In other words, there exists a subset of the non-negative integer B0 such that

n=iB2i

Once again, we use the convention that the empty sum is 0, which allows us to include 0 in this theorem.

Proof.

We will once again use strong induction.

  1.   i.

    Base case. Once again, we trivially take B={} represent 0. The reason this convention makes sense is because we do the same in the decimal numbers system; while we can express

    15=1×101+5×100,

    we can only express 0 by letting all the coefficients of powers of ten be 0. The same reasoning motivates our choice here.

  2.   ii.

    Induction step. Assume that all non-negative numbers less than or equal to k can be expressed as the sum of distinct powers of 2. We will now show the same holds for (k+1). Now we have two cases to consider, one where this number is even and one where this is odd. In the former we can write

    k+1 =2m,m0

    Since m must be strictly less than (k+1), from the induction hypothesis, we know that it can be expressed as a sum of distinct powers of two. Clearly, when we multiply each of these by two (raising their exponents by one), we find a new representation. The second case to consider is when

    k+1=k+20

    is odd. This means that k is even and by the induction hypothesis can be represented as a sum of unique powers of 2. Since it is even, this sum cannot include 20, which means we simply add this to find the desired representation.

End of proof

Having shown some unique applications of this proof technique, we return, as promised, to more complicated recurrence relations. In particular, we can now prove closed form representations for recurrences beyond first order, in which each term is a function of more than just the previous term. In the extreme case, we could have a sequence for which each term is defined in terms of all its predecessors.

an={a1:n=1f(an1,an2,,a1):n>1

However, more often, we run into cases where each term is a function of two, three or some other finite number of terms. A famous such sequence is the Fibonacci sequence,

Fn={1:n=11:n=2Fn1+Fn2:n>2

in which the first two terms are 1 and each term after this is the sum of the previous two. The first few terms of this sequence are

F=1,1,2,3,5,8,13,21,35,56,91,

It turns out that there indeed is a closed-form solution to the Fibonacci sequence, albeit one which seems far from intuitive, which we will show below

Theorem 40.

The closed form of the Fibonacci sequence is given by the exact formula,

Fn=15(φnψn)

where φ and ψ are the two roots of the quadratic x2x1, or explicitly

φ=1+52,ψ=152.

The number φ appears in other places and is often referred to as the golden ratio.

Proof.

It may not immediately be obvious that the closed firm solution holds; it seems rather miraculous that this formula, peppered with radicals, should somehow yield whole numbers, and additionally correspond to the Fibonacci sequence. Yet, we do just that.

  1.   i.

    Base Cases. The recursive part of the Fibonacci sequence definition only applies to terms past the fist two, hence we will have to show the formula hold manually; this is will be our base cases.

    F1 =15(φ1ψ1)
    =15(1+52152)
    =15252
    =1

    To show the formula also holds for the seconds term, we use a trick, which is noting that since ϕ and ψ are roots of x2x1,

    φ2φ1=0, so φ2=φ+1 and
    ψ2ψ1=0, so ψ2=ψ+1.

    We can use this to simply the exponents in the canonical closed form representation of the second term.

    F2 =15(φ2ψ2)
    =15(φ+1ψ1)
    =15(φψ)
    =F1
    =1

    This matches with the first few terms of the first two terms of the Fibonacci sequence, which means the base cases hold.

  2.   ii.

    Induction Step. Assume that the formula holds for all terms with number less than or equal to k, we want to show that it will also hold for the next term. Specifically, we will use,

    Fk=15(φkψk)
    Fk115(φk1ψk1).

    We can now plug this into the recursive definition of Fk+1 to show that the closed form also holds for the next term.

    Fk+1 =Fk+Fk1
    =15(φkψk)+15(φk1ψk1)
    =15(φk+φk1ψkψk1)
    =15(φk1(φ+1)ψk1(ψ+1))

    Now we can use the observation we used earlier, in reverse, to swiftly finish the proof. Namely x+1 equals x2 when x is φ or ψ. Making the substitution, we immediately get what we want.

    Fk+1 =15(φk1(φ+1)ψk1(ψ+1))
    =15(φk1φ2ψk1ψ2)
    =15(φk+1ψk+1)

This is what we needed to show to prove our claim by induction. End of proof

3.7 The Well-Ordering Principle

We can now show another equivalent fact about the natural numbers which is inextricably linked to induction. This is called the well-ordering principle, which states that every non-empty subset of the natural numbers has a smallest element.

Theorem 41 (Well-ordering Principle).

Let A be a non-empty subset of the natural numbers, A. A, then, must have a smallest element.

Proof.

We instead prove the contrapositive of this statement, that if A is a subset of the natural numbers which has no smallest element, then it is the empty set. Now consider the relative complement of A in the natural numbers, which we denote

B=A

Notice 1 is an element of B, since 1 cannot be an element of A, otherwise this would be its least element. Now assume

1,2,,kB,

which means that none of these numbers are elements of A. This implies that (k+1) cannot be in A otherwise it would be its smallest element. Therefore we have shown that (k+1) is in B. Therefore, we have

  1.   i.

    1B, and

  2.   ii.

    if 1,2,,kB, then (k+1)B.

Hence from the principle of strong induction, we know that that B contains the natural numbers. However, since B is the result of subtracting A from the natural numbers, we also know that B is a subset of , which together implies

B=.
A =B
=ϕ

This completes the proof of the contrapositive, which means that if A has no smallest element, then it must be the empty set. Equivalently, then, every non-empty subset of the natural numbers must have a smallest element. End of proof

In some sense, the well-ordering principle is the opposite of induction, which makes it a powerful tool for proof by contradiction over the natural numbers. If a predicate P(n) about all natural numbers is false then the set

{n:P(n) is false}

will be non-empty subset of the natural numbers. It must be non-empty because otherwise this means the predicate is true for all natural numbers. By the well-ordering principle, this set must have a smallest element. A common pattern in a proof by contradiction is the following:

  1.   i.

    Assume that the proposition is false;

  2.   ii.

    By the well-ordering principle, there must be a smallest natural number for which the predicate is false, denote this k;

  3.   iii.

    Show that we can somehow derive a smaller number for which this proposition is also false, contradicting the original assumption that k was the smallest such couter-example.