Fermat's Last Theorem: Continued Fractions: The Approximation Algorithm

Today's blog continues the discussion on Pell's Equation. Before jumping into the solution to Pell's Equation, it is necessary to review some fundamental properties of Continued Fractions. I will later show how these properties of continued fractions can be used to solve Pell's Equation. For those who are not familiar with Continued Fractions, start here.

One of the most important ideas in dealing with Continued Fractions is the Continued Fraction Approximation Algorithm. This is an algorithm that can be used to convert a real number into a series of integers which make up the continued fraction.

The algorithm itself is made up of two equations which I will label p_n and q_n. Later, I will show that any finite continued fraction which represents an approximation of a real number is equal to a ratio of these two equations.

All of the ideas presented in today's blog are based on Harold M. Stark's Introduction to Number Theory.

Lemma 1: If [a₀, a₁, ... a_n, α] is a continued fraction where all values a_i are integers and α is any real number, then there exists series p_n and q_n where:
[a₀, a₁, ... a_n, α ] = (α * p_n + p_n-1)/(α * q_n + q_n-1)

(1) Let [a₀,a₁, ...a_n, α] be a continued fraction such that:
[a₀,a₁,...a_n, α ] = a₀ + 1/[a₁ + 1/(a₂ + ... + 1/α)]

(2) Let p_n be a sequence based on a₀ ... a_n such that:
p₀ = a₀
p₁ = a₀*a₁ + 1
p_n+2 = (p_n+1)(a_n+2) + p_n

For example, p₂ = (p₁)(a₂) + p₀ = (a₀a₁ + 1)(a₂) + a₀

(3) Let q_n be a sequence based on a₀ ... a_n such that:
q₀ = 1
q₁ = a₁
q_n+2 = (q_n+1)(a_n+2) + q_n

For example, q₂ = (q₁)(a₂) + q₀ = (a₁)(a₂) + 1

(4) So, consider the case where n = 1, then we have:
[a₀, a₁, α] = a₀ + 1/[(a₁ + 1/α)] = a₀ + 1/[(a₁α + 1)/α] = a₀ + α/(a₁α + 1) =
[a₀(a₁α + 1) + α]/(a₁α + 1) = [α(a₀a₁ + 1) + a₀]/(a₁α + 1)

(5) Now, applying the definition given above for p_n and q_n gives us:
[a₀,a₁,α] = (α*p₁ + p₀)/(α*q₁ + q₀)

(6) So that, for n=1, we have:
[a₀, a₁, α] = (α*p₁ + p₀)/(α*q₁ + q₀)

(7) Let's assume that this is true up to some value n so that:
[a₀, a₁, ..., a_n, α] = (α*p_n + p_n-1)/(α*q_n + q_n-1)

(8) Now, we know that: [a₀, a₁, ..., a_n, a_n+1, α] = [a₀, a₁, ..., a_n, a_n+1 + 1/α] by the definition of continued fractions.

(9) So by our assumption in #7:
[a₀, a₁, ..., a_n, a_n+1 + 1/α] = [(a_n+1 + 1/α)*p_n + p_n-1]/[(a_n+1 + 1/α)*q_n + q_n-1]=
(a_n+1*p_n + p_n/α + p_n-1)/(a_n+1*q_n + q_n/α + q_n-1)

(10) Now, multiplying the above by α/α gives us:
(a_n+1*p_n + p_n/α + p_n-1)/(a_n+1*q_n + q_n/α + q_n-1) =
=(α*a_n+1*p_n + p_n + α*p_n-1)/(α*a_n+1*q_n + q_n + α*q_n-1) =
=[α(a_n+1p_n + p_n-1) + p_n]/[α(a_n+1q_n + q_n-1) + q_n]

(11) And finally, applying the series p_n and q_n gives us:
[α(a_n+1p_n + p_n-1) + p_n]/[α(a_n+1q_n + q_n-1) + q_n] = (αp_n+1 + p_n)/(αq_n+1 + q_n)

(12) Applying the principle of induction, we are done.

QED

Theorem 1: The Continued Fraction Approximation Algorithm

For any given finite continued fraction [ a₀, a₁, ... a_n ] where all a_i are integers, using Lemma 1 to compute p_n and q_n we find that:
[ a₀, a₁, ... a_n ] = p_n / q_n

(1) Let's define p_n, q_n using the series in Lemma 1.

(2) For case n=1, we have:
[a₀, a₁] = a₀ + 1/a₁ = (a₀*a₁ + 1)/a_{1 [By definition of Continued Fractions]
= p₁/q₁ [By definition of p_n and q_n in Lemma 1]}
(3) Assume that this is true up to some value n so that [a₀, a₁, ... a_n] = p_n/q_n

(4) Applying Lemma 1, gives us:
[a₀, a₁, ..., a_n, a_n+1] = [(a_n+1)p_n + p_n-1]/(a_n+1)q_n + q_n-1 = p_n+1/q_n+1

(5) By the Principle of Induction, we are done.

QED

Now, consider this interesting lemma:

Lemma 2: p_n*q_n-1 - p_n-1*q_n = (-1)^n-1

(1) Let's start with the Case: n = 1

p_n*q_n-1 - p_n-1*q_n = p₁* q₀ - p₀*q₁

From Lemma 1 above,

p₀ = a₀
p₁ = a₀*a₁ + 1
p₂ = p₁a₂ + p₀ = (a₀a₁ + 1)a₂ + a₀ = a₀a₁a₂ + a₂ + a₀

q₀ = 1
q₁ = a₁
q₂ = q₁a₂ + q₀ = a₁a₂ + 1

p₁q₀ = (a₀*a₁ + 1)*1 = a₀*a₁+1

p₀*q₁ = a₀*a₁

So:
p₁*q₀ - p₀q₁ = a₀*a₁ + 1 - a₀*a₁ = 1

(2) So let's assume that this is true up to n-1 so that we can assume:

p_n-1*q_n-2 - p_n-2*q_n-1 = (-1)^n-2

(3) From Lemma 1, step #2 and step #3, we know that:

p_n = p_n-1*a_n + p_n-2

and

q_n = q_n-1*a_n + q_n-2

(4) So,

p_n*q_n-1 - p_n-1*q_n = (p_n-1*a_n + p_n-2)(q_n-1) - (p_n-1)(q_n-1a_n + q_n-2)

= p_n-1*a_n*q_n-1 + p_n-2q_n-1 - p_n-1*a_n*q_n-1 - p_n-1q_n-2 =
= p_n-2q_n-1 - p_n-1q_n-2 =
= (-1)(p_n-1q_n-2 - p_n-2q_n-1) =
= (-1)(-1)^n-2 [From step #2 above]
= (-1)^n-1

(4) By the Principle of Induction we are done.

QED

Lemma 3: If α is a positive real number and α = [a₀, a₁ ... a_n-1, α_n ], then a₀ ≥ 0 and all other a_i ≥ 1, and α_n ≥ 1.

(1) a₀ = floor(α) which clearly ≥ 0.

(2) In case n = 1, α₁ = 1/(α - a₀).

In this case, a₀ is less than α (since we are assuming that α_n is a nonzero real number).

So, clearly α₁ must also be a positive number greater than 1 (since the difference between α and a₀ is less than 1)

(3) In case n=2, a₁ = floor(α₁) is ≥ 1 by the reasoning in step #2. With α₁ ≥ 1 and a₁ ≥ 1, we know that α₂ = 1/(α₁ - a₁) must also be a positive number ≥ 1.

(4) Let's assume that this is true up to n-1.

(5) At this point, we have a value α_n that is a positive real number ≥ 1.

(6) So, a_n = floor(α_n) which means that a_n ≥ 1.

(7) Finally, α_n+1 = 1/(α_n - a_n) which means that it too will be a positive real number ≥ 1.

(8) By the principle of induction we are done.

QED

Corollary 3.1: If α is a negative real number, and α = [a₀, a₁ ... a_n-1, α_n ], then a₀ ≤ 0 and all other a_i ≤ -1, and α_n ≤ 1.

(1) The reasoning here is the same as Lemma 3 except that we use a ceiling function instead of a floor function where ceiling (-5.6) = -5. In this case, the a_i is higher than the α_i value and the subtraction is still a negative number.

(2) In this way, we can use the same reasoning as Lemma 3 and we get the result that the answer is the same as in Lemma 3 except that we multiply -1 to all the a_i values and also the α_n value.

QED

Lemma 4: For a positive real number, if n is greater than 0, then q_n+1 is greater than q_n

(1) For case n=1:

q₁ = a₁
q₂ = q₁a₂ + q₀ = a₁*a₂ + 1

Since a₁, a₂ are both ≥ 1, we know that:
a₁ ≤ a₁*a₂

And therefore:
a₁ is less than a₁*a₂ + 1

(2) Let's assume that this is true up to n so that q_n+1 is greater than q_n

(3) So, q_n+2 = q_n+1*a_n+2 + q_n

(4) We know that q₁ = a₁ which means that q₁ ≥ 1. [From Lemma 3]

(5) So q_n is greater than 1 (by our assumption in #2)

(6) We also know that a_n+2 is ≥ 1 which gives us:
q_n+1 ≤ q_n+1*a_n+2

(7) And finally, applying #5
q_n+1 is less than q_n+1*a_n+2 + 1 ≤ q_n+1*a_n+2 + q_n = q_n+2

(8) By the Principle of Induction, we are done.

QED

Corollary 4.1 For a positive real number where n is greater than 0, q_n ≥ n.

(1) Let's start with n=1

q₁ = a₁ which is ≥ 1.

(2) Let's assume that this is true up to n+1 so that q_n+1 ≥ n+1.

(3) q_n+2 = q_n+1a_n + q_n

(4) Now q_n+2 is greater than q_n+1 which means that:
q_n+2 ≥ q_n+1 + 1 ≥ n + 1 + 1 ≥ n + 2.

(5) By the Principle of Induction, we are done.

QED

Corollary 4.2: We can also see for all values of n ≥ 2, p_n, p_n+1 is greater than p_n.

(1) We see that this is true for n=2:

p₁ = a₀a₁ + 1.
p₂ = p₁a₂ + a₀.
p₃ = p₂a₃ + p₁

Since a₀ is ≥ 0 (see above), we see that p₁ is ≥ 1.
Now a₃ is ≥ 1 so p₂ * a₃ is at least equal to p₂ but since p₁ is ≥ 1, we know that p₃ must be greater.

(2) Now, we assume it is true for all values up to n.

(3) p_n+1 = p_na_n+1 + p_n-1

Since p₁ ≥ 1, from (1), we know that p₃ is ≥ 1 and all values of n greater than 3 is greater than 1.

So, we know that p_n+1 is greater than p_n by at least p_n-1 which is ≥ 1.

(4) We have now proven that p_n+1 is greater than p_n for all values of n ≥ 2.

QED

Theorem 2: For a positive irrational number, we can use the approximation algorithm to generate an approximation of any degree of accuracy. In other words:

absolute(α - p_n/q_n) is less than 1/(q_nq_n+1) which is less than 1/(q_n)² which is ≤ 1/n²

(1) From Lemma 1, we know that:
α = (α_n+1p_n + p_n-1)/(α_n+1q_n + q_n-1)

(2) Subtracting both sides by p_n/q_n gives us:
α - p_n/q_n = (α_n+1p_n + p_n-1)/(α_n+1q_n + q_n-1) - p_n/q_n =
[q_n(α_n+1p_n + p_n-1) - p_n(α_n+1q_n + q_n-1)]/[q_n(α_n+1q_n + q_n-1)] =
= (q_nα_n+1p_n - q_nα_n+1p_n + p_n-1q_n - p_nq_n-1)/[q_n(α_n+1q_n +q_n-1) ] =
= (p_n-1q_n - p_nq_n-1)/[q_n(α_n+1q_n + q_n-1)] =
= (-1)ⁿ/[q_n(α_n+1q_n + q_n-1)]

(3) We know that for positive irrational numbers, a_n is less than α_n which means that:

1/[q_n(a_n+1q_n + q_n-1)] is greater than 1/[q_n(α_n+1q_n + q_n-1)]

Likewise,

-1/[q_n(a_n+1q_n + q_n-1)] is less than 1/[q_n(α_n+1q_n + q_n-1)]

(4) Now, we need to consider two cases to complete the proof:

Case I: n is even

In this case:

α - p_n/q_n is less than 1/[q_n(a_n+1q_n + q_n-1)] = 1/(q_nq_n+1)

We also know that α - p_n is greater than 0 which means that it is also greater than -1/(q_n)²

Since q_n+1 is greater than q_n, we also know that:

α - p_n/q_n is less than 1/(q_nq_n) = 1/(q_n)²

Putting all this together gives us,

absolute(α - p_n / q_n) is less than 1/(q_n)²

Case II: n is odd

In this case:

α - p_n/q_n is greater than -1/[q_n(a_n+1q_n + q_n-1)] = -1/(q_nq_n+1)

We also know that α - p_n is less than 0 which means that it is also less than 1/(q_n)²

Since q_n+1 is greater than q_n, we also know that:

α - p_n/q_n is greater than -1/(q_nq_n) = -1/(q_n)²

Putting all this together gives us,

absolute(α - p_n / q_n) is less than 1/(q_n)²

(5) Now, since q₁ ≥ 1 and q_n+1 is greater than q_n for n ≥ 1, we know that:

q_n ≥ n and:

1/(q_n)² is less than 1/n²

QED

Lemma 5: Let C_n = p_n/q_n, then:
(a) C_n - C_n-1 = [(-1)^n-1]/(q_nq_n-1)
(b) C_n - C_n-2 = [a_n(-1)ⁿ]/(q_nq_n-2)

(1) p_nq_n-1 - q_np_n-1 = (-1)^n-1. [From Lemma 2 above]

(2) Dividing both sides by q_nq_n-1 gives us (a).

(3) C_n - C_n-2 = p_n/q_n - p_n-2/q_n-2 = (p_nq_n-2 - p_n-2q_n)/(q_nq_n-2)

(4) p_nq_n-2 - p_n-2q_n = (a_np_n-1 + p_n-2)q_n-2 - p_n-2(a_nq_n-1 + q_n-2) =
= a_n(p_n-1q_n-2 - p_n-2q_n-1) = a_k(-1)^n-2

(5) Combining #3 and #4 gives us (b).

QED

Corollary 5.1 C₁ is greater than C₃ is greater than C₅ ... and C₀ is less than C₂ is less than C₄ ...

(1) From Lemma 5(b), we know that C_n is less than C_n-2 if n is odd and C_n is greater than C_n-2 if n is even.

QED

Corollary 5.2 Consecutive C_n lie above or below the exact value of the continued fraction.

(1) From Lemma 5, we know that they will alternate. For odd values of n, C_n will be greater than C_n-1 and for even values of n, C_n will be less than C_n-1.

(2) By Theorem 2, we see that each C_n is a value closer to the exact value of the continued fraction.

(3) Finally, from Theorem 2, step #2, we see that α - p_n/q_n is alternately above and below the main value. If n is even, then α is above and if it is odd, then α is below.

QED

Lemma 6: For any value n ≥ 1, gcd(p_n,q_n) = 1.

(1) Assume that f divides both p_n and q_n

(2) By Lemma 2 above, p_nq_n-1 - p_n-1q_n = (-1)^n-1

(3) So f divides both implies that f divides (-1)^n-1

(4) But the only way this is true is if f = 1.

(5) Which proves that the only common factor they can have is 1.

QED

5 comments:

Scouse Rob said...: In Lemma 2 (3):
Shouldn't you be factoring the qn+1 and pn+1?

Rob; Sep 28, 2007, 6:54:00 AM
Larry Freeman said...: Hi Rob,

Thanks very much for your question. I've rewritten Lemma 2 to make it clearer. I hope that this answers your question.

Cheers,

-Larry; Sep 30, 2007, 11:40:00 AM
Scouse Rob said...: In step (4) of Theorem 1:

[(an+1)pn+pn-1]/(an+1)qn+qn-1

should be:

[(an+1)pn + pn-1]/[(an+1)qn+qn-1]; Jun 9, 2010, 3:20:00 AM
Scouse Rob said...: In step (4) of Theorem 2

Should

We also know that α-pn is greater than 0

be

We also know that α - pn/qn is greater than 0

Rob; Jun 9, 2010, 4:51:00 AM
Scouse Rob said...: In step (4) of Lemma 5

an(pn-1qn-2-pn-2qn-1)=ak(-1)^n-2

Should be

an(pn-1qn-2-pn-2qn-1)=an(-1)^n-2; Jun 9, 2010, 5:00:00 AM

Fermat's Last Theorem

Sunday, November 27, 2005

Continued Fractions: The Approximation Algorithm

5 comments:

Topic Index

Completed Proofs

Recommended Books

Required Reading for Experts

About Me

Blog Archive