Skip to main content

Full text of "First Course in the Theory of Equations"

See other formats


Google 



This is a digital copy of a book that was preserved for general ions on library shelves before il was carefully scanned by Google as part of a project 

to make the world's books discoverable online. 

Il has survived long enough for the copyright to expire and the book to enter the public domain. A public domain book is one that was never subject 

to copyright or whose legal copyright term has expired. Whether a book is in the public domain may vary country to country. Public domain books 

are our gateways to the past, representing a wealth of history, culture and knowledge that's often diflicult to discover. 

Marks, notations and other marginalia present in the original volume will appear in this file - a reminder of this book's long journey from the 

publisher to a library and finally to you. 

Usage guidelines 

Google is proud to partner with libraries to digitize public domain materials and make them widely accessible. Public domain books belong to the 
public and we are merely their custodians. Nevertheless, this work is expensive, so in order to keep providing this resource, we have taken steps to 
prevent abuse by commercial parlies, including placing technical restrictions on automated querying. 
We also ask that you: 

+ Make non-commercial use of the plus We designed Google Book Search for use by individuals, and we request that you use these files for 
personal, non-commercial purposes. 

+ Refrain from automated querying Do not send automated queries of any sort to Google's system: If you are conducting research on machine 
translation, optical character recognition or other areas where access to a large amount of text is helpful, please contact us. We encourage the 
use of public domain materials for these purposes and may be able to help. 

+ Maintain attribution The Google "watermark" you see on each file is essential for informing people about this project and helping them find 
additional materials through Google Book Search. Please do not remove it. 

+ Keep it legal Whatever your use, remember that you are responsible for ensuring that what you are doing is legal. Do not assume that just 
because we believe a b<x>k is in the public domain for users in the United States, that the work is also in the public domain for users in other 

countries. Whether a book is still in copyright varies from country to country, and we can't offer guidance on whether any specific use of 
any specific book is allowed. Please do not assume that a book's appearance in Google Book Search means il can be used in any manner 
anywhere in the world. Copyright infringement liability can be quite severe. 

About Google Book Search 

Google's mission is to organize the world's information and to make it universally accessible and useful. Google Book Search helps readers 
discover the world's hooks while helping authors ami publishers reach new audiences. You can search through I lie lull text of this book on I lie web 
at |http : //books . qooqle . com/| 



* 1 






V *. ..• 






FIRST COURSE 

IN THE 

THEORY OF EQUATIONS 



WORKS OF L. £. DICKSON 

PUBLI8HKD BT 

JOHN WILEY & SONS, Inc. 

First Course in the Theory of Equations. 
168 pages. 6x9. Illustrated. Cloth. 

Elementary Thecry of Equations. 

184 pages, 6x9. Illustrated. Cloth. 

College Algebra. 

A text- book for colleges and technical schools. 
Second Edition. 214 pages, 5 J x S. Illus- 
trated. Cloth. 

Algebraic Invariants. 

Being No. 14 of the Mathematical Mono- 
graphs. 100 pages, 6x9. Cloth. 

FURTHER WORKS OF L. E. DICKSON 

Algebras and Their Arithmetics. 

241 pages. University of Chicago Press. 

Theory of Numbers. 

183 pages, University of Chicago Press. 

Studies in Theory of Numbers. 

230 pages, University of Chicago Press. 

Modern Algebraic The cries. 
276 pages, Sanborn & Co. 

Researches on Waring's Problem. 

257 pages, Carnegie Institution, Washington, 
D.C. 

Linear Groups. 

312 pages, Teubner, Germany. 

On Invariants and the Theory cf Numbers. 

110 pages, Madison Colloquium, American 
Mathematical Society. 

History of the Theory of Numbers. 

Vol. I, 486 pages. Vol. II, 803 pages. Vol. 
Ill, 313 pages. Carnegie Institution, Wash- 
ington, D. C. 



FIRST COURSE 

IN THE 

THEORY OF EQUATIONS 



BY 
LEONARD EUGENE DICKSON, Ph.D. 

CORRESPO *DANT DE i/lNSTITUT DE FRANCE 
PROFESSOR OF MAI HEMATICS IN THE UNIVERSITY OF CHICAGO 



NEW YORK 

JOHN WILEY & SONS, Inc. 

London: CHAPMAN & HALL, Limited 



-.j 



i 
I 



Copyright, 1922, by 
LEONARD EUGENE DICKSON 



All Rights Reserved 

This book or any part thereof must not 
be reproduced in any form without 
the written permission of the publisher. 



PRINTED IN THE UNITED 8TATE8 OP AMERICA 

10/47 






PREFACE 



The theory of equations is not only a necessity in the subsequent 
mathematical courses and their applications, but furnishes an illumina- 
ting sequel to geometry, algebra and analytic geometry. Moreover, 
: it develops anew and in greater detail various fundamental ideas of 
» calculus for the simple, but important, case of polynomials. The 
■ theory of equations therefore affords a useful supplement to differential 
! calculus whether taken subsequently or simultaneously. 

It was to meet the numerous needs of the student in regard to his 
earlier and future .mathematical courses that the present book was 
planned with great care and after wide consultation. It differs essentially 
from the author's Elementary Theory of Equations, both in regard to 
omissions and additions, and since it is addressed to younger students 
and may be used parallel with a course in differential calculus. Simpler 
: and more detailed proofs are now employed. The exercises are simpler, 
more numerous, of greater variety, and involve more practical applications. 
This book throws important light on various elementary topics. 
For example, an alert student of geometry who has learned how to bisect 
1 any angle is apt to ask if every angle can be trisected with ruler and 
compasses and if not, why not. After learning how to construct regular 
polygons of 3, 4, 5, 6, 8 and 10 sides, he will be inquisitive about the 
missing ones of 7 and 9 sides. The teacher will be in a comfortable position 
if he knows the facts and what is involved in the simplest discussion to 
date of these questions, as given in Chapter III. Other chapters throw 
needed light on various topics of algebra. In particular, the theory 
of graphs is presented in Chapter V in a more scientific and practical 
manner than was possible in algebra and analytic geometry. 

There is developed a method of computing a real root of an equation 
with minimum labor and with certainty as to the accuracy of all the 
decimals obtained. We first find by Horner's itxethod successive trans- 

in 



iv PREFACE 

formed equations whose number is half of the desired number of significant 
figures of the root. The final equation is reduced to a linear equation 
by applying to the constant term the correction computed from the 
omitted terms of the second and higher degrees, and the work is completed 
by abridged division. The method combines speed with control of 
accuracy. 

Newton's method, which is presented from both the graphical and 
the numerical standpoints, has the advantage of being applicable also to 
equations which are not algebraic; it is applied in detail to various such 
equations. 

In order to locate or isolate the real roots of an equation we may 
employ a graph, provided it be constructed scientifically, or the theorems 
of Descartes, Sturm, and Budan, which are usually neither stated, nor 
proved, correctly. 

The long chapter on determinants is independent of the earlier chap- 
ters. The theory of a general system of linear equations is here pre- 
sented also from the standpoint of matrices. 

For valuable suggestions made after reading the preliminary manu- 
script of this book, the author is greatly indebted to Professor Bussey 
of the University of Minnesota, Professor Roever of Washington Uni- 
versity, Professor Kempner of the University of Illinois, and Professor 
Young of the University of Chicago. The revised manuscript was much 
improved after it was read critically by Professor Curtiss of Northwestern 
University. The author's thanks are due also to Professor Dresden of 
the University of Wisconsin for various useful suggestions on the 
proof-sheets. 
Chicago, 1921. 



CONTENTS 

Numbers refer to pages. 



CHAPTER I 

Complex Numbers 

Square roots, 1. Addition, multiplication and division of complex num- 
bers, 2. Cube roots of unity, 3. Geometrical representation,. 3. Product 
and quotient, 4. De Moivre's theorem, 5. Cube roots, 5. nth roots, 7. 
Roots of unity, 8. Primitive roots of unity, 9. 

CHAPTER II 

Elementary Theorems on the Roots of an Equation 

Quadratic equation, 11. Remainder theorem, 12. Synthetic division, 13. 
Factored form of a polynomial, 15. Multiple roots, 16. Identical poly- 
nomials, 16. Relations between the roots and the coefficients, 17. Imaginary 
roots occur in pairs, 19. Upper limit to the real roots, 21. Integral roots, 
24. Rational roots, 27. 

CHAPTER III 

Constructions with Ruler and Compasses 

Graphical solution of a quadratic equation, 29. Analytic criterion for 
constructibility, 30. Cubic equations with a constructible root, 32. Tri- 
section of an angle, 34. Duplication of a cube, 35. Regular polygons of 
7, 9, 17, and n sides, 35-44. Reciprocal equations, 37. 

CHAPTER IV 

Cubic and Quartic Equations 

Algebraic solution of a cubic, 45. Discriminant, 47. Number of real roots 
of a cubic, 48. Trigonometric solution of a cubic, 49. Ferrari's and Descartes' 
solutions of a quartic, 50. Resolvent cubic, 51 . Discriminant of a quartic, 51 . 

CHAPTER V 

The Graph of an Equation 

Use of graphs, 55. Caution in plotting, 55. Bend points, 56. Derivatives, 
58. Horizontal tangents, 60. Multiple roots, 60. Ordinary and inflexion 



VI CONTENTS 

tangents, 62. Real roots of a cubic equation, 65. Continuity, 66. Con- 
dition for a root between a and b, 67. Sign of a polynomial at infinity, 68. 
Rolle's theorem, 69. 

CHAPTER VI 

Isolation of the Real Roots 

Descartes' rule' of signs, 71. Sturm's method, 75. Sturm's functions for 
the general quartic equation, SO. Budan's theorem, 83. 

CHAPTER VII 

Solution of Numerical Equations 

Horner's method, 86. Newton's method, algebraic and graphical discus- 
sion, systematic computation, also for functions not polynomials, 90. Imagi- 
nary roots, 98. 

CHAPTER VIII 

Determinants; Systems of Linear Equations 

Solution of 2 or 3 linear equations by determinants, 101. Even and odd 
arrangements, 103. Definition of a determinant of order w, 105. Interchange 
of rows and columns, 106. Interchange of two columns or two rows, 107. 
Minors, 109. Expansion, 109. Removal of factors, 111. Sum of deter- 
minants, 112. Addition of columns or rows, 113. Rank, 116. System 
of n linear equations in n unknowns,. 114, 116. Homogeneous equations, 119. 
System of m linear equations in n unknowns, matrix and augmented matrix, 
120. Complementary minors, 122. Laplace's development, 122. Product of 
determinants, 124. 

CHAPTER IX 

Symmetric Functions 

Sigma functions, 128. Elementary symmetric functions, 128. Funda- 
mental theorem, 129. Rational functions symmetric in all but one of the 
roots, 132. Sums of like poweis of the roots, Newton's identities, 134. 
Waring's formula, 136. Computation of symmetric functions, 141, 

CHAPTER X 
Elimination, Resultants and Discriminants 
Methods of Sylvester, Euler, and Bezout, 143. Discriminants, 152. 

APPENDIX 
The Fundamental Theorem of Algebra 

Answers 15 

Index 16 



First Course in 
The Theory of Equations 



CHAPTER I 
Complex Numbers 

1. Square Roots. If p is a positive real number, the symbol Vp is 
used to denote the positive square root of p. It is most easily computed 
by logarithms. 

We shall express the square roots of negative numbers in terms of the 
symbol i such that the relation i 2 = — 1 holds. Consequently we denote 

the roots of x 2 = — 1 by i and — L The roots of x* = — 4 are written in the 

form ±2i in preference to ± V— 4. In general, if p is positive, the roots 
of x2= — p are written in the form =b Vp i in preference to =b V— p. 

The square of either root is thus ( V p)H*= —p. Had we used the less desirable 

notation =fcV — V for the roots of x 2 — — p, we might be tempted to find the square of 
either root by multiplying together the values under the radical sign and conclude 

erroneously that ^__ __ 

V— p V — p = Vp*=-f-p. 

To prevent such errors we use v p % and not v — p. 

2. Complex Numbers. If a and b are any two real numbers and 
i 2 = — 1, a+bi is called a complex number x and a— bi its conjugate. Either 
is said to be zero if a=6=0. Two complex numbers a+bi and c+di 
are said to be equal if and only if a = c and 6 = d. In particular, a +6i=0 

1 Complex numbers are essentially couples of real numbers. For a treatment from 
this standpoint and a treatment based upon vectors, see the author's Elementary Theory 
o] Equations, p. 21, p. 18. 



2 COMPLEX NUMBERS [Ch. I 

if and only if a=6=0. If bj*0, a+bi is said to be imaginary. In partic- 
ular, bi is called a pure imaginary. 

Addition of complex numbers is defined by ^ 

(a+bi) + (c+di) = (a+c) + (b+d)i. 

The inverse operation to addition is called subtraction, and consists in 
finding a complex number z such that 

(c+di) +z= a+bi. 
In notation and value, z is 

(a+bi)-(c+di) = (a-c) + (b-d)i. 

Multiplication is defined by 

(a+bi) (c+di)=a£—bd+(ad+bc)i, 

and hence is performed as in formal algebra with a subsequent reduction 
by means of i 2 = — 1. For example, 

(a+bi) (a-bi)=a 2 -V*v>=a 2 +V>. 

Division is defined as the operation which is inverse to multiplication, 

and consists in finding a complex number q such that (a+bi)q=e+fi. 

Multiplying each member by a— bi, we find that q is, in notation and 

value, 

e+fi __ (e+fi) (a— bi) _ ae+bf , af—be . 

a+bi" a 2 +& ""a 2 +6 2 " i " a 2 +V* U 

Since a 2 +6 2 =0 implies a = 6=0 when a and b are real, we conclude that 
division except by zero is possible and unique. 

EXERCISES 
Express as complex numbers 

1. V^9. 2. VI 3. (V25+V325) V^IQ. 4. -f. 

5 . 8+2 Vi. 6.?±^I. 7.|±£ 8.°-±£ 

2+V— 1 2— 3i o— 6t 

9. Prove that the sum of two conjugate complex numbers is real and that their 
difference is a pure imaginary. 

10. Prove that the conjugate of the sum of two complex numbers is equal to the 
sum of their conjugates. Does the result hold true if each word sum is replaced by the 
word difference? 



1 4] GEOMETRICAL REPRESENTATION 3 

11. Prove that the conjugate of the product (or quotient) of two complex numbers 
is equal to the product (or quotient) of their conjugates. 

12. Prove that, if the product of two complex numbers is lero, at least one of them 
is aero. 

13. Find two pairs of real numbers x, y for which 

(x+yi)»--7+24i. 

As in Ex. 13, express as complex numbers the square roots of 

14. -11 4-60 i. 15. 5-12*. 16. 4cd+(2c*-2d*)t. 

3. Cube Roots of Unity. Any complex number x whose cube is equal 
to unity is called a cube root of unity. Since 

a?-l = (z-l)(a?+x+l), 

the roots of x 3 ^ 1 are 1 and the two numbers z for which 

x2+x+l=0, (x+i) 2 --!, x+i=±iV3t. 

Hence there are three cube roots of unity, viz., 

1, «=-i+iV3t, <a'=-t-tV3i. 

In view of the origin of a>, we have the important relations 

« 2 +o)+l=0, or>=l. 

Since a>a/=l and w 3 = l, it follows that w^a) 2 , o) = o)' 2 . 

4. Geometrical Representation of Complex Numbers. Using rect- 
angular axes of coordinates, OX and OY, we represent the complex number 
a+bi by the point A having the coordinates a, b (Fig. 1). 

The positive number r=Va 2 +V* giving 
the length of OA is called the modulus (or 
absolute value) of a+bi. The angle d=X0A f 
measured counter-clockwise from OX to 0A f 
is called the amplitude (or argument) of a+bi. 
Thus cos 0=a/r,. sin 0=6/r, whence 

(1) o+W=r(cos B+i sin 0). 

The second member is called the trigonometric form of a+bi. 

For the amplitude we may select, instead of 0, any of the angles 0rt36O°, 
0=fc72O°, etc. 




COMPLEX NUMBERS 



[Ch. I 




Two complex numbers are equal if and only if their moduli are equal 
and an amplitude of the one is equal to an amplitude of the other. 

For example, the cube roots of unity are 1 and 

=cosl20°+isin 120°, 

= cos240 o +isin240 o , 

and are represented by the points marked 1, «, «' 

at the vertices of an equilateral triangle inscribed 

in a circle of radius unity and center at the origin 

O (Fig. 2). The indicated amplitudes of o> and «* 

_ are 120° and 240° respectively, while the modulus 

* 1G 2 of each is 1. 

The modulus of —3 is 3 and its amplitude is 180° on 180° plus or minus the product 

of 360° by any positive whole number. 

5. Product of Complex Numbers. By actual multiplication, 

[r(cos d+i sin 0)] [r'(cos a+i sin a)] 
= rr'[(cos cos a— sin sin a)+i(sin cos a+cos sin a)] 
= rr , [cos (0+a)+i sin (0+a)], by trigonometry. 

Hence the modulus of the product of two complex numbers is equal to the 
product of their moduli, while the amplitude of the product is equal to the 
sum of their amplitudes. 

For example, the square of <•> =cos 120° +i sin 120° has the modulus 1 and the ampli- 
tude 120°+120° and hence is w 1 =cos 240°+i sin 240°. Again, the product of « and o>* 
has the modulus 1 and the amplitude 120° +240° and hence is cos 360° +t sin 360% 
which reduces to 1. This agrees with the known fact that «' s = 1. 

Taking r=r' = l in the above relation, we obtain the useful formula 
(2) (cos d+i sin 0) (cos a+i sin a) = cos (0+a) +i sin (0+a). 

6. Quotient of Complex Numbers. Taking a=£-0 in (2) and divid- 
ing the members of the resulting equation by cos d+i sin 0, we get 

cos0+tsin0 /a M . . . fa M 

cos 0+t sin 



§ 7] DE MOIVRE'S THEOREM 5 

Hence the amplitude of the quotient of /2(cos 0+i sin ($) by r(cos 6+i sin 0) 
is equal to the difference 0—0 of their amplitudes, while the modulus of the 
quotient is equal to the quotient R/r of their moduli. 
The case = gives the useful formula 

= cos0— isin 0. 



■ 

k 

* 



cos 0+i sin 

7. De Moivre's Theorem. If n is any positive whole number, 

(3) (cos 0+i sin 0) n = cos n$+i sin n0. 

This relation is evidently true when n = l, and when n=2 it follows 
from formula (2) with a = 0. To proceed by mathematical induction, 
suppose that our relation has been established for the values 1, 2, . . . , m 
of n. We can then prove that it holds also for the next value m+1 of n. 
For, by hypothesis, we have 

(cos 6+i sin 0) m = cos md+i sin mO. 

Multiply each member by cos 0+i sin 0, and for the product on the right 
substitute its value from (2) with a = mO. Thus 

(cos 0+i sin 0) m+1 = (cos 0+i sin 0) (cos md+i sin mO), 

= cos (0+m0)+i sin (0+m0), 

which proves (3) when n=m+l. Hence the induction is complete. 

Examples are furnished by the results at the end of § 5: 

(cos 120° +i sin 120°) *= cos 240° +i sin 240°, 
(cos 120° +» sin 120°) 8 =cos 360° +i sin 360°. 

8. Cube Roots. To find the cube roots of a complex number, we first 
express the number in its trigonometric form. For example, 

l 4V2+4V2 i=8(cos 45°+i sin 45°). 

I If it has a cube root which is a complex number, the latter is expressible 
in the trigonometric form 

(4) r(cos 0+i sin 0). 

The cube of the latter, which is found by means of (3), must be equal 



to the proposed number, so that 

r3(cos 30+i sin 30) =8(cos 45% f ^ n 45°). 



6 



COMPLEX NUMBERS 



[Ch. I 



The moduli r 3 and 8 must be equal, so that the positive real number r 
is equal to 2. Furthermore, 30 and 45° have equal cosines and equal 
sines, and hence differ by an integral mulitple of 360°. Hence 30=45°+ 
A; -360°, or 0=15°+Jfcl2O°, where Jk is an integer. 1 Substituting this 
value of 6 and the value 2 of r in (4), we get the desired cube roots. The 
values 0, 1, 2 of A; give the distinct results 

#i=2(cos 15°+tsin 15°), # 2 = 2(cos.l35°+tsin 135°), 

# 3 =2(cos 255°+i sin 255°). 

Each new integral value of k leads to a result which is equal to R\, 
R2 or Rz. In fact, from fc = 3 we obtain R\, from k =4 we obtain R2, from 
fc = 5 we obtain #3, from fc = 6 we obtain R\ again, and so on periodically. 

EXERCISES 

1. Verify that /& = «&, #a = w*#i. Verify that R\ is a cube root of 8 (cos 45°+ 
i sin 45°) by cubing R x and applying De Moivre's theorem. Why are the new expressions 
for R\ and R% evidently also cube roots? 

2. Find the three cube roots of —27; those of — t; those of «o. 

3. Find the two square roots of i; those of — i; those of «. 

4. Prove that the numbers cos 6+i sin and no others are represented by points 
on the circle of radius unity whose center is the origin. 

5. If a+bi and c+di are represented by the points A and C in Fig. 3, prove that 
their sum is represented by the fourth vertex S of the parallelogram two of whose sides 
are OA and OC. Hence show that the modulus of the sum of two complex numbers 
is equal to or less than the sum of their moduli, and is equal to or greater than the dif- 
ference of their moduli. 





1 Here, as elsewhere when the contrary is not specified, zero and negative as well 
as positive whole numbers are included under the term "integer." 



§ 9] ROOTS OF COMPLEX NUMBERS 7 

6. Let r and r' be the moduli and and a the amplitudes of two complex numbers 
represented by the points A and C in Fig. 4. Let U be the point on the x-axis one 

1 unit to the right of the origin 0. Construct triangle OCP similar to triangle OUA and 
"* similarly placed, so that corresponding sides are OC and 0U f CP and UA, OP and OA, 
o while the vertices 0, C, P are in the same order (clockwise or counter-clockwise) as 
3 the corresponding vertices 0, U f A. Prove that P represents the product (J 5) of the 
complex numbers represented by A and C. 

7. If a+bi and e+fi are represented by the points A and S in Fig. 3, prove that 
the complex number obtained by subtracting a+bi from e+fi is represented by the point 
C. Hence show that tin absolute value of the difference of two complex numbers is 
equal to or less than the sum of their absolute values, and is equal to or greater than 

i the difference of their absolute values. 

8. By modifying Ex. 6, show how to construct geometrically the quotient of two 
'complex numbers. 

9. nth Roots. As illustrated in §8, it is evident that the nth roots 
of any complex number p(co& A+i sin A) a r e the products of the nth 
roots of cos A+i sin A. by the positive real nth root of the positive real 
number p (which may be found by logarithms). 

Let an nth root of cos A+i sin A be of the form 

(4) r(cos B+i sin 0). 
Then, by De Moivre's theorem, 

r*(cos nd+i sin nff) = cos A +i sin A. 

The moduli r" and 1 must be equal, so that the positive real number r 
is equal to 1. Since nd and A have equal sines and equal cosines, they 
differ by an integral multiple of 360°. Hence n9=A+k-360°, where k 
is an integer. Substituting the resulting value of 6 and the value 1 of r 
in (4), we get 

(5) cos^ j+tsm^ y 

For each integral value of k, (5) is an answer since its nth power reduces 
to cos A+i sin A by DeMoivre^ theorem. Next, the value n of k gives 
the same answer as the value of k; the value n+\ of k gives the same 
answer as the value 1 of k; and in general the value n+m of k gives the 
same answer as the value m of k. Hence we may restrict attention to 
the values 0, 1, . . . , n— 1 of k. Finally, the answers (5) given by these 



8 



COMPLEX NUMBERS 



[Ch. I 



values 0, 1, . . . , n— 1 of k are all distinct, since they are represented by 
points whose distance from the origin is the modulus 1 and whose ampli- 
tudes are 



n 



A 360° 



A . 2.360' 



n 



n 



n 



n 



A (71-1)360* 



n 



n 



so that these n points are equally spaced points on a circle of radius unity. 
Special cases are noted at the end of § 10. Hence any complex number 
different from zero has exactly n distinct complex nth roots. 

10. Roots of Unity. The trigonometric form of 1 is cos 0°+i sin 0°. 
Hence by § 9 with A =0, the n distinct nth roots of unity are 



(6) 



cos Hsin— (fc=0, 1, . . . , n— 1), 



n 



n 



where now the angles are measured in radians (an angle of 180 degrees 
being equal to ir radians, where tt = 3.1416, approximately). For fc=0, 
(6) reduces to 1, which is an evident nth root of unity. For fc=l, (6) is 



(7) 



~ Zt . . . Ztt 

R = cos — H sm — . 
n n 



By De Moivre's theorem, the general number (6) is equal to the 
fcth power of R. Hence the n distinct nth roots of unity are 

(8) R, R 2 , R z , ...,#"- \ R n =l. 

As a special case of the final remark in § 9, the n complex numbers 
(6), and therefore the numbers (8), are represented geometrically by the 
vertices of a regular polygon of n sides inscribed in the circle of radius 
unity and center at the origin with one vertex on the positive x-axis. 



Forn=3, the numbers (8) are co, o> 2 , 1, which are repre- 
sented in Fig. 2 by the vertices of an equilateral triangle. 

For n = 4, R = cos x/2 +i sin x/2 = i. The four fourth roots 
of unity (8) are i, » 2 =-l, i»=-i, i* = l, which are repre- 
sented by the vertices of a square inscribed in a circle of 
radius unity and center at the origin (Fig, 5), 




- § 11] PRIMITIVE ROOTS OF UNITY 9 

£ EXERCISES 

1. Simplify the trigonometric forms (6) of the four fourth roots of unity. Check 
the result by factoring x 4 — 1. 

2. For n=6, show that R= — o> 2 . The sixth roots of unity are the three cube roots 
of unity and their negatives. Check by factoring x % — 1. 

- 3. From the point representing a+bi, how do you obtain that representing — (a-f W)? 
* Hence derive from Fig. 2 and Ex. 2 the points representing the six sixth roots of unity. 
Obtain this result another way. 

i 

4. Find the five fifth roots of —1. 

I 4 

5. Obtain the trigonometric forms of the nine ninth roots of unity. Which of 
them are cube roots of unity? 

6. Which powers of a ninth root (7) of unity are cube roots of unity? 

11. Primitive nth Roots of Unity. An nth root of unity is called 
i primitive if n is the smallest positive integral exponent of a power of it 
that is equal to unity. Thus p js a primitive nth root of unity if and only 
if p n = l and p'j^l for all positive integers l<n. 

Since only the last one of the numbers (8) is equal to unity, the number 
R, defined by (7), is a primitive nth root of unity. We have shown that 
the powers (8) of R give all of # the nth roots of unity. Which of these 
powers of R are primitive nth roots of unity? 

For n=4, the powers (8) of R=i were seen to be 

i l —i t i 2 = — 1, i 3 = — i, i 4 = t. 

The first and third are primitive fourth roots of unity, and their exponents 1 and 3 
are relatively prime to 4, i.e., each has no divisor >1 in common with 4. But the 
second and fourth are not primitive fourth roots of unity (since the square of —1 and the 
first power of 1 are equal to unity), and their exponents 2 and 4 have the divisor 2 in 
common with n=4. These facts illustrate and prove the next theorem for the case 
n=4. 

Theorem. The primitive nth roots of unity are those of the numbers 
(8) whose exponents are relatively prime to n. 

Proof. If k and n have a common divisor d (d> 1), R k is not a primitive 
nth root of unity, since 

and the exponent n/d is a positive integer less than n. 



10 COMPLEX NUMBERS [Ch. I 

But if k and n are relatively prime, i.e., have no common divisor >1, 
ft* is a primitive nth root of unity. To prove this, we must show that 
(JR*) 1 ?* 1 if I is a positive integer <n. By De Moivre's theorem, 

R u =*coa Hsin . # 

n n 

If this were equal to unity, 2khc/n would be a multiple of 2r, and hence 
kl a multiple of n. Since A; is relatively prime to n, the second factor I 
would be a multiple of n, whereas < I < n. 

EXERCISES 

1. Show that the primitive cube roots of unity are a> and «**. 

2. For A given by (7), prove that the primitive nth roots of unity are (t) for n—6, 
#, #*; (it) for n=8, fl, #», 2**, J* 7 ; (iti) for n = 12, R, R* t R\ R". 

3. When n is a prime, prove that any nth root of unity, other than 1, is primitive. 

4. Let R be a primitive nth root (7) of unity, where n is a product of two different 
primes p and q. Show that R, . . . ,R* are primitive with the exception of R v , R** t . . ., 
R? p , whose gth powers are unity, and R?, R* 9 , . . . , fl** whose pth powers are unity. 
These two sets of exceptions have only R™ in common. Hence there are exactly 
pq— p— g+1 primitive nth roots of unity. 

5. Find the number of primitive nth roots of unity if n is a square of a prime p. 

6. Extend Ex. 4 to the case in which n is a product of three distinct primes. 

7. If R is a primitive 15th root (7) of unity, verify that R*, R* t R*, R li are the primi- 
tive fifth roots of unity, and R* and R 10 are the primitive cube roots of unity. Show 
that their eight products by pairs give all the primitive 15th roots of unity. 

8. If p is any primitive nth root of unity, prove that p, p*, . . . , p n are distinct and 
give all the nth roots of unity. Of these show that p* is a primitive nth root of unity 
if and only if k is relatively prime to n. 

9. Show that the six primitive 18th roots of unity are the negatives of the primi- 
tive ninth roots of unity. 



CHAPTER II 

Elementary Theorems on the Roots of an Equation 

12. Quadratic Equation. If a, b, c are given numbers, a 5*0, 

(1) cux?+bx+c=0 (a^O) 

is called a quadratic equation or equation of the second degree. The 

reader is familiar with the following method of solution by " completing 

the square." Multiply the terms of the equation by 4a, and transpose 

the constant term; then 

4a 2 x 2 +4ofer= — 4ac. 

Adding b 2 to complete the square, we get f, - ~-H + Ls- 

(2ox+6) 2 =A, A = fe 2 -4oc, ** * 

(2) *— aT~ a?2=_ ^ — ^~ atl -~ 



By addition and multiplication, we find that 
(3) z 1+ * 2 =^, **-$ 



-at 






C 

2L 



Hence for all values of the variable x, 

(4) a(x— xi) (x— X2)=ax*— a(xi+X2)x+axiX2^<$+bx+c, 

the sign s being used instead of = since these functions of x are identically 
equal, i.e., the coefficients of like powers of x are the same. We speak 
of a(x— xi) (X—X2) as the factored form of the quadratic function ax?+bx+c, 
and of £— x\ and £— X2 as its linear factors. 

In (4) we assign to x the values xi and X2 in turn, and see that 

0=axi 2 +bxi+c f 0=ax2 2 +bx2+c. 

Hence the values (2) are actually the roots of equation (1). 

We call A = 6 2 — 4oc the discriminant of the function aa?+bx+c or 
of the corresponding equation (1). If A=0, the roots (2) are evidently 
equal, so that, by (4), ax?+bx+c is the square of Va(x— xi), and con* 

11 



12 THEOREMS ON ROOTS OF EQUATIONS [Ch. U 

versely. We thus obtain the useful result that ax 2 +bx+c is a perfect 
square (of a linear function of x) if and only if 6 2 =4ac (i.e., if its discrimi- 
nant is zero). 

Consider a real quadratic equation, i.e., one whose coefficients a, b, c 
are all real numbers. Then if A is positive, the two roots (2) are real. 
But if A is negative, the roots are conjugate imaginaries (§ 2). 

When the coefficients of a quadratic equation (1) are any complex 
numbers, A has two complex square roots (§9), so that the roots (2) of 
(1) are complex numbers, which need not be conjugate. 

For example, the discriminant of x 2 — 2x+c is A =4(1— c). If c = l, then A=0 and 
x 1 — 2x-\-\ = {x — 1)* is a perfect square, and the roots 1, 1 of x 2 — 2x+l— are equal. 
If c =0. A =4 is positive and the roots and 2 of x 2 — 2x==x(x —2) —0 are real. If c =2, » 

A =—4 is negative and the roots 1± v — 1 of x 2 — 2x+2=0 are conjugate complex 
numbers. The roots of x 2 — x+l-fi = are i and 1 — i, and are not conjugate. 

13. Integral Rational Function, Polynomial. If n is a positive integer 
and Co, ci, . . . y Cn are constants (real or imaginary), 

f(x)=coaf l +CiX n ' 1 +\ . . +c n - 1 x+c n 

is called a polynomial in x of degree n, or also an integral rational function 
of x of degree n. It is given the abbreviated notation /(#), just as the 
logarithm of x+2 is written log (x+2). 

If Co 5*^0, f(x) = is an equation of degree n. If n = 3, it is often called 
a cubic equation; and, if n=4, a quartic equation. For brevity, we often 
speak of an equation all of whose coefficients are real as a real equation. 

14. The Remainder Theorem. If a polynomial f(x) be divided by 
x—c until a remainder independent of x is obtained, this remainder is equal 
tof(c) y which is the value of f(x) when x = c. 

Denote the remainder by r and the quotient by q(x). Since the 
dividend is /(a;) and the divisor is x— c, we have 

f(x) = (x-c)q(x)+r, 

identically in x. Taking x = c, we obtain /(c) = r. 

If r=0, the division is exact. Hence we have prdved also the follow- 
ing useful theorem. 

The Factor Theorem. If /(c) is zero, the polynomial f(x) has the 
factor x—c. In other words, if c is a root of f(x)=0, x—c is a factor of 

fix). 



§ 14] REMAINDER AND FACTOR THEOREMS 13 

For example, 2 is a root of x 3 — 3=0, so that x— 2 is a factor of x 1 — 8. Another 
illustration is furnished by formula (4)' 

EXERCISES 

Without actual division find the remainder when 

1. x 4 — 3x 2 — x— 6 is divided by x+3. 

2. x 3 -3x 2 +6x-5 is divided by x-3. 

Without actual division show that 

3. lSx^+lQ^+l is divisible by x+1. 

4. 2x 4 — x 3 — 6x 2 +4x— 8 is divisible by x— 2andx+2. 

5. x 4 -3x 3 +3x 2 -3x+2 is divisible by x-1 and x-2. 

6. r 3 — 1, r 4 — 1, r 6 — 1 are divisible by r— 1. 

7. By performing the indicated multiplication, verify that 

r n -l = (r-l) (f*-i+ f *- , + . . . +r+l). 

8. In the last identity replace r by x/y, multiply by y n , and derive 

x n -y n sz(x-y) (x n ~ 1 +x n - 2 y+ . . . +xy n - 2 +y tl - 1 ). ' 

9. In the identity of Exercise 8 replace y by — y, and derive 

x n +y n ^(x+y) (x nr - l -x"- 2 y+ . . . -x2/ n - 2 +y n_i ), nodd; 
x n -y n * s (x+y) (x n - 1 -x n - 2 2 /+ . . . +x^- 2 -2/ n " 1 ), neven. 

Verify by the Factor Theorem that x+y is a factor. 

10. If o, ar t ar 2 , . . ., ar n ~ l are n numbers in geometrical progression (the ratio of any 
term to the preceding being a constant r^l), prove by Exercise 7 that their sum is 
equal to 

q(r w -l) 

r-1 ' 

11. At the end of each of n years a man deposits in a savings bank a dollars. With 
annual compound interest at 4%, show that his account at the end of n years will be 

-£{(1.04)'-!} 

dollars. Hint: The final deposit draws no interest; the prior deposit will amount to 
a(1.04) dollars; the deposit preceding that will amount to a(1.04) 2 dollars, etc. Hence 
apply Exercise 10 for r = 1.04. 

15. Synthetic Division. The labor of computing the value of a poly- 
nomial in x for an assigned value of x may be shortened by a simple device. 

To find the value of 

ar 4 +3x 3 -2a:-5 



14 THEOREMS ON ROOTS OF EQUATIONS [Ch. H 

* 

for x=2, note that a^=»-x 3 =2x 3 , so that the sum of the first two terms 
of the polynomial is &T 3 . To bs? = 5 -2*x we add the next term — 2x and 
obtain 18x or 36. Combining 36 with the final term —5, we obtain the 
desired value 31. 

This computation may be arranged systematically as follows. After 
supplying zero coefficients of missing powers of x, we write the coefficients 
in a line, ignoring the powers of x. 

13 -2 -5 | 2 

2 10 20 36 

1 5 10 18 31 

First we bring down the first coefficient 1. Then we multiply it by the 
given value 2 and enter the product 2 directly under the second coefficient 
3, add and write the sum 5 below. Similarly, we enter the product of 
5 by 2 under the third coefficient 0, add and write the sum 10 below; etc. 
The final number 31 in the third line is the value of the polynomial when 
x = 2. The remaining numbers in this third line are the coefficients, in 
their proper order, of the quotient 

sHSa^+lOz+lS, 

which would be obtained by the ordinary long division of the given poly- 
nomial by x— 2. 

We shall now prove that this process, called synthetic division, enables 
us to find the quotient and remainder when any polynomial f(x) is divided 

byx—c. Write 

/(a0=a o x n +ai3 n - 1 + . . . +a», 

and let the constant remainder be r and the quotient be 

4^=b x n - 1 +bi3f>- 2 + . . . +&„_!. 
. By comparing the coefficients of f(x) with those in 
(x-c) q(x)+r=b 3f > +(bi-cbo)af i - 1 

+ (&2-c6i)s n - 2 + . . . +(6»-i-c6 n _ 2 )x+r~c6 b . 1 , 
we obtain relations which become, after transposition of terms, 
6o = ao, bi=di+cbo, 62=02+061, . . . , 6 n -i=On-i+c6n-2, r=a n +c6 JI _i. 



S 15] SYNTHETIC DIVISION 15 

The steps in the work of computing the b's may be tabulated as follows: 

do fli #2 ... 0n-l On | c 

cbo cb\ . . . cbn-2 cbn-1 

bo b\ 62 ... 6»_i, T 

In the second space below ao we write 60 (which is equal to ao). We 
multiply 60 by c and enter the product directly under oi, add and write 
the sum fei below it. Next we multiply 61 by c and enter the product 
directly under 02, add and write the sum 62 below it; etc. 

EXERCISES 

Work each of the following exercises by synthetic division. 

1. Divide x 8 +3x»-2x-5 by x-2. 

2. Divide 2x*-x»+2x-l by x+2. 

3. Divide x»+6x 2 +10x-l by x-0.09. 

4. Find the quotient of x s — 5x 2 — 2x-f-24 by x— 4, and then divide the quotient by 
*-3. What are the roots of s'-5z*-2g+24-0? 

5. Given that x 4 — 2x s — 7x*-f8x+ 12=0 has the roots —1 and 2, find the quadratic 
equation whose roots are the remaining two roots of the given equation, and find these 
roots. 

6. If x A — 2x l — 12x* + lOx +3 = has the roots 1 and — 3, find the remaining two roots. 

7. Find the quotient of 2x 4 -x*-6x 2 +4x-8 by x*-4. 

8. Find the quotient of x*-3x»+3x 2 -3x+2 by x*-3x+2. 

9. Solve Exercises 1, 2, 3, 6, 7 of § 14 by synthetic division. 

16. Factored Form of a Polynomial. Consider a polynominal 

/(s)i=coa; n +ciz n - 1 + . . . +C (co^O), 

whose leading coefficient Co is not zero. If f(x)=0 has the root «i, which 
may be any complex number, the Factor Theorem shows that J(x) has the 
factor x— aij so that 

f(z)m(x-ai)Q(x), 0(x)sc X»- I +Cl , X ft - 2 + . . . +(^-1. 

If Q(x) =0 has the root a 2 , then 

Q(x) = (x-ot2)Qi(x), f(x) = (x-ai) (x-a 2 )Qi(x). 

If Qi(x) =0 has the root az, etc., we finally get 

(5) /(x)sco(x-ai) (x-a 2 ) . . . (x-cr,). 

We shall deduce several important conclusions from the preceding 
discussion. First, suppose that the equation f(x) =0 of degree n is known 



16 THEOREMS ON ROOTS OF EQUATIONS [Ch. H 

to have n distinct roots ai, . . . , a n . In f(x) = (x— ai)Q(x) take x=a.i\ 
then = (a2— «i) Q(a2), whence Q(«2)=0 and Q(x)=0 has the root «2. 
Similarly, Qi(rc)=0 has the root <*3, etc. Thus all of the assumptions 
(each introduced by an " if ") made in the above discussion have been 
justified and we have the conclusion (5). Hence if an equation f(x) = 
of degree n has n distinct roots at, . . . , a», f(x) can be expressed in the 
factored form (5). 

It follows readily that the equation can not have a root a different 
from ai, . . . , a». For, if it did, the left member of (5) is zero when 
rc=a and hence one of the factors of the right member must then be zero, 
say a— a>=0, whence the root a is equal to a*. We have now proved 
the following important result. 

Theorem. An equation of degree n cannot have more than n distinct roots. 

17. Multiple Roots. 1 Equalities may occur among the a's in (5). 
Suppose that exactly mi of the a's (including a\) are equal to a%; that 
ct27*ctu while exactly m2 of the a's are equal to a<L\ etc. Then (5) becomes 

(6) f(x)=c (x— ai) m (x— a2) mi . . . (x—a t ) m *, m\+m2+ . . . +m t =n, 

where ai, . . . , a* are distinct. We then call ai a root of multiplicity m\ 
of /(x)=0, «2 a root of multiplicity m2, etc. In other words, a\ is a root 
of multiplicity m\ of f(x) = if /(re) is exactly divisible by (x— ai) m , but is 
not divisible by (x— «i) mi+1 . We call a\ also an mi— fold root. In the 
particular cases mi = 1, 2, and 3, we also speak of ai as a simple root, double 
root, and triple root, respectively. For example, 4 is a simple root, 3 a 
double root, —2a triple root, and 6 a root of multiplicity 4 (or a 4-fold 
root) of the equation 

7(s-4) (x-3) 2 (x+2) 3 (x-6) 4 =0 

of degree 10 which has no further root. This example illustrates the 
next theorem, which follows from (6) exactly as the theorem in § 16 
followed from (5). 

Theorem. An equation of degree n cannot have more than n roots, 
a root of multiplicity m being counted as m roots. 

18. Identical Polynomials. // two polynomials in x, 

a 3*+aix n ~ 1 + . . . +On, 6 x n +feix n - 1 + . . . +fe„, 
1 Multiple roots are treated by calculus in § 58. 



$ 20] RELATIONS BETWEEN ROOTS AND COEFFICIENTS 17 

each of degree n, are equal in value for more than n distinct values of x, they 
are term by term identical, i.e., ao = bo, ai = 6i, . . . , a» = 6 n . 

For, taking their difference and writing co = a — &o> • • • > c fl =a n — b n , 
we have 

C X n + CiX n - 1 + . . . +Cn = 

for more than n distinct values of x. If cq^O, we would have a contra- 
diction with the theorem in § 16. Hence c =0. If cu*0, we would have 
a contradiction with the same theorem with n replaced by n— 1. Hence 
ci = 0, etc. Thus ao = bo, a\ = 6i, etc. 

EXERCISES 

1. Find a cubic equation having the roots 0, 1, 2. 

2. Find a quartic equation having the roots ±1, ±2. 

3. Find a quartic equation having the two double roots 3 and —3. 

4. Find a quartic equation having the root 2 and the triple root 1. 

5. What is the condition that ax 2 +bx+c = shall have a double root? 

6. If aoX n + . . . +a n =0 has more than n distinct roots, each coefficient is zero. 

7. Why is there a single answer to each of Exercises 1-4, if the coefficient of the 
highest power of the unknown be taken equal to unity? State and answer the cor- 
responding general question. 

19. The Fundamental Theorem of Algebra. JSvery algebraic equation 
with complex coefficients has a complex (real or imaginary) root. 

This theorem, which is proved in the Appendix, implies that every 
equation of degree n has exactly n roots if a root of multiplicity m be counted 
as m roots. In other words, every integral rational function of degree n 
is a product of n linear factors. For, in § 16, equations /(#) =0, Q(x)=0, 
Qi(x) =0, . . . each has a root, so that (5) and (6) hold. 

20. Relations between the Roots and the Coefficients. In § 12 we 

found the sum and the product of the two roots of any quadratic equation 
and then deduced the factored form of the equation. We now apply 
the reverse process to any equation 

(7) f(x)=c 3»+ax n - 1 + . . . +c„=0 (Co5*0), 

whose factored form is 

(8) f(x)=Co(x-ai) (x-a 2 ) . . . (x-a n ). 

Our next step is to find the expanded form of this product. The following 
special products may be found by actual multiplication: 



18 THEOREMS ON ROOTS OF EQUATIONS [Ch. U 

(x—a\) (x— 02) sx 2 — (01+02)3+0102, 

(x\ — ai) (x— 02) (x—aa) si 3 - (a\ +02+03)x 2 + (0102+0103+0203)3— 010203. 

These identities are the cases n=2 and n=3 of the following general 
formula: 

(9) (xi-oi) (x-o 2 ) . . . (x-a m )ma»-(pt\ + . . . +a n )a?~ 1 

+ (0102+0103+0203+ . . . +o„_ 1 o n )x ,, ~ 2 

— (010203+010204+ . . . +O n _20„_lO ll )x ,l ~ 8 

+ . . . +( — 1)*oio2 . . . a n , 

the quantities in parentheses being described in the theorem below. If 
we multiply each member of (9) by x— «„+i, it is not much trouble to verify 
that the resulting identity can be derived from (9) by changing n into 
n+1, so that (9) is proved true by mathematical induction. Hence the 
quotient of (7) by Cq is term by term identical with (9), so that 

01+02+ . . . +On=— C1/C0, 

0102+0103+0203+ . . . +O w _lO n = C2/Co, 

(10) 

010203+010204+ . . . +O n -20 n -lO n =— C3/C0, 

• •••••• 

«102 . . . O w -iO ll =(— l) n Cn/Co. 

These results may be expressed in the following words: 

Theorem. // 01, . . . , o„ are the roots of equation (7), the sum of the 
roots is equal to — ci/cq, the sum of the products of the roots taken two at 
a time is equal to C2/C0, the sum of the products of the roots taken three ai a 
time is equal to —cz/cq, etc.; finally, the product of all the roots is equal to 

(-D n Cn/C0. 

Since we may divide the terms of our equation (7) by cq, the essential 
part of our theorem is contained in the following simpler statement: 

Corollary. In an equation in x of degree n, in which the coefficient 
of x? is unity, the sum of the n roots is equal to the negative of the coefficient 
of a?' 1 , the sum of the products of the roots two at a time is equal to the coeffi- 
cient of x n ~ 2 , etc.; finally the product of all the roots is equal to the constant 
term or its negative, according as n is even or odd. 

For example, in a cubic equation having the roots 2, 2, 5, and having unity as the 
coefficient of x*, the coefficient of sis2-2+2. 5+2-5 =24. 



§ 21] IMAGINARY ROOTS OCCUR IN PAIRS 19 

EXERCISES 

1. Find a cubic equation having the roots 1, 2, 3. 

2. Find a quartic equation having the double roots 2 and —2. 

3. Solve x 4 -6x 3 +13.r 2 -12x+4=0, which has two double roots. 

4. Prove that one root of z 3 +px 2 +qz+r=Q is the negative of another root if and 
only if r = pq. 

5. Solve Ax 3 — 16x 2 — 9x+36=0, given that one root is the negative of another. 

6. Solve x 3 — 9x 2 -f23x — 15=0, given that one" root is the triple of another. 

7. Solve s 4 -6x 8 +12x 2 -10x+3 = 0, which has a triple root. 

8. Solve x' — 14x 2 —84x +216=0, whose roots are in geometrical progression, i.e., 
with a common ratio r [say m/r, m, mr\. 

9. Solve x 3 — 3x l — 13s + 15=0, whose roots are in arithmetical progression, i.e., 
with a common difference d [say m— d, m, w?+<fl. 

10. Solve x A — 2x 3 — 21o; 2 -f22x+40=0, whose roots are in arithmetical progression. 
[Denote them by c— 36, c— 6, c-f 6, c+36, with the common difference 2b ) 

11. Find a quadratic equation whose roots are the squares of the roots of 

x 2 — px+</=0. 

12. Find a quadratic equation whose roots are the cubes of the roots of x % — px+q —0. 
Hint: a*+0 3 = (a+0) 3 -3a0(a+0). 

13. If a and p are the roots of x*— px+q=0, find an equation whose roots are (i) a 2 /0; 
and/3 2 /a; (ii) a 8 /3anda/3 3 ; (iii) a+l/fi and 0+1/a. 

14. Find a necessary and sufficient condition that the roots, taken in some order, 
of x 3 +px 2 -f qx+r=0 shall be in geometrical progression. 

15. Solve x l -28x +48 =0, given that two roots differ by 2. 

21. Imaginary Roots occur in Pairs. The two roots of a real quadratic 
equation whose discriminant is negative are conjugate imaginaries (§ 12). 
This fact illustrates the following useful result. 

Theorem. // an algebraic equation with real coefficients has the root a+bi, 
where a and b are real and b^0, it has also the root a—bi. 

Let the equation be f(x) = and divide f(x) by 

(11) {z-ip+Vm(x-a-bQ (x-a+bi) 

until we reach a remainder rx+s whose degree in x is less than the degree 
of the divisor. Since the coefficients of the dividend and divisor are all 
real, those of the quotient Q(x) and remainder are real. We have 

/(*) «Q(*) { {x-a) 2 +b 2 } +rx+s f 

identically in x. This identity is true in particular when x=a+bi t so 

that 

0=r(a+bi)+8=ra+8+rbi. 



20 THEOREMS ON ROOTS OF EQUATIONS [Ch. n 

Since all of the letters, other than i, denote real numbers, we have (§ 2) 
ra+s=0, rb = 0. But b^O. Hence r=0, and then s=0. Hence f(x) 
is exactly divisible by the function (11), so that f(x) =0 has the root a— hi. 

The theorem may be applied to the real quotient Q(x). We obtain 
the 

Corollary. If a real algebraic equation has an imaginary rod of 
multiplicity m, the conjugate imaginary of this root is a root of multiplicity m. 

Counting a root of multiplicity m as m roots, we see that a real equation 
cannot have an odd number of imaginary roots. Hence by .§19, a real 
equation of odd degree has at least one real root. 

Of the n linear factors of a real integral rational function of degree n 
(§ 19), those having imaginary coefficients may be paired as in (11). 
Hence every integral rational function with real coefficients can be expressed 
as a product of real linear and real quadratic factors. 

EXERCISES 

1. Solve x 3 -3x 2 -6x-20=0, one root being -1+V^i. 

2. Solve x 4 — 4x 3 +5x 2 — 2x— 2 = 0, one root being 1— t. 

3. Find a cubic equation with real coefficients two of whose roots are 1 and 3+2i. 

4. If a real cubic equation x 3 — 6x 2 -f ... =0 has the root l+V-5, what are the 
cemaining roots? Find the complete equation. 

5. If an equation with rational coefficients has a root a+ V b, where a and b are 

rational, but V6 is irrational, prove that it has the root o — V b. [Use the method of 
§21.] 

6. Solve x 4 — 4x 3 +4x — 1 =0, one root being 2-fV3. 

7. Solve x 3 -(4-f V3)x 2 +(5+4\/3)x-5V3 = 0, having the root Vjj. 

8. Solve the equation in Ex. 7, given that it has the root 2 -ft. 

9. Find a cubic equation with rational coefficients having the roots J, }+ V2, 

10. Given that x 4 -2x 3 -5x 2 -6x+2=0 has the root 2-V3, find another root and 
by means of the sum and the product of the four roots deduce, without division, the 
quadratic equation satisfied by the remaining two roots. 

11. Granted that a certain cubic equation has the root 2 and no real root different 
from 2, does it have two imaginary roots? 

12. Granted that a certain quartic equation has the roots 2±3i, and no imaginary 
roots di.Terent from them, does it have two real roots? 

13. By means of the proof of Ex. 5, may we conclude as at the end of § 21 that 
every integral rational function with rational coefficients can be expressed as a product 
of linear and quadratic factors with rational coefficients? 



§ 22] UPPER LIMIT TO REAL ROOTS 21 

22. Upper Limit to the Real Roots. Any number which exceeds 
all real roots of a real equation is called an upper limit to the real roots. 
We shall prove two theorems which enable us to find readily upper limits 
to the real roots. For some equations Theorem I gives a better (smaller) 
upper limit than Theorem II; for other equations, the reverse is true. 
Evidently any positive number is an upper limit to the real roots of an 
equation having no negative coefficients. 

Theorem I. //, in a real equation 

f{x)=a^ n +aix n ' l + . . . +a n = (a >0), 

the first negative coefficient is preceded by k coefficients which are positive or 
zero, and if G denotes the greatest of the numerical values of the negative 

coefficients, then each real root is less than l+*y G/aQ. 

For example, in x?+4x*— 7x 2 — 40x-fl=0, (?=40 and k = 3 since we must supply 
the coefficient zero to the missing power x*. Thus the theorem asserts that each root 

is less than 1+ V 40 and therefore less than 4.42. Hence 4.42 is an upper limit to the 
roots. 

Proof. For positive values of x, f(x) will be reduced in value or remain 
unchanged if we omit the terms aix"" 1 , . . . , at-ix""* 4 " 1 (which are 
positive or zero), and if we change each later coefficient at, ... , o» to 
— G. Hence 

f(x)>a&P-G(p-*+a?-*- 1 + . . . +x+l). 

But, by Ex. 7 of § 14, 

af-»+ . . . +x+l=- — -, 

x — i 

if x j± 1 . Furthermore, 

n . „( x»-* +l -l \_ x- t+l {a x>- 1 (x-l)-G\+G 

"<* ~ G { x-1 ) i=I * 

Hence, if x >1, 

JW> x-1 

Thus, for x>l, f(x)>0 and x is not a root if ao(x— 1)*— GZ.0, which 
is true if xhl+^G/ao- 



22 THEOREMS ON ROOTS OF EQUATIONS [Ch. H 

23. Another Upper Limit to the Roots. 

Theorem II. //, in a real algebraic equation in which the coefficient 
of the highest power of the unknown is positive, the numerical value of each 
negative coefficient be divided by the sum of all the positive coefficients which 
precede it, the greatest quotient so obtained increased by unity is an upper 
limit to the roots. 

For the example in § 22, the quotients are 7/(1+4) and 40/5, so that Theorem II 
asserts that 1 +8 or 9 is an upper limit to the roots. Theorem I gave the better upper 
limit 4.42. But for x*+8x*— 9x-fc 2 =0, Theorem I gives the upper limit 4, while 
Theorem II gives the better upper limit 2. 

We first give the proof for the case of the equation 

f(x)spa*— pi^+paX 2 — JhX+p<>=0 

in which each p t is positive. In view of the identities 

x*=(x-l) (x*+x l +;r+l)+l, x*==(x-l) (s+lj+l, 

/ (x) is equal to the sum of the terms 

Pa(x — l)x*+p A (x — l)x l +p4(3-l)x+p 4 (x-l)+P4, 
-pi* 1 +p*(s-l)x+ps(3-l)+p* ) 

— Jhx +po. 

If x>l f negative terms occur only in the first and third columns, while the sum of the 
terms in each of these two columns will be ^ if 

p 4 (x-l)-p,^0, (p4+p*)(s-l)-pi^0. 
Hence /(x)>0 and x is not a root if 

* = H — , * = H ; — • i 

P* P4+PJ 

This proves the theorem for the present equation. 

Next, let f(x) be modified by changing its constant term to — po. We modify the 
above proof by employing the sum (p4+p*)x — p© of all the terms in the corresponding 
last two columns. This sum will be >0 if x>po/(pi+pi), which is true if 

P*+JH 

To extend this method of proof to the general case 

f(x)=a^+ . . . +ao (a*>0), 

we have only to employ suitable general notations. Let the negative 
coefficients be a*„ . . . , a^, where ki>k2> • • . >fe. For each posi- 



S 23] UPPER LIMIT TO REAL ROOTS 23 

tive integer m which is £ n and distinct from k\, . . . , k t , we replace a* 
by the equal value 

where d=x—l. Let F(x) denote the polynomial in x, with coefficients 
involving d, which is obtained from fix) by these replacements. Let 
x>l, so that d is positive. Thus the terms a^x** are the only negative 
quantities occurring in F(x). If fc<>0, the terms of F(x) which involve 
explicitly the power x*< are atfft and the OmdxPi for the various positive 
coefficients a*, which precede a^. The sum of these terms will be ^0 
if a^+dZaw^O, i.e., if 



s£14 



Sa,,,' 



There is an additional case if fci=0, i.e., if Oq is negative. Then the 
terms of F(x) not involving x explicitly are do and the a* (d+1) for the 
various positive coefficients a*,. Their sum, ao+xZom, will be >0 if 

which is true if 

-<*o 



sM-r 



ZOm* 



EXERCISES 

Apply the methods of both § 22 and § 23 to find an upper limit to the roots of 

1. 4^-8x 4 +22x»+98s l -73a;+5=0. 

2. x*-5x*+7x*-&x+l=0. 

3. s 7 +33 8 -4x 8 +5s 4 -6x l -7:r , --8=0. 

4. a*+2x«+4x<-8s l -32=0. 

5. A lower limit to the negative roots of fix) =0 may be found by applying our 
theorems to/( — x) =0, i.e., to the equation derived from /(a;) =0 by replacing 3 by — x. 
Find a lower limit to the negative roots in Exs. 2, 3, 4. 

6. Prove that every real root of a real equation fix) =0 is less than 1+g/cto if oo>0 f 
where g denotes the greatest of the numerical values of a if . . . , a». Hint: if g>0, 

cwf+aix*- l + . . . had*-g&- 1 + . . . +*+l). 

Proceed as in $ 22 with fc = l. 

7. Prove that l+0-*-|ao| is an upper limit for the moduli of all complex roots of any 
equation fix) =0 with complex coefficients, where g is the greatest of the values |oi|, 
. . . , |a»|, and \a\ denotes the modulus of a. Hint: use Ex. 5 of § 8. 



24 THEOREMS ON ROOTS OF EQUATIONS [Ch. II 

24. Integral Roots. For an equation all of whose coefficients are integers, 
any integral root is an exact divisor of the constant term. 
For, if a: is an integer such that 

(12) a&?+ . . . +a n _i:c+a n =0, 

where the a's are all integers, then, by transposing terms, we obtain 

x(-a a: n " 1 - . . . -o„_i)=a n . 

Thus x is an exact divisor of a n since the quotient is the integer given by 
the quantity in parenthesis. 

Example 1. Find all the integral roots of 

x*+x*-3x+9=0. 

Solution. The exact divisors of the constant term 9 are ±1, ±3, ±9. By trial, 
no one of dbl, 3 is a root. Next, we find that —3 is a root by synthetic division (§ 15) : 

11-39 1-3 
-3 6-9 



1-230 

Hence the quotient is x 2 — 2s+ 3, which is zero for x=l± V — 2. Thus —3 is the 
only integral root. 

When the constant term has numerous exact divisors, some device 
may simplify the application of the theorem. 

Example 2. 1 Find all the integral roots of 

2/ 3 -M22/ 2 — 32y— 256=0. 

Solution. Since ail the terms except y 3 are divisible by 2, an integral root y must 
be divisible by 2. Since all the terms except y 3 are now divisible by 2 4 , we have y—4z t 
where z is an integer. Removing the factor 2 6 from the equation in z, we obtain 

3»+3* 2 -2z-4=0. 

An integral root must divide the constant term 4. Hence, if there are any integral 
roots, they occur among the numbers ±1, ±2, ±4. By trial, —1 is found to be a 
root: 

1 3-2-4 1-1 
-1 -2 4 l— 



12-40 
This problem is needed for the solution (§ 48) of a certain quartic equation. 



S 24] INTEGRAL ROOTS 25 

Hence the quotient is 2 2 +2«— 4, which is zero for z=— l=b V5. Thus y=4«=— 4 
is the only integral root of the proposed equation. 

EXERCISES 

Find all the integral roots of 

1. s 8 +8**+ 13* +6=0. 2. s»-5z 2 -2:r+24=0. 

3. x 1 - 10s 2 +27z- 18=0. 4. *<+4a;»+8s+32=0. 

5. The equation in Ex. 4 of § 23. 

25. Newton's Method for Integral Roots. In § 24 we proved that 
an integral root x of equation (12) having integral coefficients must be 
an exact divisor of a». Similarly, if we transpose all but the last two 
terms of (12), we see that a n -ix+a n must be divisible by x 2 , and hence 
a n -i+a n /x divisible by x. By transposing all but the last three terms 
of (12), we see that their sum must be divisible by x 3 , and hence a»_2+ 
(a n -i+a n /x)/x divisible by x. We thus obtain a series of conditions 
of , divisibility which an integral root must satisfy. The final sum 
ao+a\/x+ . . . must not merely be divisible by x f but be actually zero, 
since it is the quotient of the function (12) by x n . 

In practice, we must test in turn the various divisors x of a». If a 
chosen x is not a root, that fact will be disclosed by one of the conditions 
mentioned. Newton's method is quicker than synthetic division since 
it usually detects early and throws out wrong guesses as to a root, whereas 
in synthetic division the decision comes only at the final step. 

For example, the divisor —3 of the constant term of 

(13) f(x) ss 4 -9s 8 +24x* - 23s + 15 = 

is not a root since — 23+15/(— 3) = —28 is not divisible by —3. To show that none 
of the tests fails for 3, so that 3 is a root, we may arrange the work systematically as 
follows: 

1 -9 24 -23 15 

(14) -16-6 5 



3 
(divisor) 



-3 18 -18 

First we divide the final coefficient 15 by 3, place the quotient 5 directly under the coef- 
ficient —23, and add. Next, we divide this sum —18 by 3, place the quotient —6 
directly under the coefficient 24, and add. After two more such steps we obtain the 
sum zero, so that 3 is a root. 

It is instructive to obtain the preceding process by suitably modifying synthetic 
division. First, we replace x by \/y in (13), multiply each term by y 4 , and obtain 

152/ 4 -232/ 8 +242/*-9t/+l =0. 



26 THEOREMS ON ROOTS OF EQUATIONS [Ch. II 

We may test this for the root y s *\, which corresponds to the root x=3 of (13), by 
ordinary synthetic division: 



i 

(multiplier) 



15 -23 24 -9 1 
5-6 6-1 

15 -18 18 -3 

The coefficients in the last two lines (after omitting 15) are the same as those of the last 
two lines in (14) read in reverse order. This should be the case since we have here 
multiplied the same numbers by \ that we divided by 3 in (14). The numbers in 
the present third line are the coefficients of the quotient (§15). Since we equate the 
quotient to zero for the applications, we may replace these coefficients by the numbers 
in the second line which are the products of the former numbers by -J-. The numbers 
in the second line of (14) are the negatives of the coefficients of the quotient of J(x) 
by x-3. 

Example. Find all the integral roots of equation (13). . , 

Solution. For a negative value of x, each term is positive. Hence all the. real 
roots are positive. By § 23, 10 is an upper limit to the roots. By § 24, any integral 
root is an exact divisor of the constant term 15. Hence the integral roots, if any, occur 
among the numbers 1, 3, 5. Since /(l) =8, 1 is not a root. By (14), 3 is a root. Pro- 
ceeding similarly with the quotient by x— 3, whose coefficients are the negatives of the 
numbers in the second line of (14), we find that 5 is a root. 

EXERCISES 

1. Solve Exs. 1-4 of § 24 by Newton's method. 

2. Prove that, in extending the process. (14) to the general equation (12), we may 
employ the final equations in § 15 with r— and write 

do at a» . . . %— 2 «n-i a» c 
— bo — 6i — fa . . . — 6„_2 ~"&n— i (divisor) 

— c&o —cb\ . . . — cb n —s —cbn—2 

Here the quotient, — &*-i, of a n by c is placed directly under a%-\, and added to it to 
yield the sum —cb n -2i e *°- 

26. Another Method for Integral Roots. An integral divisor d of the 
constant term is not a root if d— m is not a divisor of /(w), where m is 
any chosen integer. For, if d is a root of f(x) =0, then 

f(x) = (x-d)Q(x), 

where Q(x) is a polynomial having integral coefficients (§ 15). Hence 
f(m) = (m— d) q, where q is the integer Q(m). 



27] RATIONAL ROOTS 27 

In the example of § 25, take d - 15, m — 1. Since /(l) —8 is not divisible by 15 —1 — 14, 
15 is not an integral root. 

Consider the more difficult example 

S(x) «x 8 -2Oc*+164x-400 =0, 

whose constant term has many divisors. There is evidently no negative root, while 
21 is an upper limit to the roots. The positive divisors less than 21 of 400=2*5* are 
d - 1, 2, 4, 8, 16, 5, 10, 20. First, take m - 1 and note that/(l) - -255 - -3 • 5 • 17. The 
corresponding values of d— 1 are 0, 1, 3, 7, 15, 4, 0, 19; of these, 7, 4, 9, 19 are not divisors 
of /(l), so that d=8, 5, 10 and 20 are not roots, Next, take m»2 and note that /(2) = 
—144 is not divisible by 16—2 = 14. Hence 16 is not a root. Incidentally, d = l and 
d—2 were excluded since /(d) ?*0. There remains only d=4, which is a root. 

In case there are numerous divisors within the limits to the roots, it 
is usually a waste of time to list all these divisors. For, if a divisor is 
found to be a root, it is preferable to employ henceforth the quotient. 
as was done in the example in § 25. 

EXERCISES 

Find all the integral roots of 

1. x 4 -2x 8 -21a; l +22x+40=0. 

2. y»-9y*-242/+216=0. 

3. a; 4 -23x l +187x*-653a;-|-936=0. 

4. s*+473 4 +423x l +140:r l +1213x-420«"0. 

5. x»-34x»+29x l +212x-300=0. 

27. Rational Roots. If an equation with integral coefficients 

(15) c a x*+cix n - 1 + . . . +c n -is+c n =0 

has the rational root a/b, where a and b are integers without a common divisor 
> 1, then a is an exact divisor of Cn, and b is an exact divisor of Co. 

Insert the value a/b of x and multiply all terms of the equation by 
b*. We obtain 

<WL H +Cia*- l b+ . . . +c n -iab H - 1 +c n b*=0. 

Since a divides all the terms preceding the last term, it divides that term. 
But a has no divisor in common with b n ; hence a divides c„. Similarly, 
6 divides all the terms after the first term and hem e divides cq. 

Example. Find all the rational roots of 

2x*-7s*+10x-6»0. 



28 THEOREMS ON ROOTS OF EQUATIONS [Ch. II 

Solution. By the theorem, the denominator of any rational root x is a divisor of 2. 
Hence y — 2x is an integer. Multiplying tne terms of our equation by 4, we obtain 

y*-7y*+20y 24=0. 

There is evidently no negative root. By either oi the tests in §§ 22, 23, an upper limit 
to the positive roots of our equation in x is 1+7/2, so that y<9. Hence the only 
possible values of an integral root y are 1, 2, 3, 4, 6, 8. Since 1 and 2 are not roots, we 
try 3: 

1-7 20 -24 I 3 
-1 4 -8 



-3 12 



Hence 3 is a root and the remaining roots satisfy the equation y*— 4y+8=0 and are 
2±2i. Thus the only rational root of the proposed equation is a? =3/2. 

If Co = 1, then 6= dbl and a/b is an integer. Hence we have the 
Corollary. Any rational root of an equation with integral coefficients, 
that of the highest power of the unknown being unity, is an integer. 

Given any equation with integral coefficients 

a y n +aiy H - 1 -t ♦ . +a,»=0. 

we multiply each term by Oo"" 1 , write a^y—x, and obtain an equation 
(15) with integral coefficients, in which the coefficient cq of of 1 is now unity. 
By the Corollary, each rational root £ is an integer. Hence we need only 
find all the integral roots x and divide them by Oq to obtain all the rational 
roots y of the proposed equation. 

Frequently it is sufficient (and of course simpler) to set ky=x, where 
it is a suitably chosen integer less than a . 

EXERCISES 

Find all of the rational roots of 

l. y 4 - 1 ¥^+-4V-40y+9-o. 

2. 6y-lly l +fy-l=0. 

3. 108y l -270y*-42y+l=0. [Usefc=6J 

4. 32y 8 -6y-l -0/ [Use the least k.] 

5. 96y 3 -lfy l -6y+l=0. 6. 24y 8 -2y l -5y+l=0. 
7. y'-^ 2 -2y+l=0. 8. ^-fy 2 +3j/-2=0. 

9. Solve Exs. 2-6 by replacing y by 1/x. 

Find the equations whose roots are the products of 6 by the roots of 
10. y l -2y-£=0. 11. y»-i2/ 2 -|y+i=0. 



CHAPTER III 



Constructions with Ruler and Compasses 

28. Impossible Constructions. We shall prove that it is not possible, 
by the methods of Euclidean geometry, to trisect all angles, or to con- 
struct a regular polygon of 7 or 9 sides. The proof, which is beyond the 
scope of elementary geometry, is based on principles of the theory of 
equations. Moreover, the discussion will show that a regular polygon 
of 17 sides can be constructed with ruler and compasses, a fact not suspected 
during the twenty centuries from Euclid to Gauss. 

29. Graphical Solution of a Quadratic Equation. If a and b are con- 
structive, and 

(1) x*-ax+b=0 

has real coefficients and real roots, the roots 
can be constructed with ruler and compasses 
as follows. Draw a circle having as a diam- 
eter the line BQ joining the points B = 
(0, 1) and Q = (a, b) in Fig. 6. Then the 
abscissas ON and OM of the points of inter- 
section of this circle with the x-axis are the 
roots of (1). 

For, the center of the circle is (a/2, (6+l)/2); the square of BQ is 
a 2 +(6— l) 2 ; hence the equation of the circle is 




Fig. 6 



HM-¥)'- 



a 2 +(6-l) 2 



2 / 4 

This is found to reduce to (1) when y=0, which proves the theorem. 

When the circle is tangent to the x-axis, so that M and N coincide, 
the two roots are equal. When the circle does not cut the x-axis, or 
when Q coincides with B, the roots are imaginary. 

Another construction follows from § 30. 

29 



30 CONSTRUCTIONS WITH RULER AND COMPASSES [Ch. in 

EXERCISES 

Solve graphically: 

1. s*-5s+4=0. 2. x*+5s+4=0. 3. x l +5z-4=0. 

4. s*-5x-4=0. 5. s*-4j;+4=0. 6. x l -3s+4=0. 

30. Analytic Criterion for Constructibility. The first step in our 
consideration of a problem proposed for construction consists in formu- 
lating the problem analytically. In some instances elementary algebra 
suffices for this formulation. For example, in the ancient problem of 
the duplication of a cube, we take as a unit of length a side of the given 
cube, and seek the length a; of a ride of another cube whose volume is 
double that of the given cube; hence 

(2) x* = 2. 

But usually it is convenient to employ analytic geometry as in § 29; 
a point is determined by its coordinates x and y with reference to fixed 
rectangular axes; a straight line is determined by an equation of the 
first degree, a circle by one of the second degree, in the coordinates of the 
general point on it. Hence we are concerned with certain numbers, 
some being the coordinates of points, others being the coefficients of equa- 
tions, and still others expressing lengths, areas or volumes. These num- 
bers may be said to define analytically the various geometric elements 
involved. 

Criterion. A proposed construction is possible by ruler and com- 
passes if and only if the numbers which define analytically the desired geo- 
metric elements can be derived from those defining the given elements by a 
finite number of rational operations and extractions of real square roots. 

In § 29 we were given the numbers a and b, and constructed lines of lengths 

i(a±V a *-4b). 

Proof. First, we grant the condition stated in the criterion and prove 
that the construction is possible with ruler and compasses. For, a rational 
function of given quantities is obtained from them by additions, sub- 
tractions, multiplications, and divisions. The construction of the sum 
or difference of two segments is obvious. The construction, by means 
of parallel lines, of a segment whose length p is equal to the product a-b 
of the lengths of two given segments is shown in Fig. 7; that for the quo- 



830} 



ANALYTIC CRITERION FOR CONSTRUCTIBILITY 



31 



tient q=a/b in Fig. 8. Finally, a segment of length s = Vn may be con- 
structed, as in Fig. 9, by drawing a semicircle on a diameter composed 
of two segments of lengths 1 and n, and then drawing a perpendicular 
to the diameter at the point which separates the two segments. Or we 
may construct a root of x 2 — n=0 by § 29. 






Second, suppose that the proposed construction is possible with ruler 
and compasses. The straight lines and circles drawn in making the con- 
struction are located by means of points either initially given or obtained 
as the intersections of two straight lines, a straight line and a circle, or 
two circles. Since the axes of coordinates are at our choice, we may 
assume that the y-axis is not parallel to any of the straight lines employed 
in the construction. Then the equation of any one of our lines is 



(3) 

Let y 
sects (3). 



y=mx+b. 

m'x+b' be the equation of another of our lines which inter- 
The coordinates of their point of intersection are 



x= 



b'-b 



m—m 



n 



y= 



mb'—m'b 
m—m' J 



which are rational functions of the coefficients of the equations of the 
two lines. 

Suppose that a line (3) intersects the circle 

with the center (c, d) and radius r. To find the coordinates of the points 
of intersection, we eliminate y between the equations and obtain a quad- 
ratic equation for x. Thus x (and hence also mx+b or y) involves no 



32 CONSTRUCTIONS WITH RULER AND COMPASSES [Ch. Ill 

irrationality other than a real square root, besides real irrationalities 
present in ra, b, c, d, r. 

Finally, the intersections of two circles are given by the intersections 
of one of them with their common chord, so that this case reduces to the 
preceding. 

For example, a side of a regular pentagon inscribed in a circle of radius unity is 
(Ex. 2 of § 37) 

(4) «=^VlO-2V5, 

which is a number of the type mentioned in the criterion. Hence a regular pentagon 
can be constructed by ruler and compasses (see the example above quoted). 

31. Cubic Equations with a Constructive Root. We saw that the 
problem of the duplication of a cube led to a cubic equation (2). We 
shall later show that each of the problems, to trisect an angle, and to con- 
struct regular polygons of 7 and 9 sides with ruler and compasses, leads 
to a cubic equation. We shall be in a position to treat all of these problems 
as soon as we have proved the following general result. 

Theorem. // is not possible to construct with ruler and compasses a 
line whose length is a root or the negative of a root of a cubic equation with 
rational coefficients having no rational root. 

Suppose that xi is a root of 

(5) 7? +ax 2 + fix + 7 = (a, 0, y rational) 

such that a line of length x\ or — x\ can be constructed with ruler and com- 
passes; we shall prove that one of the roots of (5) is rational. We have 
only to discuss the case in which x\ is irrational. 

By the criterion in § 30, since the given numbers in this problem are 
a, /3, 7, all rational, x\ can be obtained by a finite number of rational 
operations and extractions of real square roots, performed upon rational 
numbers or numbers derived from them by such operations. Thus xi 
involves one or more real square roots, but no further irrationalities. 

As in the case of (4), there may be superimposed radicals. Such a 
two-story radical which is not expressible as a rational function, with 
rational coefficients, of a finite number of square roots of positive rational 
numbers is said to be a radical of order 2. In general, an n-story radical 
is said to be of order n if it is not expressible as a rational function, with 
rational coefficients, of radicals each with fewer than n superimposed 
radicals, the innermost ones affecting positive rational numbers. 



§ 31] CUBIC EQUATIONS WITH A CONSTRUCT I BLE ROOT 33 

We agree to simplify x\ by making all possible replacements of certain 
types that are sufficiently illustrated by the following numerical examples. 

If x\ involves V3, V5, and VT&, we agree to replace VIE by V3- V5. 
If £1=3— 7t 9 where s is given by (4) and 



<-f\ZlO+2V5 f 

so that 8t= V5, we agree to write x\ in the form s— 7 V5/s, which involves 
a single radical of ord er 2 and no new radical of lower order. Finally, 

we agree to replace V 4— 2V3 by its simpler form V3— 1. 

After all possible simplifications of these types have been made, the 
resulting expressions have the following properties (to be cited as our 
agreements): no one of the radicals of highest order n in x\ is equal to 
a rational function, with rational coefficients, of the remaining radicals 
of order n and the radicals of lower orders, while no one of the radicals 
of order n— 1 is equal to a rational function of the remaining radicals of 
order n— 1 and the radicals of lower orders, etc. 

Let Vfc be a radical of highest order n in x\. Then 

c+dVk' 

where a, 6, c, d do not involve Vfc, but may involve other radicals. If 
ri = 0, then C5**0 and we write e for a/c y f for 6/c, and get 

(6) xi = e+fVk, (f*0) 

where neither e nor / involves V7&. If d 5^0, we derive (6) by multiplying 
the numerator and denominator of the fraction for x\ by c— dVfc, which 
is not zero since Vfc = c/d would contradict our above agreements. 

By hypothesis, (6) is a root of equation (5). After expanding the 
powers and replacing the square of Vfc by fc, we see that 

(7) . (e+/v / fc) 3 +a(e+/Vfc) 2 +^(6+/Vfc)+ 7 =A+5Vfc, 

where A and B are certain polynomials in e, /, k and the rational numbers 
a, 0, 7. Thus A +BVk = 0. If B 5* 0, Vjfc = - A /B is a rational function, 
with rational coefficients, of the radicals, other than vfc, in xi, contrary 
to our agreements. Hence B = and therefore A = 0. 

When e— fVk is substituted for x in the cubic function (5), the result 



1 

34 CONSTRUCTIONS WITH RULER AND COMPASSES [Ch. m | 

is the left member of (7) with Vk replaced by — Vfc, and hence the result j 
is A - BVk. But A = B = 0. This shows that 

(8) x 2 =e-fVk 

is a new root of our cubic equation. Since the sum of the three roots 
is equal to —a by § 20, the third root is 

(9) £3=— a— x\— X2=— a— 2e. 

Now a is rational. If also e is rational, xs is a rational root and we have 
reached our goal. We next make the assumption that e is irrational 
and show that it leads to a contradiction. Since e is a component part 
of the constructible root (6), its only irrationalities are square roots. 
Let Vs be one of the radicals of highest order in e. By the argument 
which led to (6), we may write e=e'+/'Vs, whence, by (9), 

(90 xs-g+hV^s, (h^O) 

where neither g nor h involves Vs. Then by the argument which led 
to (8), g—hVs is a root, different from xs, of our cubic equation, and hence 
is equal to x\ or X2 since there are only three roots (§16). Thus 

g-hV8 = e±fVk. 

By definition, Vs is one of the radicals occurring in e. Also, by (9') f 
every radical occurring in g or h occurs in xz and hence in e=£(— a— xz) t 
by (9), a being rational. Hence Vk is expressible rationally in terms 
of the remaining radicals occurring in e and /, and hence in x\ f whose value 
is given by (6). But this contradicts one of our agreements. 

32. Trisection of an Angle. For a given angle A, we can construct 
with ruler and compasses a line of length cos A or —cos A, namely the J 
adjacent leg of a right triangle, with hypotenuse unity, formed by dropping F 
a perpendicular from a point in one side of A to the other, produced if I J 
necessary. If it were possible to trisect angle A, i.e., construct the angle 
A/3 with ruler and compasses, we could as before construct a line whose 
length is ±cos CA/3). Hence if we show that this last cannot be done 
when the only given geometric elements are the angle A and a line of 
unit length, we shall have proved that the angle A cannot be trisected. 
We shall give the proof for A = 120°. 

We employ the trigonometric identity 

COS A =4 COS 3 75 — 3costt« 

3 3 



321 TRISECTION OF AN ANGLE 36 

lultiply each term by 2 and write x for 2 cos (A/3). Thus 

10) x 3 -3x=2cosA. 
For A = 120°, cos A = -± and (10) becomes 

11) x 3 -3x+l=0. 

Any rational root is an integer (§27) which is an exact divisor of the 
:onstant term (§24). By trial, neither +1 nor —1 is a root. Hence 

11) has no rational root. Hence (§ 31) it is not possible to trisect all 
ingles with ruler and compasses. 

Certain angles, like 90°, 180°, can be trisected. When A = 180°, the equation 
10) becomes x*— 3s= —2 and has the rational root x==l. It is the rationality of a 
oot which accounts for the possibility of trisecting this special angle 180°. 

33. Regular Polygon of 9 Sides, Duplication of a Cube. Since angle 
120° cannot be trisected with ruler and compasses (§ 32), angle 40°can- 
aot be so constructed in terms of angle 120° and the line of unit length 
is the given geometric elements. Since the former of these elements 
and its cosine are constructive when the latter is given, we may take 
the line of unit length as the only given element. In a regular polygon 
of 9 sides, the angle subtended at the center by one side is £• 360° =40°. 
Hence a regular polygon of 9 sides cannot be constructed with ruler and com- 
passes. Here, as in similar subsequent statements where the given 
elements are not specified, the only such element is the line of unit length. 

A rational root of x 3 = 2 is an integer (§ 27) which is an exact divisor 
of 2. The cubes of ±1 and ±2 are distinct from 2. Hence there is no 
rational root. Hence (§§ 30, 31) it is not possible to duplicate a cube with 
ruler and compasses. 

34. Regular Polygon of 7 Sides. If we could construct with ruler 
and compasses an angle B containing 360/7 degrees, we could so con- 
struct a line of length x=2cos B. Since 7 2? =360°, cos 3 B=cos4 B. 

Jut 

2 cos 3B=2(4 cos 3 B-3 cos B) =a?-3x, 

2cos4B=2(2cos 2 2B-l)=4(2cos 2 B-l) 2 -2=(aj2-2) 2 -2. 

Tpnc6 

0=2*-4a?+2-(a?-3x) = (x-2)(a?+a?-2x--l). 

&ut x=2 would give cos B= 1, whereas B is acute. Hence 

12) a^+x 2 -2x-l=0. 



36 CONSTRUCTIONS WITH RULER AND COMPASSES [Ch. HI 

Since this has no rational root, it is impossible to construct a regular 
polygon of 7 sides with ruler and compasses. 

35. Regular Polygon of 7 Sides and Roots of Unity. If 

R = cos -=-+t sin -=-, 

we saw in § 10 that R, fl 2 , /F, ft 4 , R 5 , R 6 , R 7 = l give all the roots of y 7 =l 
and are complex numbers represented by the vertices of a regular polygon 
of 7 sides inscribed in a circle of radius unity and center at the origin of 
coordinates. By § 6, 

p = cos ^ — i sin y, fl+ ^ = 2 cos -=-. 

We saw in § 34 that 2 cos (2tt/7) is one of the roots of the cubic 
equation (12). This equation can be derived in a new manner by utilizing 
the preceding remarks on 7th roots of unity. Our purpose is not primarily 
to derive (12) again, but to illustrate some principles necessary in the 
general theory of the construction of regular polygons. 

Removing from y 7 — 1 the factor y— 1, we get 

(13) yt+yt+tf+lf+tf+y+l =0, 

whose roots are R, R 2 , . . . , R 6 . Since we know that R+l/R is one of 
the roots of the cubic equation (12), it is a natural step to make the sub* 
stitution 

(14) y+±=x 

in (13). After dividing its terms by y 3 , we have 

(13') (^) + (^ + ») + ( H i) + l _. 

« 

By squaring and cubing the members of (14), we see that 

(15) t/ 2 +p=*2-2, y*+±=*>-3x. 

Substituting these values in (13'), we obtain 

(12) a^+x 2 -2x-l=0. 

That is, the substitution (14) converts equation (13) into (12). 



§ 35] REGULAR POLYGON OF SEVEN SIDES 37 

If in (14) we assign to y the six values R, ... , R 6 , we obtain only- 
three distinct values of x: 

(16) Xi = R+±=R+R«, x 2 =R 2 +jp=R 2 +R 5 , £ 3 =/2 3 +^3=/2 3 +i2 4 

In order to illustrate a general method of the theory of regular poly- 
gons, we start with the preceding sums of the six roots in pairs and find 
the cubic equation having these sums as its roots. For this purpose we 
need to calculate 

X1+X2+X3, X\X2-\rX\Xz'\-X2Xz i £1X2X3. 

First, by (16), 

xi+x 2 +x 3 =R+R 2 + . . . +/J6=-i, 

since R, . . . , R e are the roots of (13). Similarly, 

XiX 2 +XiX Z +X2X2 = 2(R+R 2 + . . . +#6) = -2, 

xiX2X Z = 2+R+R 2 + . . . +# 6 = 1. 

Consequently (§ 20), the cubic having xi 9 £2, £3 as roots is (12). 

36. Reciprocal Equations. Any algebraic equation such that the 
reciprocal of each root is itself a root ol the same multiplicity is called a 
reciprocal equation. 

The equation y 7 — 1 =0 is a reciprocal equation, since if r is any root, 1/r is evidently 
also a root. Since (13) has the same roots as this equation, with the exception of unity 
which is its own reciprocal, (13) is also a reciprocal equation. 

If r is any root 5^0 of any equation 

/(y) -»»+... +c«0, 
1/r is a root of f(l/y) =0 and hence of 



2/7(^1+ ... +q/ n =0. 

If the former is a reciprocal equation, it has also the root 1/r, so that every 
root of the former is a root of the latter equation. Hence, by § 18, the 
left member of the latter is indentical with cf(y). Equating the constant 
terms, we have c 2 = 1, c= ± 1. Hence 

(17) yv(J)-±/(y). 



38 CONSTRUCTIONS WITH RULER AND COMPASSES fCfe. HI 

Thus if Pd/*"* is a term of f(y), also dzptf is a term. Hence 

(180 f(y)^y n ±i+Pi(y n - l ^y)+My H " 2 ±y 2 )+ .... 

If n is odd, n=2t+l, the final term is vfyf+ l ±if) 9 and y=fcl is a facto 
of f(y). In view of (17), the quotient 

has the property that 



tf-'Q^) -«<*). 



Comparing this with (17), which implied (180, we see that Q(y)=0k 
a reciprocal equation of the type 

(18) y 2 '+l+c l (y 2 '- 1 +y)+C2(y*- 2 +y>)+ . . . +c t . l (y'+ l +y'- 1 )+ctf=l\ 

If n is even, n=2t, and if the upper sign holds in (17), then (180 
of the form (18). Next, let the lower sign hold in (17). Since a 
p t \f would imply a term —prf, we have p t =0. The final term in (If 
is therefore p t -i(y t * 1 —y t ~ 1 ). Hence f(y) has the factor y 2 — 1. 
quotient g(y) =/(2/)/(i/ 2 — 1) has the property that 

y"~ 2 tf(-)=tf(y). 

Comparing this with (17) as before, we see that ff(y)=0 is of the 
(18) where now 2t=n—2. Hence, at least after removing one or bo( 
of the factors y±l, any reciprocal equation may be given the form (18). 

The method by which (13) was reduced to a cubic equation may 
used to reduce any equation (18) to an equation in x of half the degre^ 
First, we divide the terms of (18) by y 1 and obtain 

(^+i J )+ci(y-+^ T )+ . .+o- 1 (y+£)+«=0. 

Next, we perform the substitution (14) by either of the following methc 
We may make use of the relation 

to compute the values of y*+l/y* in terms of x, starting with the specie 



r 



136] 



RECIPROCAL EQUATIONS 



39 



i 

. cases (14) and (15). For example, 

«H?-*(»H?)-(»* + ?) 

=x(x 3 -3x)-(x 2 -2) =z 4 -4x 2 +2 

| Or we may employ the explicit formula (19) of § 107 for the sum yM-l/fJ 1 
« of the fcth powers of the roots y and 1/y of t/ 2 — xy+1 =0. 

i 37. Regular Polygon of 9 Sides and Roots of Unity. If 

ft = cos -Q-+* sm -Q-, 

the powers ft, iZ 2 , ft 4 , ft 5 , ft 7 , ft 8 , are the primitive ninth roots of unity 
(§11). They are therefore the roots of 

09) "'- 1 



y 3 -! 



=2/ 6 +!/ 3 +l=0. 



XMviding the terms of this reciprocal equation by t/ 3 and applying the sec- 
ond relation (15), we obtain our former cubic equation (11). 

EXERCISES 

1. Show by (16) that the roots of (12) are 2 cos 2x/7, 2 cos 4ir/7, 2 cos 6r/7. 

2. The imaginary fifth roots of unity satisfy t/ 4 -f B 
l^-Hf/'+y+l—Ot which by the substitution (14) be- 
oomes x 2 +x—l=Q. It has the root 

ft+!=2cc*y=i(V5-l). 

Tn a circle of radius unity and center draw two per- 
pendicular diameters A0A' t BOB'. With the middle 
£x>int M of OA' as center and radius MB draw a < ircle 
cutting OA at C (Fig. 10). Show that OC and BC 
the sides «io and «& of the inscribed regular decagon 
id pentagon respectively. Hints: 





Af£=jV5, OC=§(V5-l), BC=Vn-OC 2 =§VlO-2V^ 



«io=2sin 18°=2cos— -=0C, 

5 



f 



(2sin36°)«=2(l-cos^ : ) =7(10-2V5), 

\ 5/4 



*=#C. 



40 CONSTRUCTIONS WITH RULER AND COMPASSES [Ch. m 

3. If fl is a root of (19) verify as at the end of § 35 that R+R>, Rt+R 1 , and R*+& 
are the roots of (11). 

4. Hence show that the roots of (11) are 2 cos 2r/9, 2 cos 4x/9, 2 cos 8r/9. 

5. Reduce y n — 1 to an equation of degree 5 in a?. 

6. Solve y 6 — 7y 4 +y* — y*+7y — 1 = by radicals. [One root is 1 .] 

7. After finding so easily in Chapter I the trigonometric forms of the complex roots 
of unity, why do we now go to so much additional trouble to find them algebraically? 

8. Prove that every real root of x*+ax 2 +b=*0 can be constructed with ruler and 
compasses, given lines of lengths a and 6. 

9. Show that the real roots of x?—px—q=0 are the abscissas of the intersections 
of the parabola y=x 2 and the circle through the origin with the center (iq, J-f-Jp). 

Prove that it is impossible, with ruler and compasses: 

10. To construct a straight line representing the distance from the circular base 
of a hemisphere to the parallel plane which bisects the hemisphere. 

11. To construct lines representing the lengths of the edges of an existing rectangular 
parallelopiped having a diagonal of length 5, surface area 24, and volume 1, 2, 3, or 5. 

12. To trisect an angle whose cosine is i, J, }, i or p/q, where p and q (q> 1) are 
integers without a common factor, and q is not divisible by a cube. 

Prove algebraically that it jp possible, with ruler and compasses: 

13. To trisect an angle whose cosine is (4o s — 3abi*)/b* t where the integer a is numer- 
ically less than the integer b; for example, cos -1 11/16 if a= — 1, 6=4. 

14. To construct the legs of a right triangle, given its area and hypotenuse. 

15. To construct the third side of a triangle, given two sides and its area. 

16. To locate the point P on the side BC=\ of a given square ABCD such that 
tne straight line AP cuts DC produced at a point Q for which the length of PQ is a given 
number g. Show that y=*BP is a root of a reciprocal quartic equation, and solve it 
when 0=10. * 

38. The Periods of Roots of Unity. Before taking up the regular 
polygon of 17 sides, we first explain another method of finding the pain 
of imaginary seventh roots of unity R and R 6 f R 2 and R 5 , R z and fl* # 
employed in (16). To this end we seek a positive integer g such that 
the six roots can be arranged in the order 

(20) R, R° f R°\ R*, R*, R* f 

where each term is the gth power of its predecessor. Trying gr= 2, we find 
that the fourth term would then be R S =R. Hence g?*2. Trying 0=3> 
we obtain 

(21) R, iF, fl 2 , #*, #, R*, 
where each term is the cube of its predecessor. 



1 39] REGULAR POLYGON OF 17 SIDES 41 

To define three periods, each of two terms, 

(16') R+R 6 , R 2 +R 5 , IP+R*, 

we select the first term R of (21) and the third term R e after it and add 
them, then the second term R B and the third term R* after it, and finally 
J2 2 and the third term R 5 after it. 

We may also define two periods, each of three terms, 

zi=R+R 2 +R*, z 2 =i2 3 +# 6 +fl 5 , 

by taking alternate terms in (21). 

Since Z\-\-z%- —1, z\ti =34-/2+ . . . +# 6 =2, z\ and z% are the roots of z*+z+2=0. 
Then R, R* t R* are the roots of w i -ziw i +z i w-l=^0. 

39. Regular Polygon of 17 Sides. Let R be a root ^1 of x 17 =l. 
Then 

fil7 ~ 1 =/2 16 +i2 15 + . . . +72+1=0. 



R-l 



f As in § 38, we may take 0=3 and arrange the roots R, . . . , R 16 so that 
each is the cube of its predecessor: 

R, R?,R 9 , R 10 , fi 13 , R 5 , R 15 , R u , R 1 *, R u , R*, R 7 , &, 72 12 , R 2 , R«. 

Taking alternate terms, we get the two periods, each of eight terms, 

3/i =R+R 9 +R l3 +R l5 +R l6 +R*+R*+R 2 , 
y2=R 3 +R l0 +R 5 +R n +R 14 +R 7 +R 12 J-R 6 . 
Hence yi+y 2 = -1. We find that 2/i2fe=4 (R+ . . . +# 16 ) = -4. Thus 
(22) 2/i, V2 satisfy i/ 2 +2/-4=0. 

Taking alternate terms in 2/1, we obtain the two periods 
Zi=R+R lz +R ie +R i t Z2=R 9 +R l5 +R 8 +R*. 
Taking alternate terms in 2/2, we get the two periods 

wi=R*+R 5 +R 14 +R 12 , w 2 =R l0 +R n +R 7 +R*. 
Thus zi+Z2 wz yi, w\ +102 = 2/2. We find that Z1Z2 = W1W2 = — 1. Hence 

(23) zi, Z2 satisfy z 2 — y\z — 1 = 0, 

(24) ii>], 102 satisfy 10 2 — 2/210 — 1 = 0. 



42 CONSTRUCTIONS WITH RULER AND COMPASSES [Ch. m 

Taking alternate terms in z\ } we obtain the periods 

vi=R+R 16 , v 2 =R 1 *+R i . 
Now, Vi+V2=zi, v\V2=w\. Hence 

(25) vi, V2 satisfy v 2 — ziv+wi=0, 

(26) R, R 16 satisfy p*-t;ip+ 1=0. 

Hence we can find R by solving a series of quadratic equations. Which 
of the sixteen values of R we shall thus obtain depends upon which root 
of (22) is called y\ and which #2, and similarly in (23)-(26). We shall now 
show what choice is to be made in each such case in order that we shall 
finally get the value of the particular root 



Then 



R = cos yf+t sm y=. 

1 2t . . 2t r> , 1 o 2t 

7j=cosj=-isin j=, t;i=/2 +p =2cos j=, 

08 17 m 17' t;2=S 4 +^4=2cosj=. 
Hence t>i > 02 > 0, and therefore z\ = v\ + V2 > 0. Similarly, 

w\ =# 3 +p3+# 5 +^5 = 2 cos i7+ 2 cos "i7" =2 cos i7~ 2 cos if >0 ' 

#2=2 cos Ty+2 cos Ty+2 cos -^-+2 cos -jy-<0> 

since only the first cosine in y% is positive and it is numerically less than 
the third. But yiy 2 = - 4. Hence y x > 0. Thus (22)-(24) give 

!/i = i(vT7-l), 2/2 = |(-Vl7-l), 

2i = i2/i+Vl+Jt/i 2 , ^i = |t/2+Vl+Jy 2 2 . 

We may readily construct segments of these lengths. Evidently 

Vl7 is the length of the hypotenuse of a right triangle whose legs are of 
lengths 1 and 4, while for the radical in z\ we employ legs of lengths 1 
and \y\. We thus obtain segments representing the coefficients of the 



§401 



REGULAR POLYGONS 



43 



quadratic equation (25). Its roots may be constructed as in § 29. The 
larger root is 

t>i=2 cos j^. 

Hence we can construct angle 2r/17 with ruler and compasses, and there- 
fore a regular polygon of 17 sides. 

40. Construction of a Regular Polygon of 17 Sides. In a circle of 
radius unity, construct two per- 
pendicular diameters AB, CD, 
and draw tangents at A, D, 
which intersect at S (Fig. 11). 
Find the point E in AS for which 
AE=IAS, by means of two bi- 
sections. Then 

AE=l, 0E=iW7. 

Let the circle with center E 
and radius 0E cut AS at F and 
F. Then 




Fig. 11 



AF=EF-EA=0E-\=\y u 
AF' = EF'+EA=OE+\ = -\y 2 , 

0F=V0A 2 +AF 2 =Vl+ly l *, 0F' = Vl+Jt/ 2 2 . 

Let the circle with center F and radius FO cut AS at H y outside of F'F; 
tfcat with center F' and radius F'O cut AS at H' between F' and F. Then 



AH=AF+FH=AF+OF=%yi+Vl+iyi 2 =zi, 
AH' =F'H' -F'A=0F' -AF' =w x . 

It remains to construct the roots of equation (25). This will be done 
as in § 29. Draw HTQ parallel to AO and intersecting OC produced at 
T. Make TQ—AH'. Draw a circle having as diameter the line BQ 
joining B= (0,1) with Q = (zi,w{). The abscissas ON and OM of the inter- 
sections of this circle with the x-axis OT are the roots of (25). Hence 
the larger root v\ is OM =2 cos(2t/17). 



44 CONSTRUCTIONS WITH RULER AND COMPASSES [Ch. m 

Let the perpendicular bisector LP of OM cut the initial circle of unit 
radius at P. Then 

cos LOP=OL=cos yj, LOP=^j. 

Hence the chord CP is a side of the inscribed regular polygon of 17 
sides, constructed with ruler and compasses. 

41. Regular Polygon of n Sides. If n be a prime such that n— 1 is 
a power 2* of 2 (As is the case when n = 3, 5, 17), the n— 1 imaginary nth 
roots of unity can be separated into 2 sets each of 2*" 1 roots, each of these 
sets subdivided into 2 sets each of 2*~ 2 roots, etc., until we reach the pairs 
R, }/R and R 2 , l/R 2 , etc., and in fact l in such a manner that we have a 
series of quadratic equations, the coefficients of any one of which depend 
only upon the roots of quadratic equations preceding it in the series. 
Note that this was the case for n = 17 and for w=5. It is in this manner 
that it can be proved that the roots of x n = l can be found in terms of 
square roots, so that a regular polygon of n sides can be inscribed by ruler 
and compasses, provided n be a prime of the form 2*+l. 

If n be a product of distinct primes of this form, or 2* times such a 
product (for example, n = 15, 30 or 6), or if n=2 m (m> 1), it follows readily 
(see Ex. 1 below) that we can inscribe with ruler and compasses a regular 
polygon of n sides. But this is impossible for all other values of n. 

EXERCISES 

1. If a and b are relatively prime numbers, so that their greatest common divisor 
is unity, we can find integers c and d such that ac+bd = 1. Show that if regular poly- 
gons of a and b sides can be constructed and hence angles 2x/a and 2x/6, a regular 
polygon of a- 6 sides can be derived. 

2. If p =2*4-1 is a prime, h is a power of 2. For h =2°, 2 1 , 2 8 , 2», the values of p 
are 3, 5, 17, 257 and are primes. [Show that h cannot have an odd factor other than 
unity.] 

3. For 13th roots of unity find the least g (§ 38), write out the three periods each 
of four terms, and find the cubic equation having them as roots. 

4. For the primitive ninth roots of unity find the least g and write out the three 
periods each of two terms. 

Solve the following reciprocal equations: 

5. 2/ 4 +4i/ 3 -3!/ 2 +42/+l=0. 6 . ^_4 2/ 4+ y 3+ 2 ,2_4 2 , +1= o # 

7. 2y 8 -5r/*+4!/ 4 --42/ 2 +5y-2 = 0. 8. 2/*+l=31(y+l) 6 . 

1 See the author's article " Constructions with ruler and compasses; regular poly- 
gons/' in Monographs on Topics of Modern Mathematics, Longmans, Green and Co., 
1911, p. 374. 



CHAPTER IV 
Solution op Cubic and Quartic Equations; Their Discriminants 

42. Reduced Cubic Equation. If, in the general cubic equation 

(1) a*+bx 2 +cx+d=0, 

we set x=y— 6/3, we obtain the redu^dj^fcic^giiaiiQn 

(2) y+py+q=0, 
lacking the square of the unknown y, where 

(3) P =c ~3' * =d ~"3 + 2T 

After finding the roots y\, t/2, 2/3 of (2), we shall know the roots of (1): 

/m\ 6 6 6 

(4) 3i = 2/i— g, x 2 = 2/2— g, 33 = ^3-3- 

43. Algebraic Solution of the Reduced Cubic Equation. We shall 
employ the method which is essentially the same as that given by Vieta 
in 1591. We make the substitution 



y 
in (2) and obtain 



(5) y=z 32 



since the terms in z cancel, and likewise the terms in 1/2. Thus 

(6) * + ^_£« . 

Solving this as a quadratic equation for z 3 , we obtain 

CT ,--1*** *-(D 8+ (0- 

45 



46 CUBIC AND QUARTIC EQUATIONS [Ch. IV 

By § 8, any number has three cube roots, two of which are the products 
of the remaining one by the imaginary cube roots of unity: 

(8) <*=-$+iVzi, ^=-1-1^31. 

We can choose particular cube roots 



(9) 



A^-1+VR, B-q-l-VB, 



such that AB= — p/3, since the product of the numbers under the cube 
root radicals is equal to (— p/3) 3 . Hence the six values of z are 

A, a>A, d?Ar bT ^B, ~~v*B: ~~~ 

These can be paired so that th e product of the two in each pair is — p/3 

o o o 

Hence with any root z is paired a root equal to — p/(3z). By (5), the sum 

of the two is a value of y. Hence the three values of y are 

<-_ _._ . . — 

(10) yi=A+B, y 2 = a>A+J*B, y z = J*A+G>B. 

It is easy to verify that these numbers are actually roots of (2) For 
example, since. co 3 = 1, the cube of 2/2 is 

A 3 +B 3 +3coA 2 S+3w 2 AB 2 = -g-p(wA+co 2 B) = -ff-py 2 , 

by (9) andi4B=-p/3. 

The numbers (10) are known as Cardan's formulas for the roots of a 
reduced cubic equation (2). The expression A+B for a root was first 
published by Cardan in his Ars Magna of 1545, although he had obtained 
it from Tartaglia under promise of secrecy. 

Example. Solve y* - 15y - 126 = 0. 

Solution. The substitution (5) is here y=z+b/z. We get 

*•- 126V + 125 = 0, z 3 - 1 or 125. 

The pairs of values of z whose product is 5 are 1 and 5, a> and 5a) 1 , o>* and 5«. Theii 
sums 6, <o-f-5a> 2 , and u 2 -J-5a> give the three roots. 

EXERCISES 

Solve the equations: 

1. y 8 -18y+35=0. 2. s 8 +6x*+3a;+18«0. 

3. y 8 -2i/+4=0. 4. 28x*+9s t -l-0. 



543] 



ALGEBRAIC SOLUTION OF CUBIC EQUATIONS 



46A 



EXERCISES 



(In Exs. 1-35, R is a perfect square.) 



1. y»+9y-6-0, An«. ^5-^3, w^9-w J ^3, w»^9-w^3. 

2. y»— 9y— 12=0, Ana. as in Ex. 1 with all signs +. 



3. y«-6y-6«0. 
6. y»-16y-30»0. 

9. y»-12y-20=0. 
11. y«+18y-30=0. 
13. x»-9x*-9x-15=0. 
15. s*-6x*-4=0. 

17. x»-3x*-18j-36=0. 

19. x»-6x*-12x-8=0. 

21. x»-6x*-6x-14=0. 

23. y«-12y-34=0. 

26. y»+30y+15=0. 

27. y*-18y-75=0. 
30. y«-30y-65=0. 

33. y*-Zkty-t(t+k*)=Q. 



4. y'+18t/+6=0. 5. y»+15t/-20=0. 

7. y*+21y-42=0. 8. y s +12t/+12=0. 

10. j^-lSy-SO^Ana.A^lS, £' = 12. 
12. y*+30y+30=0, Ana. A 8 = 20, 5'= -50. 
14. y*+6y+2=0, Ana. A* =2, £' = -4. 
16. y»+36y+12=0, 4ns. A»=36, £» = -48. 

18. y* +18y +15=0, Ana. A = \ft, B= -2*y 5. 

20. y*+42y+7=0,An*.A = ^y49, £=-2^7. 

22. y*+12y-30=0,Arw.A=2^J,5=-^2. 

24. y 8 +18y+50=0, Ana. A =^4, £= -3^2. 

26. y*-18y-110=0,An*. A =3^4,5 = ^2. 

28. y'+18t/-69=0. 29. y*-18y-33=0. 

31. y 8 +54y-9=0. 32. y'+66y-33=0. 

34. y»-3«ty-**(*+0=0. 



35. y*+3i0y+2tn0=O, 10=1**— v*, u and t; arbitrary. 



EXERCISES ON QUARTIC EQUATIONS 

Factor the following functions or solve the corresponding equations. 

I. Quartics having two real and two imaginary roots. 
1. x*+12x-5, Ana. x*+2x-l, x*-2x+5. 



2. x*+32x-60. 3. x 

4. x*-2x*+12x-8. 5. x 

7. x*-4x*+8x-4. 8. x 

10. s'-lQz'+lox+S. 11. x 

li. x 4 -8x*+24x+7. 14. x 

16. x 4 -7x*+28x+8. 17. x 

19. x*-7x*+14x-10. 20. x 

22. x*+5x*+22x— 10. 23. x 

24. x*-8x*+16x+12. 25. x 

26. x 4 -8x*+16x-12. 27. x 

28. x 4 -5x*+18x-20. 29. x 

30. x*-4x*+20x-25. 31. x 



-2x*-8x-3=0, Ana. l=fc\/2, -l=fc\/^2. 



-6x 2 +12x-8. 

-x 2 +10x-4. 

-12x 2 +24x-5. 

-10x 2 +32x-7. 

-9x 2 +36x-8. 

-x 2 +14x-10. 



6. x 

9. x 

12. x 

15. x 

18. x 

21. x 



-3x*+10x-6. 

-2x 2 +8x-3. 

-9x*-f-20x+6. 

-llx 2 +28x-6. 

-9x 2 +12x-f-10. 

-13x 2 -f-28x-10. 



-25x 2 +54x+10=0, Ana. 3=hi, -3=b\/8. 
-3x 2 +14x-12=0, Ans. l±y/^ = H, -l=h\/7. 
^5x 2 -f-14x-12=0, Ans. l*=b\/^2, -l=fc\/5. 
-3x 2 +18x-20=0, Ana. l±2i, -l=fc\/5. 
-x 2 +2x-l=0, Ana. \(l±y/^$), £(-l=fc\/5). 



46B 



CUBIC AND QUARTIC EQUATIONS 



[Ch. IV 



II. Quartics having four 

32. x 4 -16x*-8x-l. 
35. x 4 -7x*+2x+2. 
38. x 4 -15x»-12x-2. 

41. x 4 -20x l +8x+3. 

43. x 4 -9x 8 -6x+4. 

46. x 4 -10x»-8x+5. 

47. x 4 -21x»+4x+6. 
49. x 4 -10x»-4x+8. 
51. x 4 -23x»+4x+12. 
53. x 4 -27x l +30x+ 14. 



real roots. 

33. x 4 -36x»+12x-l. 
36. x 4 -33x»+6x+2. 
39. x 4 -19x»+4x+2. 

42. x 4 -32x»+12x+3 = 

44. x 4 -21x l +12x+4 = 

46. x 4 -23x 8 +20x+6 = 

48. x 4 -28x»+36x+7 = 

50. x 4 -llx 8 -6x+10 = 

52. x 4 -24x l +16x+ 12 

54. x 4 -25x*+12x+18 



34. x 4 -13x l +4x+2. 
37. x 4 -35x»+ 18x-2. 
40. x 4 -37x»+18x-2. 

*0, Ana. 3=fc\/6, -3=fc\/8. 

=0, Ana. 2=fc\/5i -2dbV§. 

=0, Arts. 2=fc\/5i -2=fc\/l0. 

■0, Ana. 3db y/2 t -3=b V§. 

=0, Ana. l=fc\/6, -ldb\/3. 

=0, Ana. 2dbV5, -2dbVl0. 

=0, Ana. 2dbv / ?i -2=fc\/l0. 



III. Quartics haying four imaginary roots. 



55. x 4 +x*-2x+6. 
58. x 4 +3x»+2x+12. 
61. x 4 +8x*+16x+20. 

64. x 4 -4x*-8x+35. 
66. x 4 -3x*+4x+42. 
68. x 4 -2x*+8x+48. 



56. x 4 +2x*-4x+8. 
59. x 4 +4x»+4x+15. 
62. x 4 -5x*-4x+30. 

65. x 4 -3x*-12x+40 = 



57. x 4 +3x*+6x+10. 
60. x 4 +5x*+2x+20. 

63. x 4 +9x»+14x+30. 

0, Ana. 2±i, -2=b2*. 
=0, Ana. Idb3*, -lzfcV^I. 



67. x 4 +10x»+12x-f 40 

69. x 4 +llx»+10x+50=0, Ana. Idb3*, -l=fc2t. 



§ 44] DISCRIMINANT OF A CUBIC 47 

44. Discriminant. The product of the squares of the differences of 
the roots of any equation in which the coefficient of the highest power of 
the unknown is unity shall be called the discriminant of the equation. 

For the reduced cubic (2), the discriminant is „■ ^ 

/ \ - 

(l l) iyi- V2) 2 (yi - y 3 ) 2 (2/2 - y 3 ) 2 =\^V - 27< ? 2 y 

a result which should be memorized in view of its important applications. 
It is proved by means of (10) and w 3 = 1, w 2 +w+l = 0, as follows: 

yi-y2 = (l-o>)(A-oPB), 2,i-2/3 = (l-co 2 )(A-c*B), 

2/2-2fe = («-« 2 ) (A-B), 
(l-w)(l-w 2 )=3, w-co 2 = \/3i. 

Since 1, «, w 2 are the cube roots of unity, 

(x— l)(x— w)(x— w 2 ) =7?— 1, 
identically in x. Taking x=A/B } we see that 

(A-B)(A-aB)(A-a?B)=A 3 -B*=2\/R, 
by (9). Hence 

(yi - 2/2) (yi - yz) (y 2 - yz) = 6 VsVr i. 

Squaring, we get (11), since -108#= -4jP-27q 2 by (7). For later 
use, we note that the discriminant of the reduced cubic is equal to — 108 R. 

The discriminant A of the general cubic (1) is equal to the discriminant 
of the corresponding reduced cubic (2). For, by (4), 

xi-x 2 = yi-y2, xi-x 3 = yi r y 3 , x 2 -x 3 = y 2 -y 3 . 

Inserting in (11) the values of p and q given by (3), we get 

(12) A = 18bcd -46 3 d+6 2 c 2 - 4c 3 - 27d 2 . 

It is sometimes convenient to employ a cubic equation 

(13) ar»+6x*+C2;+d=0 (a^O), 

in which the coefficient of x l has not been made unity by division. The product P 
of the squares of the differences of its roots is evidently derived from (12) by replacing 
b, c, d by b/a, c/a, d/a. Hence 

(14) o 4 P = 18 a&cd-46»d+ 6 8 c*-4ac»-27a»d». 
expression (and not P itself) is called the discriminant of (13). 



48 CUBIC AND QUARTIC EQUATIONS [Ch. IV 

45. Number of Real Roots of a Cubic Equation. A cubic equation 
with real coefficients has three distinct real roots if its discriminant A is positive, 
a single real root and two conjugate imaginary roots if A is negative, and at 
least two equal real roots if A is zero. 

If the roots x\, X2, £3 are all real and distinct, the square of the differ- 
ence of any two is positive and hence A is positive. 

If x\ and X2 are conjugate imaginaries and hence £3 is real (§ 21), 
(xi— X2) 2 is negative. Since x\— xz and X2— £3 are conjugate imaginaries, 
their product is positive. Hence A is negative. 

If £i=#2, A is zero. If X2 were imaginary, its conjugate would be 
equal to xs by § 21, and X2, £3 would be the roots of a real quadratic 
equation. The remaining factor x—x\ of the cubic would have real 
coefficients, whereas x\=X2 is imaginary. Hence the equal roots must 
be real. 

Our theorem now follows from these three results by formal logic. 
For example, if A is positive, the roots are all real and distinct, since 
otherwise either two would be imaginary and A would be negative, or two 
would be equal and A would be zero. 

EXERCISES 

Compute the discriminant and find the number of real roots of 

1. r/ 3 -2y-4=0. 2. 2/ 3 -15y+4=0. 

3. 2/ 3 -27y+54=0. 4. s 8 +4x 2 -llx+6=0. 

5. Show by means of § 21 that a double root of a real cubic is real. 

46. Irreducible Case. 'When the roots of a real cubic equation are 
all real and distinct, the discriminant A is positive and R=— A/108 is 
negative, so that Cardan's formulas present the values of the roots in a 
form involving cube roots of imaginaries. This is called the irreducible 
case since it may be shown that a cube root of a general complex number 
cannot be expressed in the form a+bi, where a and b involve only real 
radicals. 1 While we cannot always find these cube roots algebraically, 
we have learned how to find them trigonometrically (§ 8). 

Example. Solve the cubic equation (2) when p= —12, q= — 8 V 2. 
Solution. By (7), R — —32. Hence formulas (9) become 

A = V 4 V2+4 ^2 i, B = V4 V2-4 V2 t. 
1 Author's Elementary Theory of Equations, pp 35. 36. 



5 471 TRIGONOMETRIC SOLUTION OF A CUBIC 49 

rhe values of A were found in § 8. The values pf B are evidently the conjugate imagi- 
naries of the values of A. Hence the roots are 

4oosl5°, 4 cos 135°, 4 cos 255°. 



EXERCISES 

1. Solve y»-15y+4=0. 2. Solve y*-2t/-l=0. 

3. Solve y*-7y+7=0. 4. Solve x 3 +3x 2 -2.r-5'=0. 

5. Solve x»+s , -2s-l=0. 6. Solve x s +4x* -7=0. 

47; Trigonometric Solution of a Cubic Equation with A>0. When 
the roots of a real cubic equation are all real, i.e., if R is negative, they 
can be computed simultaneously by means of a table of cosines with much 
less labor than required by Cardan's formulas. To this end we write 
the trigonometric identity 

cos 3A =4 cos 3 ^ — 3 cos A 
in the form 

2 s — fz— J cos3A=0 (z = cos4). 

In the given cubic y 3 +py+g=0 take y=m; then 

rr n 6 

which will be identical with the former equation in z if 

n=V^fp, coa3A = -%q+V-p i /27. 

Since /2 = p 3 /27+g 2 /4 is negative, p must be negative, so that n is real 
and the value of cos 34 is real and numerically less than unity. Hence 
we can find 34 from a table of cosines. The three values of z are then 

cos 4, cos (4+120°), cos (4+240°). 

Multiplying these by n, we obtain the three roots y correct to a number 
of decimal places which depends on the tables used. 

EXERCISES 



1. For y»-2y-l=0, show that n»=8/3, cos 3A = V27/32, 3A=23°17'0", 
cos A =0.99084, cos (4+120°) = -0.61237, cos (A +240°) = -0.37847, and that the 
roots y are 1.61804, -1, -0.61804. 

2. Solve Exs. 1, 3, 4, 5, 6 of § 46 by trigonometry. 



50 CUBIC AND QUARTIC EQUATIONS [Ch. IV 

48. Ferrari's Solution of the -Quartic Equation. The general quartic 
equation 

(15) a*+bx?+cx>+dx+e = 0, 

or equation of degree four, becomes after transposition of terms 

£*+bx?= — ex 2 — dx—e. 

The left member contains two of the terms of the square of x?+\bx. 
Hence by completing the square, we get 

(x?+Jte) 2 =(i&*-c)x?-dx-e. 
Adding {x*+\bx)y+ \y 2 to each member, we obtain 

(16) {x 2 +\bx+\y) 2 = (\V-c+y)7?+{\by-d)x+\yi-e. 

The second member is a perfect square of a linear function of £ if and 
only if its discriminant is zero (§12): 

{\by-d) 2 -±£b 2 -c+y) (iy 2 -e)=0, 
which may be written in the form 

(17) y*-cy 2 +(bd-4e)y-'V>e+4ce--d 2 =0. 

Choose any root y of this resolvent cubic equation (17). Then the 
right member of (16) is the square of a linear function, say mx+n. Thus 

(18) x 2 +\bx+\y=mx+n or x*+%bx+%y= — mx— n. 

The roots of these quadratic equations are the four roots of (16) and 
hence of the equivalent equation (15). This method of solution is due 
to Ferrari (1522-1565). 

Example. Solve x 4 +2x 8 -12x 2 -10x+ 3 =0. 

Solution. Here b =2, c = — 12, d = — 10, e =3. Hence (17) becomes 

y»+12y*-32t,-256=0, 

which by Ex. 2 of § 24 has the root y = — 4. Our quartic may be written in the form 

(x 2 +x)* = 13x*+10a:-3. 

Adding (x 2 +x) (—4) +4 to each member, we get 

(s 2 +x-2)*=9x*+6x+l = (3x+l) 1 , 

x 2 +x-2 = =fc(3x+l), * x 2 -2x-3=0or x 2 +4x-l=0, 

whose roots are 3, —1, —2d: V 5. As a check, note that the sum of the roots is —2. 



§ 50] DISCRIMINANT OF A QUARTIC 51 

EXERCISES 

1. Solve x*-8x*+9x*+&x-10=0. Note that (17) is (y-9) ( y 8 -24) =0. 

2. Solve :r 4 -23»-7a:*+8x+12 =0. Since the right member of (16) is (8+y) (x*-x) 
+iy*-12, usey=-8. 

3. Solve x 4 -3x 1 +6x-2=0. 

4. Solve s 4 -2s»-8a;-3=0. 5. Solve s 4 -10x» -20a; -16=0. 

49. Roots of the Resolvent Cubic Equation. Let t/i be the root y 
which was employed in § 48. Let x\ and £2 be the roots of the first 
quadratic equation (18), and £3 and £4 the roots of the second. Then 

xiX2= z ^yi'-n ) £3£4 = iyi+w, £i£2+£3£4 = yi. 

If, instead of y\, another root #2 or yz of the resolvent cubic (17) had been 
employed in §48, quadratic equations different from (18) would have 
been obtained, such, however, that their four roots are £1, £2, £3, £4, paired 
in a new manner. The root which is paired with x\ is £2 or £3 or £4. It 
is now plausible that the values of the three y's are 

(19) y\ — £l£2 +£3^4, t/2 = £l£3 +£2£4, Jfa = £l£4 +£2£3» 

To give a more formal proof that the y's given by (19) are the roots 
of (17), we employ (§ 20) 

£l +£2+£3 +£4 = ~~ b, £1X2X3 +^l£2£4 +£l£3£4 +£2£3£4 = ~ d, 

X\X2 +£l£3 +£l£4 +£2£3 +£2£4 +£3£4 = C, £1X2X3X4 = 6. 

From these four relations we conclude that 

2/1+^2 + ^3 = 0, 
yi2/2+ytf/3+y2y3 = (Sl+£2+£3+X4)(£l£2£3+ • • • +^£3X4) - 4£i£ 2 £3£4 

=6d— 4e, 

yi2/2y3 = («lX2£3+ . • • ) 2 +£l£2£3«4{(£l+ . . . ) 2 -4(£i£ 2 + . . . )} 

=d 2 +e(6 2 -4c). 
Hence (§ 20) yi, 2/2, 2/3 are the roots of the cubic equation (17). 

60. Discriminant. The discriminant A of the quartic equation 
(15) is defined to be the product of the squares of the differences of its roots: 

A = (£1 — £2) 2 (£l — Z3) 2 (3i — £ 4 ) 2 (£2 — £3) 2 (Z2 — £4) 2 (^3 — £4) 2 . 



52 CUBIC AND QUARTIC EQUATIONS [Ch. IV 

The fact that A is equal to the discriminant of the resolvent cubic 
equation (17) follows at once from (19), by which 

yi-y2-(Xi-X4)(X2-X3) } yi-y3 = (Xi-X3)(X2~X4), 

2/2-2/3 = (zi-X2)(x3-x 4 ), (2/i-y2) 2 (t/i-2te) 2 (y2-y3) 2 =A. 

Hence (§44) A is equal to the discriminant — 4P 3 — 27g 2 of the reduced 
cubic Y 3 +pY+q=0 ) obtained from (17) by setting y=Y+c/3. Thus 

(20) p = bd-4e-%<?, q= -Pe+lbcd+ice-dP-fr?. 

Theorem. The discriminant of any quartic equation (15) is equal to 
the discriminant of its resolvent cubic equation and therefore is equal to the 
discriminant — 4jP — 21q 2 of the corresponding reduced cubic Y 3 +pY+q=^0 9 
whose coefficients have the values (20). 

EXERCISES 

1. Find the discriminant of x 4 — 3x*+x'+3x— 2=0 and show that the equation 
has a multiple root. 

2. Show by its discriminant that x 4 — 8x , +22x* — 24x+9=0 has a multiple root. 

3. If a real quartic equation has two pairs of conjugate imaginary roots, show that 
its discriminant A is positive. Hence prove that, if A<0, there are exactly two real 
roots. 

4. Hence show that x 4 — 3x*-|-3x 2 — 3x+2 =0 has two real and two imaginary roots. 

51. Descartes 9 Solution of the Quartic Equation. Replacing x by 
2—6/4 in the general quartic (15), we obtain the reduced quartic equation 

(21) *+qz?+rz+s = 0, 

lacking the term with z 3 . We shall prove that we can express the left 
member of (21) as the product of two quadratic factors 

(z 2 +2kz+l)(z 2 -2kz+m) =z i +(l+m-4k 2 )z 2 +2k(m-l)z+lm. 

The conditions are 

l+m— 4k 2 = q, 2k(m—l)=r, fon=«. 

If fc^O, the first two give 

2l = q+4k 2 ~ y 2m = q+4kH T 



2k' * ' * 2k' 

Inserting these values in 2Z-2w=4s, we obtain 
(22) 64fc 6 +32gfc 4 +4(g 2 -4s)fc 2 -r 2 =0. 



§51] DESCARTES' SOLUTION OF A QUARTIC 63 

The latter may be solved as a cubic equation for k 2 . Any root fcMO 
gives a pair of quadratic factors of (21) : 

(23) z*±2kz+$q+2k 2 ^. 

The four roots of these two quadratic functions are the four roots of (21). 
This method of Descartes (1596-1650) therefore succeeds unless every 
root of (22) is zero, whence g=s=r= 0, so that (12) is the trivial equation 

For example, consider z 4 — 33*+6z— 2«=0. Then (22) becomes 

64A«-3-32ifc«+4.17ifc*-36=0. 

The value k*=*l gives the factors 3 2 +2z— 1, z* — 2? +2. Equating these to zero, we 
find the four roots -l=fcV2, l=fcV-l. 

52. Symmetrical Form of Descartes' Solution. To obtain this sym- 
metrical form, we use all three roots fa 2 , fa 2 , fa 2 of (22). Then 

fa 2 +fa*+fa 2 = - ±q } fa 2 fa 2 fa 2 = £. 

It is at our choice as to which square root of k\ 2 is denoted by +fa and 
which by —fa, and likewise as to ±fa, ±fa. For our purposes any 
choice of these signs is suitable provided the choice give 

(24) fafafa=-g. 

Let ki7*0. The quadratic function (23) is zero for k = ki if 

(2±fcl) 2 !- fc l 2 ±i^ = fc2 2 +fr3 2 T^^ 3 = (fc 2 =Ffc 3 ) 2 . 

Hence the four roots of the quartic equation (21) are 

(25) fa+fa+fa, ki— fa— fa, — fa+fa— fa, — fa— fa+fa. 

EXERCISES 

1. Solve Exs. 4, 5 of § 48 by the method of Descartes. 

2. By writing yi, y%, y% for the roots fci 2 , fa*, fc 3 * of 

(26) Wy*+32w'+4fa*-48)y-r»=0, 
show that the four roots of (21) are the values of 

(27) 2 = Vyi+VyH-Vy, 



54 CUBIC AND QUARTIC EQUATIONS [Ch. IV 

for all combinations of the square roots for which 

(28) V^.\/^.V^=-^-. 

o 

3. Euler (1707-1783) solved (21) by assuming that it has a root of the form (27). 
Square (27), transpose the terms free of radicals, square again, replace the last factor 

of 8Vyiy*y* ( V t/i+ Vys+Vya) by z, and identify the resulting quartic in z with (21). 
Show that t/i, 2/2, y* are the roots of (26) and that relation (28) holds. 

4. Find the six differences of the roots (25) and verify that the discriminant A of 
(21) is equal to the quotient of the discriminant of (26) by 4 6 . 

5. In the theory of the inflexion points of a plane cubic curve there occurs the 
equation 

z* -Sz* -$Tz -&S* =0. 

Show that (26) now becomes 



H)"- "©'-©•• 



and that the roots of the quartic equation are 

±y/\s+</c±y/\s+<*</c±y/\s+<**</c^ 

where « is an imaginary cube root of unity and the signs are to be chosen so that the 
product of the three summands is equal to +J7\ 

MISCELLANEOUS EXERCISES 

1. Find the coordinates of the single real point of intersection of the parabola 
y—x 1 and the hyperbola xy— 4x-hy -1-6=0. 

2. Show that the abscissas of the points of intersection of y=x* and ax 8 — xy+y s — 
x — (a+5)y — 6 = are the roots of x 4 — x 3 — 5x*— x— 6=0. Compute the discriminant 
of the latter and show that only two of the four points of intersection are real. 

3. Find the coordinates of the two real points in Ex. 2. 

4. A right prism of height h has a square base whose side is b and whose diagonal 

is therefore b V 2. If v denotes the volume and d a diagonal of the prism, v=hb* and 
d*=A 2 -f- (6 V 2)*. Multiply the last equation by h and replace hb* by v. Hence h*-~ d*h 

+2v=0. Its discriminant is zero if d=3V 3, l>=27; find h. 

5. Find the admissible values of h in Ex. 4 when d = 12, t>=332.5. 

6. Find a necessary and sufficient condition that quartic equation (15) shall have 
one root the negative of another root. 

Hint: (xi-f x*) (xj+x*) —q—yi- Hence substitute q for y in (17). 

7. In the study of parabolic orbits occurs the equation tan iw+f tan 8 %v=*t. 
Prove that there is a single real root and that it has the same sign as t. 

8. In the problem of three astronomical bodies occurs the equation x*+ax+2*=0. 
Prove that it has three real roots if and only if a £ —3. 



CHAPTER V 



The Graph of an Equation 

63. Use of Graphs in the Theory of Equations. To find geometrically 
the real roots of a real equation f(x) = 0, we construct a graph of y=f(x) 
and measure the distances from the origin to the intersections of the 
graph and the x-axis, whose equation is y = 0. ^^ 

For example to find geometrically the real 
roots of 



(1) 



x 2 -6x-3 = 0, 



we equate the left member to y and make a 
graph of 



do 



y=x*— 6x— 3. 



We obtain the parabola in Fig. 12. Of the 
points shown, P has the abscissa #=OQ=4 
and the ordinate y=— QP= — 11. From the 
points of intersection of y = (the x-axis OX) 
with the parabola, we obtain the approximate 
values 6.46 and —0.46 of the roots of (1). 

EXERCISES 




Fig. 12 



1. Find graphically the real roots of x 2 — 6:c+7 =0. 

Hint: For each x, y=x 2 — 6x+7 exceeds the y in (1') by 10, so that the new graph 
is obtained by shifting the parabola in Fig. 12 upward 10 units, leaving the axes OX 
and OY unchanged. What amounts to the same thing, but is simpler to do, we leave 
the parabola and OY unchanged, and move the axis OX downward 10 units. 
. 2. Discuss graphically the reality of the roots of x 2 — 6x+12 =0. 

3. Find graphically the roots of x 2 — 6x+9 = 0. 

54. Caution in Plotting. If the example set were 

(2) 2/ = 8ar*-14a?-9x 2 +llz--2, 

one might use successive integral values of x, obtain the points (—2, 180), 

55 



GRAPHS 



(Ch.V 




Fio. 13 



(-1, 0), (0, -2), (1, -6), (2, 0), (3, 220), all but the 
first and last of which are shown (by crosses) in Fig. 
13, and be tempted to conclude that the graph is a ■ 
U-shaped curve approximately like that in Fig. 12 
and that there are just two real roots, — 1 and 2, of 
(20 S^—14a?-9x s +llx-2=0. 

But both of these conclusions would be false. In fact, 
the graph is a W-shaped curve (Fig. 13) and the 
additional real roots are i and }. 

This example shows that it is often necessary to 
employ also values of x which are not integers. The 
purpose of the example was, however, not to point 
out this obvious fact, but rather to emphasize the 
chance of serious error in sketching a curve through a 
number of points, however numerous. The true curve 
between two points below the x-axis may not cross the 
x-axis, or may have a peak and actually cross the x-axis 
twice, or may be an M-shaped curve crossing it four 
times, etc. 



For example, the graph (Fig. 14) of 
(3) y=3?+4z?-U 

crosses the x-axis only once; but this fact can- 
not be established by a graph located by a num- 
ber of points, however numerous, whose abscissas 
arc chosen at random. 

We shall find that correct conclusions regard- 
ing the number of real roots may be -deduced 
from a graph whose bend points (§ 55) have been 
located. 

55. Bend Points. A point (like M or M' in' 
Fig. 14) is called a bend point of the graph of 
y =/(*) if the tangent to the graph at that point 
is horizontal and if all of the adjacent points of 
the graph lie below the tangent or all above the 
tangent. The first, but not the second, condi- 




555] 



BEND POINTS 



57 



tion is satisfied by the point of the graph of y=x 3 given in Fig. 15 (see 
§ 57). In the language of the calculus, f(x) has a (relative) maximum or 
minimum value at the abscissa of a bend point on the graph of t/=/(x). 





Fig. 15 



Fig. 16 



Let P=(x f y) and Q=(x+h, Y) be two points on the graph, sketched 
in Fig. 16, of y=/(x). By the slope of a straight line is meant the tangent 
of the angle between the line and the z-axis, measured counter-clockwise 
from the latter. In Fig. 16, the slope of the straight line PQ is 



(4) 



Y-y _ f(x+h)-f(x) 
h h ' 

For equation (3) , f(x) = x 3 +4X 2 — 1 1 . Hence 
f(x+h) = (x+hf+4(x+h) 2 - 11 

=x 3 +4x 2 - 11 +(3x 2 +8x) h+(3x+4) tf+l?. 

The slope (4) of the secant PQ is therefore here 

3x2+8*+ (3z+4) h+h 2 . 

Now let the point Q move along the graph toward P. Then h approaches 
the value zero and the secant PQ approaches the tangent at P. The 
slope of the tangent at P is therefore the corresponding limit 3a^+8x 
of the preceding expression. We call 3a?+8x the derivative of x 3 +4X 2 —11. 



58 _ GRAPHS [Ch.V 

In particular, if P is a bend point, the slope of the (horizontal) tangent 
at P is zero, whence 3x 2 +8x = 0, x = or x=— f. Equation (3) gives 
the corresponding values of y. The resulting points 

M=(0,-11), Jf'»(-f,-|^ 

are easily shown to be bend points. Indeed, for x>0 and for x between 
—4 and 0, x 2 (x+4) is positive, and hence f(x)> — 11 for such values of 
x, so that the function (3) has a relative minimum at x=0. Similarly, 
there is a relative maximum at x— — f. We may also employ the general 
method of § 59 to show that M and M ' are bend points. Since these bend 
points are both below the x-axis we are now certain that the graph 
crosses the z-axis only once. 

The use of the bend points insures greater accuracy to the graph than 
the use of dozens pf points whose abscissas are taken at random. 

56. Derivatives. We shall now find the slope of the tangent to the 

graph of y=f(x), where /(x) is any polynomial 

(5) f(x)=a x n +aix n " 1 + . . . +a n _ l x+a n . 

We need the expansion of /(x+ft) in powers of x. By the binomial 
theorem, 

a (x+h) n = aox n +7mx n ' 1 h + U ^ ^ ao^^h 2 + . . . , 
ai(x+/ir- I = aia; w - I +(n-l)aix w - 2 /i+ (n ^ 1) (n " 2) aix^ 3 A 2 + 



i 



a n - 2 (x+h) 2 = a n _2P 2 +2a>n-2xh +a»-2h 2 , 

a n _ 1 (x+A)=a n _ 1 x +a n ^ 1 h J 

The sum of the left members is evidently f(x+h). On the right, the 
sum of the first terms (i.e., those free of ft) is f(x). The sum of the coef- 
ficients of ft is denoted by f'(x), the sum of the coefficients of \h 2 is denoted 
by f"(%)> • . • , the sum of the coefficients of 

ft* 
1-2 ... * 



§56] DERIVATIVES, TAYLOR'S THEOREM 59 

is denoted by f {k) (x) . Thus 

(6) / , (x)=no x n - 1 +(n--l)aiX n - 2 + . . . +2a n _ 2 x+a n _ 1 , 

(7) / ,/ (a:)=n(n-l)a x tt - 2 +(n-l) (n-2)a l x n ~ z + . . . +2a B _ 2 , 
etc. Hence we have 

(8) f{x+h)=f{x)+f\x)h+f"{x)^H'"{x) h3 



1-2 " v/ l-2-3 
r! J v nr 



+ . . . +f(€,+ • • • +/»(*&, 



where r! is the symbol, read r factorial, for the product 1-2-3 .. . (r— l)r. 
Here r is a positive integer, but we include the case r=0 by the definition, 

0!«1. 

This formula (8) is known as Taylor 1 s theorem for the present case of 
a polynomial /(x) of degree n. We call /'(a?) the (first) derivative off(x) 9 
and f"(x) the second derivative of f(x), etc. Concerning the fact that 
f"(x) is equal to the first derivative of f'(x) and that, in general, the Arth 
derivative f (t) (x) of f(x) is equal to the first derivative of f*~ l) (x), see 
Exs. 6-9 of the next set. 

In view of (8), the limit of (4) as h approaches zero is f'(x). Hence 
f(x) is the slope of the tangent to the graph of y=f(x) at the point (x, y). 

In (5) and (6), let every a be zero except ao. Thus the derivative of 
aox" is naox"" 1 ! and hence is obtained by multiplying the given term by 
its exponent n and then diminishing its exponent by unity. For example, 
the derivative of 2s 3 is 6x 2 . 

Moreover, the derivative of f(x) is equal to the sum of the derivatives 
of its separate terms. Thus the derivative of rr 3 +4x 2 — 11 is 3x 2 +8x, 
as found also in § 55. 

EXERCISES 

1. Show that the slope of the tangent to ?/ = 8x 3 — 22.r 2 +13x— 2 at {x t y) is 24x 2 - 
443+13, and that the bend points are (0.37, 0.203), (1.46, —5.03), approximately. 
Draw the graph. 

2. Prove that the bend points of t/=x 3 -2x-5 are (.82, -6.09), .(-.82, -3.91), 
approximately. Draw the graph and locate the real roots. 

3. Find the bend points of y = x 3 +6.c 2 +8.c +8. Locate the real roots. 

4. Locate the real roots of f(x) =x 4 +x 3 — x— 2 = 0. 

Hints: The abscissas of the bend points are the roots of f'(x) =4s 3 +3:c 2 — 1=0. 
The bend points of y =/'(*) are (0, — 1) and ( — \, — f ), so that }'{x) =0 has a single real 
root (it is just less than J). The single bend point of y=f(x) is (-J-, — j J), approxi- 
mately. 



60 GRAPHS [Ch.V 

5. Locate the real roots of x*— 7x*— 3x f +7=0. 

6. Prove that/"(x), given by (7), is equal to the first derivative of /'(a:). 

7. If f(x) =fi(x)+ft(x), prove that. the A:th derivative of / is equal to the sum of 
the fcth derivatives of /i and/ 2 . Use (8). 

8. Prove* that f (k) (x) is equal to the first derivative of / ( *~ !) (:r). Hint: prove this 
for/=ox m ; then prove that it is true for /=/i+/« if true for/i and /*. 

9. Find the third derivative of x*+5x* by forming successive first derivatives; 
also that of 2x* — 7x 3 -f x. 

10. Prove that if g and k are polynomials in x,the derivative of gk is g'k+gk'. Hint: 
multiply the members of g(x+h) =g(x)+g'(x)h+ . . .and k(x+h)=k{x)-\-k'(x)h+ . . . 
and use (8) for f-gk. 

67. Horizontal Tangents. If (x, y) is a bend point of the graph of 

y=f(x), then, by definition, the slope of the tangent at (x, y) is zero. 
Hence (§ 56), the abscissa x is a root of /'(x)=0. In Exs. 1-5 of the 
preceding set, it was true that, conversely, any real root of f'(x)=0 is 
the abscissa of a bend point. However, this is not always the case. 
We shall now consider in detail an example illustrating this fact. The 
example is the one merely mentioned in § 55 to indicate the need of the 
second requirement made in our definition of a bend point. 

The graph (Fig. 15) of y=x? has no bend point since x 3 increases when 
x increases. Nevertheless, the derivative 3x 2 of a? is zero for the real 
value x = 0. The tangent to the curve at (0, 0) is the horizontal li^e 
y = 0. It may be thought of as the limiting position of a secant through 
which meets the curve in two further points, seen to be equidistant 
from 0. When one, and hence also the other, of the latter points approaches 
0, the secant approaches the position of tangency. In this sense the 
tangent at is said to meet the curve in three coincident points, their 
abscissas being the three coinciding roots of a^=0. In the language of 
§ 17, x 3 =0 has the triple root x = 0. The subject of bend points, to which 
we recur in § 59, has thus led us to a digression on the important subject 
of multiple roots. 

• 

68. Multiple Roots. In (8) replace x by a, and h by x-a. Then 

(9) /(x)=/(a)+/'(a)(^-«)+/"(«)^^- 2 +/'"(«)^^+ . , . 

+f m - u \«) ( T a) n! +/ (m) («)^ Z ^'+ .... 
(m— 1)! ml 

By definition (§ 17) a is a root of f(x)=0 of multiplicity m if fix) is exactly 



§ 58] MULTIPLE ROOTS 61 

divisible by (x— a) w , but not by (x— a) m+1 . Hence a is a root of multi- 
plicity to o//(x) =0 if and only if 

(10) /(«)=0, /'(«)=0, /"(«)=0, ..., /<»"»(«)- 0, /^(a^O. 

For example, x 4 4-2x ,ss has the triple root x—0 since is a root, and since the 
first and second derivatives 4x 3 +6x 2 and 12x 2 -fl2x are zero for x=0, while the third 
derivative 24a; +12 is not zero for x—0. 

If in (9) we replace / by /' and hence / (t) by / ( * +1) , or if we differentiate 
every term with respect to x, we see by either method that 

(id /'(*) =/»+/" («)(*-«)+ . . . +/ cm - i) («)feT~ 2 



(m-2)! 



+/ (w) (<*) ( ? a n7 + • • • • 



Let/(x) and/'(x) have the common factor (x— a) m_1 , but not the com- 
mon factor (x— a) w , where m>l. Since (11) has the factor (x— a) m_1 , we 
have /'(a)=0, . . . , f im ~ 1) (a) = 0. Since also f(x) has the factor x— a, 
evidently /(a)=0. Then, by (9), /(x) has the factor (x-a) m , which, 
by hypothesis, is not also a factor of /'(x). Hence, in (11), / (m) (a)^0. 
Thus, by (10), a is a root of /(x) =0 of multiplicity m. 

Conversely, let a be a root of /(x)=0 of multiplicity m. Then rela- 
tions (10) hold, and hence, by (11), /'(x) is divisible by (x— a)"*" 1 , but 
not by (x— a) m . Thus/(x) and/'(x) have the common factor (x— a)*" 1 , 
but not the common factor (x— a) m . 

We have now proved the following useful result. 
Theorem. If /(x) and f'(x) have a greatest common divisor g(x) 
involving x, a root of g(x)=0 of multiplicity ra-1 is a root of /(x)=0 of 
multiplicity m, and conversely any root of f(x) =0 o/ multiplicity m is a root 
of g(x)=0 of multiplicity m— 1. 

In view of this theorem, the problem of finding all the multiple roots 
of f(x)=0 and the multiplicity of each multiple root is reduced to the 
problem of finding the roots of g{x) =0 and the multiplicity of each. 

[ For example, let/(x) =x»-2x a -4x+8. Then 

" f'{x) =3x*-4x-4, 9/0r) =/'(xK3z-2) -32(a:-2). 

Since x— 2 is a factor of f'(x), it may be taken to be the greatest common divisor of f(x) 
and f'(x), the choice of the constant factor c in c(x— 2) being here immaterial. Hence 
2 is a double root off(x) —0 t while the remaining root —2 is a simple root. 



62 GRAPHS [Ch. V 

EXERCISES 

* 

1. Prove that x 3 — 7x 2 +15x— 9 = has a double root. 

2. Show that x A — 8x 2 -f 1(3 = has two double roots. 

3. Prove that x A — ftr 2 — Sx — 3 = has a triple root. 

4. Tost x 4 -8.r 3 +22x 2 -24x+9 = for multiple roots. 

5. Test j; 3 -6x 2 +Hx-6=0 for multiple roots. 

6. Test x A - Ox 8 +0x 2 +8lx- 102=0 for multiple roots. 

59. Ordinary and Inflexion Tangents. The equation of the straight 
line through the point (a, p) with the slope s is y—p = s(x—a). The slope 
of the tangent to the graph of y—f{x) at the point (a, /3) on it is s=f(a) 
by § 56. Also, p=f(a). Hence the equation of the tangent is 

(12) y =/(«)+/'(«) (x- a). 

By subtracting the members of this equation from the corresponding 
members of equation (9), we see that the abscissas x of the points of inter- 
section of the graph of y—f{x) with its tangent satisfy the equation 

/"W^^+/'"W^fr- ? + • . • +/ ( "- 1) (g) ( ^~" ) 1 w ) 7 

+/(m)(a) ^-+ • • • =0 - 

Here the term containing / (m_1) (a) must evidently be suppressed if m = 2, 
since the term containing / (m) (a) then coincides with the first term. 

If a is a root of multiplicity m of this equation, i.e., if the left member 
is divisible by (x— a) m , but not by (x— a) w+1 , the point (a, 0) is counted 
as in coincident points of intersection of the curve with its tangent (just 
as in the case of y = a? and its tangent y = in § 57). This will be the case 
if and only if 

(13) /"(a)=0, /"'(«)=0,..., /<»- 1 >(a)=0, /<»>(«)*0, 

in which ra>l and, as explained above, only the final relation /"(a) 5^0 
is retained if in = 2. If in = 3, the conditions are /"(«) = 0, / (3) (a) ?*0. 

For example, if f(x)=x* and a = 0, then /"(0) =/'" (0)=0, / (4) (0) =24*0, so that 
m=4. The graph of y—x 4 is a [/-Shaped curve, whose intersection with the tangent 
(the x-axis) at (0, 0) is counted as four coincident points of intersection. 

Given f(x) and a, we can find, as in the preceding example, the value 
of m for which relations (13) hold. We then apply the 



59| 



ORDINARY AND INFLEXION TANGENTS 



63 



Theorem. If m is even (ra>0), the points of the curve in the vicinity 
of the point of tangency (a, fi) are all on the same side of the tangent, which 
is then called an ordinary tangent. But if m is odd (m> 1), the curve crosses 
the tangent at the point of tangency (a, 0), and this point is called an inflexion 
point, while the tangent is called an inflexion tangent. 

For example, in Fig. 15, OX is an inflexion tangent, while the tangent at any point 
except O is an ordinary tangent. In Figs. 18, 19, 20, the tangents at the points marked 
by crosses are ordinary tangents, but the tangent at the point midway between them 
and on the t/-axis is an inflexion tangent. 

To simplify the proof, we first take as new axes lines parallel to the 
old axes and intersecting at (a, &). In other words, we set x— a=X, 
y— f$=Y, where X, Y are the coordinates of (x, y) referred to the new 
axes. Since 0=/(a), the tangent (12) becomes Y=f'(a)X, while, by (9), 
y=f(x)=P+f'(a) (x— a)+ . . . becomes 



(14) 



V2 Vm 

7=/'(«)X+/"(a)^-+ . . .=/'(a)X+/<"»(«)£ T + . . . , 



after omitting terms which are zero by (13). 
To simplify further the algebraic work, 
we pass to oblique axes, 1 the new y-axis 
coinciding with the y-axis, while the new 
x-axis is the tangent, the angle between 
which and the X-axis is designated by 6. 
Then 

tan 6=f'(a). 
By Fig. 17, 

X=xcos0, Y-y=f'(a)X. 

Hence when expressed in terms of the 
new coordinates x, y, the tangent is y=0, while the equation (14) of the 
curve becomes 




y=cx m +dx m+1 + . . . , 



/ (w) («)cos m ^ 

c= J — ^- f 5*0. 

ml 



For x sufficiently small numerically, whether positive or negative, 
the sum of the terms after ex™ is insignificant in comparison with ex" 1 

1 Since the earlier x, y do not occur in (14) and the new equation of the tangent 
we shall designate the final coordinates by x, y without confusion. 



64 GRAPHS [Ch.V 

so that y has the same sign as ex* (§ 64). Hence, if ra is even, the points 
of the curve in the vicinity of .the origin and on both sides of it are all 
on the same side of the x-axis, i.e., the tangent. But, if m is odd, the points 
with small positive abscissas x lie on one side of the x-axis and those with 
numerically small negative abscissas lie on the opposite side. 

Our transformations of coordinates changed the equations of the 
curve and of its tangent, but did not change the curve itself and its tangent. 
Hence our theorem is proved. 

By our theorem, a is the abscissa of an inflexion point of the graph 
of y=f(x) if and only if conditions (13) hold with m odd (m>l). These 
conditions include neither /(a)=0 nor /'(a)=0, in contrast with (10). 
In the theory of equations we are primarily interested in the abscissas 
a of only those points of inflexion whose inflexion tangents are horizontal, 
and are interested in them, because we must exclude such roots a of 
f'(x) =0 when seeking the abscissas of bend points, which are the important 
points for our purposes. A point on the graph at which the tangent is 
both horizontal and an ordinary tangent is a bend point by the definition 
in §55. Hence if we apply our theorem to the special case /'(a)=0, 
we obtain the following 

Criterion. Any root a of f'(x)=0 is the abscissa of a bend point of 
the graph of y=f(x) or of a point with a horizontal inflexion tangent according 
as the value of m for which relations (13) hold is even or odd. 

For example, if f(x) =x A , then a=0 and m=4, so that (0, 0) is a bend point of the 
{/-shaped graph of y=x*. If /(x)=x 8 , then a— and m = 3, so that (0, 0) is a point 
with a horizontal inflexion tangent (OX in Fig. 15) of the graph of y^x*. 

EXERCISES 

1. If /(x) =3x»+ 5x 8 +4, the only real root of f'(x) =0 is x=0. Show that (0, 4) 
is an inflexion point, and thus that there is no bend point and hence that /(a;) =0 has a 
single real root. 

2. Prove that x 3 — 3x a +3x+c=0 has an inflexion point, but no bend point. 

3. Show that x 6 — 10x 3 — 20x 2 — 15x-f-c=0 has two bend points and no horizontal 
inflexion tangents. 

4. Prove that 3x* — 40x 8 -f240x-f-c=0 has no bend point, but has two horizontal 
inflexion tangents. 

5. Prove that any function x 8 — 3ax 2 -f- ... of the third degree can be written in 
the form /(x) = (x — a)*+ax+b. The straight line having the equation y »ax+6 meets 
the graph of y=J(x) in three coincident points with the abscissa a and hence is an 
inflexion tangent. If we take new axes of coordinates parallel to the old and inter- 
secting at the new origin (a, 0), i.e., if we make the transformation x=X+a, y=*Y, 



560] 



REAL ROOTS OF A CUBIC 



65 



of coordinates, we see that the equation f(z) =0 becomes, a i educed cubic equation 
X a +pX+g=0(542). 

6. Find the inflexion tangent to y=x*+6x 2 — 3x4-1 and transform s'+Gas* — Zx 
+1=0 into a reduced cubic equation. 

60. Real Roots of a Real Cubic Equation. It suffices to consider 

f(x)=2?-3lx+q (1*0), 

in view of Ex. 5 above. Then /' = 3 (x 2 -l), f" = 6x. If Z<0, there is 
no bend point and the cubic equation /(x) =0 has a single real root. 
If l>0, there are two bend points 

(VT, q-2lVT), (-VT, q+2lVT), 
which are shown by crosses in Figs. 18-20 for the graph of y=f(x) in the 






Fig. 18 

three possible cases specified by the inequalities shown below the figures. 
For a large positive x, the term rr 3 in f(x) predominates, so that the graph 
contains a point high up in the first quad- 
rant, thence extends downward to the 
right-hand bend point, then ascends to 
the left-hand bend point, and finally de- 
scends. As a check, the graph contains 
a point far down in the third quadrant, 
since for x negative, but sufficiently large 
numerically, the term s? predominates and the sign of y is negative. 

If the equality sign holds in Fig. 18 or Fig. 19, a necessary and sufficient 
condition for which is g 2 =4J 3 , one of the bend points is on the s-axis, and 
the cubic equation has a double root. The inequalities in Fig. 20 hold 
if and only if g 2 <4P, which implies that l>0. Hence a?-3lx+q=0 
hds three distinel real roots if and only if q 2 <4l s , a single real root if and 
only if q*>4P, a double root (necessarily real) if and only if q 2 =4P and Z?*0, 
and a triple root if q 2 =4P =0. 



Fig. 20 



66 GRAPHS (Ch. V 

EXERCISES 

Find the bend points, sketch the graph, and find the number of real roots of 

1. x*+2x-4=0. 2. x 3 -7x+7 = 0. 

3. x*-2x-l=0. # 4. x a +6x a -3x+l«0. 

5. Prove that the inflexion point of y=x* — Zlx+q is (0, q). 

6. Show that the theorem in the text is equivalent to that in § 45. 

7. Prove that, if m and n are positive odd integers and rw>n, x m +px n +q=*Q has 
no bend point and hence has a single real root if p>0; but; if p<0, it has just two 
bend points which are on the same side or opposite sides of the x-axis according as 

tn—n 



/np\ m / ng V 
\m / \ra— n/ 



is positive or negative, so that the number of real roots is 1 or 3 in the respective cases. 

8. Draw the graph of y — x* — x*. By finding its intersections with the line y = mx+6, 
solve x 4 — x 2 —mx — 6=0. 

9. Prove that, if p and q are positive, x 2m — px 2n -hg = has four distinct real roots, 
two pairs of equal roots, or no real root, according as 



/np\ m / nq\ 
\m J \m—n/ 



tn-n 

>0, =0, or <0. 



10. Prove that no straight line crosses the graph of y=f(x) in more than n points if 
the degree n of the real polynomial /(x) exceeds unity. [Apply § 16.] This fact serves 
as a check on the accuracy of a graph. 

61. Definition of Continuity of a Polynomial. Hitherto we have 
located certain points of the graph of y—f{x) } where f(x) is a polynomial 
in x with real coefficients, and taken the liberty to join them by a con- 
tinuous curve. 

A polynomial f(x) with real coefficients shall be called continuous at 
x = a, where a is a real constant, if the difference 

D=f(a+h)-f(a) 

is numerically less than any assigned positive number p for all real values 
of h sufficiently small numerically. 

62. Any Polynomial fix) with real Coefficients is continuous at x = a 9 
where a is any real Constant. Taylor's formula (8) gives 

D-r(«)*+^W . . . +£^#. 

This polynomial is a special case of 

F = a\h+02h 2 + . . . +a n h*. 



5 63] 



ROOT LOCATED BY CHANGE OF SIGN 



67 



We shall prove that, if a\, . . . , a* are alt real, F is numerically less thav 
any assigned positive number p for all real values of h sufficiently sma^ 
numerically. Denote by g the greatest numerical value of ai, . . . , a n . 
If h is numerically less than k, where A;<1, we see that F is numerically 
less than 

g(k+k*+ ...+*■) Kg^ <p, if *<^. 

Hence a real polynomial f(x) is continuous at every real value of x. But 
the function tan x is not continuous at x =90° (§ 63). 

63. Root between a and b if /(a) and /(&) have opposite Signs. If 

the coefficients of a polynomial f(x) are real and if a and b are real numbers 
such that f(a) and f(b) have opposite signs, the equation f{x) =0 has at least 
one real root between a and b; in fact, an odd number of such roots, if an 
m-fold root is counted as m roots. 

The only argument l given here (other than that in Ex. 5 below) is 
one based upon geometrical intuition. We are stating that, if the points 

(a, /(a)), (6, f{b)) 

lie on opposite sides of the x-axis, the graph of y=f(x) crosses the x-axis 
once, or an odd number of times, between the vertical lines through 
these two points. Indeed, the part of the graph be- 
tween these verticals is a continuous curve having one 
and only one point on each intermediate vertical line, 
since the function has a single value for each value 
of x. 

This would not follow for the graph of y 2 =x, which 
is a parabola with the x-axis as its axis. It may not 
cross the x-axis between the two initial vertical lines, 
but cross at a point to the left of each. 

A like theorem does not hold for f(x) =tan x, when 
x is measured in radians and 0<a<7r/2<6<7r, since 
tan x is not continuous at x=7r/2. When t increases 
from a to r/2, tan x increases without limit. When 
x decreases from b to w/2, tan x decreases without 
limit. There is no root between a and b of tan x = 0. 

1 An arithmetical proof based upon a refined theory of irrational numbers is given 
In Weber's Jjehrbuch tier Algebra, eel. 2, vol. 1, p. 123. 




O 



a JL 

2 




Fig. 21 



68 GRAPHS [Ch. V 

EXERCISES 

1. Prove that 8x*— 4x 2 — 18x+9=0 has a root between and 1, one between 1 and 
2, and one between —2 and —1. 

2. Prove that 16x 4 — 24x*+16x— 3=0 has a triple root between and 1, and a 
simple root between —2 and —1. 

3. Prove that if a<b<c . . . <l, and a, 0, . . . , X are positive, these quantities 

being all real, 

a 8 y X 

x— a x—b x—c x—l 

has a real root between a and b, one between b and c, . . . . one between k and I, and 
if t is negative one greater than /, but if t is positive one less than a. 

4. Verify that the equation in Ex. 3 has no imaginary root by substituting r+si 
and r — si in turn for x, and subtracting the results. 

5. Admitting that an equation /(x) =x n + . . . =0 with real coefficients has n roots, 
show algebraically that there is a real root between a and 6 if /(a) and/(6) have opposite 
signs. Note that a pair of conjugate imaginary roots cdtdi are the roots of 

(x-c) 2 +d 2 =0 

and that this quadratic function is positive if x is real. Hence if xi, . . . , Xr are the 
real roots and 

4>(x)=(x—xi) . . . (x— Xr), 

then <f>(a) and <t>(b) have opposite signs. Thus a— x\ and 6— x\ have opposite signs for 
at least one real root x%. (Lagrange.) 

64. Sign of a Polynomial, (liven a polynomial 

f(x)=a x n +aix n - 1 -+ . . . +dn (ao^O) 

with real coefficients, we can find a positive number P such that f(x) has 
the same sign as aoX n when x>P. In fact, 



/(s)=s»(ao+</>), *=-+£§+ ...+=• 



2± I £2 , , On 

X "V 1 " ' * ' "*>• 



By the result in § 62, the numerical value of </> is less than that of ao 
when 1/x is positive and less than a sufficiently small positive number, 
say 1/P, and hence when x>P. Then ao+</> has the same sign as ao, 
and hence f(x) the same sign as aox n . 

The last result holds also when x is a negative number sufficiently large 
numerically. For, if we set x=— X, the former case shows that /(— X) 
has the same sign as (— l) n aoX n when X is a sufficiently large positive 
number. 



§ 651 ROLLE'S THEOREM 69 

We shall therefore say briefly that, for x = +oo, f(x) has the same 
sign as ao; while, for x= — oo, f(x) has the same sign as ao if n is even, 
but the sign opposite to ao if n is odd. 

EXERCISES 

1. Prove that x*+ax*+bx— 4=0 has a positive real root [use s=0 and x = + oo ]. 

2. Prove that x*4-ac*+6xH-4 =0 has a negative real root [use x =0 and x = — oo ]. 

3. Prove that if ao> and n is odd, Ooa^H- . . . +a n =0 has a real root of sign opposite 
to the sign of On [use x = — oo , 0, + oo ]. 

4. Prove that x 4 -H*r 8 +&£ 2 +cz--4=0 has a positive and a negative root. 

5. Show that any equation of even degree n in which the coefficient of x n and the 
constant term are of opposite signs has a positive and a negative root. 

65. Rolle's Theorem. Between two consecutive real roots a and b off(x) 
=0, there is an odd number of real roots of f'(x) = 0, a root of multiplicity m 
being counted as m roots. 

Let 

f(x) b (x-a) r (x- b)'Q(x), a<6, 

where Q(x) is a polynomial divisible by neither x— a nor x— b. Then 
by the rule for the derivative of a product (§ 56, Ex. 10), 

(x-a)(x-b)f'(x) , . M , , ... w ,,Q'(x) 
}(x) =r(x-b)+s(x-a) + (x-a)(x-b)?^. 

The second member has the value r(a— 6)<0 for x=a and the value 
s(b— a)>0 for x = b f and hence vanishes an odd number of times between 
a and b (§63). But, in the left member, (x—a) (x—b) and/(x) remain 
of constant sign between a and 6, since /(x) — has no root between a 
and 6. Hence f'(x) vanishes an odd number of times. 

Corollary. Between two connective real roots a and & of f'(x)=Q 
there occurs at most one real root of f{x) =0. 

For, if there were two such real roots a and b of f(x) =0, the theorem 
shows that/'(x) =0 would have a real root between a and b and hence be- 
tween a and 0, contrary to. hypothesis. 

Applying also § 63 we obtain the 

Criterion. If a and ft are consecutive real roots off f (x) =0, thenf(x) =0 
has a single real root between a and ft if f(a) and f{&) have opposite signs , 
but no root if they have like signs. At most one real root off(x)=0 is greater 
than the greatest real root off'(x) = 0, and at most one real root of fix) =0 is . 
less than the lead real root of f'{x) =0. 



70 GRAPHS [Ch. V 

If /(a)=0 for our root a of /'(x)=0, a is a multiple root of /(s)=0 

and it would be removed before the criterion is applied. 

... v - - 
Example. For/(x) =3x 5 -25x s + 60x-20, 

T^/'W =x 4 -5x*4-4 = fx»-l) (x*-4). 

Hence the roots of /'(x) =0 are ±1, ±2. Now 

/(-co): oo, /(-2) = -36, /(-l) = -58, /(1)=18, /(2) = -4, /(+oo) = + oo. 

Hence there is a single real root in each of the intervals 

(-1, 1), (1, 2), (2,+ »), 

and two imaginary roots. The three real roots are positive. 

EXERCISES 

1. Prove that x 5 — 5x+2 =0 has 1 negative, 2 positive and 2 imaginary roots. 

2. Prove that x 8 +x— 1 =0 has 1 negative, 1 positive and 4 imaginary roots. 

3. Show that x 6 — 3x*+2x 2 — 5=0 has two imaginary roots, and a real root in each 
of the intervals (-2, -1.5), (-1.5, -1), (1, 2). 

4. Prove that 4s 5 — 3x 4 — 2x 2 +4x — 10=0 has a single real root. 

5. Show that, if / (l) U0 =0 has imaginary roots, /(x) =0 has imaginary roots. 

6. Derive Rolle's theo^m from the fact that there is an odd number of bend points 
between a and 6, the abscissa of each being a root of /'(x) =0 of odd multiplicity, while 
the abscissa of an inflexion point with a horizontal tangent is a root of /'(x) =0 of even 
multiplicity. 



CHAPTER VI 
Isolation of the Real Roots of a Real Equation 

66. Purpose and Methods of Isolating the Real Roots. In the next 
chapter we shall explain processes of computing the real roots of a given 
real equation to any assigned number of decimal places. Each such 
method requires some preliminary information concerning the root to 
be computed. For example, it would be sufficient to know that the root 
is between 4 and 5, provided there be no other root between the same 
limits. But in the contrary case, narrower limits are necessary, such 
as 4 and 4.3, with the further fact that only one root is between these new 
limits. Then that root is said to be isolated. 

If an equation has a single positive root and a single negative root, the real roots 
are isolated, since there is a single root between — oo and 0, and a single one between 
and + oo . However, for the practical purpose of their computation, we shall need 
narrower limits, sufficient to fix the first significant figure of each root, for example 
—40 and -30, or 20 and 30. 

We may isolate the real roots of /(x) =0 by means of the graph of 
y=f(x). But to obtain a reliable graph, we saw in Chapter V that we 
must employ the bend points, whose abscissas occur among the roots 
f'(x)=0. Since the latter equation is of degree n— 1 when f(x)=0 is of 
degree n, this method is usually impracticable when n exceeds 3. The 
method based on Rolle's theorem (§ 65) is open to the same, objection. 

The most effective method is that due to Sturm (§68). We shall, 
however, begin with Descartes' rule of signs since it is so easily applied. 
Unfortunately it rarely tells us the exact number of real roots. 

67. Descartes' Rule of Signs. Two consecutive terms of a real poly- 
nomial or equation are said to present a variation of sign if their coefficients 
have unlike signs. By the variations of sign of a real polynomial or equa- 
tion we mean all the variations presented by consecutive terms. 

Thus, in a^— 2x*— 4x 2 +3=0, the first two terms present a variation of sign, and 
likewise the last two terms. The number of variations of sign of the equation is two. 

71 



72 ISOLATION OF REAL ROOTS [Ch. VI 

Descaktes' Rule. The number of positive real roots of an equation 
with real coefficients is either equal to the number of its variations of sign 
or is less than that number by a positive even integer. A root of multiplicity 
m is here counted as m roots. 



j 



For example, x 6 — 3x*+x+l =0 has either two*>r no positive roots, the exact number 
not being found. But 3x*— x— 1=0 has exactly one positive root, which is a simple 
root. 

Descartes' rule will be derived in § 73 as a corollary to Budan's theorem. 
The following elementary proof 1 was communicated to the author by 
Professor D. R. Curtiss. 

Consider any real polynomial | 

/(^^oo^+ai^-^ . . . +a# n - % (ao5*0, a,^0). 

Let r be a positive real number. By actual multiplication, 

F( x ) = (z-r)f(x)=A x n + 1 +Aix»+ . . . +A H . 1 x»~ l , 
where 

Ao=ao, Ai=ai—rao, A2=a*—ra\, . . . , Ai=a x — ra^i, il, +1 = — ta%, ; 

In f(x) let a tl be the first non-vanishing coefficient of different sign from 
do, let a^ be the first non-vanishing coefficient following a tl and of the 
same sign as ao, etc., the last such term, a^, being either a t or of the same 
sign as a,. Evidently v is the number of variations of sign of f(x). 

For example, iff(x) =2x 8 +3x 8 — 4x*— 6x*+7x, we have v —2, at x -at^ —4, a*,=oe=7. 
Note that eu = since x 2 is absent. 

The numbers Ao, A tv . . . , A*,, il, +1 are all different from zero and 
have the same signs as ao, a kv . . . , a kv , —a h respectively. This is 
obviously true for Ao=ao and A J+1 = —ra t . Next, A^ is the sum of the 
non-vanishing number a^ and the number — ra^-i, which is either zero 
or else of the same sign as a^ since a^-i is either zero or of opposite sign 
to a^. Hence the sum A^ is not zero and has the same sign as a^. 

By hypothesis, each of the numbers ao, a 4l , . . . , a^ after the first 
is of opposite sign to its predecessor, while — aj is of opposite sign to a^ 
Hence each term after the first in the sequence Ao, A* v . . . , At,, iij +1 
is of opposite sign to its predecessor. Thus these terms present v+1 
variations of sign. We conclude that F(x) has at least one more vari- 
ation of sign than f(x). But we may go further and prove the following 

1 The proofs given in college algebras are mere verifications of special cases. 



1 67] DESCARTES' RULE OF SIGNS 73 

Lemma. The number of variations of sign of F(x) is equal to thai of 
f(x) increased by some positive odd irUeger. 

For, the sequence Ao, Ai, . . . , A kl has an odd number of variations 

of sign since its first and last terms are of opposite sign; and similarly 

for the v sequences 

At v <A*i+ij . . . , Aj^i 



t©» sit v +i, . . . , ^z+i« 



The total number of variations of sign of the entire sequence Ao f A\, . . . , 
Ai +1 is evidently the sum of the numbers of variations of sign for the 
v+1 partial sequences indicated above, and is thus the sum of v+1 posi- 
tive odd integers. Since each such odd integer may be expressed as 1 
plus or a positive even integer, the sum mentioned is equal to v+1 plus 
or a positive even integer, i.e., to v plus a positive odd integer. 

To prove Descartes' rule of signs, consider first the case in which /(x) =0 
has no positive real roots, i.e., no real root between and +oo. Then 
/(0) and /(ao) are of the same sign (§ 63), and hence the first and last 
coefficients of f(x) are of the same sign. 1 Thus f{x) has either no vari- 
ations of sign or an even number of them, as Descartes' rule requires. 

Next, let /(a;) =0 have the positive real roots ri, . . . , r t and no others. 
A root of multiplicity m occurs here m times, so that the r's need not be 
distinct. Then 

f(x) = (x-ri) . . . (x-r k )<t>(x) 9 

where 4>fa) is a polynomial with real coefficients such that </>(x)=0 has 
no positive real roots. We saw in the preceding paragraph that <t>(x) 
has either no variations of sign or an even number of them. By the 
Lemma, the product (x— rt)<t>(x) has as the number of its variations of 
sign the number for 4>(x) increased by a positive odd integer. Similarly 
when we introduce each new factor x— r it Hence the number of varia- 
tions of sign of the final product fix) is equal to that of <l>(x) increased 
by k positive odd integers, i.e., by k plus or a positive even integer. 
Since 4>(x) has either no variations of sign or an even number of them, 
the number of variations of sign off(x) is k plus or a positive even integer, 
a result equivalent to our statement of Descartes' rule. 

1 In case f(x) has a factor x n ~ l , we use the polynomial f(x)/x n ~ l instead of f(x) in 
this argument. 



74 ISOLATION OF REAL ROOTS [Ch. VI 

If —p is a negative root of f(x) =0, then p is a positive root of /(— x) =0. 
Hence we obtain the 

Corollary. The number of negative root s of f(x) =0 is either equal 
to the number of variations of sign of /(— x ) or is less than that number 
by a positive even integer. 

For example, x 4 +3x 8 +x— 1=0 has a single negative root, which is a simple root, 
since x 4 — 3x 3 — x— 1 =0 has a single positive root. 

As indicated in Exs. 10, 11 below, Descartes' rule may be used to isolate 
the roots. 

EXERCISES 

Prove by Descartes' rule the statements in Exs. 1-8, 12, 15. 

1. An equation all of whose coefficients are of like sign has no positive root. Why 
is this self-evident? 

2. There is no negative r oot of an equat ion, like x 5 — 2x 4 — 3x 2 +7x — 5=0, in which 
the coefficients of the odd powers of x are of like sign, and the coefficients of the even 
powers (including the constant term) are of the opposite sign. Verify by taking x = — p, 
where p is positive. 

3. x 3 +a 2 x+b 2 = has two imaginary roots if 6 3*0. 

4. For n even, x n — 1 =0 has only two real roots. 

5. For n odd, x n — 1 =0 has only one real root. 

6. For n even, x n -fl =0 has no real root; for n odd, only one. 

7. x 4 -f 12x 2 +5x — 9=0 has just two imaginary roots. 

8. x*+a 2 x 2 +b 2 x — c 2 = (c^0) has just two imaginary roots. 

9. Descartes' rule enables us to find the exact number of positive roots only when 
all the coefficients are of like sign or when 

f(x)=x n +pxX n ~ l + . . . +Pn-*c'-Pn-s+lX 8 ~ 1 - . . . ~Pn=0, 

each pi being £^0. Without using that rule, show that the latter equation has one 
and only one positive root r. Hints: There is a positive root r by § 63 (a=0, 6= oo ). 
Denote by P(x) the quotient of the sum of the positive terms by x*, and by —N(x) 
that of the negative terms. Then N(x) is a sum of powers of l/x with positive coef- 
ficients. 

If x>r, P(x)>P(r), mx)<N(r), f(x)>0; 
If x<r, P(x)<P(r), N(x)>N(r), f(x)<0. 

10. Prove that we obtain an upper limit to the number of real roots of f(x) =0 
between a and 6, if we set 

a+by 



x = 



/ x— a\ 



1+y 
multiply by (l+y) n , and apply Descartes' rule to the resulting equation in y. 



§ 68] STURM'S METHOD 75 

11. Show by the method of Ex. 10 that there is a single root between 2 and 4 of 
x , +x , -17x+15=0. Here we have 272/ 3 +3£ 2 -232/-7=0. 

12. In the astronomical problem of three bodies occurs the equation 

r 5 +(3-M)r 4 +(3-2M)r 3 -Mr l -2 M r-M=0, 

where 0<m<C1. Why is there a single positive real root? 

13. Prove that &-\-x % — x* H-2x— 3=0 has four imaginary roots by applying Des- 
cartes' rule to the equation in y whose roots are the squares of the roots of the former. 
Transpose the odd powers, square each new member, and replace x 2 by y. 

14. As in Ex. 13 prove that a: 3 +x 2 +8x+6 = has imaginary roots. 

15. If a real equation f(x) =0 of degree n has n real roots, the number of positive 
roots is exactly equal to the number V of variations of sign. Hint: consider also 

16. Show that x z —x t +2^+1=0 has no positive root. Hint: multiply by x-\-l. 

68. Sturm's Method. Let fax) = be an equation with real coefficients, 
and f{x) the first derivative of fax). The first step of the usual process 
of finding the greatest common divisor of f(x) and /'(a;), if it exists, con- 
sists in dividing / by /' until we obtain a remainder r(x), whose degree 
is less than that of /'. Then, if q\ is the quotient, we have f=qif+r. 
Instead of dividing /' by r, as in the greatest common divisor process, and 
proceeding further in that manner, we write fa=—r y divide/' by/2, and 
denote by fa the remainder with its sign changed. Thus 

f=Qif-fa, /' = 92/2-/3, /2 = £3/3-/4, . . . . 

The latter equations, in which each remainder is exhibited as the nega- 
tive of a polynomial /«, yield a modified process, just as effective as the 
usual process, of finding the greatest common divisor G of fix) and fix) if 
it exists. 

Suppose that —fa is the first constant remainder. If fa = 0, then /3 = G, 
since fa divides fa and hence also /' and / (as shown by using our above 
equations in reverse order); while, conversely, any common divisor of 
/ and/' divides fa and hence also/3. 

But it fa is" a constant 5*^0, / and /' have no common divisor involving 
x. This case arises if and only if /(x)=0 has no multiple root (§58), 
and is the only case considered in §§ 69-71. 

Before stating Sturm's theorem in general, we shall state it for a 
numerical case and illustrate its use. 



76 ISOLATION OF REAL ROOTS 



[Ck.VI 



Example. /(x)=x 8 +4x'-7. Then f'=3x l +8x, 

/' - («*+** vi -/«, /. - mi. 

For 1 i=l, the signs of /, /', / 2 , f Zf are h + +, showing a single variation of 

consecutive signs. For x=2, the signs are + + + +, showing no variation of sign. 
Sturm's theorem states that there is a single real root between 1 and 2. For x = — oo f 

the signs are 1 h, showing 3 variations of sign. The theorem states that there 

are 3 — 1 =2 real roots between — oo and 1. Similarly, 



X 


Signs 


Variations 


-1 


— — 


+ + 


1 


-2 


+ - 


- + 


2 


-3 


+ + 


- + 


2 


-4 


- + 


- + 


3 



Hence there is a single real root between —2 and —1, and a single one between —4 
and —3. Each real root has now been isolated since we have found two numbers 
such that a single real root lies between these two numbers or is equal to one of them. 

Some of the preceding computation was unnecessary. After isolating a root between 
—2 and —1, we know that the remaining root is isolated between — oo and —2. But 
before we can compute it by Horner's nethod, we need closer limits for it. For that 
purpose it is unnecessary to find the Signs of all four functions, but merely the sign 
of/ (§63). 

69. Sturm's Theorem. Let f(x) = be an equation with real coefficients 
and without multiple roots. Modify the usual process of seeking the great- 
est common divisor of f{x) and its first derivative 2 /i(x) by exhibiting each 
remainder as the negative of a polynomial /<: 

( 1 ) f=<]lfl — /2, /l = 92/2 ~ /3, /2 = 93/3-/4, . . . , /»- 2 = Qn- l/n- 1 — fn, 

where* /„ is a constant?*0. If a and b or*, real numbers, a<b, neither 

1 Before going further, check that the preceding relations hold when x » 1 by insert- 
ing the computed values of /, /', fi f or x = 1 . Experience shows that most students make 
some error in finding f-i, / 3 , . . . , so that checking is essential. 

2 The notation /1 instead of the usual /', and similarly /o instead of /, is used to reg- 
ularize the notation of all the fs, and enables us to write any one of the equations (1) 
in the single notation (3) . • 

* If the division process did not yield ultimately a constant remainder 3^0, / and f» 
would have a common factor involving x, and hence /(x) =0 a multiple root. 



i 69] STURM'S THEOREM 77 

a root of f{x) =0, the number of real roots off(x) = between a and b is equal 
to the excess of the number of variations of sign of 

(2) fix), M*), /»(*), *••> /.-i(«), /» 

/or x=a over the number of variations of sign for x = b. Terms which vanish 
are to be dropped out before counting the variations of sign. 

For brevity, let V x denote the number of variations of sign of the 
numbers (2) when x is a particular real number not a root of f{x) = 0. 

First, if x\ and X2 are real numbers such that no one of the continuous 
functions (2) vanishes for a value of x between x\ and X2 or for x=xi or 
x = X2, the values of any one of these functions for x=x\ and x=X2 are 
both positive or both negative (§63), and therefore V Xl = V^. 

Second, let p be a root of f t (x) = 0, where 1 £ i < n. Then 

(3) /i-i(*)«^«(*)-/i+ifc) 

and the equations (1) following this one show that/i_i(x) and /<(x) have 
no common divisor involving x (since it would divide the constant /»). 
By hypothesis, f t (x) has the factor x— p. Hence /i-i(x) does not have 
this factor x— p. Thus, by (3), 

/<-l(p)=-/<+l(p)5*0. 

Hence, if p is a sufficiently small positive number, the values of 

/<-i(s), fi(*), /i+iCc) 

for x=p— p show just one variation of sign, since the first and third 
values are of opposite sign, and for x = p+p show just one variation of 
sign, and therefore show no change in the number of variations of sign 
for the two values of x. 

It follows from the first and second cases that V a = V fi if a and are 
real numbers for neither of which any one of the functions (2) vanishes 
and such that no root of f(x) = lies between a and 0. 

Third, let r be a root of f(x) = 0. By Taylor's theorem (8) of § 56, 

fir-Pi - -KTW+frTO -..-.,. 
f(r+p)= p/ , W+ip 2 / ,, (r)+ . . . . 

If p is a sufficiently small positive number, each of these polynomials in 
p has the same sign as its first term. For, after removing the factor p f 



78 I SOLA TION OF REAL ROOTS [Ch. VI 

we obtain a quotient of the form ao+s, where s = aip+a2P 2 + ... is 
numerically less than ao for all values of p sufficiently small (-§ 62). Hence 
if /'(r) is positive, f(r—p) is negative and f(r+p) is positive, so that the 

terms /(x), fi(x)=f'(x) have the signs h for x = r—p and the signs 

+ + for'x = r+p. If /'(r) is negative, these signs are H and 

respectively. In each case, f(x), f\(x) show one more variation of sign 
for x = r—p than for x = r+p. Evidently p may be chosen so small that 
no one of the functions /i(x), . . . , f n vanishes for either x = r—p or x=r+p, 
and such that fi(x) does not vanish for a value of x between r—p and 
r+p, so that f(x) = has the single real root r between these limits (§ 65). 
Hence by the first and second cases, /i, ...,/» show the same number 
of variations of sign for x = r—p as for x = r+p. Thus, for the entire 
series of functions (2), we have 

(4) r r _ p -r r+p =i. ' 

The real roots of f(x) = within the main interval from a to b (i.e., the 
aggregate of numbers between a and 6} separate it into intervals. By 
the earlier result, V x has the same value for all numbers in the same 
interval. By the present result (4), the value of V x in any interval exceeds 
the value for the next interval by unity. Hence V a exceeds Y b by the 
number of real roots between a and b. 

Corollary. If a<6, then V a l. V b . 

A violation of this Corollary usually indicates an error in the com- 
putation of Sturm's functions (2). 

EXERCISES 

Isolate by Sturm's theorem the real roots of 

1. x 3 +2x+20=0. 2. x 3 +z-3=0. 

70. Simplifications of Sturm's Functions. In order to avoid fractions, 
we may first multiply f(x) by a positive constant before dividing it by 
fi(x), and similarly multiply /i by a positive constant before dividing it 
by /2, etc. Moreover, we may remove from any /< any factor hi which is 
either a positive constant or a polynomial in x positive for l a^x^b, 
and use the remaining factor F< as the next divisor. 

To prove that Stumps theorem remains true when these modified 

1 Usually we would require that h\ be positive for all values of x, since we usually 
wish to employ the limits — oo and + °° • 



S 70] SIMPLIFICATIONS OF STURM'S FUNCTIONS 79 

functions /, F\ } . . . , F m are employed in place of functions (2), consider 
the equations replacing (1) : 

fi = kiF h c 2 f=qiFi-k 2 F 2 , c^Fi^q 2 F 2 -kzF^ 

C4F 2 = qzFz — k±Fi } . . . , c m Fm-2=q m -iFm-i— kmFm, 

in which c 2 , C3, . . . are positive constants and F m is a constant ?*0. A 
common divisor (involving x) of F<_x and F< would divide f<_ 2 , . . . , 
F 2y F\ r f, fij whereas /(a;) =0 has no multiple roots. Hence if p is a root 
of F,(x)=0, then jFWpJ^O and 

c <+1 F < _ 1 (p) = -fc <+1 (p)F <+1 (p), c <+1 >0, fc <+1 (p)>0. 

Thus Fi-i and F <+1 have opposite signs for x—p. We proceed as in § 69. 

Example 1. If f(x)=x*+§x— 10, /i=3(x l +2) is always positive. Hence we may 
employ / and F\ = \. For x= — 00 , there is one variation of sign; for x = + 00 , no 
variation. Hence there is a single real root; it lies between 1 and 2. 

Example 2. If f(x) =2x 4 -13:r 2 -10:r-19, we may take 

/i=4x 8 -13x-5. 
Then 

2/=tfi-/*, fi = lSx*+15x+SS = mx+i§y+m±. 

Since /1 is always positive, we need go no further (we may take F 2 = l). For x= — 00 , 

the signs are -\ h; for 3= + oo, + + +. Hence the*# are two real roots. The 

signs for x =0 are K Hence one real root is positive and the other negative. 

EXERCISES 

Isolate by Sturm's theorem the real roots of 

1. 3 8 +3s«-2x-5=0. 2. x 4 +12z 2 +5z-9 = 0. 

3. x*-7x-7=0. 4. 3s 4 -6:r 2 +8z-3=0. 

5. x 6 +6x*-30x 2 -12x-9=0 [stop with /J. 

6. x*-Sx*+25x*- 36x+8=0. 

7. For f=x*+px+q (p^0), show that/i = 3x 2 +p, / 2 = -2px-Sq, 

4p 2 /i = ( -6ps+9tf)/2 -/ 3 , / 8 - -4p» -27g«, 

so that ft is the discriminant A (§44). Let [p] denote the sign^of p. Then the signs 
of /, /1, /*, /* are 

- + +[p] [A] forx=-oo, 

+ + — [p] [^] for x = + x . 

For A negative there is a single real root. For A positive and therefore p negative, 
there are three distinct real roots. For A=0, ft is a divisor of /1 and /, so that x = 
—3q/ (2p) is a double root. 



80 ISOLATION OF REAL ROOTS [Ch. VI 

■ 

8. Prove that if one of Sturm's functions has p imaginary roots, the initial equation 
has at least p imaginary roots. 

9. State Sturm's theorem so as to include the possibility of a, or b, or both a and b 
being roots of f(x) =0. 

71. Sturm's Functions for a Quartic Equation. For the reduced quar- 

tic equation f(z) = 0, 

f/ =2 i +qz i +rz+8 9 

(5) < fi=4**+2qz+r, 

k /2=— 2qz 2 — 3rz— 4s. 

Let q?*0 and divide qpfa by /2. The negative of the remainder is 

(6) / 3 =Lz-12rs-rg 2 , L = Sq8-2q*-9r>. 

Let Lj^O. Then fa is a constant which is zero if and only if /=0 has 
multiple roots, i.e., if its discriminant A is zero. We therefore desire fa 
expressed as a multiple of A. By § 50, 

(7) A=-4P3-27Q 2 , P=-4s-^, Q-fc*-r*-*9». 

We may employ P and Q to eliminate 

(8) 4 5 =-P-^, r^-Q-SsP-A* 3 . 

We divide L 2 /2 by 

(9) / 3 =Ls+3rP, L = ?Q+4gP. 

The negative of the remainder * is 

( 10) lSrtqP 2 - 9r2LP+4sL 2 = ^A. 

The left memt>er is easily reduced to q*A. Inserting the values (8) and 
replacing I? by L(9Q+4qP), we get 

- ISqQP 2 - 12<f P*-±£(fP 2 +2qF 2 L+$<f i PL-Sq 2 QL. 

Replacing L by its value (9), we get (fA. Hence we may take 

(11) / 4 = A. 

Hence if qLAj^O, we may take (5), (9), (11) as Sturm's functions. 

1 Found directly by the Remainder Theorem (§ 14) by inserting the root £*■ — 3rP/2. 
of/3=0 into L*/i. 



§ 71] STURM'S FUNCTIONS FOR A QUARTIC 81 

Denote the sign of q by [q]. The signs of Sturm's functions are 

+ - -Iff] -W [A] forx=-oo, 
+ + ~M [L] M for s= +00. 

First, let A>0. If q is negative and L is positive, the signs are 

H 1 h and + ++++, so that there are four real roots. In each 

of the remaining three cases for q and L, there are two variations of sign 
in either of the two series and hence there is no real root. 

Next, let A<0. In each of the three cases in which q and L are not 
both positive, there are three variations of sign in the first series and one 
variation in the second, and hence just two real roots. If q and L are 
both positive, the number of variations is 1 in the first series and 3* in the 
second, so that this case is excluded by the Corollary to Sturm's theorem. 
To give a direct proof, note that, by the value of L in (6), L>0, q>0 
imply 4s><f, i.e., s>0, and hence, by (7), P is negative, so that each term 
of (10) is > 0, whenceA>0. 

Hence, if qLA^O, there are four distinct real roots if and only if A 
and L are positive, and q negative; two distinct real and two imaginary 
roots if and only if A is negative. 

Combining this result with that in Ex. 4 below, we obtain the 

Theorem. If the discriminant A of 2S i +qz i +rz+s = is negative, there 
are two distinct real roots and two imaginary roots; if A>0, tf<0, L>0, 
four distinct real roots; if A >0 and either q > or L £ 0, no real roots. Here 
L = 8^s-2g 3 -9r 2 . 

Our discussion furnished also the series of Sturm functions, which 
may be used in isolating the roots. 

EXERCISES 

1. If gA^O, L*=0, then /s=3rP is not zero (there being no multiple root) and its 
sign is immaterial in determining the number of real roots. Prove that there are just 
two real roots if q<0, and none if q>0. By (10), q has the same sign as A. 

2. If rA 3^0, q=0, obtain — / s by substituting z = — 4s/(3r) in f x . Show that we may 
take/s=rA and that there are just two real roots if A <0, and no real roots if A>0. 

3. If A s*0, g=r=0, prove that there are just two real roots if A<0, and no real 
roots if A>0. Since A =256a 8 , check by solving z 4 +s =0. 

4. If A 5*0, 9^=0, there are just two real roots if A<0, and no real roots if A>0. 
[Combine the results in Exs. 1-3.] 

5. Apply the theorem to Exs. 2, 4, 6 of § 70. 

6. Isolate the real roots of Exs. 3, 4, 5 of § 48. 



82 ISOLATION OF REAL ROOTS [Ch. VI 

/ 72. Sturm's Theorem for the Case of Multiple Roots. We might 
remove the multiple roots by dividing /(x) by 1 /»(x), the greatest com- 
mon divisor of /(x) and f\ =/'(x) ; but this would involve considerable 
work, besides wasting the valuable information in hand. A3 before, we 
suppose /(a) and f(b) different from zero.. We have equations (1) in 
which /„ is now not a constant. 

The difference V a — V b is the number of real roots between a and 6, each 
multiple root being counted only once. 

If p is a root of /<(x) =0, but not a multiple root of /(x) =0, then /<_i(p) 
is not zero. For, if it were zero, x— p would by (1) be a common factor 
of / and f\. We may now proceed as in the second case in § 69. 

The third case requires a modified proof only when r is a multiple root. 
Let r be a root of multiplicity m, m^2. Then /(r), f'(r), . . . , / (m_1) (r) 
are zero and, by Taylor's theorem, 

^ + ^ = i-2. pm (Lir ^ + ---- 

These have like signs if p is a positive number so small that the signs of 
the polynomials are those of their first terms. Similarly, /(r— p) and 
f(r—p) have opposite signs. Hence / and /1 show one more variation 
of sign for x=r— p than for x=r+p. Now (x— r) m_1 is a factor of / 
and /1 and hence, by (1), of /2, . . . , /„. Let their quotients by this 
factor be 4, 0i, ... , </>». Then equations (1) hold after the/'s are replaced 
by the 0's. Taking p so small that </>i(x) = has no root between r— p 
and r+pj we see by the first and second cases in § 69 that </>i, . . . , </> n 
show the same number of variations of sign for x = r— p as for x = r+p. 
The same is true for /1, . . . , /„ since the products of <£i, . . . , <j> n by 
(x— r) m_1 have for a given x the. same signs as <f>\ . . . , </>„ or the same 
signs as — <£i, . . . , — <£ n . But the latter series evidently shows the 
same number of variations of sign as </>i, . . . , </>„. Hence (4) is proved 
and consequently the present theorem. 

1 The degree of f(x) is not n, nor was it necessarily n in § 69. 



ft 73] SUDAN'S THEOREM 83 

EXERCISES 

1. For/=x 4 -8x*+16, prove that F x =x*-4x t F,=x»-4, F x =xF t . Hence n=2. 
Verify that V-*> =2, Foo =0, and that there are just two real roots, each a double 
root. 

Discuss similarly the following equations. 

2. x*-5*»+9a: l -7z+2=0. 3. s 4 +2x l -3x , -4x+4=0. 4. x *-x 1 -2x+2=0. 

73. Budan's Theorem. Let a and b be real numbers, a<b } neither 1 
a root of f(x) = 0, an equation of degree n with teal coefficients. Let V m 
denote the number of variations of sign of 

(12) f{x), f'(x), f'(x) t .'.., /<»>(*) 

for x=a, after vanishing terms have been deleted. Then V a —\\ is either 
the number of real roots of f(x) = between a and b or exceeds the number 
of those roots by a positive even integer. A root of multiplicity m is here 
counted as m roots. 

For example, if /(x)=s 8 -7x-7, then f'=Zx*-l, /"=6s, /"'=6. Their values 
for x=3, 4, —2, —1 are tabulated below. 



X 


/ 


/' 


r r 


Variations 


3 


-1 


20 


18 6 


1 


4 


29 


41 


24 6 





-2 


-1 


5 


-12 6 


3 


-1 


-1 


-4 


-66 


1 



Hence the theorem shows that there is a single real root between 3 and 4, and two 
or no real roots between —2 and —1. The theorem does not tell us the exact number 
of roots between the latter limits. To decide this ambiguity, note that /( — 3/2) = +1/8, 
so that there is a single real root between —2 and —1.5, and a single one between 
— 1.5 and — 1. 

The proof is quite simple if no term of the series (12) vanishes for 
x=a or for rr=6 and if no two consecutive terms vanish for the same 
value of x between a and b. Indeed, if no one of the terms vanishes for 
xi£x£x2, then V Xl =V X2 , since any term has the same sign for x=x\ 
as for x=X2. Next, let r be a root of f (i) (x) =0, a<r<b. By hypothesis, 

1 In case a or & is a root of fix) =0, the theorem holds if we count the number of 
roots > a and =6. This inclusive theorem has been proved, by means of Rolle's 
theorem, by A. Hurwitz, Mathematische Annalen, Vol. 71, 1912, p. 584, who extended 
Budan's theorem from the case of a polynomial to a function /(.r) which is real and 
regular for a £ x <b. 



84 ISOLATION OF REAL ROOTS [Ch. VI 

the first derivative / (<+1) (x) of f (t) (x) is not zero for x=r. As in the third 
step (now actually the case t = 0) in §69, f®(x) and / <l+1) («) show one 
more variation of sign f or x = r — p than for x = r +p, where p is a sufficiently 
small positive number. If i>0, / (<) is preceded by a term / (l-1) in (12). 
By hypothesis, / (<_1) (x)^0 for x=r and hence has the same sign for 
x=r— p and x=r+p when p is sufficiently small. For these values of 
x, / (0 (x) has opposite signs. Hence / (< ~ 1} and / (<) show one more or one less 
variation of sign for x=r— p than for x=r+p, so that Z"" 1 *, / (<> , / (<+1) show 
two more variations or the same number of variations of sign. 

Next, let no term of the series (12) vanish for x=a or for x =6, but 
let several sucessive terms 

(13) /«>(*), /«+*>(*), . . . , /«+'-»(*) 

all vanish for a value r of x between a and 6, while / (<+ -°(r) is not zero, 
but is say positive. 1 Let I\ be the interval between r—p and r, and 72 
the interval between r and r+p. Let the positive number p be so small 
that no one of the functions (13) or / (l+ ^(x) is zero in these intervals, so 
that the last function remains positive. Hence / (<+i-1) (x) increases with 
x (since its derivative is positive) and is therefore negative in 7i 
and positive in h- Thus / (l+ - /_2) (x) decreases in I\ and increases in I2 
and hence is positive in each interval. In this manner we may verify the 
signs in the following table: 

f(D /«+!> y«+2> # # # yw+y-3) y<i+y-2) y«+j-D y«+» 

h (-)> (-)'- 1 (-)>"* . . . • - + - + 

h + + + ...+ + + + 

Hence these functions show j variations of sign in 7i and none in 72. 

If i>0, the first term of (13) is preceded by a function / (<_I) (x) which 

is not zero for x=r, and hence not zero in 7i or 72 if p is sufficiently small. 

If j is even, the signs of / (<_1) and / (i) are + + or \- in both 7i and 72, 

showing no loss in the number of variations of sign. If j is odd, their 

signs are 

7i + - 

or 

7 2 + + - + 

so that there is a loss or gain of a single variation of sign. Hence 

1 If negative, all signs in the table below are to be changed; but the conclusion holds. 



§ 731 SUDAN'S AND DESCARTES' THEOREMS 85 

show a loss of j variations of sign if j is even, and a loss of j± 1 if j is odd, 
and hence always a loss of an jeven number iO of variations of sign. 

If i=0, / w s/ has r as a j-fold root and the functions in the table show 
j more variations of sign for x=r— p than for x=r+p. 

Thus, when no one of the functions (12) vanishes for x = a or for x=6, 
the theorem follows as at the end of § 69 (with unity replaced by the 
multiplicity of a root). 

Finally, let one of the functions (12), other than/(x) itself, vanish for 
m x=a or for x=b. If 8 is a sufficiently small positive number, all of the 
N roots of /(rr)=0 between a and b lie between a+8 and 6—5, and for 
the latter values no one of the functions (12) is zero. By the above 

proof, 

V a + s -V b - s =N+2t, 

Fa-T« + a = 2;, V b - 9 -V b =2s, 

where t, j f s are integers ^ 0. Hence V a — F 6 = N+2(t+j+s). 

Descartes' rule of signs (§ 67) is a corollary to Budan's theor em. Con- 
sider any equation with real coefficients 

f(x)^aox n +aix*- 1 + . . . +a n _ 1 x+a n =0, 

having 0*5*0. For x=0 the functions (12) have the same signs as 

a*, a n _i, . . . , ai, ao. 

Hence Vo is equal to the number V of variations of sign of f(x). 

For rr= + oo, the functions all have the same sign, which is that of ao. 
Thus Vo— Foo = F is either the number of positive roots or exceeds that 
number by a positive even integer. Finally, Descartes' rule holds if 
Ou=0, as shown by removing the factors x. 

EXERCISES 

Isolate by Budan's theorem the real roots of 

1. x»-a; 1 -2a;+l=0. 2. x 8 +3x'-2a:-5=0. 

3. Prove that if f(a) 5^0, V a equals the number of real roots >a or exceeds that 
number by an even integer. 

4. Prove that there is no root greater than a number making each of the functions 
(12) positive, if the leading coefficient of f(x) is positive. (Newton.) 

5. Hence verify that z A — 4x 8 — 3x+23=0 has no root >4. 

6. Show that x*-4x 8 +x 2 +6x-f-2=0 has no root >3. 



[Ch. VI 



ISOLATION OF REAL ROOTS 



85A 



EXERCISES 

1. If g{x) =ax 2 +6x+c has a negative discriminant Z)=6 2 — 4ac, g has the same sign 
as a for all real values of x. Hint: 

. 4o0 = (2ax+&)*-Z). 

2. Hence, when the Sturm's function g(x) of the second degree has a negative 
discriminant, we may replace it by its first coefficient a and discard all later Sturm's 
functions. 

3. Show that x 3 -f 3gx 2 -f 3(p-fg 2 )x+c = has a single real root if p>0. 

Show that three Sturm's functions suffice to prove that there are exactly two rreri 
roots of the following equations. 



4. x 4 +4x 3 +3x 2 -2x-5=0. 

6. x 4 +4x»+3x 2 -2x-8=0. 

8. x 4 +4x 3 +3x 2 -6x-9=0. 
10. x 4 +&x 3 +6x 2 +6x+l=0, b > 4. 
12. x 4 +6x l +30x 2 +56x+25=0, 6* > 80. 
14. 2x 4 -8x 3 -8x-l=0. 
16. 2x 4 -8x l +6x 2 -8x+l=0. 
18. x 4 +6x 3 +x 2 -£=0, b ^ 2, E £ 1. 
20. x 4 +3x 3 +3x 2 -#=0, E k 2. 
22. x 4 +4x l +4x 2 -#=0, E ^ 2. 
24. x 4 +4x l +5x 2 -#=0, E ^ 4. 



5. 


X 4 


7. 


X 4 


9. 


X 4 


11. 


X 4 


13. 


X 4 


15. 


X 4 


17. 


X 4 


19. 


X 4 


21. 


X 4 


23. 


X 4 


25. 


X 4 



4 -4x 3 +3x 2 -6x-3=0. 
4 -4x»+3x 2 -6x-6=0. 
4 -4x 3 +3x 2 -10x-5=0. 
4 +6x»+ 12x 2 +26x+4=0, 6 2 > 32. 
4 -8x l +18x 2 -32x+12=0. 
4 -4x»-4x-i=0,i >- \. 
4 -4x 3 +3x 2 -4x-m = 0, m >-f. 
<+&x 3 +2x 2 -#=0, b £ 3, E ^ 1. 
4 +fex 3 +3x 2 -# = 0, 6 ^ 4, £ £ 1. 
4 +&x 3 +4x 2 -#=0, 6 ^ 5, E ^ 1. 
4 -f 6x 3 +5x 2 -#=0, 6 £ 6, £ £ 1; 
b = 5,E^2. 



For the following equations show that we may stop with Sturm's quartic function 
which is the product of two factors like x 2 +c, c > 0: 



26. x«-2x 4 -4x 2 -2«0. 
28. x«-2x 4 -5x 2 -4 = 0. 



27. x 8 -6x*-30x 2 +12x-9=0. 
29. x«-2x 4 -6x 2 -6=0. 



The preceding equations are ideal for solution by Horner's Method. 



Solve the following equations by Newton's Method: 



30. x»-30x-110=0. 
32. x 3 +78x-65=0. 
34. x 3 +63x-84=0. 
36. x 4 -5x 2 +22x-30 = 0. 
38. x 4 -10x 2 +40x-16 = 0. 



31. x 3 -36x-84 = 0, Ans. 6.9361683. 
33. x 3 -60x- 180=0, Ans. 8.9504582. 
35. x 3 +84x-84 = 0, 4ns. 0.9885012. 
37. x 4 +x 2 +30x-50 = 0. 
39. x 4 -x 2 +44x+26=0. 






\ 



CHAPTER VII 
Solution of Numerical Equations 

74. Homer's Method. 1 After we have isolated a real root of a real 
equation by one of the methods in Chapter VI, we can compute the root 
to any desired number of decimal places either by Horner's method, 
which is available only for polynomial equations, or by Newton's method 
(§75), which is applicable also to logarithmic, trigonometric, and other 
equations. 

To find the root between 2 and 3 of 

(1) x 3 -2x-5=0, 

set x = 2+p. Direct substitution gives the transformed equation for p: 

(2) p 3 +6p 2 +10p-l = 0. 

The method just used is laborious especially for equations of high degree. 
We next explain a s impler method . Since p=x— 2, 

x 3 -2x-5 = (x-2) 3 +6(x-2) 2 +10(x-2)-l, 

identically in x. Hence —1 is the remainder obtained when the given 

polynomial x 3 — 2x— 5 is divided by x—2. By inspection, the quotient 

Q is equal to 

(z-2) 2 +6(x-2) + 10. 

Hence 10 is the remainder obtained when Q is divided by rr— 2. The 
new quotient is equal to (x— 2) +6, and another division gives the 
remainder 6. Hence to find the coefficients 6, 10, — 1 of the terms follow- 
ing p 3 in the transformed equation (2), we have only to divide the given 
polynomial x 3 — 2x— 5 by x— 2, the quotient Q by x— 2, etc., and take 
the remainders in reverse order. However, when this work is performed 
by synthetic division (§ 15) as tabulated below, no reversal of order is 

1 W. G. Horner, London Philosophical Transactions, 1819. Earlier (1804) by P. 
Ruffuu. See Bulletin American Math. Society, May, 1911. 

86 



§74] 



HORNER'S METHOD 



87 



necessary, since the coefficients then appear on the page in their desired 
order. 



1 





-2 


-5 2 




2 


4 


4 


1 


2 


2 


-1 




2 


8 




1 


.4 
2 


10 





6 

Thus 1, 6, 10, — 1 are the coefficients of the desired equation (2). 

To obtain an approximation to the decimal p, we ignore for the moment 
the terms involving p 3 and p 2 ; then by lOp— 1 = 0, p=0.1. But this 
value is. too large since the terms ignored are all positive. For p = 0.09, 
the polynomial in (2) is found to be negative, while for p=0.1 it was just 
seen to be positive. Hence p = 0.09+A, where h is of the denomination 
thousandths. The coefficients 1, 6.27, ... of the transformed equation 
for h appear in heavy type just under the first zigzag line in the following 
scheme: 

[ 0.09 



1 6 
0.09 




10 
0.5481 


-1 
0.949329 


1 6.09 10.5481 
0.09 0.5562 


-0 050671 


1 6.18 
0.09 


11.1043 


0.044517584 


1 6.27 
0.00 


4 0.025096 


1 6.274 11.129396 
0.004 0.025112 


-0.006163416 


1 6.278 
0.004 


11.164608 




1 6.28 


2 







0.05 

11.1 

=0.004 



Hence x— 2.094 +t, where t is a root of 

< 3 +6.282J 2 +11.154508- 0.006153416=0. 

By the last two terms, t is between 0.0005 and 0.0006. Then the value 



88 SOLUTION OF NUMERICAL EQUATIONS [Ch. VII 

oi C^' 3 +6.282Z 2 is found to lie between 0.00000157 and 0.00000227. 
Hence we may ignore C provided the constant term be reduced by an 
amount between these limits. Whichever of the two limits we use, we 
obtain the same dividend below correct to 6 decimal places. 

11.154508 | 0.006151 I 0.000551 =* 

5577 



574 
558 



16 
11 



Since the quotient is 0.0005 +, only two decimal places of the divisor are 
used, except to see by inspection how much is to be carried when making 
the first multiplication. Hence we mark a cross above the figure 5 in 
the hundredths place of the divisor and use only 11.15. Before making 
the multiplication by the second significant figure 5 of the quotient t, 
we mark a cross over the figure 1 in the tenths place of the divisor and 
hence use only 11.1. Thus x = 2.0945514+, with doubt only as to whether 
the last figure should be 4 or 5. 

If we require a greater number of decimal places, it is not necessary 
to go back and construct a new transformed equation from the equation 
in L We have only to revise our preceding dividend on the basis of our 
present better value of L We now know that t is between 0.000551 and 
0.000552. To compute the new value of the correction C, in which we 
may evidently ignore t 3 , we use logarithms. 

log 5.51 = .74115 log 5.52 = .74194 

.-. log 5 . 51 2 = 1 . 48230 /. log 5 . 52 2 = 1 . 48388 

log 6.282= .79810 log 6.282= .79810 

log 190.72 =2.28040 log 191.42 =2.28198 

Hence C is between 0.000001907 and 0.000001915. Whichever of the two 
limits we use, we obtain the same new dividend below correct to 8 decimal 
places. 



§ 74] HORNER'S METHOD 89 

11.154508 | 0.00615150 | 0.00055 148 

557725 

57425 
55773 

1652 
1115 



537 
446 

91 
89 

2 

Hence, finally, x= 2.094551482, with doubt only as to the last figure. 

EXERCISES 

(The number of transformations made by synthetic division should be about half 
the number of significant figures desired for a root.) 

By one of the methods in Chapter VI, isolate each real root of the following equa- 
tions, and compute each real root to 5 decimal places. 

I. x s +2x+20=0. 2. x 3 +3x»-2x-5=0. 

3. x»+x'-2x-l=0. 4. x 4 +4x»-17.5x'-18x-|-58.5-0. 

5. x 4 -ll,727x +40,385=0. 6. x» = 10. 

Find to 7 decimal places all the real roots of 

7. x»+4x 2 -7=0. 8. x 8 -7x-7=0. 

Find to 8 decimal places 

9. The root between 2 and 3ofx s — x— 9=0 (make only 3 transformations). 

10. The real cube root of 7.976. 

II. The abscissa of the real point of intersection of the conies y=x* t xy+x+Zy— 
6=0. 

12. Find to 3 decimal places the abscissas of the points of intersection of x*+ y'=9, 
y=x*-x. 

13. A sphere two feet in diameter is formed of a kind of wood a cubic foot of which 
weighs two-thirds as much as a cubic foot of water (i.e., the specific gravity of the wood 
is 2/3). Find to four significant figures the depth h to which the floating sphere 
will sink in water. 

Hints: The volume of a sphere of radius r is -firr*. Hence our sphere whose radius 



90 SOLUTION OF NUMERICAL EQUATIONS [Ch. VII 

is 1 foot weighs as much as ft- f cubic feet of water. The volume of the submerged 
portion of the sphere is wh^r—^h) cubic feet. Since this is also the volume of the dis- 
placed water, its value for r = 1 must equal £r • § Hence h* — 3A*+§ = 0. 

14. If the specific gravity of cork is 1/4, find to four significant figures how far a 
cork sphere two feet in diameter will sink in water. 

15. Compute cos 20° to four decimal places by use of 

cos3A=4cos s A — 3 cob A, cos 60°=^. 

16. Three intersecting edges of a rectangular parallelopiped are of lengths 6, 8, 
and 10 feet. If the volume is increased by 300 cubic feet by equal elongations of the 
edges, find the elongation to three decimal places. 

17. Given that the volume of a right circular cylinder is otw and the total area of 
its surface is 2/3x, prove that the radius r of its base is a root of r 8 — (ir+a -0. If a = 56, 
0=28, find to four decimal places the two positive roots r. The corresponding altitude 
is a/r*. 

18. What rate of interest is implied in an offer to sell a house for $2700 cash, or 
in annual installments each of $1000 payable 1, 2, and 3 years from date? 

Hint: The amount of $2700 with interest for 3 years should be equal to the sum 
of the first payment with interest for 2 years, the amount of the second payment with 
interest for 1 year, and the third payment. Hence if r is the rate of interest and we 
write x for 1 +r, we have 

2700 x* = 1000 x*+1000 a+1000. 

19. Find the rate of interest implied in an offer to sell a house for $3500 cash, or in 
annual installments each of $1000 payable 1, 2, 3, and 4 years from date. 

20. Find the rate of interest implied in an offer to sell a house for $3500 cash, or 
$4000 payable in annual installments each of $1000, the first payable now. 

75. Newton's Method. Prior to 1676, Newton 1 had already found 
the root between 2 and 3 of equation (1). He replaced x by 2+p and 
obtained (2). Since p is a decimal, he neglected the terms in p 3 and p 2 , 
and hence obtained p=0.1, approximately. Replacing p by 0.1+g in 
(2), he obtained 

g 3 +6.3g 2 +11.23g+0.061=0. 

Dividing —0.061 by 11.23, he obtained —0.0054 as the approximate 
value of q. Neglecting g 3 and replacing q by — 0.0054 +r, he obtained 

6.3^+11. 16196r+0.000541708=0. 

Dropping O.Sr 2 , he found r and hence 

x = 2+0. 1 - 0.0054 - 0.00004853 = 2.09455147, 

r 

1 Isaac Newton, Opuscula, I, 1794, p. 10, p. 37. 



§75] 



NEWTON'S METHOD 



91 



of which all figures but the last are correct (§74). But the method will 
not often lead so quickly to so accurate a value of the root. 

Newton used the close approximation 0.1 to p, in spite of the fact 
that this value exceeds the root p and hence led to a negative correction 
at the next step. This is in contrast with Horner's method in which each 
correction is positive, so that each approximation must be chosen less 
than the root, as 0.09 for p. 

Newton's method may be presented in the following general form, 
which is applicable to any equation /(x)=0, whether /(x) is a polynomial 
or not. Given an approximate value a of a real root, we can usually 
find a closer approximation a+h to the root by neglecting the powers 
h 2 , h 3 , . . . of the small number h in Taylor's formula (§ 56) 

f(a+h) =/(a) +f>(a)h+f"(a)%+ • ■ . 



and hence by taking 



m+rm-o, 



h= 



-m 



We then repeat the process with a\ =a+A in place of the former a. 
Thus in Newton's example, /(x) = a^— 2x— 5, we have, for a =2, 

— f(2) 1 
^TW^IO' <*i = a+A = 2.1, 

A -/(2.D _-Q.061_ 000M 
hl ~ f\2A) " 11.23 ~ °- UU5 * 

76. Graphical Discussion of Newton's Method. Using rectangular 
coordinates, consider the graph of y=f(x) and the point P on it with the 
abscissa OQ = a (Fig. 22). Let the tangent at P meet the x-axis at T 





Fio. 22 



Fig. 23 



92 



SOLUTION OF NUMERICAL EQUATIONS 



fCH. VII 



and let the graph meet the x-ans at S. Take h=QT, the subtangent. 
Then 

QP =/(o), /'(a) = tan X TP = ^^, 

h _ -/(«) 
/'(a)' 

In the graph in Fig. 22, OT=a+h is a better approximation to the 
root OS than OQ=a. The next step (indicated by dotted lines) gives a 
still better approximation OT\. 

If, however, we had begun with the abscissa a of a point Pi in Fig. 22 
near a bend point, the subtangent would be very large and the method 
would probably fail to give a better approximation. Failure is certain 
if we use a point P2 such that a single bend point lies between it and S. 

We are concerned with the approximation to a root previously isolated 
as the only real root between two given numbers a and ft. These should 
•be chosen so nearly equal that/'(x) =0 has no real root between a and ft 
and hence/(x) = y has no bend point between a and ft Further, iff"(x) = 
has a root between our limits, our graph will have an inflexion point with 
an abscissa between a and ft and the method will likely fail (Fig. 23). 

Let, therefore, neither f'(x) nor f"{x) vanish between a and ft Since 
/" preserves its sign in the interval from a to ft while / changes in sign, 
/" and / will have the same sign for one end point. According as the 
abscissa of this point is a or ft we take a=a or a = # for the first step of 
Newton's process. In fact, the tangent at one of the end points meets 
the x-axis at a point T with an abscissa within the interval from a to ft 
If f'{x) is positive in the interval, so that the tangent makes an acute 
angle with the x-axis, we have Fig. 24 or Fig. 25; if/' is negative, Fig. 
26 or Fig. 22. 






Fig. 24 



Fig. 25 



Fig. 26 



§ 76] GRAPHICAL DISCUSSION OF NEWTON'S METHOD 93 

In Newton's example, the graph between" the points with the abscissas a =2 and 
(3 = 3 is of the type in Fig. 24, but more nearly like a vertical straight line. In view 
of this feature of the graph, we may safely take a— a, as did Newton, although our 
general procedure would be to take a=/3. The next step, however, accords with our 
present process; we have a =2, =2.1 in Fig. 24 and hence we now take a =0, getting 

0.061 

inr 00054 

as the subtangent, and hence 2.1 —0.0054 as the approximate root. 

If we have secured (as in Fig. 24 or Fig. 26) a better upper limit to the 
root than 0, we may take the abscissa c of the intersection of the chord 
A B with the x-axis as a better lower limit than a. By similar triangles, 

-/(a) :c-a=/(0) : j8-c, 
whence 
(%s a/(fl-ff(«) 

W °~~ /08)-/(a) * 

This method of finding the value of c intermediate to a and £ is called the 
method of interpolation (regula falsi). 

' In Newton's example, a =2, 0=2.1, 

/(a) = -l, /(0) =0.061, c=2.0942. 

The advantage of having c at each step is that we know a close limit 
of the error made in the approximation to the root. 

We may combine the various possible cases discussed into one: 

If f(x) = has a single real root between a and 0, and f'(x) = 0, f"{x) = 
have no real root between a and /3, and if we designate by ff that one of the 
numbers a and for which f{f$) and /"(/3) have the same sign, then the root 
lies in the narrower interval from c to P—f(ff)/f'(P), where c is given by (3). 

It is possible to prove x this theorem algebraically and to show that by 
repeated applications of it we can obtain two limits a ', P' between which* 
the root lies, such that a'— &' is numerically less than any assigned posi- 
tive number. Hence the root can be found in this manner to any desired 
accuracy. 

Examtlb. /(:r)-x 8 -2x 2 -2, a=2|, 0=2£. Then 

* Weber's Algebra, 2d ed., I, pp. 380-382; Kleines Lehrbuch der Algebra, 1912, p. 163. 



-/(2) = --6 

/'(2) 31 



94 SOLUTION OF NUMERICAL EQUATIONS [Ch. VII 

Neither of the roots 0, 4/3 of /'(x) =0 lies between a and 0, so that /(x) =0 has a single 
real root between these limits ($65). Nor is the root $ of f"(x) =0 within these limits. 
The conditions of the theorem are therefore satisfied. For a<x<fi, the graph is of 1 
the type in Fig. 24. We find that approximately 

c-Mf =2.3487, fr-0-^-2.3714, 

For x=2.3593, /(x) = -0.00003. We therefore have the root to four decimal places 
For a =2. 3593, 

/'fa) - 7.2620, a -~ - 2.3593041, 

f(a) 

which is the value of the root correct to 7 decimal places. We at once verify that the 
result is greater than the root in view of our work and Fig. 24, while if we change the 
final digit from 1 to 0, f(x) is negative. 

EXERCISES 

1. For f(x)=x A +x 9 — 3x 2 — x— 4, show by Descartes' rule of signs that/'(x)=0 
and /"(x) =0 each have a single positive root and that neither has a root between 1 
and 2. Which of the values 1 and 2 should be taken as 0? 

2. When seeking a root between 2 and 3 of «'— x— 9=0, which value should be 
taken as 0? 

77. Systematic Computation of Roots by Newton's Method. By way 

of illustration we shall compute to 7 decimal places a positive root of 

/(x)=z 4 +s 3 -3x 2 -a;-4=q. 
Since /(l) = — 6, /(2) = 6, there is a real root between 1 and 2. Since 
/ f (x)=4xH-3x 2 -te-l, /'(1)=0, - /'(2) = 31, 

the graph of y=f(x) is approximately horizontal near (1, —6) and approxi- 
mately vertical near (2, 6). Hence the root is much nearer to 2 than to 1. 
Thus in applying Newton's method we employ a =2 as the first approxi- 
mation to the root. The correction h is then 



The work of performing the substitutions x = 2+d, d=— 0.2+e, . . . , 
to find the transformed equations satisfied by d, e, . . . , is done by syn- 



§77] 



COMPUTATION OF ROOTS BY NEWTON'S METHOD 



95 



thetic division, exactly as in Hornets method, except that some of the 
\ multipliers are now negative: 

LB 



1 1 


-3 


-1 


-4 


2 


6 


6 


10 


1 3 


3 


5 


6 


2 


10 


26 




1 5 


.13 


31 




2 


14 






1 7 


27 




2 








1 9 





-0.2 



8.8 
-0.2 



8.6 
-0.2 



8.4 
-0.2 



8.2 



-1.76 



-5.048 



25.24 
-1.72 



25.952 
-4.704 



23.52 
-1.68 



21 248 



21 84 



-5.1904 



-0.2 



0.8096 



-0.8096 
21.248 
=*-0.04 



-0.04 


-0.3264 


—0.860544 


-0.81549824 


8.16 
-0.04 


21.5136 
-0.3248 


20.387456 
-0.847552 


-0 00589824 


8.12 
-0.04 


21.1888 
-0.3232 


19 539904 




8.08 
-0.04 

O Ail 


20.8666 


0.00 & 898 =0000302 
19.54 



The root is 2-0.2-0.04+0.000302=1.760302, in which the last 
figure is in slight doubt. Indeed, it can be proved that if the final f radian 
g, when expressed as a decimal, has k zeros between the decimal point and the 
first significant figure, the division may be safely carried to 2k decimal places. 
In our example fc = 3, so that we retained 6 decimal places in g. 

To proceed independently of this rule, we note that g is obviously 
between 0.00030 and 0.00031. Then the value of gf 4 +8.04g 3 +20.8656^ 2 



96 SOLUTION OF NUMERICAL EQUATIONS [Ch. VII 

is found to lie between 0.000001878 and 0.000002006. Whichever of these 
limits we use as a correction by which to reduce the constant term, we 
obtain the same dividend below correct to 6 decimal places. 

19.539904 I 0.005896 I 0.0003017 



005862 

34 
20 

14 
14 

Hence the root is 1.7603017 to 7 decimal places. 

EXERCISES 

1. Find to 8 decimal places the root between 2 and 3 of x 8 — s— 9=0. 

2. Find to 7 decimal places the root between 2 and 3 of x 8 — 2x*— 2=0. 

3. Find the real cube root of 7.976 to 5 decimal places. 

4. Explain by Taylor's expansion of /(2-,-d) why the values of 

/(2). /'(2), if"(2), ^/'"(2), ^7 4 /""(2) 

are in reverse order the coefficients of the transformed equation 

d 4 +9<J 8 +27<J 2 +31(*+6=0, 
obtained in the Example in the text, and printed in heavy type. 

5. The method commonly used to find the positive square root of n by a computing 
machine consists in dividing n by an assumed approximate value a of the square root 
and taking half the sum of a and the quotient as a better approximation. Show that 
the latter agrees with the value of a+h given by applying Newton's method to 

f(x)=x 2 —n. 

78. Newton's Method for Functions not Polynomials. 

Example 1. Find the angle x at the center of a circle subtended by a chord which 
cuts off a segment whose area is one-eighth of that of the circle. 

Solution. If x is measured in radians and if r is the radius, the area of the segment 
is equal to the left member of 

£r 2 (x— sin x) ^inr*, 
whence . 

x— sinx=^ir. 



§ 78] NEWTON'S METHOD FOR FUNCTIONS NOT POLYNOMIALS 97 

By means of a graph of y—sinx and the straight line represented by y=z— ^r, we 
see that the abscissa of their point of intersection is approximately 1.78 radians or 102°. 
Thus a = 102° is a first approximation to the root of 

/(x) sx- sin x — \r = 0. 

By Newton's method a better approximation is a-\-h, where l 

—f(a) — a + sin a +±x 



h = 



f'(a) 1 — cos a 



sin 102° = 0.9781 cos 102° - -0.2079 

£(3.1416) = 0.7854 1 -cos 102°= 1.2079 

1.7635 -0.0167 

A = = -0.0138 

102° - 1 .7802 radians 1 .2079 



-0.0167 



0l =d+/i = 1.7664 



_ -/(a,) -1.766 4 + 0.9809+0.7854 AAAA1 

/»i = —— — — = TTZT'* = — U.UUU1. 

/'(ai) 1.1944 

Hence x =ai+hi = 1.7663 radians, or 101° 12' 

Example 2. 2 Solve x — log x = 7, the logarithm being to base 10. 

Solution. Evidently x exceeds 7 by a positive decimal which is the value of log x. 
Hence in a table of common logarithms, we seek a number x between 7 and 8 whose 
logarithm coincides approximately with the decimal part of x. We read off the values 
in the second column. 



X 


logx 


x— logx 


7.897 
7.898 


0.89746 
0.89752 


6.99954 
7.00048 



By the final column the ratio of interpolation is 46/94. Hence x= 7.8975 to four 
decimal places. 

1 The derivative of sin x is cos x. We need the limit of 

sin (x-\-2k) —sin x 2 cos -|(2x+2fc) sin ^-(2fc) _cos (x-f k) sin A; 
2k = 2k k 

as 2k approaches zero. Since the ratio of sin A; to fc approaches 1, the limit is cos x. 

2 This Ex. 2, which should be contrasted with Ex. 3, is solved by interpolation 
since that method is simpler than Newton's method in this special case. 



98 SOLUTION OF NUMERICAL EQUATIONS [Ch. Vfl 

Example 3. Solve 2x— log $=7, the logarithm being to base 10. 
Solution. Evidently x is a little less than 4. A table of common logarithms shows 
at once that a fair approximation to x is a = 3.8. Write 

/(x)s2x-logx-7, log x=M log. x 9 Af =0.4343. 
By calculus, the derivative of log* x is l/x. Hence 

f(x)~2-— , f(a) =2-0.1143-1.8857, 

x 

/(a) =0.6-log 3.8=0.6-0.57978=0.02022, 
-A =^=0.0107, ai=a+* =3.7893, 
/(a,) =0.000041, /(3.7892) = -0.000148. 
7^X0.0001 =0.000078, 3=3.789278. 
All figures of x are correct as shown by Vega's table of logarithms to 10 places. 

EXERCISES 

Find the angle x at the center of a circle subtended by a chord which cuts off a seg- 
ment whose ratio to the circle is 

1. -j- 2. ■§•. 

When the logarithms are to base 10, 

3. Solve 2x-log x =9. 4. Solve 3x-log x =9. 

5. Find the angle just > 15° for which ^ sin x H-sin 2z=0.64. 

6. Find the angle just > 72° for which x— % sin x-\w. 

7. Find all solutions of Ex. 5 by replacing sin 2x by 2 sin x cos x, squaring, and 
living the quartic equation for cos x. 

8. Solve similarly sin x+sin 2z = 1.2. 

9. Find x to 6 decimal places in sin x=x— 2. 
10. Find x to 5 decimal places in x=3 log e x. 

79. Imaginary Roots. To find the imaginary roots x+yi of an equa- 
tion /(z)=0 with real coefficients, expand f(x+yi) by Taylors theorem; 
we get 

m+r(x)yi^n^-r(x) 1 ^+ . . . =o. 



i 791 IMAGINARY ROOTS 09 



Since x and y are to be real, and 2/5*0, 



-v 



(4) 



/'(x)-/'"(x) I ^3+/<»(x)|^- ... =0. 



In the Example and Exercises below, f(z) is of degree 4 or less. Then 
the second equation (4) is linear in y 2 . Substituting the resulting value 
of y 2 in the first equation (4), we obtain an equation E(x)=0, whose real 
roots may be found by one of the preceding methods. If the degree of 
f(z) exceeds 4, we may find E(x)=0 by eliminating y 2 between the two 
equations (4) by one of the methods to be explained in Chapter X. 

Example. For/(z)=« 4 — z+1, equations (4) are 

x«-:r+l-6xy+y 4 =*0, 4x , -l-4xy , «0. 
Thus 

y 4x' 16 

The cubic equation in x 1 has the single real root 

x* - 0.528727, x = ±0.72714. 
Then y* =0.184912 or 0.87254, and 

z = x +yi =0.72714=b0.43001t, - 0.72714 ±0.93409*. 

EXERCISES 

Find the imaginary roots of 

1. *»-2*-5~0. 2. 28**+92 , -l«0. 

3. «<-3«»-62=2. 4. « 4 -4z*+ll« s -14«+10«a 

5. s«-4**+9* , -16z+20=0. Hint: 

E(x) 3z(x-2)(16x<-64x 8 +136V-144s+65)-^ 

and the last factor becomes (tp*-r-l)(ti;*+9) for 2x*=w+ 2. 

Note. If we know a real root r of a cubic equation f(z) =0, we may remove the 
factor z— r and solve the resulting quadratic equation. When, as usual, r involves 
several decimal places, this method is laborious and unsatisfactory. But we may utilize 
a device, explained in the author's Elementary Theory of Equations, pp. 119-121, §§ 6, 7. 
As there explained, a simila. device may be used when we know two real roots of a 
quartic equation. 



100 SOLUTION OF NUMERICAL EQUATIONS [Ch. VH 

MISCELLANEOUS EXERCISES 

(Give answers to 6 decimal places, unless the contrary is stated.) 

1. What arc of a circle is double its chord? 

2. What arc of a circle is double the distance from the center of the circle to the 
chord of the arc? 

3. If A and B are the points of contact of two tangents to a circle of radius unity 
from a point P without it, and if arc AB is equal to PA, find the length of the arc. 

4. Find the angle at the center of a circle of a sector which is bisected by its chord. 

5. Find the radius of the smallest hollow iron sphere, with air exhausted, which will 
float in water if its shell is 1 inch thick and the specific gravity of iron is 7.5. 

6. From one end of a diameter of a circle draw a chord which bisects the semicircle. 

7. The equation xtaxix=*c occurs in the theory of vibrating strings. Its approxi- 
mate solutions may be found from the graphs of y = cot x, y = x/c. Find x when c = 1. 

8. The equation tan x=x occurs in the study of the vibrations of air in a spherical 
cavity. From an approximate solution x x = 1.5V, we obtain successively better approxi- 
mations x 2 = =tan- 1 £i = 1.4334 x, £ s =tan- l x 2 , .... Find the first three solutions to 
4 decimal places. 

9. Find to 3 decimal places the first five solutions of 

2x 
tanx = 



2-x 2 ' 

which occurs in the theory of vibrations in a conical pipe. 

10. 4tz 3 — (3x — 1) 2 =0 arises in the study of the isothermals of a gas. Find its 
roots when (i) T = 0.002 and (ii) T = 0.99. 

11. Solve x x =100. 12. Solve x = 10 log x. 13. Solve s+log x =z log x. 

14. Solve Kepler's equation M=x-esin x when M = 332° 28' 54.8", e = 14° 3' 20". 

15. In what time would a sum of money at 6% interest compounded annually 
amount to as much as the same sum at simple interest at 8%? 

16. In a semicircle of diameter x is inscribed a quadrilateral with sides a, 6, c, x; 
then x 3 -(a 2 +6 2 +c 2 )x-2a6c=0 (I. Newton). Given a = 2, 6=3, c=4, find x. 

17. What rate of interest is implied in an offer to sell a house for $9000 cash, or 
$1000 down and $3000 at the end of each year for three years? 



CHAPTER VIII 



Determinants; Systems of Linear Equations 

80. Solution of Two Linear Equations by Determinants of Order 2. 

Assume that there is a pair of numbers x and y for which 



(1) 



aix+biy=ki, 
a,2X+b2y=k2. 

Multiply the members of the first equation by 62 and those of the second 
equation by —61, and add the resulting equations. We get 

(aib2—(i2bi)x = kib2—k2bi. 
Employing the respective multipliers — a2 and ai, we get 

(0162 — Ci2bi)y = ai&2 — ctefci. 
The common multiplier of x and y is 
(2) aib2-a,2bi, 

and is denoted by the symbol 

a\ b\ 
a,2 62 

'which is called a determinant of the second order , and also called the deter- 
minant of the coefficients of x and y in equations (1). The results above 
may now be written in the form 

(3) 

We shall call fci and £2 the known terms of our equations (1). Hence, 
if D is the determinant of the coefficients of the unknowns, the product of D by 
any one of the unknowns is equal to the determinant- obtained from D by 
substituting the known terms in place of the coefficients of that unknown. 

101 



(20 



a\ b\ 




fa 61 




a\ b\ 




a\ fa 


fl2 &2 


X — 


k2 62 


1 


ei2 62 


2/ = 


a2 k2 



102 



DETERMINANTS; SYSTEMS OF LINEAR EQUATIONS [Ch. VIII 



If D 5*0, relations (3) uniquely determine values of x and y: 



£ = 



kib2 — k2b] 
D 



y= 



€L\k2~ 0,2^1 

~D J 



and these values satisfy equations (1) ; for example, 

{a\b2— a>2b\)k\. 



aix+biy = 



D 



Hence our equations (1) have been solved by determinants when D^O. 
We shall treat in § 96 the more troublesome case in which Z)=0. 

Example. For 2x — Sy = — 4, 6x — 2y = 2, we have 



14x = 14, x=l f 



2 -3 




-4 -3 


6 -2 


£ = 


2 -2 


1 


Ay- 


2 -4 
6 2 



=28, y=2. 



15 



EXERCISES 

Solve by determinants the following systems of equations: 

1. 8x-y = 34, 2. 3x+4t/ = 10, 3. ax+by=a\ 
x-h8i/ = 53. 4x-f 2/ = 9. 6x— ay—ab. 

81. Solution of Three Linear Equations by Determinants of Order 3. 

Consider a system of three linear equations 

aix+biy+ciz = ki y 

(4) a 2 x+b 2 y+C2z=k2, 

a3X+b3y+Csz = h. 

Multiply the members of the first, second and third equations by 

(5) &2C3 — &3C2, b'sC\ — biCs, b\C2—b2Cx, 

respectively, and add the resulting equations. We obtain an equation 
in which the coefficients of y and z are found to be zero, while the coeffi- 
cient of x is 

(6) ai&2C3~Oi63C2+a2&3Cl — a2&lC3 + a3&iC2 — a3&2Cl» 



k 



fe 



§821 



SIGNS OF TERMS OF A DETERMINANT 



103 



(60 



Such an expression is called a determinant of the third order and denoted 
by the symbol 

a\ 6i a 

0,2 &2 C2 
03 63 C3 

The nine numbers ai, . . . , C3 are called the elements of the determi- 
nant. In the symbol these elements lie in three (horizontal) rows, and 
also in three (vertical) columns. Thus 02, 62, C2 are the elements of the 
second row, while the three c's are the elements of the third column. 

The equation (free of y and z), obtained above, may now be written 
as ' 



a\ fei C\ 




fci 61 C\ 


(12 &2 C2 


£ = 


&2 &2 C2 


03 63 C3 




fa 63 C3 



since the right member was the sum of the products of the expressions 
(5) by ki, fa, fa, and hence may be derived from (6) by replacing the 
d's by the fc's. Thus the theorem of § 80 holds here as regards the 
unknown x. We shall later prove, without the laborious computations 
just employed, that the theorem holds for all three unknowns. 

82. The Signs of the Terms of a Determinant of Order 3. In the 

six terms of our determinant (6), the letters a, 6, c were always written 
in this sequence, while the subscripts are the six possible arrangements 
of the numbers 1, 2, 3. The first term 016203 shall be called the diagonal 
term, since it is the product of the elements in the main diagonal running 
from the upper left-hand corner to the lower right-hand corner of the 
symbol (6') for the determinant. The subscripts in the term —016302 
are derived from those of the diagonal term by interchanging 2 and 3, 
and the minus sign is to be associated with the fact that an odd number 
(here one) of interchanges of subscripts were used. To obtain the arrange- 
ment 2, 3, 1 of the subscripts in the term +026301 from the natural order 
1, 2, 3 (in the diagonal term), we may first interchange 1 and 2, obtaining 
the arrangement 2, 1, 3, and then interchange 1 and 3; an even number 
(two) of interchanges of subscripts were used and the sign of the term 
is plus. 

While the arrangement 1, 3, 2 was obtained from 1, 2, 3 by one inter- 
change (2, 3), we may obtain it by applying in succession the three inter- 



104 DETERMINANTS; SYSTEMS OF LINEAR EQUATIONS [Ch. Vm 

• 
changes (1, 2), (1, 3), (1, 2), and in many new ways. To show that the 
number of interchanges which will produce the final arrangement 1, 3, 2 
is odd in every case, note that each of the three possible interchanges, 
viz., (1, 2), (1, 3), and (2, 3), changes the sign of the product 

P = (Xl — X 2 ) (X\ — X 3 ) (X 2 — X3) , 

where the x's are arbitrary variables. Thus a succession of k interchanges 
yields P or — P according as k is even or odd. Starting with the arrange* 
ment 1, 2, 3 and applying k successive interchanges, suppose that we 
obtain the final arrangement 1, 3, 2. But if in P we replace the subscripts 
1, 2, 3 by 1, 3, 2, respectively, i.e., if we interchange 2 and 3, we obtain 
—P. Hence k is odd. We have therefore proved the following rule 
of signs: 

Although the arrangement r, s, t of the subscripts in any term =ka,fc*c* of 
the determinant may be obtained from the arrangement 1, 2, 3 by various 
successio7is of interchanges, the number of these interchanges is either always 
an even number and then the sign of the term is plus or always an odd num- 
ber and then the sign of the term is minus. 

EXERCISES 

Apply the rule of signs to all terms of 

1. Determinant (6). 2. Determinant ai&2— o*6i. 

83. Number of Interchanges always Even or always Odd. We now 

extend the result in § 82 to the case of n variables xi f . . . , Xn. The 
product of all of their differences Xi—Xj(i<j) is 

P=(xi— x 2 )(xi— X3) . . . (si— re,,) 

•(X2 — £3) . . . (#2 — £n) 



•(x n _ i—X n ). 

Interchange any two subscripts i and j. The factors which involve neither 
i nor j are unaltered. The factor (:r<— xj) involving both is changed in 
sign. The remaining factors may be paired to form the products 

± (x t — x k ) (xj — x t ) (h = l, . . . , n; k^i, k?*f). 

Such a product is unaltered. Hence P is changed in sign. 

Suppose that an arrangement i\, 2*2, . . . , i n can be obtained from 



§ 84] DEFINITION OF A DETERMINANT OF ORDER n 105 

1, 2, . . . , n by using m successive interchanges and also by t successive 
interchanges. Make these interchanges on the subscripts in P; the 
resulting functions are equal to (— \) m P and (— 1)7*, respectively. But 
the resulting functions are identical since either can be obtained at one 
step from P by replacing the subscript 1 by i\, 2 by 2*2, . . . , n by i n . Hence 

(-i)-p-(-jyp, 

so that m and t are both even or both odd. 

Thus if the same arrangement is derived from 1, 2, . . . , n by m successive 
interchanges as by t successive interchanges, then m and t are both even or 
both odd. 

84. Definition of a Determinant of Order n. We define a determinant 
of order 4 to be 

a\ 61 ci d\ 



(7) 



02 &2 C2 ^2 

03 63 C3 dz 

04 64 C4 #4 



= ^2w ^^rC^dt 



(24) 



where g, r, s, £ is any one of the 24 arrangements of 1, 2, 3, 4, and the 
sign of the corresponding term is + or — according as an even or odd 
number of interchanges are needed to derive this arrangement q y r, 5, t 
from 1, 2, 3, 4. Although different numbers of interchanges will produce 
the same arrangement q y r, s, t from 1, 2, 3, 4, these numbers are all even 
or all odd, as just proved, so that the sigr* is fully determined. 

We have seen that the analogous definitions of determinants of orders 
2 and 3 lead to our earlier expressions (2) and (6). 

We will have no difficulty in extending the definition to a determinant 
of general order n as soon as we decide upon a proper notation for the n 2 
elements. The subscripts 1, 2, . . -. , n may be used as before to specify 
the rows. But the alphabet does not contain n letters with which to 
specify the columns. The use of e', e" y . . . , e (n) for this purpose would 
conflict with the notation for derivatives and besides be very awkward 
when exponents are used. It is customary in mathematical journals 
and scientific books (a custom not always followed in introductory text 
books, to the distinct disadvantage of the reader) to denote the n letters 
used to distinguish the n columns by 61, ^2, . . . , e n (or some other letter 
with the same subscripts) and to prefix (but see § 85) such a subscript by 



106 



DETERMINANTS; SYSTEMS OF LINEAR EQUATIONS [Ch. VIII 



the new subscript indicating the row. The symbol for the determinant 

is therefore 

en 612 .. . e\ u 



(8) 



Z)= 



621 622 .. • 62» 



By definition this shall mean the sum of the n(n— 1) ... 2-1 terms 
(9) (—1)^1^2 . . . <V 

in which t"i, t*2, . . . , in is an arrangement of 1, 2, ... 9 n, derived from 
1, 2, . . . , n by i interchanges. Any term (9) of the determinant (8) is, 
apart from sign, the product of n factors, one and only one from each col- 
umn, and one and only one from each row. 

For example, if we take n=4 and write a iy bj, Cj, dj for e/i, e#, e#, e&, 
the symbol (8) becomes (7) and the general term (9) becomes the general 
term ( — 1)* a^ b^ c^d^ of the second member of (7). 

EXERCISES 

1. Find the six terms involving a% in the determinant (7). 

2. What are the signs of aib b oad\e<, a&^CscMi in a determinant of order five? 

3. Show that the arrangement 4, 1, 3, 2 may be obtained from 1, 2, 3, 4 by use of 
the two successive interchanges (1, 4), (1, 2), and also by use of the four successive 
interchanges (1, 4\ (1, 3), (1, 2), (2, 3). 

4. Write out the six terms of (8) for n=3, rearrange the factors of each term so that 
the new first subscripts shall be in the order 1, 2, 3, and verify that the resulting six 
terms are those of the determinant D' in § 85 for n= 3. 

85. Interchange of Rows and Columns. Any determinant is not 
altered in value if in its symbol we replace the elements of the first, second, 
. . . , nth rows by the elements which formerly appeared in the same order 
in the first, second, . . . , nth columns, or briefly if we interchange the cor- 
responding rows and columns. For example, 

a c 
= ad — bc= 



a b 
c d 



b d 



We are to prove that the determinant D given by (8) is equal to 

en 621 ... e n i 



Z>'= 



0i2 622 .. . e n 2 



&\n &2n • • • ^nn 



1 85] INTERCHANGE OF ROWS AND COLUMNS 107 

If we give to D' a more familiar aspect by writing e tt = a w for each element 
so that, as in (8), the row subscript precedes instead of follows the column 
subscript, the definition of the determinant in terms of the a's gives D' 
in terms of the e's as the sum of all expressions 

(—1)^1*^2*,. . . £»*„, 

in which k\, fa . . . , &* is an arrangement of 1, 2, . . . , n, derived from 
the latter sequence by i interchanges. 

As for the terms of D, without altering (9), we may rearrange its factors 
so that the first subscripts shall appear in the order 1, 2, . . . , n, and 
obtain 

(—1)^1*1*2*, • • • ent . 

This can be done by performing in reverse order the i successive inter- 
changes of the letters e corresponding to the i successive interchanges 
which were used to derive the arrangement i\, t*2, . . . , i n of the first 
subscripts from the arrangement 1, 2, . . . , n. Thus the new second 
subscripts k\, . . . , k u are derived from the old second subscripts 1, . . . , 
n by i interchanges. The resulting signed product is therefore a term 
of D'. Hence D=D'. 

86. Interchange of Two Columns. A determinant is merely changed 
in sign by the interchange of any two of its columns. For example, 



Z)= 



a b 

c d 



=ad—bc, A = 



b a 
d c 



= 6c— ad=— D. 



Let A be the determinant derived from (8) by the interchange of the 
rth and sth columns. The terms of A are therefore obtained from the 
terms (9) of D by interchanging r and s in the series of second subscripts. 
Interchange the rth and sth letters e to restore the second subscripts 
to their natural order. Since the first subscripts have undergone an 
interchange, the negative of any term of A is a term of D, and A= — D. 

87. Interchange of Two Rows. A determinant D is merely changed 
in sign by the interchange of any two rows. 

Let A be the determinant obtained from D by interchanging the rth 

and sth rows. By interchanging the rows and columns in D and in A, 

we get two determinants D f and A', either of which may be derived from 

the other by the interchange of the rth and sth columns. Hence, by 

§§85,86, 

A = A'=-ZX=-.D. 



108 



DETERMINANTS; SYSTEMS OF LINEAR EQUATIONS [Ch-VHI 



88. Two Rows or Two Columns Alike. A determinant is zero if any 

two of its rows or any two of its columns are alike. 

i 

For, by the interchange of the two like rows or two like columns, the 
determinant is evidently unaltered, and yet must change in sign by §§ 86, i 
87. Hence /)=-/), Z)=0. | 



1. Prove that the equation of the straight line determined by the two distinct 
points (xi f j/i) and (x», yt) is 

x y \ 

xi yi 1 =0. 

xt Vt 1 



2. Show that 



at 



6, 
bi 



Ci 
Ci 
C* 



Qt 


Ci 


b» 




ai 


C\ 


lh 


= 


Of 


c% 


bt 





<h (h <h 
bt bi b% 
c% Ci c% 



By use of the Factor Theorem (§ 14) and the diagonal term, prove that 



3. 



1 1 1 
a b c 
a* 6* c* 



= (6-a)(c-a)(c-6). 



4. 



Xi 



XS 



Xt 



Xt 1 



Xj 



X\ Xt . . . Xn 



n (xi-xj). 



This is known as the determinant of Vandermonde, who discussed it in 1770. 
symbol on the right means the product of all factors of the type indicated. 
5. Prove that a skew-symmetric determinant of odd order is zero: 



The 






a 


b 




—a 





c 


=0, 


-6 


— c 











a 


b 


c 


d 


—a 





e 


/ 


9 


-b 


— e 





h 


• 

J 


—c 


-/ 


-A 





k 



-d -g -i -k 



0. 



§90] 



EXPANSION BY ROW OR COLUMN 



109 



89. Minors. The determinant of order n— 1 obtained by erasing 
(or covering up) the row and column crossing at a given element of a 
determinant of order n is called the minor of that clement. 

For example, in the determinant (6') of order 3, the minors of b h bt, &i are respect- 
ively 



02 C| 


, ft = 


ai C\ 


, A- 


ai ci 


as Ci 




a% c% 




<h Ct 



Again, (6') is the minor of d« in the determinant of order 4 given by (7). 

90. Expansion According to the Elements of a Row or Column. In 

CL\ b\ C\ 

(6') D= a 2 b 2 C2 

03 &3 C3 

denote the minor of any element by the corresponding capital letter, 
so that 61 has the minor B\ f 63 has the minor £3, etc., as in § 89. We 
shall prove that 



D= a\A\— 61B1+C1C1, 
Z)= —02^2+62^2— C2C2, 
D= 03^3— 63B3+C3C3, 



D= a\A 1—02^2+03^3, 
D=-6iJ5i+62S2-6 3 i53, 
D = C1C1-C2C2+C3C3. 



The three relations at the left (or right) are expressed in words by saying 
that a determinant D of the third order may be expanded according to the 
elements of the first, second or third row {or column). To obtain the expan- 
sion, we multiply each element of the row (or column) by the minor of 
the element, prefix the proper sign to the product, and add the signed 
products. The signs are alternately + and — , as in the diagram 

+ - + 
- + - 
+ - + 

For example, by expansion according to the second column, 



= -4X9= -36. 



14 5 




2 3 


2 3 


= -4 


3 9 


3 9 







110 DETERMINANTS; SYSTEMS OF LINEAR EQUATIONS [Ch.VIII 



Similarly the value of the determinant (7) of order 4 may be found by expansion 
according to the elements of the fourth column: 



-d x 



We shall now prove that any determinant D of order n may be expanded 
according to the elements of any row or any column. . 

• 

Let E tJ denote the minor of e«> in D, given by (8), so that .Ey is 
obtained by erasing the ith row and jth column of D. 
(i) We first prove that 



at bt Ct 




<ti bi C\ 


i 


a\ b\ C\ 




Ol bi C\ 


a% bt Ct 


+d* 


at b% ct 


-dt 


a* b% c% 


+4 


at bt Ct 


Oi bi C4 




Oi bi d 




a* bi d 




a% b% Ct 



(10) 



D=eiiEn-e2iE2i+e3iE 3 i- . . . +(-l) w - 1 e nl # 1 



»i? 



so that D may be expanded according to the elements of its first column. 
By (9) the terms of D having the factor en are of the form 

( — l/ene^. . . e inni 

where 1, t*2, . . . , in is an arrangement of 1, 2, . . . , n, obtained from the 
latter by i interchanges, so that ?2, . . . , t n is an arrangement of 2, ... , 
n, derived from the latter by i interchanges. After removing from each 
term the common factor e\\ and adding the quotients, we obtain a sum 
which, by definition, is the value of the determinant 2?n of order n— 1. 
Hence the terms of D having the factor en may all be combined into 
en En, which is the first part of (10). 

We next prove that the terms of D having the factor C21 may be com- 
bined into —e2\E2\, which is the second part of (10). For, if A be the 
determinant obtained from D by interchanging its first and second rows, 
the result just proved shows that the terms of A having the factor e2i 
may be combined into the product of 621 by the minor 

e\2 013 . . . e\ n 

€32 033 • • • £3n 



nn 



&n2 0n3 • • 

of 02i in A. Now this minor is identical with the minor E21 of e2i in D. 
But A= — D (§87). Hence the terms of D having the factor 621 may be 



§911 



REMOVAL OF FACTORS 



111 



combined into— 621-B21 . Similarly, the terms of D having the factor 631 
may be combined into C31-B31, etc., as in (10). 

(it) We next prove that D may be expanded according to the elements 
of its ftth column (fc>l): 



(ID 



D-2(-iy+vA. 



Consider the determinant 8 derived from D by moving the fcth column 
over the earlier columns until it becomes the new first column. Since 
this may be done by k— 1 interchanges of adjacent columns, 8 = (— l)*"" 1 !). 
The minors of the elements 6u, . . . , 4* in the first column of d are evidently 
the minors El k , . . . , Ent of e^, . . . , e„* in D. Hence, by (10), 

S=e n E lk -e 2 *E 2k +. • • • +(-1)*-^,*= £ (-lY+ l ejtEjt. 

Thus D=(-l)*- 1 5 has the desired value (11). 

(iit) Finally, D may be expanded according to the elements of its 
kth row: 

In fact, by Case (it), the latter is the expansion of the equal determinant 
D' in § 85 according to the elements of its Arth column. 

91. Removal of Factors. A common factor of all of the elements of the 
same row or same column of a determinant may be divided out of the elements 
and placed as a factor before the new determinant. 

In other words, if all of the elements of a row or column are divided 
by n, the value of the determinant is divided by n. For example, 



na\ nb\ 


=n 


a\ b\ 


% 


(*2 62 




02 &2 


7 



ai ritn a 




ai bi c\ 


a*i nb% c% 


= n 


02 &2 C2 


az nbz c$ 




az 63 cz 



Proof is made by expanding the determinants according to the elements 
of the row or column in question and noting that the minors are the same 
for the two determinants. Thus the second equation is equivalent to 

-(n6i)J5i+(n6 2 )B2-(n63)B3-n(--6ifii+62fi2--63fi3), 
where /?< denotes the minor of fc« in the final determinant. 



112 



DETERMINANTS; SYSTEMS OF LINEAR EQUATIONS [Ch. VIII 



EXERCISES 



1. 


3a 36 3c 


2. 


2r I Sr 




5a 56 6c 


=0. 


2* m 3s 




d e f 




2t n Zt 


Expand by the shortest method and evaluate 


3. 


2 7 3 


4. 


5 7 




5 9 8 




6 8 






3 




3 9 4 



0. 



5. 



abed 
a* 6* c* d» 
a 3 6* c* d» 
a 4 6 4 c 4 d 4 



=a6cd(a-6)(a-c)(a-d)(6-c)(6-d)(c-d). 



92. Sum of Determinants. A determinant having ai+gi, a2+#2, . . . a* 
Me elements of a column is equal to the sum of the determinant having a\, 
a2, ... as the elements of the corresponding column and the determinant 
having q\ } #2, ... as the elements of that column, while the elements of the 
remaining columns of each determinant are the same as in the given determi- 
nant. 

For example, 



ai+qi 

fl2 + <?2 
03 + 23 



61 
62 
h 



C2 



a\ b\ c\ 




qi b\ c\ 


a2 62 C2 


+ 


q2 62 C2 


a$ 63 C3 




#3 63 cz 



To prove the theorem we have only to expand the three determinants 
according to the elements of the column in question (the first column in 
the example) and note that the minors are the same for all three determi- 
nants. Hence ai+qi is multiplied by the same minor that a\ and q\ 
are multiplied by separately, and similarly for 02+^2, etc. 

The similar theorem concerning the splitting of the elements of an} 
row into two parts is proved by expanding the three determinants accord- 
ing to the elements of the row in question. For example, 

a+r 



b+s 


^^» 


a b 


+ 


r s 


d 




c d 


■ 


c d 



§93] 



ADDITION OF COLUMNS OR ROWS 



113 



93. Addition of Columns or Rows. A determinant is not changed 
in value if we add to the elements of any column the products of the correspond- 
ing elements of another column by the same arbitrary number. 

Let at, <Z2, . . . be the elements to which we add the products of the 
elements 6i, 62, . . . by n. We apply § 92 with qi = nbi, q2 = nb2, .... 
Thus the modified determinant is equal to the sum of the initial determi- 
nant and a determinant having 61, 62, . . • in one column and nb\, nb2, 
... in another column. But (§ 91) the latter determinant is equal to 
the product of n by a determinant with two columns alike and hence 
is zero (§88). For example, 



a\-\-nb\ 61 c\ 




a\ b\ C\ 


t. 


61 fei C\ 


G2+W&2 62 C2 


= 


02 62 C2 


+n 


62 62 C2 


az+nbz 63 C3 




fl3 63 Cz 




63 &3 Cz 



and the last determinant is zero. 

Similarly, a determinant is not changed in value if we add to the elements 
of any row the products of the corresponding elements of another row by the 
same arbitrary number. 



For example, 



a+nc b+nd 
c d 



— 


a b 

c d 


+n 


c d 
c d 


= 


a b 
c d 



Example. Evaluate the first determinant below. 



1 -2 1 




1 1 




12 3 


= 


18 3 


= 


6 4 3 




6 10 3 








1 




-2 8 


2 


8 3 


— 


3 10 


3 


10 3 







= -44. 



Solution. First we add to the elements of the second column the products of the 
elements of the last column by 2. In the resulting second determinant, we add to the 
elements of the first column the products of the elements of the third column by — 1. 
Finally, we expand the resulting third determinant according to the elements of its 
first row. 



EXERCISES 



X Prove that 



b +c c -fa a +b 
b\+Ci ci+ai ai+bi 
bt+Ct Ci+a* a*-\-bi 



=2 



a b e 

CL\ 61 Ci 

<h bi Ct 



114 



DETERMINANTS; SYSTEMS OF LINEAR EQUATIONS [Ch.VIII 



By reducing to a determinant of order 3, etc., prove that 



2. 



1 


1 


1 1 


a 


b 


c d 


a* 


6* 


c* d* 


a* 


6» 


c* d' 


2 


-1 


3 -2 


1 


7 


1 -1 


3 


5 ■ 


-5 3 


4 


-3 


2 -1 



(a-6)(a-c)(a-<*)(6-c)(6-(i)(c-<*). 



-42. 



1111 
12 3 4 
1 3 6 10 
1 4 10 20 



- 1. 



(12) 



94. System of n Linear Equations in n Unknowns with Dj*0. In 

CL11X1+CL12X2+ . . . +ainX%=ki, 



^nlXl+(haX2+ . . . +atmXn = kn, 

let D denote the determinant of the coefficients of the n unknowns: 



Z> = 



Then 



ZXn = 



an a\2 . . . a ln 



anxi ai2 . . . din 



Q>n\X\ &n2 • • • a nn 



Q nl a *2 • • • Aim 

anXi+ai2a:2+ . . . +aiȣii, 012 ... a 



a n iXi+a n2 a:2+ . . . +ajmX ll , a„2 . . . a** 

where the second determinant was derived from the first by adding to 
the elements of the first column the products of the corresponding elements 
of the second column by £2, etc., and finally the products of the elements 
of the last column by x„. The elements of the new first column are equal 
to fci, . . . , k n by (12). In this manner, we find that 

(13) Dxi~K h Dx 2 = K 2 , ..., Dxn=K u , 

in which #« is derived from D by substituting k\, . . . , k n for the elements 
a 14 , . . . , am of the ith column of Z), whence 

an ... aia-i k\ 



Ki = 



fci ai2 . . . a-in I 

1 *» ^n2 • • • (Inn 



0*1 • • • 0»»-l &» 



§M1 



EQUATIONS WITH DETERMINANT NOT ZERO 



115 



If D^Oy the unique values of x\ f . . . , Xn determined by division 
from (13) actually satisfy equations (12). For instance, the first equation 
is satisfied since 

k\ an d\2 • • . €L\ n 



kiD—auKi—aizKi— . . . — a in K n = 



ki an #12 • • . Clin 
A?2 #21 #22 • * • <hn 

k n 0»1 0«2 • • • <*nn 



as shown by expansion according to the elements of the first row; and 
the determinant is zero, having two rows alike. 

Theorem. If D denotes the determinant of the coefficients of the n 
unknowns in a system of n linear equations, the product of D by any one 
of the unknowns is equal to the determinant obtained from D by substituting 
the known terms in place of the coefficients of that unknown. If D^O, we 
obtain the unique values of the unknowns by division by D. 

We have therefore given a complete proof of the results stated and 
illustrated in §80, § 81. Another proof is suggested in Ex. 7 below. 
The theorem was discovered by induction in 1760 by G. Cramer. 

EXERCISES 

Solve by determinants the following systems of equations (reducing each deter- 
minant to one having zero as the value of every element but one in a row or column, 
in the example in § 03). 



1. x+ y+ 2-11, 
* 2x-6y- z»0, 
3s+4y+2z=0. 

3. x-2y+ z = 12, 

x+2y+3z=48, 

6x+4y+3z=84. 

5. x+ y+ z+ w= 1, 
x+2y+ 3*+ 4u> = ll, 
x+Sy+ 6z+10u>=26, 
x+4y+10z+20u>=47. 



2. x+ y+ z=0, 
s+2y+3z=-l, 
s+3y+6*=0. 

4. 3x-2y=7, 
3y-2z=6, 
3z-2s = -l. 

•6. 2x- y+3z-2u>=4, 

x+7y+ z— t0=2, 

3x+5y-52+3u>=0, 

4x -Zy +2z- u> = 5. 



7. Prove the first relation (13) by multiplying the members of the first equation 
(12) by An,, those of the second equation by — An, . . . , those of the nth equation by 
( — l) n " l A n \, and adding, where Ay denotes the minorof a v in D. Hint : The resulting 
coefficient of x* is the expansion, according to the elements of its first column, of a deter- 
minant derived from D by replacing an by an, ... , a n \ by a^. 



116 DETERMINANTS; SYSTEMS OF LINEAR EQUATIONS [Ch.VIII 

95. Rank of a Determinant. If we erase from a determinant D of 
order n all but r rows and all but r columns, we obtain a determinant 
of order r called an r-rowed minor of D. In particular, any element. is 
regarded as a one-rowed minor, and D itself is regarded as an n-rowed 
minor. 

If a determinant D of order n is not zero, it is said to be of rank n. 
If, for 0<r<n, some r-rowed minor of D is not zero, while every (r+1)- 
rowed minor is zero, D is said to be of rank r. It is said to be of rank 
zero if every element is zero. 

For example, a determinant D of order 3 is of rank 3 if Z)^0; of rank 2 if Z)=0, 
but some two-rowed minor is not zero; of rank 1 if every two -rowed minor is zero, 
but some element is not zero. Again, every three-rowed minor of 

abed 
e f g h 
abed 
e f g h 

is zero since two pairs of its rows are alike. Hence it is of rank 2 if some two-rowed 
minor is not zero. But it is of rank 1 if a, 6, c, d are not all zero and are proportional 
to e, /, g, h, since all two-rowed minors are then zero. 

96. System of n Linear Equations in n Unknowns with D=0. We 

shall now discuss the equations (12) for the troublesome case (previously 
ignored) in which the determinant D of the coefficients of the unknowns 
is zero. In view of (13), the given equations are evidently inconsistent 
if any one of the determinants K\, . . . , K n is not zero. But if D and 
these K's are all zero, our former results (13) give us no information 
concerning the unknowns x if and we resort to the following 

Theorem. Let the determinant D of the coefficients of the unknowns 
in equations (12) be of rank r, r<n. If the determinants K obtained from 
the (r+l)-rowed minor* of D by replacing the elements of any column by 
the corresponding known terms k t are not all zero, the equations are incon- 
sistent. But if these determinants K are all zero, the r equations involving 
the elements of a non-vanishing r-rowed minor of D determine uniquely r 
of the unknowns as linear functions of the remaining n—r unknowns, which 
are independent variables, and the expressions for these r unknowns satisfy 
also the remaining n—r equations. 



1961 



EQUATIONS WITH DETERMINANT ZERO 



117 



Consider for example the three equations (4) in the unknowns x, y, z. Five cases 
te: 

(a) D of rank 3, i.e., Dj*Q. 
(0) D of rank 2 (i.e., D=0, but some two-rowed minor ^0), and 



*1 


bi 


Ci 




ai 


h 


C\ 




ai 


fh 


fa 


kt 


6, 


Ct 


, K t = 


<h 


kt 


Ol 


, K> = 


a* 


6, 


fa 


h 


bt 


c% 




a% 


fa 


c% 




a% 


&i 


fa 



<H ^ 




bi ki 




ct ki 


aj kj 


i 


bj ks 


i 


cj kj 



Kv 



not all aero. 

(y) D of rank 2 and K\, K tf Ki all zero. 

(5) D of rank 1 (i.e., every two-rowed minor =0, but some element ^0), and 



(i, j chosen from 1, 2, 3) 



not all zero; there are nine such determinants A'. 

(c) D of rank 1, and all nine of the two-rowed determinants K zero. 

In case (a) the equations have a single set of solutions (§94). In cases (0) and 
(6) there is no set of solutions. For (0) the proof follows from (13). In case (7) one 
of the equations is a linear combination of the other two; for example, if aik— aA ?*0, 
the first two equations determine x and y as linear functions of z (as shown by trans- 
posing the terms in z and solving the resulting equations for x and y), and the resulting 
values of x and y satisfy the third equation identically as to z. Finally, in case (*), 
two of the equations are obtained by multiplying the remaining one by constants. 

The reader acquainted with the elements of solid analytic geometry will see that 
the planes represented by the three equations have the following relations: 

(a) The three planes intersect in a single point. 

(0) Two of the planes intersect in a line parallel to the third plane. 

(y) The three planes intersect in a common line. 

(5) The three planes are parallel and not all coincident. 

(c) The three planes coincide. 

The remarks preceding our theorem furnish an illustration (the case 
r=n— 1) of the following 

Lemma 1. If every (r+l)-rowed minor M formed from certain r+1 
rows of D is zero, the corresponding r+1 equations (12) are inconsistent 
provided there is a non-vanishing determinant K formed from any M 
by replacing the elements of any column by the corresponding known 
terms I*. 

For concreteness, 1 let the rows in question be the first r+1 and let 

1 All other cases may be reduced to this one by rearranging the n equations and 
relabelling the unknowns (replacing x t by the new x h for example). 



118 



DETERMINANTS; SYSTEMS OF LINEAR EQUATIONS [Ch.VUI 



K = 



an 



Olr 



fcl 



'r+1 



5*0. 



Let di, . . . , dr+i be the minors of k\, . . 

first r+1 equations (12) by di, — tfe, . . . 

add. The right member of the resulting equation is the expansion of 

ztK. The coefficient of x, is the expansion of 

an • • • a lr a U 



. , k r+ i in K. Multiply the 
(— l) r d r+1 , respectively, and 



| Or+11 • • • Gr+lr Gr+1* 

and is zero, being an M if s>r, and having two columns identical if s^r. 
Hence 0= ±K. Thus if K^0 y the equations are inconsistent. 

Lemma 2. If all of the determinants M and K in Lemma 1 are zero, 
but an r-rowed minor of an M is not zero, one of the corresponding r+1 
equations is a linear combination of the remaining r equations. 

As before let the r+1 rows in question be the first r+1. Let the 
non-vanishing r-rowed minor be 



(14) 



4+1 = 



an . . . a lr 



a T \ . . . On 



5*0. 



Let the functions obtained by transposing the terms A* in (12) be 

L < =a n xi+a i2 a;2+ • • • +a*n£n—fc«. 
By the multiplication made in the proof of Lemma 1, 

diLi-d 2 L 2 + . . . +(-l) r d r+ iL r+1 = T#=0. 
Hence L r+1 is a linear combination of L\ } . . . , L r . 

The first part of the theorem is true by Lemma 1. The second part 
is readily proved by means of Lemma 2. Let (14) be the non-vanishing 
r-rowed minor of D. For s>r, the sth equation is a linear combination 
of the first r equations, and hence is satisfied by any set of solutions of 
the latter. In the latter transpose the terms involving x r+1) . . . , x n . 
Since the determinant of the coefficients of xi, . . .-, x r is not zero, § 94 
shows that xi, . . . , x r are uniquely determined linear functions of x r+1 , 
. . . , x n (which enter from the new right members). 



{ 97] HOMOGENEOUS EQUATIONS 119 

EXERCISES 

Apply the theorem to the following four systems of equations and check the con- 
clusions: 

1. 2x+ y+3z = l, 2. 2x+ y+3z = l, 

4x+2y- 2= -3, 4x+2y- 2=3, 

2x+ y— 42=— 4. 2x-\- y— 42=4. 

3. x- 3y+ 42 = 1, 4. x- 3y+ 42 = 1, 

4s- I2y +16« =3, 4s -12y+ 162=4, 

3z- 9y+12z=3. 3x- 9y+122=3. 

5 Discuss the system 

ox+ 1/+ z=a— 3, 

x+ay+ z= -2, 
x+ y+az=— 2, 

when (i) a=l; (ii) a =—2; (in) a ?* 1,-2, obtaining the simplest forms of the 
unknowns. 

6. Discuss the system 

x+ y+ 2 = 1, 

flX-f &# + C2 = fc, 

a , x-f6*y-^c , 2=A;^ 
when (%) a, 6, c are distinct; (it) a=b^c; (Hi) a—b—c. 

97. Homogeneous Linear Equations. When the known terms fci, . . . , 
k n in (12) are all zero, the equations are called homogeneous. The determi- 
nants K are now all zero, so that the n homogeneous equations are never 
inconsistent. This is also evident from the fact that they have the set 
of solutions xi=0, . . . , x n —0. By (13), there is no further set of solu- 
tions if D^O. If D = 0, there are further sets of solutions. This is 
shown by the theorem of § 96 which now takes the following simpler form. 

If the determinant D of the coefficients of n linear homogeneous equations 
in n unknowns is of rank r,r<n, the r equations involving the elements of a 
now-vanishing r-rowed minor of D determine uniquely r of the unknowns as 
linear functions of the remaining n—r unknowns , which are independent vari- 
ables, and the expressions for these r unknowns satisfy also the remaining 
n—r equations. 

The particular case mentioned is the much used theorem: 

A necessary and sufficient condition that n linear homogeneous equations 
in n unknowns shall have a set of solutions, other than the trivial one in which 
each unknown is zero, is thai the determinant of the coefficients be zero. 



120 DETERMINANTS; SYSTEMS OF LINEAR EQUATIONS [Ch.VIII 

EXERCISES 

Discuss the following systems of equations: 

1. x+ j/4-32=0, 2. 2s- y+ 4*=0, 3. x- 3y+ 4*-0, 

i-f22/+2z=0, x+ 3y- 22=0, 4x-12y+162=0, 

*+5y- z=0. x-lly+142-0. 3x- 9y+12*-0. 

4. fcr+4y+32-84u>=0, 5. 2s+ 3y- 42+ 5to=0, 

x+2y+32-48u>=0, 3x+ 5y- 2+ 2u?-0, 

x-2y+ 2-12w=0, 7x+lly- 0z+12u?=0, 

4x+4y- 2-24w=0. 3x+ 4y-ll2+13u>=0. 

98. System of m Linear Equations in n Unknowns. The case m<n 
may be treated by means of the lemmas in § 96. If m>n, we select 
any n of the equations and apply to them the theorems of §§ 94, 96. If 
they are found to be inconsistent, the entire system is evidently inconsist- 
ent. But if the n equations are consistent, and if r is the rank of the 
determinant of their coefficients, we obtain r of the unknowns expressed 
as linear functions of the remaining n—r unknowns. Substituting these 
values of these r unknowns in the remaining equations, we obtain a 
system of m—n linear equations in n—r unknowns. Treating this sys- 
tem in the same manner, we ultimately either find that the proposed 
m equations are consistent and obtain the general set of solutions, 
or find that they are inconsistent. To decide in advance whether the 
former or latter of these cases will arise, we have only to find the maxi- 
mum order r of a non-vanishing r-rowed determinant formed from the 
coefficients of the unknowns, taken in the regular order in which they 
occur in the equations, and ascertain whether or not the corresponding 
(r+l)-rowed determinants K t formed as in § 96, are all zero. 

The last result may be expressed simply by employing the terminology 
of matrices. The system of coefficients of the unknowns in any set of 
linear equations 

a\\ xi+ . . . +a iH x n -ki, 
(15) 

flml X\+ . . . +Omn Z» = fcm, 

arranged as they occur in the equations, is called the matrix of the coeffi- 
cients, and is denoted by 

an Gi2 . . . ai„ \ 
A= I ). 



i 9S] tn EQUATIONS IN n UNKNOWNS; MATRIX 121 

By annexing the column composed of the known terms A*, we obtain 
the so-called augmented matrix 

(On di2 • • . dfn &i 

Ami <hn2 • • • ©urn *m 

The definitions of an r-rowed minor (determinant) of a matrix and of 
the rank of a matrix are entirely analogous to the definitions in § 95. 

In view of Lemma 1 in § 96, our equations (15) are inconsistent if 
B is of rank r+1 and A is of rank£r. By Lemma 2, if A and B are both 
of rank r, all of our equations are linear combinations of r of them. Noting 
also that the rank r of A cannot exceed the rank of B, since every minor 
of A is a minor of B, and hence a non-vanishing r-rowed minor of A is a 
minor of B f so that the rank of B is not less than r, we have the following 

Theorem. A system of m linear equations in n unknowns is consistent 
if and only if the rank of the matrix of the coefficients of the unknowns is 
equal to the rank of the augmented matrix. If the rank of both matrices is r, 
c&rtain r of the equations determine uniquely r of the unknowns as linear 
functions of the remaining n—r unknowns , which are independent vari- 
ables, and the expressions for these r unknowns satisfy also the remaining 
m—r equations. 

When m=n+l, B has an m-roiwed minor called the determinant of 
the square matrix B. If this determinant is not zero, B is of rank m. 
Since A has no m-rowed minor, its rank is less than m. Hence we obtain 
the 

Corollary. Any system of n+1 linear equations in n unknowns is 
inconsistent if the determinant of the augmented matrix is not zero. 

EXERCISES 

Discuss the following systems of equations: 

1. 2x+ y+3z = l, 2. 2s- y+32=2, 3. 4x- y+ 2=5, 4. 4z-5y=2, 

4x+2y- z=-3, x+7t/+ z = l, 2x-3y-f52 = l, 2x+3y = 12, 

2z+ y-4z=— 4, 3x+5i/-52=a, x+ y-2z = 2, 10x-7y = 16. 

iar+5v-6z=--10. 4x-3i/+2z = l. bx - z=2. 

5. Prove the Corollary by multiplying the known terms by x n+ i = 1 and applying 
§ 97 with n replaced by n-fl- 

6. Prove that if the matrix of the coefficients of any system of linear homogeneous 
equations in n unknowns is of rank r, the values of certain n — r of the unknowns mav b' 



122 



DETERMINANTS; SYSTEMS OF LINEAR EQUATIONS [Ch.VIII 



assigned at pleasure and the others will then be uniquely determined and satisfy all 
of the equations. 

99. Complementary Minors. The determinant 

ax b\ C\ d\ 

(16) Z>= 



0,2 &2 C2 C?2 

a$ 63 Cz dz 
a± 64 C4 d± 



is said to have the two-rowed complementary minors 



M= 



ai 61 
a$ 63 



ilf' = 



C2 d2 

c\ d± 



since either is obtained by erasing from D all the rows and columns having 
an element which occurs in the other. 

Irl^general, if we erase from a determinant D of order n all but r rows 
and all but r columns, we obtain a determinant M of order r called an 
r-rowed minor of D. But if we had erased from D the r rows and r columns 
previously kept, we would have obtained an (n— r)-rowed minor of D 
called the minor complementary to M. In particular, any element is 
regarded as a one-rowed minor and is complementary to its minor (of 
order n— 1). 

100. Laplace's Development by Columns. Any determinant D is 
equal to the sum of all the signed products ±MM', where M is an r-rowed 
minor having its elements in the first r columns of D, and M f is the minor 
complementary to M, while the sign is + or — according as an even or odd 
number of interchanges of rows of D will bring M into the position occupied 
by the minor Mi whose elements lie in the first r rows and first r columns 
ofD. 

For r = l, this development becomes the known expansion of D according to the 
elements of the first column (§ 90); here Mi = en. 
If r—2 and D is the determinant (16), 



r>= 




• 


Cs d$ 
C4 d 4 


— 


ai 61 
as bz 


• 


Ci d^ 
d di 


+ 


a>\ b\ 
ai bi 


• 


ct dt 
cz di 


+ 


a s 6 a 


• 


C\ di 
Ci di 


— 


a* 62 
04 bi 


• 


c\ di 
Ct dz 


+ 


a 3 bz 
di bi 


• 


ci di 
ct ck 



iiooi 



LAPLACE'S DEVELOPMENT 



123 



The first product in the development is M\M\\ the second product is —MM' (in the 
notations of § 99), and the sign is minus since the interchange of the second and third 
rows of D brings this M into the position of M\. The sign of the third product in 
the development is plus since two interchanges of rows of D bring the first factor 
into the position of Mi. 



If D is the determinant (8), then 



Jlfi = 



en . . . ei r 



Mi' = 



e r+lr+l • • • ^r+1 



^nr+1 • • • • 6\ 



nn 



r 
i 

r 



^rl • • • &rr 

Any term of the product Af iAf i' is of the type 

(17) (-1)^11 e< 2 2 . . . v (-i^+ir+i • • • V> 

where t"i, . . . , i r is an arrangement of 1, . . . , r derived from 1, . . . , r 
by i interchanges, while i T + u . . . , i H is an arrangement of r+1, . . . , n 
derived by j interchanges. Hence i\ } . . . , i % is an arrangement of 
1, . . . , n derived by t+j interchanges, so that the product (17) is a term 
of D with the proper sign. 

It now follows from § 87 that any term of any of the products ±MM' 
mentioned in the theorem is a term of D. Clearly we do not obtain twice 
in this manner the same term of D. 

Conversely, any term t of D occurs in one of the products iilfJlf'. 
Indeed, t contains as factors r elements from the first r columns of D, 
no two being in the same row, and the product of these is, except per- 
haps as to sign, a term of some minor M. Thus t is a term of MM' or 
of — MM'. In view of the earlier discussion, the sign of t is that of the 
corresponding term in iilfilf', where the latter sign is given by the 
theorem. 

101. Laplace's Development by Rows. There is a Laplace develop- 
ment of D in which the r-rowed minors M have their elements in the first 
r rows of D, instead of in the first r columns as in § 100. To prove this, 
we have only to apply § 100 to the equal determinant obtained by inter • 
changing the rows and columns of D. 

TJiere are more general (but less used) Laplace developments in which 
the r-rowed minors M have their elements in any chosen r columns (or 
rows) of D. It is simpler to apply the earlier developments to the determi- 
nant ±D having the elements of the chosen r columns (or rows) in the 
new first r columns (or rows). 



124 



DETERMINANTS; SYSTEMS OF LINEAR EQUATIONS [Ch. VIII 



EXERCISES 



1. Prove that 



a b c 


d 












e f g 


h 




a b 


• 


• 

3 


k 


j 


k 




e f 




I 


m 


I 


m 













2. By employing 2-rowed minors from the first two rows, show that 



1 
2 



abed 
e f g h 
abed 
e f g h 



a b 
ef 



c d 
Oh 



a c 

e g 



b d 
f h 



+ 



a d 

e h 



! b e 
f 



0. 



3. By employing 2-rowed minors from the first two columns of the 4-rowed deter- 
minant in Ex. 2, show that the products in Laplace's development cancel. 

102. Product of Determinants. The jyroduet of two determinants of 
the same order is equal to a determinant of like order in which the element 
of the r th row and c th column is the sum of the products of the elements of 
the r th row of the first determinant by the corresponding elements of the c th 
column of the second determinant. 



For example, 



(18) 



a b 
c d 



e f 
9 h 



ae+bg 
ce+dg 



af+bh 
cf+dh 



While for brevity we shall give the proof for determinants of order 3, 
the method is seen to apply to determinants of any order. By Laplace's 
development with r = 3 (§ 101), we have 



(19) 



a\ 


b\ c\ 





a% 


62 C2 





az 


&3 C3 





-1 





ei h g\ 


- 


-1 


62 h 02 





-1 


ez h 03 



a\ b\ c\ 




ei /1 01 


a2 &2 C2 


• 


£2/2 02 


03 &3 C3 




€3 /3 03 



§102] 



PRODUCT OF DETERMINANTS 



125 



In the determinant of order 6, add to the elements of the fourth, fifth, 
and sixth columns the products of the elements of the first column by 
«it fi> Qu respectively (and hence introduce zeros in place of the former 
elements a, /i, gi). Next, add to the elements of the fourth, fifth, and 
sixth columns the products of the elements of the second column by 
02, /2, 02, respectively. Finally, add to the elements of the fourth, fifth, 
and sixth columns the products of the elements of the third column by 
ez, fz, 03, respectively. The new determinant is 



at 


bi 


C\ 


aiei+bie2+ciea 


aifi+bif2+c\f3 


oifli+friflfe+cifite 


02 


fe 


<to 


aifii +62C2+C2CS 


02/1+62/2+02/3 


O201+&2ff2+C2fl3 


az 


h 


<% 


0361+6362 +cafii 


03/1+63/2+03/3 


03ffl+&3ff2+c3ff3 


1 




















-1 




















-1 












By Laplace's development (or by expansion according to the elements of 
the last row, etc.), this is equal to the 3-rowed minor whose elements 
are the long sums. Hence this minor is equal to the product in the right 
member of (19). 



EXERCISES 



1. Prove (18) by means of § 92. 

2. Prove that, if ««=<** +^+7*, 



111 

« P 7 

a* /J* 7 s 





1 a a* 




3 


*1 *! 


• 


1 P 0' 


= 


«i 


*1 St 




1 y t* 




H 


«a 84 



3. If Ai, Bi, C4 are the minors of o«, 6«, c< in the determinant D defined by the second 
factor below, prove that 



A, 


-A t 


A, 




Oi 61 C\ 




D 


Bi 


Bt 


-B, 


• 


<h 1h Ct 


- 


D 


Ci 


-Ct 


C, 




as bz c% 




D 



Hence the first factor is equal to D* if D^O. 



126 



DETERMINANTS; SYSTEMS OF LINEAR EQUATIONS [Ch.VHI 



4. Express (a , +& , +c , +d , )(e , + < / , -^0 , +fc , ) as a sum of four squares by writing 



a+bi c+di 
— c+di a—bi 



e-f/i g+hi 
—g+hi e—fi 



as a determinant of order 2 similar to each factor. Hint: If k' denotes the conjugate 
of the complex number k f each of the three determinants is of the form 

k I 
-J' k' 



MISCELLANEOUS EXERCISES 

1. Solve 

ax+ by+ cz=k f 

a*x+b*y+c*z=k* 

by determinants for x, treating all cases. 

2. In three linear homogeneous equations in four unknowns, prove that the values 
of the unknowns are proportional to four determinants of order 3 formed from the 
coefficients. 

Factor the following determinants: 



3. 



1 
1 
1 



a be 
b ca 
c ah 



4. 



x x* yz 




X 1 x* 1 


V y* xz 


= 


V* V* 1 


z z* xy 




2 l Z* 1 



5. 



a 


b 


c 


c 


a 


b 


b 


c 


a 



(a +b+c) (a+bv+cco*) (a +b<o t +cu>) 9 



where w is an imaginary cube root of unity. 



6. 



abed 
bade 
c d a b 
d c b a 



7. 



abed 
d a b c 
c d a b 
b c d a 



8. If the points (xi, yi), . . . , (x 4 , y*) lie on a circle, prove that 



S-L*/.2 



xi*+y 



Xi 



yi 



0. 



2-L*j.2 



*4 f +tf 



Xi 



V* 



§102] 



MISCELLANEOUS EXERCISES 



127 



9. Prove that 



aa' +W+ctf 
ae'+bf'+cg* 



ea'+Z&'+yc' 

ee'+ff'+gg* 



a 



b 
f 



a' 



V 

r 



+ 



a c 



a' 



e* 






+ 



b 
f 



V 

r 



& 
t 



10. Prove that the cubic equation 







a—x b c 




D«- 


b f—x g 
c g h—x 


-0 


has only real roots. Hints: 




a , + & , + c , --a; , ab+bf+cg ac+bg+ch 


D(x)-D(-x) = 


ab+bf +cg 6* + /»+ g*-x* be +fg+gh 




ac+bg+c 


h bc+fg+gh 


c* +0 , + A 8 -x« 



= -*«+x*(a*+/*+/i*+2&*+2c*+20 8 ) ~i«(Di-fA+D 8 )+D l (0), 

'where D% denotes the first determinant in Ex. 9 with ail accents removed and with 
e=b, while D\ and D% are analogous minors of elements in the main diagonal of the 
present determinant of order 3 with x=0. Hence the coefficient of — x* is a sum of 
squares. Since the function of degree 6 is not zero for a negative value of x t y D(x) =0 
has no purely imaginary root. If it had an imaginary root r-fst, then Z>(x+r)=0 
would have a purely imaginary root at. But D{x+r) is of the form D(x) with a, /, h 
replaced by a—r, f—r, h—r. Hence D(x)=0 has only real roots. The method is 
applicable to such determinants of order n. 

11. If ai, . . . , On are distinct, solve the system of equations 



Xi 



+ 



Xi 



-+ • • • +; 



x% 



-=1 



(t = l, . . ., n). 



ki—ai hi—a* ' ' ki—an 

Hint: Regard ki, . . . , k n as the roots of an equation of degree n in k formed from 
the typical one above by substituting A; for k% and clearing of fractions; write k=aj—t t 
and consider the product of the roots of t n + ... =0. Hence find xj. 

12. Solve the equation 

a+x x x 
x b+x x =0. 

* « z c+x 



CHAPTER tX 
Symmetric Functions 

103. Sigma Functions, Elementary Symmetric Functions. A rational 

function of the independent variables xi, X2, . . ., x n is said to be symmetric 
in them if it is unaltered by the interchange of any two of the variables. 
For example, 

X\ 2 +X2 2 +Xs 2 +4xi +4X2+4X3 

is a symmetric polynomial in xi, X2, X3; the sum of the first three terms 
is denoted by 2xi 2 and the sum of the last three by 42xi. In general, 
if t is a rational function of xi, . . . , x„, 2J denotes the sum of t and all 
of the distinct functions obtained from t by permutations of the variables; 
such a 2-f unction (read sigma function) is symmetric in xi, . . . , x n . 

For example, if there are three independent variables a, 0, 7, 

2a0=a0+a7+07, 2a 2 /3=a 2 /3+a/3 2 +a 2 7+a7 2 +j8 2 7+l^, 

y 1=1+1+1 y£_£ + ? + £ + i+2 + 2 



2 



c^+ff^+ff 2 c^+t 2 ff+T 2 
aj8 a/3 a7 07 ' 



In particular, 2a=a+0+7, 2a0, and a/37 are called the three elementary 
symmetric functions of a, 0, 7. In general, 

2a 1, 2aia2, 2aia2a3, . . . , 2aia2 • • • a*_i, aia2 . . . a* 

are the elementary symmetric functions of ai, a2, . . . , a». In § 20 they 
were written out more fully and proved to be equal to — ci, C2,— cz, . . . , 
( — l) w c n if ai, . . . , a» are the roots of the equation 

(1) x w +cix n - 1 +c 2 x n - 2 + . . . +Cn=0 

whose leading coefficient is unity. 

128 



I 



I 



I 1041 FUNDAMENTAL THEOREM ON SYMMETRIC FUNCTIONS 129 

EXERCISES 

If a, 0, 7 are the roots of x*+px*+qx+r =0, so that Za = — p, Za0 «g, and a&y — — r, 
prove that 

1. (2a) l = 2a l +22a0, whence 2a 8 =p 8 -2g. 

2. 2a- Sa/^Sa^+iJaPT, whence 2a»0=3r— pg. 

3. Za^Y-pr. 

4. 2a l 0» = (2W)*-2a072a=*g*-2pr. 

If a, 0, 7, 5 are the roots of z i -\-px i -\-qx t -\-rx-{-8^0 f prove that 

yl H[ X^JL-? ^p 1 r*-2q s 
J-4oT 8 ' Z^afT % Z^a*~~ 8* 

Hint: Compute the sum, sum of the products two at a time, and sum of the squares 
of the roots of the equation 

obtained by replacing x by \/y in the given quartic equation. 

9 X^ 7 _ 3r-pg 
<Wa£ 8 



104. Fundamental Theorem on Symmetric Functions. Any poly- 
nomial symmetric in xi, . . . , z» is equal to an integral rational function, 
with integral coefficients, of the elementary symmetric functions 

(2) 2?i = 2xi, 2?2 = 2x1X2, £3 = 2x1X2X3, . . . , En= X1X2 • . . x* 

and fte coefficients of the given polynomial. In particular, any symmetric 
polynomial with integral coefficients is equal to a polynomial in the elementary 
symmetric functions with integral coefficients. 

For example, if n=2, 

rzi'+ra* 8 +«xi +*& =r(#i* -2J? 2 ) +«&. 

In case r and 8 are integers, the resulting polynomial in E x and E t has integral coeffi- 
cients. 

The theorem is most frequently used in the equivalent form: 



130 SYMMETRIC FUNCTIONS [Ch. IX 

Any polynomial symmetric in the roots of an equation, 

x n -Eix*- l +E 2 x n - 2 - . . . + (-l) n E n =Q, 

is equal to an integral rational function, with integral coefficients, of the coeffi- 
cients of the equation and the coefficients of the polynomial. 

It is this precise theorem that is required in all parts of modern algebra 
and the theory of numbers, where attention to the nature of the coefficients 
is vital, rather than the inadequate, oft-quoted, theorem that any sym- 
metric function of the roots is expressible (rationally) in terms of the 
coefficients. 

It suffices to prove the theorem for any homogeneous symmetric poly- 
nomial S, i.e., one expressible as a sum of terms 

h = axi kl X2 ki . . . x*** 

of constant total degree k = k\+k2+ . . . +k n in the x'&. Evidently 
we may assume that no two terms of S have the same set of exponents 
k\, . . . , k n (since such terms may be combined into a single one). We 
shall say that h is higher than the term bxi ll xj* . . . x n ln if ki>li, or if 
&i = Zi, k2>ht or if ki=h, k2 = h y kz>fa, . . . , so that the first one of 
the differences fci — Zi, A*2— fe, Aft — fe, . . . which is not zero is positive. 

We first prove that, if the above term h is the highest term of S, then 

A^rlAft^Aft . . . ^.k n . 

For, if fci<fc2, the symmetric polynomial S would contain the term 

axi**X2 kl xs k * . . . x n kn , 
which is higher than h. If k2<ks, S would contain the term 

axi kl X2 ki xz k * . . . £„*», 
which is higher than h, etc. 

If the highest term in another homogeneous symmetric polynomial 
S'is 

and that of S is h, then the highest term in their product SS' is 

hh' = aa'xi k i+ k < . . . *„*»+*»'. 



§ 104] FUNDAMENTAL THEOREM ON SYMMETRIC FUNCTIONS 131 
Indeed, suppose that SS' has a term, higher than hh', 

(3) CXih+h' . . . X&+ 1 *', 

which is either a product of terms 

t=bx\K . . x n *s t'=b'xi« . . . xj*' 

of S and S' respectively, or is a sum of such products. Since (3) is higher 
than hh' t the first one of the differences 

li+h'-*i-*i',. . • ,ln+ln'-h-k n ' 

which is not zero is positive. But, either all of the differences l\— ki, 
. . . , Z„— fc„ are zero or the first one which is not zero is negative, since 
h is either identical with t or is higher than t. Likewise for the differences 
W— fci'i • • • > ln'—k n '. We therefore have a contradiction. 

It follows at once that the highest term in a product of any number 
of homogeneous symmetric polynomials is the product of their highest 
terms. Now the highest terms in E\, E2, E$ 9 . . . , E n , given by (2), are 

Xi, X\X2j X 1X2X3, . . . , X1X2 . . . X n , 

respectively. Hence the highest term in EfiEtf* . . . E n an is 

Thus the highest term in 

is h. Hence Si=S— <r is a homogeneous symmetric polynomial of the 
same total degree k as S and having a highest term hi not as high as h. 
As before, we form a product <r\ of the E's whose highest term is this hi. 
Then S2=Si— <ri is a homogeneous symmetric polynomial of total degree 
k and with a highest term /12 not as high as hi. We must finally reach 
a difference S<— <r t which is identically zero. Indeed, there is only a 
finite number of products of powers of xi, . . . , x n of total degree k. 
Among these are the parts A 7 , ft'i, h f 2, ... of A, hi, h2, . . . with the coeffi- 
cients suppressed. Since each hi is not as high as h t _ l9 the h' } h\ } fa', . . . are 
all distinct. Hence there is only a finite number of hi. Since S,— <r t =0, 

S = <r-\ m Si s =a-\-ai-\"S2 = = . . . == 0'+<ri+<r2+ . . . +o"<. 

Hence 8 is a polynomial in Ei, E2, . . . , E n and a, b, ... , with integral 
coefficients. 



132 SYMMETRIC FUNCTIONS [Ch. DC 

Example 1. If S — Zxjxfxi and n> 4, we have 

*=EtEi=*S+3Exi*x&&4+10 SxiXiXsO^Ek, 
5i— S—*=— 3 Zx^xtXiXi — 10 2xix»x»X4X» > 
<n=» — 3 E\Ei= —3 (2xi , XsXjX4~|-5 XxiXtXiX&t), 
St =Si — <ri «■ 5 "ExiXtXtX&i = 5 J£», 
5=<r -f*5i m « , -f'« , i+& =E%Ei— 3 2?iJ£4-f"5 &• 

Example 2. If 5 = Zx i t xtXt and n> 4, 

= 2xi , X|Xa+2 2ii*aJt , X3+3 Zzi'z}£|Z« 

+4 (22i*2*rsZ4+5 2zix&&a*), 
Si=S—ff— —2 Zxi^xa— 7 2ii*x»x»X4— 20 Zzixwca*. 
Take <n = — 2 j$^& s and proceed as in Ex. 1. 

Example 3. By examples 1 and 2, if n>4, 

a2xi , x, 8 a;,+62:xi , x^r3=&^i , ^3-(3a+6)Eil?4+(a-26)^ft+5(a+6)ft. 

105. Rational Functions Symmetric in all but One of the Roots. If 

P is a rational function of the roots of an equation f(x) — of degree n and 
if P is symmetric inn— I of the roots, then P is equal to a rational function, 
with integral coefficients, of the remaining root and (he coefficients off(x) and P. 

For example, P=rai-f«i*-f «**+ . . . -fa** is symmetric in at, . . . , a», and 

P — rcti + Son* — ai* =» Ci 1 — 2c% -{-rati — ai*, 

if at, . . . , a n are the roots of equation (1) . 

Since 1 any symmetric rational function is the quotient of two symmet- 
ric polynomials, the above theorem will follow if proved for the case in which 
the words rational function are in both places replaced by polynomial. 

If a\ is the remaining root, the polynomial P is symmetric in the roots 
c*2, . . . , <*n of f(x)/(z— ai)=0, an equation of degree n— 1 whose coeffi- 
cients are polynomials in a\, c\, . . . , c„ with integral coefficients. Hence 
(§ 104), P is equal to a polynomial, with integral coefficients, in a\, ci, . . . , 
Cn and the coefficients of P. 

1 If N/D is symmetric in cn, at, and the polynomials N and D have no common 
factor, while N becomes N' and D becomes D' when an, as are interchanged, then 
ND'=DN'. Thus N divides N' and both are of the same degree. Hence N'=cN, 
iy=cD, where c is a constant. By again interchanging cn, at, we obtain N from N', 
whence N=cN'=c 2 N, c 2 = l. If c=— 1, we take ai=a 2 and see that N=*N'=*—N, 
N=0, whence N has the factor ai—a 2 . Similarly, D has the same factor, contrary to 
hypothesis. Hence c = -f 1 and N and D are each symmetric in cn, cn. 



§105] 



FUNCTIONS SYMMETRIC IN ALL BUT ONE ROOT 



133 



Example. If a, 0, 7 are the roots of f(x) sx s -hpa? 2 -f qx~\- r =0, find 



2 



r 



a*+0* a 2 +0 2 a*+7 2 2 +7* 
_. 1. 1 

ct+0 a-f-0 a+7 0+7 
Solution. Since 0* +7* = p 2 — 2g —a 1 , +7 = — p —a, 



2 



•^-2^-2 (-*£) - ->-^2i? 



But a+P» 0+P, 7+P are the roots y h y%, y% of the cubic equation obtained from 
f(x)=*Q by the substitution s+p=y, i.e., x = y—p. The resulting equation is 

y*-2py*+(p 2 +q)y+r-pq=Q. 

Since we desire the sum of the reciprocals of j/i, 2/1, #», we set y — 1/2 and find the sum 
of the roots z it z*, z» of 

l-2pz+(p»+g)« 2 -f(r-pg)«»=0. 
Hence 

>p_l ^_1_^ P 2 +9 ^« 2 +0* 2g 2 -2p«g+4pr 



pq-r 



2 a 2 +0 2 _ 
«+0 



P9-r 



EXERCISES 



[In Exs. 1-12, a, 0, 7 are the roots of f(x) ^x t -\-px t -\-qx+r^0.] 
Using 07+a(0+7) ~g, find 

<Sp07+a* n '^O 307 -2a* 



2. 



2 



0+7 ^-* 0+7— a 

3. Why would the use of 07 = — r/a complicate Exs. 1, 2? Verify that 

— r f(a)—r 

07 = = =a 2 -fpa-Hf. 

a a 

20 , +7 1 
? 
0T+C 



5. Find 



2 W+7) 1 . 6. Find ^(a+0-7)'. 7. Find ^ (fcr) 1 - 

8. Find a necessary and sufficient condition on the coefficients that the roots, in 

some order, shall be in harmonic progression. Hint: If — | — =-, then —0=0, 

a 7 q 

and conversely. Hence the condition is 

(f-)(f->)(f-Mf)-«- 



3 



134 SYMMETRIC FUNCTIONS [Ch. IX 

9. Find the cubic equation with the roots 07 — , ay — , o0 — . Hint: since 

a 7 

these are (— r— l)/o, etc., make the substitution (— r— l)/x=y. 

Find the substitution which replaces the given cubic equation by one with the roots 

10. o^+otT, «0+07, «7+07. 

11. , etc. 12. — -, etc. 

0+7 -« 0+7 -2o' 

If a, 0, 7, 6 are the roots of x 4 +px 8 +ga; , 4-rx+s=0, find 

s +7 2 +« t ^ , ST s 07+0*+7« 



13. 



-<O fl'+7 2 +a f <sp 



15. Prove that if y h y%, yt are the roots of y*+py+q=0, the equation with the roots* 
fc-te-yj) 1 , ^^(j/i-j/s) 2 , *s = (yi-2fc) 2 is 

« 8 +6p2«+9p 2 2+4p s +27g , =0. 

Hints: since 2i = 2tyi J — 2ys|/s — |/i* = — 2p+2g/yi— yi J , etc., we set «= — 2p+2g/y— y*. 
By the given equation, y 2 +p+g/y =0. Thus the desired substitution is 2= — p+3q/y, 

y=Zq/(z+p). 

16. Hence find the discriminant of the reduced cubic equation. 

17. If x\ % . . . , x n are the roots of f(x) =0, show that 



2: 



1 -/'(c) 



xi-c /(c) 
Hint: a* — c=yi, . . . , a^— c=y n are the roots of 

/(c+y) =/(c) +ief (c) +y 2 ( ) + . . . =0, 

as shown by Taylor's theorem. Or we may employ (5) for x=c. 

106. Sums of Like Powers of the Roots. If ai, . . . , a» are the roots of 

• (1) f(x)=x n +cix n - 1 +c 2 x n ' 2 .+ . . . +c=0, 

we write s\ for 2<*i, $2 for 2<*i 2 , and, in general, 

$ t = 2ai*=ai*+a 2 *+ • • • +<*n*- 
The factored form of (I) is 

(4) f{x) = (x-ai)(x-ct2) . . . (x-ctn). 



§ 106] SUMS OF LIKE POWERS OF THE ROOTS 135 

The derivative /'(x) of this product is found by multiplying the derivative 
(unity) of each factor by the product of the remaining factors md adding 
the results. Hence 

/'(aO = (x-a 2 ) . . . (x-On) + (x-ai)(x-a3) . . . (z-a n )+ . . . , 

(5) mm m.+M. + ...+M.. 

X—a\ X—CL2 %—<*n 

If a is any root of (1), f(a) =0 and 

/(*) /(*)-/(«) _ *»-<*" p a- 1 -*"- 1 , . x-a 

= = hci h . . . -rCn-i 

x—a x—a x—a x—a x—a 

=x H - l +ax H - 2 +c?x n - 3 + . . . +a(x H - 2 +ax n - 3 + . . . ) 

+c 2 (x"~ 3 + ...)+..., 

: (6) ^=x*- l +(a+Ci)x n - 2 +(a 2 +Cia+c 2 )x n - z + . . . 
x—a 

+ (aM-Cia*- 1 +c 2 a*- 2 + . . . +c k -iot+Ct)x n - k - 1 + 

Taking a to be a\, . . . , a»in turn, adding the results, and applying (5), we 
obtain 

f(x)=iix?~ l +(8i+nci)x n ~ 2 +(82+ci8i+nc2)x*~ z + . . . 

+ («*+Cl8*-i+C2S*-2+ . . . +C*-i*l+nC*)05"""*~ 1 + .... 

The derivative of (1) is found at once by the rules of calculus (or by 
§ 56) to be 

f\x) = nx H - 1 +(n-l)ciX n - 2 +(n-2)c 2 x n - z + . . . + (n-k)c k x»- t - 1 + 
Since this expression is identical term by term with the preceding, we have 

Sl+Cl=0, $2 + Cl$l+2c 2 = 0, . . . , 

(7) 

8k+CiS k -i + C2S k . 2 + . . . +C it _ 1 Si+fcc* = (fc£tt-l). 

We may therefore find in turn «i, $2, . . . , *»-i: 

(8) «1=— Cl, S2 = Ci 2 — 2C2, S 3 = — Ci 3 + 3CiC2 — 3 C3, . . . . 

To find s„, replace z in (1) by a\, . . . , a„ in turn and add the resulting 
equations. We get 

(9) Sn+cis»_i+C2$„_2+ • • • +c n _ 1 $i+nc n = 0. 



136 SYMMETRIC FUNCTIONS [Ch. IX 

We may combine (7) and (9) into 

(10) 8 k +CiS t - l +C2S k - 2 + • • • +c*-i8i+fcc* = (fc=l,2, . . . , n). 

This set of formulas (10) will be referred to as Newton's identities. 
The student should be able to write them down from memory and, when 
writing them, should always check the final one (9) by deriving it as above. 

To derive a formula which shall enable us to compute the s t for 
k>n, we multiply (1) by x*~ B , take x=a\, . . . , x=On in turn, and add 
the resulting equations. We get 

(11) s*+ci« t _ 1 +c 2 s*-2+ • • • +c»s*- ll =0 (fc>n). 

Instead of memorizing this formula, it is preferable to deduce it for the 
particular equation for which it is needed, thus avoiding errors of substi- 
tution as well as confusion with (10). 

Example . Find s* f or x n — 1 = 0. 

Solution. Comparing our equation with (1), we have Ci=0, . . . , c*_i=0, c*= — 1. 
Hence in (10) for k <n, each c is zero and s* =0. But, for k=n, (10) becomes s H —n =0. 
We may check the latter by substituting each root a h . . . , a n in our given equation 
and adding. Finally, to find sj when l>n, multiply our equation by x l ~ n In the 
resulting equation x l — x l ~ n =*0 we substitute each root, add, and obtain *|— «|_„=0. 
Hence from s t we obtain an equal s by subtracting n from I. After repeated subtrac- 
tions, we reach a value k for which 1 £ fc £ n. Since s*=0 or n according as k<n or 
k-n, it follows that *j=0 or n according as lis not or is divisible by n. 

EXERCISES 

1. For a cubic equation, *4=ci 4 — 4c i t c i +4ciCi+2ct*. 

2. For an equation of degree 2l 4, «4=ci 4 — ic^+^Ci-f^ 1 — 42|. 

3. Find «i, «j, «4 f ** for s 2 — ps-hg=0: 

4. Finds* for s*-3=0. 

5. Find a*, s*, *, s? for z*— px+q=Q. 

107. Waring's Formula for s k in Terms of the Coefficients. While 

we have learned how to find si, «2, *s> • • • in turn by Newton's identities, 
it is occasionally useful to know an explicit expression for s t , where k 
has an arbitrary value. The formula in question is applied ordinarily 
only to a quadratic equation 

a?+px+q=0. 

Accordingly we shall treat this case in detail. If its roots are a and 0, 

then 

7?+px+q={x— a)(x— 0). 



1 1071 WAKING'S FORMULA 137 

Replace x by 1/y and multiply by y 2 . We get 

(12) l+py+tnf^d-ctyKl-fly). 

Taking derivatives, we have 

p+2qy=-a(l-fiy)-fi(l-ay). 

Change of signs and division by the members of (12) gives 

ri3) -p-2w - « i 

The identity in Ex. 7, § 14, with n changed to k, may be written in 
the form 

(14) -i~sl+r+r 2 + . . .+ r*~ l + * 



— r 1— r 



t Take r=oy and multiply the resulting terms by a; thus 



a =«M+-.+«V-M a 



*+y 



—ay 1— ay 

Similarly, 

jJ- - s+ ^ + . . . -MV-+£& 

To show that on adding, and writing s* for a*+0*, we obtain (15), we need 
the sum of the final fractions, which by (12) is 



<i-J&-w -i+&y *-«"'<i-*)+^*'(.-^). 



Hence 



« _l_ P _.. j _i_ x„.,Mi *2/* 



(15) i — -r.+i — 5^ = *i+ 5 2y+ • • • +*4T + 



1-oy * l-0y y • • • » j+py^.^ 

• 

where the exact expression for <t> is immaterial. 

Next, we seek an expansion of the fraction in the left member of 
(13). Its denominator will be identical with that in (14) if we choose 
r= —py—qy 2 . Evidently (14) may be written in the compact form 



Hence it becomes 



1— r ,.o 1— r 



1 ■S , (-D*(py+w s ) , + ' *** 



l+Ptf+92/ 2 «-o l+py+gy 2 ' 



138 SYMMETRIC FUNCTIONS [Ch. IX 

where ^=(— p— qy)*, although no use will be made of the particular 
form of the polynomial ^. By the binomial theorem, 

(py+«ifl'= 2^r ! (p»)'(^)*» 

where the summation extends over all sets of integers g and h, each £.0, 
for which g+h=t, while g\ denotes the product of 1. 2, . . . , g if g2.1 f 
but denotes unity if gf=0. Hence 

(16) i5^ =(p+ ^ )S( " 1) ' + * +1 ^^^ + ^ + ^ 

1+py+qy 2 ' 

where the summation extends over all sets of integers g and h, each 2:0, 
for which g+h£.k— 1. 

Since the left members of (15) and (16) are identically equal by (13), 
their right members must be identical, so that the coefficients of y*"" 1 
in them must be equal. 1 Hence the coefficient s t of t/*" 1 in (15) is equal 
to the coefficient of y*~ l in (16), which is made up of two parts, correspond- 
ing to the two terms of the factor p+2qy. When we use the constant 
term p, we must employ from 2 in (16) the terms in which the exponent 
of y is equal to k— 1. But when we use the other term 2qy, we must 
employ from 2 the terms in which the exponent of y is equal to k— 2, in 
order to obtain the combined exponent k— 1 of y. Hence «» is equal to 
the sum of the following two parts: 

pS( _l)* + A + i ( -^-W (0+2A=fc-l), 

2gS(-l)^+ 1 ^|^-W (0+2A=fc-2). 

In the upper sum, write i for 0+1, and j for h. In the lower sum, write 
i for g, and j for A+ 1. Hence 

^=2(-i/-|^ P v+2 S (-iy-f^f ! P v, 

1 In fact, the (k— l)th derivatives of the two right members are identical, and we 
obtain the indicated result by substituting y=0 in these two derivatives and equating 
the results. Note that the final terms in both (15) and (16) have y as a factor of their 
(k— l)th derivatives. 



§ 1071 WARING'S FORMULA 139 

where now each summation extends over all sets of integers i and j, each 
^ 0, for which 

(17) t+2j=fc. 

Finally, we may combine our two sums. Multiply the numerator 
and denominator of the first fraction by i, and those of the second fraction 
by j. Thus 

(18) *-*Z(- D w (i+ L X) W , 

since the present fraction occurred first multiplied by i and second multi- 
| plied by 2j, and, by (17), the sum of these multipliers is equal to k. Our 
final result is (18), where the summation extends over all sets of integers 
t and j, each ^0, satisfying (17). 

If we replace t by its value k — 2j t and change the sign of p, we obtain from (18) 
the result that the sum of the k th powers of the roots of x 2 —px+q = Q is equal to 

do) ^ki^iyf^^p'^ 

/-o (k-2j)\j\ 

k i t-2 i *(«— 3) ^4 2 k(k—4){k—5) t _ 6 3 

-v-hp q+-j^-p « — ^ — p «+•• • 

where K is the largest integer not exceeding k/2. 

The product of the roots is equal to q. Hence if x denotes one root, the second 
root is q/x. Thus st ss x k +(q/x) k . Again, the sum of the roots is x-\-q/x-p. Regard 
q as given and p as unknown. Hence, if c is an arbitrary constant, the equation 

(20) V -"OP H — f^tf P — . . . -e 

is transformed by the substitution p-x+q/x into 



*©'- 



Hence equation (20) may be solved for p by radicals by the method employed in § 43 
for a cubic equation. 



140 SYMMETRIC FUNCTIONS [Ch.DC 

The above proof applies l without essential change to any equation 
x?+dx*~ l + . . . +c»=0 and leads to the following formula for the sum 
of the fcth powers of its roots: 

(2i) *-*z(-i)*+ - • • +* {ri \':' +r *'T 1)l ci*. . . cj*, 

where the summation extends over all sets of integers n, . . . , r n , each 
^0, for which ri+2r2+3r3+ . . . +nr n =k. This result (21) is known 
as Waring 9 8 formula and was published by him in 1762. 

Example. Let n-3, ib=4. Then ri+2r 2 +3rj=4 and 

(n, r„ u) = (4, 0, 0), (2, 1, 0), (1, 0, 1), (0, 2, 0), 



/3! 2! 1! 1! \ 

^ = Hii Cl4 "2ir! c ^ + rm CiCi+ ^V 



m a* — 4Ci*C| +4C\C% +20^. 

EXERCISES 

1. For the quadratic x*—px+q=0 write out the expressions for *, « s , «« v *• given by 
(19), and compare with those obtained from Newton's identities (Ex. 3, § 106). 

2. Find s 4 for a quartic equation by Waring's formula. 

3. For k =5, (20) becomes DeMoivre's quintic p*— 5gp , -f5g , p«c. Solve it by 
radicals for p. 

4. Solve (20) by radicals when ib=7. 

108. 2-functions Expressed in Terms of the Functions **. Since 
we have learned two methods of expressing the s k in terms of the coeffi- 
cients, it is desirable to learn how to express any 2-polynomial (and 
hence any symmetric function) in terms of the s t . 

By performing the indicated multiplication, we find that 

where m=l if a ^6, m=2 if a =6. Transposing the first term, which 
is equal to s a + b) and dividing by m, we obtain 

(22) Xai a a 2 b = - (m* - ««+*) • 

m 

1 See the author's Elementary Theory of Equations, pp 72-74, where there is given 
also a shorter proof by means of infinite series. 



§ 1081 COMPUTATION OF SIGMA FUNCTIONS 141 

In order to compute Zai 4 a2 3 a3 2 in terms of the s k , we form the product 
Sai 4 • 2aiW = 2ai 7 a 2 2 + 2ai 6 a 2 3 + 2ai 4 a 2 3 03 2 . 
Making three applications of (22), we get 

«4(«3«2 — ss) = (S7S2 — 89) + (sqSz — 89) + 2ai 4 a2 3 as 2 . 

Hence 

Sai 4 a2 3 a3 2 — 828384 — 8287 — 838b — 84.85+289. 

EXERCISES 

For a quartic equation, express in terms of the s* and ultimately in terms of the 
coefficients c it . . . , c#: 

1. XotW. 2. Wo*. 

3. SotiVejcrj. 4. 2ai\xft\xs s . 

5. If a^6 > c>0, prove that 

2ai% as* =— («a»5«c— «a«ft+c"-«»«o+e""Mo+ft+2«o+d+c)> 
HI 

where m = 1 if a> 5, 111=2 if a =6. 

6. Safi^^^^SoSd 1 — *a*2» — 28&8 a +&-h2*a+2»), a>5>0. 

7. Sa 1 a a,% a =i(« a , -3«a«3o+2«8a), «>0. 

109. Computation of Symmetric Functions. The method last explained 
is practicable when a term of the S-function involves only a few distinct 
roots, the largeness of the exponents not introducing a difficulty in the 
initial work of expressing the 2-function in terms of the s*. 

But when a term of the S-function involves a large number of roots 
with small exponents, we resort to a method suggested by § 104, which 
tells us which auxiliary simpler symmetric fuctions should be multiplied 
together to produce our 2-function along with simpler ones. 

For example, to find Zx^x&iXi, when n> 4, we employ 

E1E4 — Xx\ • ?,X\XiXzX\ = T,X\ i XiX&\ -f-5 22i$i£)24£t, 

Exi 2 XiXiXi = E\E\ —5 Ei. 

To find Zzihcthczhti, employ EiE i = 'ExiXiXf2xiXiXiXi. 

When many such products of Z-functions are to be computed, it will save time 
in the long run to learn and apply the " method of leaders " explained in the author's 
Elementary Theory of Equations, pp. 64-65. 



142 SYMMETRIC FUNCTIONS [Ch. EX 

MISCELLANEOUS EXERCISES 

Express in terms of the coefficients Ci, . . . , c„: 

1. ZaiV&X). 2. ZaiVxs^xi. 3. Zai\x»fe»a4. 4. Sexist Vx» s . 

If a, 0, 7 are the roots of x l +px t +qx+r=O i find a cubic equation with the roots 

2 2 2 

5. a*, 0«, 7*. 6. aft «7, 07- 7. -, -, -. 

a p y 

8. a«+0', a*+7 s , 0*+7*. 9. a«+a0+0*, etc. 

If a, 0, 7, 5 are the roots of x 4 -f pa?' -fgs* +*£-[-«= 0, find 

"•2£~2i-22-2>2>2£ 

12. Express 2cn a at b ai e a4 d in terms of the s* when (t) o>6>c>d>0, and (it) when 

13. By solving the first k of Newton's identities (10) as a system of linear equations, 
find an expression in the form of a determinant (t) for s* in terms of c h . . . , ci, and 
(it) for c* in terms of *i, . . . , 8*. 

14. One set of n numbers is a mere rearrangement of another set if *i, . . . , «» 
have the same values for each set. 



CHAPTER X 
Elimination, Resultants and Discriminants 

110. Elimination. If the two equations 

ax+b=0, cx+d=0 (a^O, c-^0) 

are simultaneous, i.e., if x has the same value in each, then 

h ri 

x= — = — , R=ad— bc=0, 

a c 

and conversely. Hence a necessary and sufficient condition that the 
equations have a common root is #=0. We call R the resultant (or 
eliminant) of the two equations. 

The result of eliminating x between the two equations might equally 
well have been written in the form 6c— ad=0. But the arbitrary selec- 
tion of R as the resultant, rather than the product of R by some constant, 
as — 1, is a matter of more importance than is apparent at first sight. For, 
we seek a definite function of the coefficients a, 6, c, d of the functions 
ax+b, cx+d, and not merely a property J?=0 or R?*0 of the correspond- 
ing equations. Accordingly, we shall lay down the definition in § 111, 
which, as the reader may verify, leads to J? in our present example. 

Methods of elimination which seem plausible often yield not R itself, 
but the product of R by an extraneous function of the coefficients. This 
point (illustrated in § 114) indicates that the subject demands a more 
careful treatment than is often given. 

111. Resultant of Two Polynomials in x. Let 

' f(x) = a x m +aix m - 1 + . . . +a,n (ao^O), 

\ g(x) = box n +bix*- l + . . . +b H (6o^O) 

be two polynomials of degrees m and n. Let a\, . . . , a m be the roots 
of f(x)=0. Since <*i is a root of g(x)=0 only when <7(«i)=0, the two 
equations have a root in common if and only if the product 

9(on)g(a 2 ) . . . 0(«m) 
143 



(1) 



144 ELIMINATION, RESULTANTS, DISCRIMINANTS [C* X 

is zero. This symmetric function of the roots of /(x)=0 is of degree n 
in any one root and hence is expressible as a polynomial of degree n in the 
elementary symmetric functions (§ 104), which are equal to — ai/ao, 
02/00, .... To be rid of the denominators oo, it therefore suffices to 
multiply our polynomial by ao*. We therefore define 

(2) R(f, g) =ao*g(ai)g(a 2 ) . . . 0(O 

to be the resultant of / and g. It equals an integral rational function of 
oo, • • • , On, bo, . . . , b n with integral coefficients. 

EXERCISES 

1. If m-1, n-2, R(f, g)=btfl l *—biaoai+btao i . 

2. If m=2, n-1, R(f, g) « 00(60*1 +&i)(&oo*+&i) -aA l -ak6«bi+«h l , since 

«o(«i "fa*) «* — «i, Ooatiai ■ Of. 

3. If ft, . . . , ft, are the roots of g(x) «0, so that 

p(«<)=5o(eti-/5i) (01— ft) . . . (ai-AO, 
then 

#(/, 0) -aoW-ft) («i-/3t . . . («i-A0 

•(ai-/5i) (aa-ft) . . . (aa-/3j 



•<a»-0i)(a»-0i) . . . (a»-j8ii). 
Multiplying together the differences in each column, we see that 

R(f,g) = (-l) MH bo m f(Pi)m) . . ./(W-C-l)""^./*. 

4. If m=2, n = l, #(?,/)= 6 y(- 61/60) -o^-a^A +0160*, which is equal to R(f,p) 
by Ex. 2. This illustrates the final result in Ex. 3. 

5. If m=n=2, R(f, g) =Oo*6o 2 aiW-|-ao*Wiai«»(ai-|-<»») 

+ a *bM<*i 2 + a% 2 ) +a *&i W,+ao*&iWai+ai) +0* V 
* 60*01' —6061O1O1 +6062(01* —2 aoOs) +61*0002 —6161O0O1 -hoo*6j*. 

This equals i2(^, /), since it is unaltered when the a's and 6's are interchanged. 

6. Prove by (2) that R is homogeneous and of total degree m in 60, . . . , 6»; and 
by Ex. 3, that R is homogeneous and of total degree n in ao, . . . , a». Show that R 
has the terms aoV* and ( - lyW. 

7. R(f,gig i )=R(f,gi)-R(f,g2). 

8. W.^-C-l)""*^/)-^!)"^*. 



1112] 



SYLVESTER'S METHOD OF ELIMINATION 



US 



112. Sylvester's Dialytic Method of Elimination. 1 Let the equations 

/(x)=aox 3 +aia?+a2X+a3=0, g(x)=boa?+b\x+b2=0 

have a common root x. Multiply the first equation by x and the second 
by x 2 and x in turn. We now have five equations 

aoxt+aiaP+axxP+azx =0, 

aoaP+aixP+tw+aa =0, 

btft+biaP+btix? =0, 

&0* 3 +&ix 2 +& a x =0, 
boa?+bix+b2=0, 

which are linear and homogeneous in x 4 , x 3 , x 2 , x, 1. Hence (§ 97) 

ao a\ 02 as 

do ai 02 fl3 

(3) F= bo bi 62 

60 61 62 

60 h 62 

must be zero. Next, if F=0, there exist (§97) values which, when 
substituted for x 4 , x 3 , x 2 , x and 1, satisfy the five equations. But why is 
the value for x 4 the fourth power of the value for x, that for x 3 the cube of 
the value for x, etc.? Since the direct verification of these facts would 
be very laborious, we resort to a device to show that, conversely, if F=0 
the two given equations have a root in common. 

In (3) replace az by 03— z and consider the equation 



(4» 



00 


01 


02 


az— t 


! U 





00 


<*1 


<*2 


0,%—Z 


bo 


61 


62 











60 


61 


fe 











60 


h 


62 



=0. 



To prove that it has the roots /(/Si) and /(/fe), where 0i and fc are the roots 
of 0(20=0, we take z=f(fii) and prove that the determinant is then equal 
to zero. For, if we add to the last column the products of the elements 

1 Given without proof by Sylvester, Philosophical Magazine, 1840, p. 132. 



146 



ELIMINATION, RESULTANTS, DISCRIMINANTS 



[Ch.X 



of the first four columns by ft 4 , ft 3 , ft 2 , ft, respectively, we find that all 
of the elements of the new last column are zero. 
Since (4) reduces to (3) for 2=0, it is of the form 

b<?Z 2 +kz+F = 0, 

in which the value of k is immaterial. By considering the product of 
the roots of this quadratic equation, we see that 

F«W/G8i)/(jh). 

Hence the Sylvester determinant F is the resultant R(g,f) and hence 
is the resultant R(f } g) y since mn is here even (Ex. 3, § 111). 

In general, if the equations are 

/(x)=o x m + . . . +o,»=0, g(x)=box?+ . . . +&»=0, 

we multiply the first equation by x n ~ l , x n ~ 2 , . . . , x, 1, in turn, and the 
second by x m ~ l } a^ -2 , . . . , x } 1, in turn. We obtain n+m equations 
which are linear and homogeneous in the m+n quantities x m+n ~ 1 , . . . , 
«, 1. Hence the determinant 

ClQ a\ 0,2 . . . dm 

ao a>i a>2 . . . (bn 

Oq dl 0,2 ... Om ... 



(5) 



F= 








. bo bi b, 



> n rows 



m rows 



is zero. It may be shown to be equal to the resultant R(f, g), whether 
mn is even or odd, by the method employed in the above case m=3, n=2. 

We may also prove as follows that if F=0 the equations /=0 and 
0=0 have a common root. Since F was obtained as the determinant 
of the coefficients of 



x /,..., xf , /, 



x"*-^, 



>xg> g> 



F=0 implies, by § 96, Lemma 2, the existence of a linear relation 
B x n ~ 1 f+ . . . +B n - 2 xf+B n - l f+A x m - 1 g+ . . . +A m -&g+A m - 1 g=0, 



5112] 



SYLVESTER'S METHOD OF ELIMINATION 



147 



identically in x, with constant coefficients So, . . . , A m - 1 not all zero. 
In other words, 0f+ag=O, where 

(6) as4 * m - 1 + . . . +A m -&+A m - u p=Box n ' l + . . . +B n -&+B n _ 1 . 

Neither a nor is identically zero. For, if a=0, for example, then 
fif se=0 and 0=0, whereas the A t and B t are not all zero. 

Consider the factored forms of /, g y a, 0. Suppose that / and g have 
no common linear factor. The highest power of each linear factor occur- 
ring in / divides ag=—f$f and hence divides a. Thus / divides a, whereas 
/ is of higher degree than a. Hence our assumption that /=0 and g=0 
have no common root has led to a contradiction. 

A similar idea is involved in the method of elimination due to Euler (1707-1783). 
If /=0 and 0=0 have a common root c, then/s(x— t)a, — g = (x— c)/3, identically in 
x, where a and are polynomials in x of degrees ra— 1 and n— 1, respectively. Give 
them the notations (6). In the identity 0/+a0=O, the coefficient of each power of x 
is zero. Hence 



CLqBq 

aiBo-\-doBi 



-f&oAo 
+biA +boAi 








<h»Bn- 2+<hn - \B n - 1 
OmBn-i 



+&*A»-2+&„_ iil m _ i = 
+Mm-1=0. 



Since these m+n linear homogeneous equations in the unknowns £<>, ...,£»- i, A o, ... . 
Am- 1 have a set of solutions not all zero, the determinant of the coefficients is zero, 
By interchanging the rows and columns, we obtain the determinant (5) 



EXERCISES 

1. For m=n=2, show that the resultant is 

do a,\ at 



R= 



do cti <h, 

bo bi &2 

&o 6i bi 
Interchange the second and third rows, apply Laplace's development, and prove that 

R = (ao&s) * — (aofci) (aiW , 
where (aJh) denotes ao&2— oj&o, etc. 



148 



ELIMINATION, RESULTANTS, DISCRIMINANTS 



[Ch.X 



2. For m-n-3, write down the resultant R and, by interchanges of rows, derm ji 
the second determinant in 



R= 



OQ 


Ol 


Ot 


Ot 










OQ 


Ol 


Ol 


Ot 











Oo 


Ol 


Ol 


Ot 







bo 


6i 


> 


6i 














oo 


ai 


a* 


Ot 







Oo 


ai 


at 


at 





&o 


6i 


fc 


bt 













bo 


6i 


fc 


bt 








bo 


6i 


bt 


6i 













do 


ai 


at 


Ot 








&o 


6i 


fc 


bt 










*0 


6i 


bt 


•a. 



To the second determinant apply Laplace's development, selecting minors from the 
first two rows, and to the complementary minors apply a similar development. This 
may be done by inspection and the following value oi —R will be obtained: 

(aoW { (aAXoiW -(aiW'+tai&sXao&i) } 
-(aoW | {(aobtKaM -(aoWfafc) } 
+(aoW {(aA)(a«6,)-(aA)«} • 

The third term of the first line and the first term of the last line are alike. Hence, 
changing the signs, 

-B=(o 6i) , -2 (aoh)(aob^(aJbi)-(aobt)(aobt)(a x bt) 

+(ao6,) 1 (a^ l )H-(aa6i)(a 1 6,)«-(ao6 l )(aiW(<H6 t ). 

Other methods of simplifying Sylvester's determinant (5) are given in § 113. 

113. Bezout's Method of Elimination. When the two equations are 
of the same degree, the method published by B6zout in 1764 will be clear 
from the example 



/sooo?+oia; 2 +02X+03=0, g=boa?+bia?+b2Z+bz=0. 



Then 



aog—bof> 

(7) (flox+ai)g- (box+bi)f, 

(aox 2 +oix+02)flf — Qwc?+bix+b2)f 

are equal respectively to 

(ao&Ox 2 +(ao& 2 )s +(0063) =0, 

(8) (ao^J^+KaobsJ+Coi^J^+Ca^^O, 

(oofe)^ +(ait>3)x +(0263) =0, 



\ 



§1131 



BEZOUTS METHOD OF ELIMINATION 



149 



where (oo&i)=oo6i— aibo, etc. The determinant of the coefficients is the 
negative of the resultant R(f, g). Indeed, the negative of the determi- 
nant is easily verified to have the expansion given at the end of Ex. 2 
just above. 

To give a more instructive proof of the last fact, note that, by (7), equations (8) 
are linear combinations of 

x*/=0, x/=0, /=0, x*g=0, xp=0, f-0, 

the latter being the equations used in Sylvester's method of elimination. The deter- 
minant of the coefficients in these six equations is the first determinant R in Ex. 2 just 
above. The operations carried out to obtain equations (8) are seen to correspond 
step for step to the following operations on determinants. To the products of the ele- 
ments of the fourth row by a add the products of the elements of the 1st, 2nd, 3rd, 
5th, 6th rows by — 6 , — &i, — &i, ai, <h respectively [corresponding to the formation 
of the third function (7)]. To the products of the elements of the fifth row by a* add 
the products of the elements of the 2nd, 3rd, 6th rows by — 6 , — &i, <h respectively [cor-, 
responding to the second function (7)]. Finally, to the products of the elements of the 
sixth row by oq add the products of the elements of the third row by —b [corresponding 
to a*0—b<J]. Hence 

do a\ a% a% 



Oq*R = 



a a\ 



<h 



a t 









a* 


ai 


at at 











(ao6j) 


(ai&t) (otftt) 











(«<&) 


(ao&t)+(ai6t) (aA) 











(flo&i) 


(aaW (ao6j) 



so that R is equal to the 3-rowed minor enclosed by the dots. The method of Bezout 
therefore suggests a definite process for the reduction of Sylvester's determinant of 
order 2n (when m—n) to one of order n. 

Next, for equations of different degrees, consider the example 



Then 



/saoX 4 +a 1 a5*H-a»a; 2 -}-a»xH-a4, g&b&i+bxz+bt. 

aoxtg-bof, (aoX+ai)xV — (&o3+W/ 
are equal respectively to 

(ao&i)x* + (aoWx* — <hboX — 0460, 

(ajh)x*+ { (aJh) -atfco } x 1 — { a»6i +a 4 6o } x —aJh. 

The determinant of the coefficients of x 8 , x*, x, 1 in these two functions and xg, g, after 
the first and second rows are interchanged, is the determinant of order 4 enclosed by 
Hots in the second determinant below. Hence it is the resultant R(f, g) . 



150 



ELIMINATION, RESULTANTS, DISCRIMINANTS 



[Ch.X 



As in the former example, we shall indicate the corresponding operations on Syl- 
vester's determinant 

do Q\ 0* €L% CLi 



R 






ao 


ai 


Ol 


Ot 


<u 


bo 


6i 


bt 














bo 


bi 


z* 














bo 


5i 


*» 














&o 


lh 


bt 



': 



Multiply the elements of the third and fourth rows by a . In the resulting determinant 
oo*/2, add to the elements of the third row the products of the elements of the first, 
second and fourth rows by —b , — 6i, ai/oo respectively. Add to the elements of the 
fourth row the products of those of the second by — b . We get 



a *R 



do 


ai 


<h 


Ot 


04 








ao 


ai 


at 


Ot 


0« 








. (aoW 


(aib*) —a t b 


— ajbi— ajbo 


—0461 

1 








(ao&i) 


(ao&i) 


— os&o 


1 

— O460 








bo 


bi 


ft* 











\ o 


&o 


6i 


6, 



Hence A is equal to the minor enclosed by dots. 



EXERCISES 



1. For m=3, n=2, apply to Sylvester's determinant R exactly the same operations 
as used in the last case in § 113 and obtain 



R 



(aobt) (aibt) —<hbo — aJh 

(aM (0061) — O|6o 

bo bi bi 



2. For ro— n— 4, reduce Sylvester's # (as in the first case in § 113) to 

(ao&i) (aoW (flobi) (ajbi) 

(aobt) (a 6s)+(aiW (ao&O+foi&j) (aM 

(ao6i) (ao60+ (ai&») (aibO+fat&i) (0164) 

(ao& 4 ) (ai&O (aA) (oth) 



I 



1114) 



GENERAL THEOREM ON ELIMINATION 



151 



114. General Theorem on Elimination* If any method of eliminating 
x between two equations in x leads to a relation F=0, where F is a polynomial 
in the coefficients, then F has as a factor the true resultant of the equations. 

Some of the preceding proofs become simpler if this theorem is applied. 
For example, determinant (3) is divisible by the resultant R. Since the 
diagonal term of (3) is a term ao 2 &2 3 of R (Ex. 6, § 111), F is identical 
with R. 

The preceding general theorem is proved in Jhe author's Elementary 
Theory of Equations, pp. 152-4. We shall here merely verify the theorem 
in an instructive special case. Let 

f=ao3?+aia?+a2X+a3=0, g=bo3?+b\x*+b2X+ba=0 
have a common root x?*0. Then 

-bof+aog = (ao&i)a?+(ao6 2 )x+(ao&3), 
(63/- a 3 fif) /x = (0063)2* + (aibz)x+ (a 2 6 3 ) . 

By Ex. 1 of § 112, the resultant of these two quadratic functions is 



F= 



(aibz) (0062) 
(0263) (ao&3) 



(0063) (aobi) 2 _ (0063) (aobi) 
(0263) (0063) (aibz) (0062) 

This is, however, not the resultant R of the cubic functions /, g. To show 
that (0063) is an extraneous factor, note that the terms of F not having 
this factor explicitly are 

(aobi) (a 2 fo) { (ao&i) (0263) - (0062) (ai6 3 ) } . 

The quantity in brackets is equal to —(0063) (0162), since, as in Ex. 2 
of § 101, 



o=4 



ao 


Ol 


02 


03 


bo 


fei 


62 


63 


ao 


ai 


<Z2 


03 


60 


61 


h 


h 



= lao&i) (a 2 bz) — (ao& 2 ) (aife) + (0063) (0162) . 



We now see that F=(aobz)R f where R is given in Ex. 2 of § 112. This 
method of elimination therefore introduces an extraneous factor (a bz). 
The student should employ only methods of elimination (such as those 
due to Sylvester, Euler, and BSzout) which have been proved to lead 
to the true resultant. 



152 ELIMINATION, RESULTANTS AND DISCRIMINANTS [Ch. X 

EXERCISES 

Find the result of eliminating x and hence find all sets of common solutions of 
1. rc'-y'-Q, xy«5y. 2. s*+y*«25, rc»+3(c-l);c+c(y l -25)«0. 

3. When x^+ax+b^O has a double root, what 3-rowed determinant is zero? 

4. Find the roots of x«+3x«+32x*+67x«+32x+65«0 fc y 5 79 

115. Discriminants. Let ai, . . ., a* be the roots of 

(9) f(x)=<wr+aiz!»- 1 + . . . +O--0 (ao^O), 
so that 

(10) /(«)=ao(x-ai)(rr-a2) . . . (s-a*). 
As in § 44, we define the discriminant of (9) to be 

D = ao 2m - 2 (ai-a 2 ) 2 (ai-as) 2 . . . (ai-a m ) 2 (a 2 -as) 2 . . . (o^-i-o,,,) 2 . 

Evidently D is unaltered by the interchange of any two roots. Since the 
degree in any root is 2(m— 1), the symmetric function D is equal to a 
polynomial in ao> . . . , a*. Indeed, ao 2 "*"* 2 is the lowest power of ao 
sufficient to cancel the denominators introduced by replacing Xai by 
— ai/ao, . . . , cl\cl2 . . . oc m by dbam/ao. By differentiating (10), we see 
that 

/'(ai) = ao(«i— «2)(ai— «s) . . . («i— On), 

/ / («2)=ao(a2— ai)(«2— «3) • • •' («2— On), 

/'(a3)=ao(a3— ai)(«3— «2)(a3— 04) . . . (0:3 — a*), 

etc. Hence 

ao—V^i) • • •/'(«») =ao*»- 1 (-D ,+a+ • • • + "-'(«i-« 2 ) 2 . . . (a._ I -a«)» 

m(m-l) 

==(-1) 2 aoD. 
By (2), the left member is the resultant of /(x), /'Or). Hencfe 

m(m-l) -1 

(ID D=(-D * ^«(/,/0- 

00 



§ 115] 



DISCRIMINANTS 



153 



EXERCISES 

1. Show that the discriminant of /=y 3 +py+g=0 is — 4p*— 27o* by evaluating 
the determinant of order five for i2(/, /') . 

2. Prove that the discriminant of the product of two functions is equal to the prod- 
uct of their discriminants multiplied by the square of their resultant. Hint: use 
the expressions in terms of the differences of the roots. 

3. For a = l, show that the discriminant is equal to 



1 
1 



«i at 1 



at a t 2 



ai 



m-1 



a% 



m-1 



m-1 



2 



«0 




*i 


St 


m • • 


*»- 


1 


*1 




St 


St 


• • • 


Sm 




*m- 


-1 


Sm 


*m-\ 


hi • • • 


S2m 


-2 



- -4p»-27g*. 



1 <*m «m 2 • • • «m' 

where ««=«!*+ . . . W. See Ex. 4, § 88; Ex. 2, § 102. 

4. Hence verify that the discriminant of x*+px+q=0 is equal to 

3 -2p 

-2p -3g 

-2p -3o 2p* 

5. By means of Ex. 1, § 113, show that the discriminant of oox'+o^+ojx+oa^O is 

2a</h aiOs+3aoas 2oia* 

d\ 2at 3o» 

3ao 2ai aj 

= 18aoai02aj — 40001* —4oi^aj +01^01* — 27o W. 



MISCELLANEOUS EXERCISES 

1. Find the equation whose roots are the abscissas of the points of intersection 
of two general conies. 

2. Find a necessary and sufficient condition that 

/(x) 33£ 4 +px'-f qx*+ rx+8 =0 

shall have one root the negative of another root. When this condition is satisfied, 
what are the quadratic factors of /(x)? Apply to Ex. 4, § 74. Hint: add and subtract 
/(x) and/(— x). 

3. Solve /(x)sx 4 -6x 3 +13x 2 -14x+6=0, given that two roots a and are such 
that 2a+/5 = 5. Hint: /(x) and/(5— 2x) have a common factor. 

4. Solve x*-f px+g =0 by eliminating x between it and x 2 -f vx+w = y by the greatest 
common divisor process, and choosing v and w so that in the resulting cubic equation 
for y the coefficients of y and y 1 are zero. The next to the last step of the elimination 



154 ELIMINATION, RESULTANTS AND DISCRIMINANTS [Ch. X 

gives x as a rational function of y. (Tschirnhausen, Ada Erudit., Lipsiae, II, 1683, 
p. 204.) 

5. Find the preceding y-cubic as follows. Multiply x*-\-vx+w=y by x and' replace 
x* by — px — q; then multiply the resulting quadratic equation in x by x and replace 
x* by its value. The determinant of the coefficients of x*, x, 1 must vanish. 

6. Eliminate y between y lss v, x=ry+sy l t and get 

x*-3r8vx-(r*v+8*v*) =0. 

Take « = 1 and chose r and v so that this equation shall be identical with x*+px+q=0, 
and hence solve the latter. (Euler, 1764.) 

7. Eliminate y between y* — v, x ^f+ey -f y % and get 



1 e f-x 
e /— x v 
f—x v ev 



0. 



x — 



1+V 



This cubic equation in x may be identified with the general cubic equation by choice 
of c, /, v, Hence solve the latter. 

8. Determine r, 8 and v so that the resultant of 

x+r 

y+s 

shall be identical with x*+px+q=0. (B6zout, 1762.) 

9. Show that the reduction of a cubic equation in x to the form y*=v by the sub- i 
stitution 

r+sy 



is not essentially different from the method of Ex. 7. [Multiply the numerator and 
denominator of as by l—y+y*.\ 

10. Prove that the equation whose roots are the n(n— 1) differences xj—x* of the 
roots of f(x) =0 may be obtained by eliminating x between the latter and f(x+y) =0 _j 
and deleting from the eliminant the factor y* (arising from y=xj—xj=0). The I 
equation free of this factor may be obtained by eliminating x between f(x) =0 and 

i/(^!/)-/w|/i/=/'w+/"W r; +■ • • +/ (n) (*)rl -o. 

1* l l*z . . . n 

This eliminant involves only even powers of y, so that if we set y 2 —z we obtain an 
equation in z having as its roots the squares of the differences of the roots of /(x) «= 0. 
(Lagrange Resolution des Equations, 1798, § 8.) 

11. Compute by iJx. 10 the z-equation when/(x) =x»-r-px+g. 



APPENDIX 



THE FUNDAMENTAL THEOREM OF ALGEBRA 

Theorem. An equation of degree n with any complex coefficients 

f(z)= z »+ ai z*-*+ . . . ^an=0 

has a complex {real or imaginary) root. 

Write z=x+iy where x and y are real, and similarly ai=ci+id\, etc. 
By means of the binomial theorem, we may express any power of z in the 
form X+iY. Hence 

(1) K*)-4>(x,y)+Wz,y), 

where <t> and \p are polynomials with real coefficients. 

The first proof of the fundamental theorem was given by Gauss in 
1799 and simplified by him in 1849. This simplified proof consists in 
showing that the two curves represented by #(x, y)=0 and \f/(x, y)=0 
have at least one point (xi, y{) in common, so that z\ =x\+iy\ is a root 
of f(z) =0. This proof is given in Chapter V of the author's Elementary 
Theory of Equations. 

We here give a shorter proof, the initial idea of which was suggested, 
but not fully developed, by Cauchy. 1 

Lemma 1. aih+a2h 2 + . . . +aji n is less in absolute value than any 
assigned positive number p for all complex values of h sufficiently small in 
absolute value. 

The proof differs from that of the auxiliary theorem in § 62 only in 
reading " in absolute value " for " numerically." 

We shall employ the notation \z\ for the absolute value +Vx 2 +y 2 of 
z*=x+iy. 

1 For a history of the fundamental theorem, see Encycloptdie des sciences matht- 
matiques, tome I, vol. II, pp. 189-205. 

155 



156 APPENDIX 

Lei nma 2* Given any positive number P } we can find a positive number 
Rsuckthat\f(z) \>Pif\z\>R. 

The proof is analogous to that in § 64. We have 

/oo-^a+D), J>=ai@)+ • • • + a »(i)" 

Since (Ex. 5, § 8) the absolute value of a sum of two complex numbers 
is equal to or greater than the difference of their absolute values, we have 

|/(z)|>|*|»[l-|£|]. 

Let p be any assigned positive number <1. Applying Lemma 1 with 
h replaced by l/z, we see that | D \ <p if | 1/z | is sufficiently small, i.e., 
if p=| z | is sufficiently large. Then 

i/( 2 )i>p»d-p)>p 

if p"iP/(l— p), which is true if 



*$£* 



This proves Lemma 2. 

Lemma 3. Given a complex number a such that /(a) 5*0, toe can find 
a complex number z for which \ f(z) \ < | /(a) | . 

Write z=a+h. By Taylors theorem (8) of § 56, 

f(a+h)=f(a)+f'(a)h+ . . . +/<'>(<*)-+ . . . +f (n) (a)~ 

ri nl 

Not all of the values /'(a), /"(a), ... are zero since / (n) (a)=n!. Let 
/ (r) (a) be the first one of these values which is not zero. Then 

f(a+h) _. ,/"(a) V , , / w (a) h? 

f(a) 1+ /(a) Vr" • • • + /(a) 'nV 

Writing the second member in the simpler notation 

g (h) = l+bh r +ch r+1 + . . . +lh n , 6^0, 

we shall prove that a complex value of h may be found such that | g(h) | < 1. 
Then the absolrte value of f(z)/f(a) will be <1 and Lemma 3 proved. 
To find such a value of h, write h and b in their trigonometric forms (§ 4) 

A= p(cos 6+i sin 0), b = \ b \ (cos fi+i sin f$). 



FUNDAMENTAL THEOREM OF ALGEBRA 157 

Then by § 5, § 7, 

6A r =| b \p r { cos (fi+r6)+isin (fi+rd)}. 

Since h is at our choice, p and angle are at our choice. We choose $ 
so that 0+r0=18O°. Then the quantity in brackets reduces to —1, 

wll6IlC6 

0(A)- (1-1 b \p r )+h r (ch+ . . . +lh»-<). 

By Lemma 1, we may choose p so small that 

\ch+ ... +lh n ' r \<\b\. 

By taking p still smaller if necessary, we may assume at the same time 
that|6|p r <l. Then 

|0(/O|<(i-|Mp f )+P f IH lrf*)l<i. 

Minimum Value of a Continuous Function. Let F(x) be any poly- 
nomial with real coefficients. Among the real values of x for which 
2£x£3, there is at least one value x\ for which F(x) takes its minimum 
value F(xi), i.e., for which F(x\)±F{x) for all real values of x such that 
2£a:£3. This becomes intuitive geometrically. The portion of the 
graph of y=F(x) which extends from its point with the abscissa 2 to its 
point with the abscissa 3 either has a lowest point or else has several 
equally low points, each lower than all the remaining points. The arith- 
metic proof depends upon the fact that F(x) is continuous for each x 
between 2 and 3 inclusive (§62). The proof is rather delicate and is 
omitted since the theorem for functions of one variable x is mentioned 
here only by way of introduction to our case of functions of two variables. 

We are interested in the analogous question for 

G(x,y) = <t?(x,y)+Mx,y), 

which, by (1), is the square of | f(z) |. As in the elements of solid analytic 
geometry, consider the surface represented by Z=G(x, y) and the right 
circular cylinder x?+y 2 = R 2 . Of the points on the first surface and on 
or within their curve of intersection there is a lowest point or there are 
several equally low lowest points, possibly an infinite number of them. 
Expressed arithmetically, among all the pairs of real numbers x, y for 



158 APPENDIX 

which a^+y 2 ^^ 2 , there is 1 at least one pair z\, y\ for which the 
polynomial G(x, y) takes a minimum value 0(x\ 9 yi), i.e., for which 
G(xi, yi) S.G(x, y) for all pairs of real numbers x, y for which xP+y 2 S.B?. 

Proof of the Fundamental Theorem. Let z' denote any complex 
number for which f(z?) 5*0. Let P denote any positive number exceeding 
| /CO |. Determine R as in Lemma 2. In it the condition \z\^.R may 
be interpreted geometrically to imply that the point (x, y) representing 
z=x+iy is outside or on the circle C having the equation x?+y 2 =R?. 
Lemma 2 thus states that, if z is represented by any point outside or on 
the circle C, then \f(z) \>P. In other words, if \f{z) |£P, the point 
representing z is inside circle C. In particular, the point representing z' 
is inside circle C. 

In view of the preceding section on minimum value, we have 

G(x h yi) £ (?(x, y) 

for all pairs of real numbers x, y for which x l -\-y 2 S.R 2 ) where xi, yi is one 
such pair. Write z\ for xi+iyi. Since \f(z) | 2 =G(x, y), we have 

for all z's represented by points on or within circle C. Since & is repre- 
sented by such a point, 

(2) I /(*l) 1^1/(2') l<P. 

This number z\ is a root of f(z) =0. For, if f(z x ) ?*0, Lemma 3 shows 
that there would exist a complex number z for which 

(3) l/«KI/(*i)|. 

Then | f(z) \ <P by (2), so that the point representing z is inside circle C, 
as shown above. By the statement preceding (2), 

I/(*1)I^I/(Z).|. 

But this contradicts (3). Hence the fundamental theorem is proved. 

1 Harkness and Morley, Introduction to the Theory of Analytic Functions, p. 79, 
prove that a real function of two variables which is continuous thoroughout a closed 
region has a minimum value at some point of the region. 



ANSWERS 



Pages 2, 3 



i. 3t\ a. 2. 3. -20+20i. 4. -f 5- (8+2^3). 

6. i(6+V5)+i(2V5-3)t. 7 . ^+gt 8. ^+^ ">. Yes. 
13. 3, 4 and -3, -4. 14. ±(5+6i). 15. =fc(3-2i). 16. ±[c+d+(c-d)i]. 

Pages 6, 7 

2. —3, — 3«, — 3«*; i, «i; o)*i;_/2=cos 40°+t sin 40°, «#, o^JB. 

3. ±(l+i)/V2; ±(l T t)/^2; =fc «*. 

Page 9 

4, -1, cos A +i sin A (A =36°, 108°, 252°, 324°). 6. &, «•, fl». 

Page 10 

5. p(p-l). 6. (p-l)(g-l)(r-l) if n=pgr. 

Page 13 
z. 51. 2. 13. 

Page 15 

1. Rem. 11, quot. x*+5x+8. ' 2. -61, 2x<-4x*+7a;*-14a;+30. 

3. -0.050671, z'+6.09z +10.5481. 4. z'-x-6, z+2; 4,3, -2. 

5. z*-x-6=0, 3, -2. 6. 2±V5. 7. 2x*-z+2. 8. s*+l. 

Page 17 

1. s»-3x*+2x=0. 2. x 4 -5zH-4=0. 3. x 4 - 18s* +81=0. 

4. z 4 -5z»+9z 2 -7z+2=0. 5. 6*=4oc. 7. By theorem in §18. 

159 



160 ANSWERS 

Page 19 

i. x*-6x l +ll3-6=0. 2. x«-8x*+16=0. 

3. 1, 2. 5. 4, f, -f. 6. 1, 3, 5. 7. 1, 1, 1, 3. 

8. 2, -6, 18. 9. -3, 1, 5. 10. 5, 2, -1, -4. 

n. y i -(p t -2q)y+q t =0. 12. y , -(p 8 -3pg)y+g 8 =0. 

*3« (0 2/*-2/(P 3 -3pg)/g+g=0. 14. p*r=q*. 

(«) y'-^P'-^to+^-O. 15. 2,4, -6. 

(iii) 2/ , -(P+p/g)2/+2+«+l/«=0. 

Page 20 

I. 5, -1±V^3. 2. l±i, l=bV2. 

3. x 3 -7s»+19z-13=0. _ 4. 4, 1 - V-5, 3* -6x* +14* -24=0. 

6. ±1,2±V3. 7- V3, 2±i. 9. x'-$c'-{g+£-0. 
10. 2 + V3, x , +2x+2=0. 11,12. Not necessarily. 13. No. 

Page 23 
1. 19J, 3. 2. 6. 3- 2. 4. 3. 5. 0, -7, -$. 

Page 25 
1. -1, -1, -6. 2. -2, 3, 4. 3. 1, 3, 6. 4. -2, -4. 5. None. 

Page 27 
1. 2, -1, -4, 5. 2. 9. 3. 8, 9. 4. -12, -35. 5. 2, 2, -3. 

Page 28 

1. 1, 3, 9, J. 2. 1, J, J. 3. -J. 4. J, -J, -J. 5. J, -i, J. 6. -J, J, i. 

7. J. 8. J. 10. x»-12x-12=0. 11. z 3 -3x*-12z+54=0. 

Page 30 
j. 1,4. 2. -1, -4. 3. 0.7,-5.7. 4. -0.7,5.7. 5. 2,2. 6. Imaginary. 

Page 40 

5. z*+z<-4x 3 -3z*+3z+l=0. 6. -i(l±y/~^3) } J(7±V45J. 

10. See (11), §32. 

11. Edges roots of x i —7x 2 -\-l2x—v =0, all real (§45) and irra tional. 
14. A = area, c = hypotenuse, squares of legs i(c* ± V& — 16 A'). 



15. A area, a, b given sides, square third side is a 2 +6 2 ±2Va 2 6 1 — 4a*. 

16. y 4 -22/*+(2-0*)i/ a -22/+l=O, po S . roots 0.09125, 10.95862. 



ANSWERS 161 

Page 44 

3» g =*2, R+R*+R l *+R\ etc., z*+z*-4z+l =0. _ 

4* g=2, #+#«, Rt+RijJt'+RK 5- i(l± V-3), J(-5=fcV21). 

*• -1, 2±V3, J±iV-3. 7- 1, 1, 1, -1, i(l=fcV-15). 

8. -1, -2, -J,J(-5±V-11). 

Page 46 

1. -5, |(5dbV33). 2. -6, ±Vl3. 3# -2, l±i. 4. i+(- 2±vC3). 

Page 48 

1. A= -400, one. 2. A =4 -27 -121, three. 3. A=0, two. 4. A=*0, two. 

Page 49 

1. -4, 2± V3. 2. See Ex. 1, §47. 

3. 1.3569 4. -1.201639 5. 1.24698 6. 1.1642 
1.6920 1.330058 -1.80194 -1.7729 

-3.0489. -3.128419. -0.44504. -3.3914. 

Page 51 

1. 1,-1,_4±V6. 2. -1, -2,2,3. 3. ld=i, -ld=V2. 

4. l=fcV2, -l±V-2. 5. 4, -2, -l±i. 

Page 54 (bottom) 

1. (-3,9). 2. A= -250000, x= 3, -2, ±i. 

3. (3, 9), (-2, 4). 4. h =3. 5. 6.856, 7. 

Pages 59, 60 

2. 2.1. 3. (-0.845, 4.921), (-3.155, 11.079); between -4 and -5. 

4. 1.1, -1.3 5. Between and 1, and -1, 2.5 and 3, -2.5 and -3. 
9. 120(x*+x), 120x*-42. 

Page 62 

1. 3. 2. 2, -2. 3. —1. 4. Double roots, 1, 3. 5. None. 6. 3,3, —3,6. 

Pages 64, 65 

3. Use Ex. 3, p. 62, abscissas -1, 3. 4. Use Ex. 2, p. 62. 
6. i/=-15x-7, X*-15X+23=0. 



162 ANSWERS 

Page 66 

i. One real. _ 2. (± Vf, 7Ti^Vf), three real 

3. (±V$, -l=F|Vf), three. 4. (-2± V5, 23=F10V5), one. 

Pages 74, 75 
13. y*+2y*+5y*+Sy*-2y-9=0. 14. y*+152/ l +52y-36«0. 

* 

Page 78 
i. One, between —2 and —3. 2. One, between 1 and 2. 

Pages 79, 80 

1. (-4, -3), (-2, -I), (1, 2). 2. (-2,-1), (0, 1). 
3. (-2, -1.5), (-1.5, -1), (3, 4). 4. (-2, -1), (0, 1). 
S (-7, -6), (1,2). 6. (0, 1), (3, 4). 

Page 83 

2. 1, 1, 1, 2. 3. 1, 1, —2, -2. 4. 1, 1, two imaginary. 

Page 85 

1. (-2, -1), (0, 1), (1, 2). 2. (-4, -3), (-2, -1), (1, 2). 

Pages 89, 90 

1. Single, -2.46955. 2. -1.20164, 1.33006, -3.12842. 

3. 1.24698, -1.80194, -0.44504. 4. ±2.1213203, 2.1231056, -6.1231056 
5. 3.45592, 21.43067. 6. 2.15443. 

7. -1.7728656, 1.1642479, -3.3913823. 

8. 3.0489173, -1.3568958, -1.6920215. 

9. 2.24004099. 10. 1.997997997. 
n. 1.094551482. 12. 2.059, -1.228. 

13. 1.2261. 14. 0.6527= reciprocal of 2 cos 40°. 

15. 0.9397. 16. 1.3500. 17. 2.7138, 3.3840. 

18. 5.46%. 19. 5.57%. 20. 9.70%. 

Page 94 
1. 2. 2. 3. 



ANSWERS 163 

Page 96 
1. 2.24004099. 2. 2.3593041. 3, 1.097998. 

Page 98 

1 132° 20.7'. 2. 157° 12'. 3. 4.8425364. 4. 3.1668771. 

5, 7. 15° 16*', 85° 56*', 212° 49', 225° 57'. 

6. 72° 17'. 8. 5° 56*', 25° 18'. 9. 2.5541949. 10. 1.85718. 

Page 99 

1. -1.04727 ±1.13594 t. 2. -f±|V3i. 

3. -lit'. 4. lit, l±2i. 5. 2±i, ±2t. 

Page 100 

1. 217° 12' 27.4" =3.790988 radians. 2. 42° 20' 471" doubled. 

3. 133° 33.8'. 4. 108° 36' 14". 5. 21.468212. 

6. Angle at center 47° 39' 13". 7. 49° 17' 36.5". 

8. 1.4303 x, 2.4590 x, 3.4709 x; 257° 27' 12.225" more exact than first. 

9. x/tt =0.6625, 1.891, 2.930, 3.948, 4.959. 

10. (t) 0.327739, 0.339224, 1124.333037. 

(it) 0.250279, 0.894609, 1.127839. [Set x = 1 +y, y = l/z and solve by trigo- 
nometry.] 

11. 3.597285. 12. 10, 1.371288. 
13. 0. 326878, 12.267305. 14. 324° 16' 29.55". 

15. 10 yr. 4 mo. days. 16. 6.074674. 17. 6.13%. 

# Page 102 

1. x=5,y=6. 2. x=2,y = l. 3. a?=*a,y=0. 

Page 106 

Page 112 
3« —o. 4* ""8. 

Page 115 

1. s= -8, y=-7, z=26. 2. s=3, y=-5, z«2. 

3. 3=6, y=3, 2 = 12. 4. a? =5, y=4. z=3. 

S» s= — 5, y=3, z=2, t0 = l. 6. s = l, y=z=0, u> = — 1. 



164 ANSWERS 

Page 119 

1. Consistent: y- — 8/7— 2x, s-5/7 (common line). 

a. Inconsistent, case (0). 3. Inconsistent (two parallel planes). 

4. Consistent (single plane). 

5. (i) s« — x— y— 2; (u) inconsistent," (ui) x= — — -, y-z = — -. j 

o+J a-f-2 

* ,.x (A;-ft)(c-fc) ,... A;-c a-k \ 

6. (1) x-7 — -; (w) y« x, 2= if k=a or A; = c, but mcon- 

(a-b)(c-a) a-c a-c 

sistent if A; is different from a and c; (Hi) z — 1 — x — y if A; =0, inconsistent if fcs^a. 

Page 120 

1. r=2,x : y :s= — 4 : 1 : 1. 2. r=2, x :y :z = —10: 8 : 7. 

3. r — 1, two unknowns arbitrary. 4. r =3, x : y : z : w =6 : 3 : 12 :1. 

5. r-2, z=-±£x-*Py, w =-*£x-±JV- 

Page 121 

1. Ranks of A and 5 are 2; y= -8/7 -2x, «=5/7. 

—5 3 45 

2. Consistent only when a — — 225/61 and then x = -^7-, y = — , z = — . 

01 ol ol 

3. Rank of A is 2, rank of B is 3, inconsistent. 

4. A and £ of rank 2, x =3, y =2. 

Pages 126, 127 % 

fc(6-fc)(c-fc)(fc+6+c) ... " 

1. x=— - — w . . . m " <*, 6, c are distinct and not zero and their 

a(6-a)(c-a)(a+6+c) 

sum 5^0. If a =6r^c, acs^O, equations are inconsistent unless A; =0, o, c, or —a — c, j 

A4 . fc(c-fc) Jb(t-a) ... j 

and then y =— r -x, z = — r, x arbitrary. 

a(c-a) c(c—a) 

3. (o-6)(6-c)(c-o). 4. (x-y)(2/-z)(z-x)(a^+y2-far). 

6. (a+6+c+d) (a+6-c-d) (a-6-c+d) (a-6+c-d). 

7. (a+6+c+d) (a— 6+c— (0 (a+&i — c— di) (a—bi—c+di). 

n 

11. X/ = (fc— Oy) . . . (fc« — a/)-T-II (a s —aj). 

«=1 

•p*/ 

12. x(ab+ac+bc) = —ate. 



ANSWERS 



165 



Pages 133, 134 



pi-Sptq+lypr+q- 



i. 

r-pq 

5. 2p*-2q. 

3p *g* — 4p*r r-4g 8 — 2pqr — 9r» 
7 * (r-w)« ' 

10. y=q+r/x. 

4x*+px+ q 



2. 



(5p«-12g)(p«-4g) 
4(p» — 4pg+8r) 
6. 24r-p*. 

8. 27r*-9pgr+2g»=0. 



13 



11. s = 



1-PV 



13. 



2+22/' 
2g(p'+2pg-r) 
p*q—pr+8 



— 5p, see Ex. 17, 



Page 136 

3. 8 t =p*-2q, s 3 =p'-3pg, s 4 =p 4 -4p 2 g+2g*, «j=p*-5p«g+5pg*. 
4« «5n=5-3 n , «t=0 if A; is not divisible by 5. 5. All zero. 



Page 140 

2. See Ex. 2, p. 136. 

3. ^hc + VQ +e 6 -rf jc-VQ, Q=}c»-g*(i=0, 1,2,3,4). 

4. ^ic + VQ + e 7 -^ic-VQ, Q-Jc'-g' (i«0, 1, . . . , 6). 



1. Cj*— 2ciC 8 H-2c4. 
3. CiCj— 4c 4 . 



Page 141 

2. Ci*Cj-2c l *-CiC!+4c#. 
4. Cs'— 2dC4. 



Page 142 

1. Cic 8 -4c 4 ifn>3, CiC 8 ifn=3. 2. 3ciC4-C8d-5d. 

3. c*c 4 -4cic 6 +9c 6 . 4. Ci , -2cjC4+2ciC J -2c«. 

S y 3 -(P*-2g)y*+(g«-2pr)y-r»=0. 

6. y'-gj/'+pry-r'^O. 7. ry»+2gy»+4py+8=0. 

8. Eliminate x by y =«* -a*. 9. Use p* -g+pa? =y. 

10. -4+pr/s. 11. (r«-pr J +2pgs)/«*. 

12. (t) Sa«6«cSd-2s a S 6 « c+d +2SSa8ft+ c + < f + S8 a+ft 8 c+d -6«« +ft+c+ tf. 
(tt) ^ T (8a 4 -65 a 2 «2a+8«aSjaH-38|« , -6«4a).