GIFT OF
MATVU
STAT.
SEMICENTENNIAL PUBLICATIONS
OF THE
UNIVERSITY OF CALIFORNIA
THE THEORY OF
THE RELATIVITY OF MOTION
BY
RICHARD CHACE TOLMAN
UNIVERSITY OF CALIFORNIA PRESS
BERKELEY
1918
SEMICENTENNIAL PUBLICATIONS
OF THE
UNIVERSITY OF CALIFORNIA
1868-1918
THE THEORY OF
THE RELATIVITY OF MOTION
BY
RICHARD C. TOLMAN
UNIVERSITY OF CALIFORNIA PRESS
BERKELEY
1917
PRESS OF
THE NEW ERA PRINTING COMPANY
LANCASTER. PA
TO
H. E.
383389
THE THEORY OF THE RELATIVITY OF MOTION.
BY
RICHARD C. TOLMAN, PH.D.
TABLE OF CONTENTS.
PREFACE 1
CHAPTER I. Historical Development of Ideas as to the Nature of Space and
Time 5
Part I. The Space and Time of Galileo and Newton 5
Newtonian Time 6
Newtonian Space 7
The Galileo Transformation Equations 9
Part II. The Space and Time of the Ether Theory 10
Rise of the Ether Theory 10
Idea of a Stationary Ether 12
Ether in the Neighborhood of Moving Bodies 12
Ether Entrained in Dielectrics 13
The Lorentz Theory of a Stationary Ether 13
Part III. Rise of the Einstein Theory of Relativity 17
The Michelson-Morley Experiment 17
The Postulates of Einstein 18
CHAPTER II. The Two Postulates 01 the Einstein Theory of Relativity 20
The First Postulate of Relativity 20
The Second Postulate of the Einstein Theory of Relativity 21
Suggested Alternative to the Postulate of the Independence of the
Velocity of Light and the Velocity of the Source 23
Evidence against Emission Theories of Light 24
Different Forms of Emission Theory 25
Further Postulates of the Theory of Relativity 27
CHAPTER III. Some Elementary Deductions 28
Measurements of Time in a Moving System 28
Measurements of Length in a Moving System 30
The Setting of Clocks in a Moving System 33
The Composition of Velocities
The Mass of a Moving Body 37
The Relation between Mass and Energy 39
CHAPTER IV. The Einstein Transformation Equations for Space and Time . . 42
The Lorentz Transformation 42
Deduction of the Fundamental Transformation Equations 43
The Three Conditions to be Fulfilled 44
The Transformation Equations
Further Transformation Equations 47
Transformation Equations for Velocity
Transformation Equations for the Function — = 47
vi Table of Contents.
Transformation Equations for Acceleration 48
CHAPTER V. Kinematical Applications 49
The Kinematical Shape of a Rigid Body 49
The Kinematical Rate of a Clock 50
The Idea of Simultaneity 51
The Composition of Velocities 52
The Case of Parallel Velocities 52
Composition of Velocities in General 53
Velocities Greater than that of Light 54
Applications to Optical Problems 56
The Doppler Effect 57
The Aberration of Light 59
Velocity of Light in Moving Media 60
Group Velocity 61
CHAPTER VI. The Dynamics of a Particle 62
The Laws of Motion 62
Difference between Newtonian and Relativity Mechanics 62
The Mass of a Moving Particle 63
Transverse Collision 63
Mass the Same in all Directions 66
Longitudinal Collision : 67
Collision of any Type 68
Transformation Equations for Mass 72
The Force Acting on a Moving Particle 73
Transformation Equations for Force 73
The Relation between Force and Acceleration 74
Transverse and Longitudinal Acceleration 76
The Force Exerted by a Moving Charge 77
The Field around a Moving Charge 79
Application to a Specific Problem 80
Work 81
Kinetic Energy 81
Potential Energy 82
The Relation between Mass and Energy 83
Application to a Specific Problem * 85
CHAPTER VII. The Dynamics of a System of Particles 88
On the Nature of a System of Particles 88
The Conservation of Momentum 89
The Equation of Angular Momentum 90
The Function T 92
The Modified Lagrangian Function 93
The Principle of Least Action 93
Lagrange's Equations 95
Equations of Motion in the Hamiltonian Form 96
Value of the Function T' 97
The Principle of the Conservation of Energy 99
On the Location of Energy in Space 100
Table of Contents. vii
CHAPTER VIII. The Chaotic Motion of a System of Particles 102
The Equations of Motion 102
Representation in Generalized Space 103
Liouville's Theorem .103
A System of Particles • 104
Probability of a Given Statistical State 105
Equilibrium Relations 106
The Energy as a Function of the Momentum 108
The Distribution Law 109
Polar Coordinates
The Law of Equipartition
Criterion for Equality of Temperature 112
Pressure Exerted by a System of Particles . .
The Relativity Expression for Temperature
The Partition of Energy
Partition of Energy for Zero Mass
Approximate Partition for Particles of any Mass 118
CHAPTER IX. The Principle of Relativity and the Principle of Least Action . 121
The Principle of Least Action 121
The Equations of Motion in the Lagrangian Form 122
Introduction of the Principle of Relativity 124
Relation between f W'dt' and f Wdt .124
Relation between H' and H -127
CHAPTER X. The Dynamics of Elastic Bodies
On the Impossibility of Absolutely Rigid Bodies
Part I. Stress and Strain
Definition of Strain
Definition of Stress
Transformation Equations for Strain
Variation in the Strain
Part II. Introduction of the Principle of Least Action . .
The Kinetic Potential for an Elastic Body . .
Lagrange's Equations
Transformation Equations for Stress . .
Value of E°
The Equations of Motion in the Lagrangian Form
Density of Momentum
Density of Energy
Summary of Results from the Principle of Least Action
Part III. Some Mathematical Relations . .
The Unsymmetrical Stress Tensor t
The Symmetrical Tensor p
Relation between div t and t» . . .
The Equations of Motion in the Eulerian Form . . .
Part IV. Applications of the Results
Relation between Energy and Momentum
The Conservation of Momentum
viii Table of Contents.
The Conservation of Angular Momentum 150
Relation between Angular Momentum and the Unsymmetrical
Stress Tensor 151
The Right- Angled Lever 152
Isolated Systems in a Steady State 154
The Dynamics of a Particle 154
Conclusion 154
CHAPTER XI. The Dynamics of a Thermodynamic System '. 156
The Generalized Coordinates and Forces 156
Transformation Equation for Volume 156
Transformation Equation for Entropy 157
Introduction of the Principle of Least Action. The Kinetic
Potential 157
The Lagrangian Equations 158
Transformation Equation for Pressure 159
Transformation Equation for Temperature 159
The Equations of Motion for Quasistationary Adiabatic Accelera
tion 160
The Energy of a Moving Thermodynamic System 161
The Momentum of a Moving Thermodynamic System 161
The Dynamics of a Hohlraum 162
CHAPTER XII. Electromagnetic Theory 164
The Form of the Kinetic Potential 164
The Principle of Least Action : 165
The Partial Integrations 165
Derivation of the Fundamental Equations of Electromagnetic
Theory 166
The Transformation Equations for e, h and p 168
The Invariance of Electric Charge 170
The Relativity of Magnetic and Electric Fields 171
Nature of Electromotive Force 172
Derivation of the Fifth Fundamental Equation 172
Difference between the Ether and the Relativity Theories of Electro
magnetics 173
Applications to Electromagnetic Theory 176
The Electric and Magnetic Fields around a Moving Charge 176
The Energy of a Moving Electromagnetic System 178
Relation between Mass and Energy 180
The Theory of Moving Dielectrics 181
Relation between Field Equations for Material Media and
Electron Theory 182
Transformation Equations for Moving Media 183
Theory of the Wilson Experiment 186
CHAPTER XIII. Four-Dimensional Analysis 188
Idea of a Time Axis 188
Non-Euclidean Character of the Space - .... 189
Table of Contents. ix
Part I. Vector Analysis of the Non-Euclidean Four-Dimensional Mani
fold 191
Space, Time and Singular Vectors 192
Invariance of z2 + y2 + z2 — c-t2 192
Inner Product of One- Vectors 193
Non-Euclidean Angle 194
Kinematical Interpretation of Angle in Terms of Velocity 194
Vectors of Higher Dimensions 195
Outer Products 195
Inner Product of Vectors in General 198
The Complement of a Vector 198
The Vector Operator, O or Quad 199
Tensors 200
The Rotation of Axes 201
Interpretation of the Lorentz Transformation as a Rotation of
Axes 206
Graphical Representation 208
Part II. Applications of the Four-Dimensional Analysis 211
Kinematics 211
Extended Position 211
Extended Velocity 212
Extended Acceleration 213
The Velocity of Light .214
The Dynamics of a Particle 214
Extended Momentum 214
The Conservation Laws 215
The Dynamics of an Elastic Body 216
The Tensor of Extended Stress 216
The Equation of Motion 216
Electromagnetics 217
Extended Current 218
The Electromagnetic Vector M 217
The Field Equations 217
The Conservation of Electricity 218
The Product M-q
The Extended Tensor of Electromagnetic Stress 219
Combined Electrical and Mechanical Systems 221
Appendix I. Symbols for Quantities
Scalar Quantities
Vector Quantities 223
Appendix II. Vector Notation 224
Three Dimensional Space 224
Non-Euclidean Four Dimensional Space 225
PREFACE.
Thirty or forty years ago, in the field of physical science, there
was a widespread feeling that the days of adventurous discovery had
passed forever, and the conservative physicist was only too happy to
devote his life to the measurement to the sixth decimal place of
quantities whose significance for physical theory was already an -old
story. The passage of time, however, has completely upset such
bourgeois ideas as to the state of physical science, through the dis
covery of some most extraordinary experimental facts and the develop
ment of very fundamental theories for their explanation.
On the experimental side, the intervening years have seen the
discovery of radioactivity, the exhaustive study of the conduction of
electricity through gases, the accompanying discoveries of cathode,
canal and X-rays, the isolation of the electron, the study of the
distribution of energy in the hohlraum. and the final failure of all
attempts to detect the earth's motion through the supposititious
ether. During this same time, the theoretical physicist has been
working hand in hand with the experimenter endeavoring to correlate
the facts already discovered and to point the way to further research.
The theoretical achievements, which have been found particularly
helpful in performing these functions of explanation and prediction,
have been the development of the modern theory of electrons, the
application of thermodynamic and statistical reasoning to the phe
nomena of radiation, and the development of Einstein's brilliant
theory of the relativity of motion.
It has been the endeavor of the following book to present an
introduction to this theory of relativity, which in the decade since
the publication of Einstein's first paper in 1905 (Annalen der Physik)
has become a necessary part of the theoretical equipment of every
physicist. Even if we regard the Einstein theory of relativity merely
as a convenient tool for the prediction of electromagnetic and optical
phenomena, its importance to the physicist is very great, not only
because its introduction greatly simplifies the deduction of many
2 1
2 Preface.
theorems which were already familiar in the older theories based on a
stationary ether, but also because it leads simply and directly to cor
rect conclusions in the case of such experiments as those of Michelson
and Morley, Trouton and Noble, and Kaufman and Bucherer, which
can be made to agree with the idea of a stationary ether only by the
introduction of complicated and ad hoc assumptions. Regarded from
a more philosophical point of view, an acceptance of the Einstein
theory of relativity shows us the advisability of completely remodelling
some of our most fundamental ideas. In particular we shall now
do well to change our concepts of space and time in such a way as
to give up the old idea of their complete independence, a notion
which we have received as the inheritance of a long ancestral experience
with bodies moving with slow velocities, but which no longer proves
pragmatic when we deal with velocities approaching that of light.
The method of treatment adopted in the following chapters is
to a considerable extent original, partly appearing here for the first
time and partly already published elsewhere.* Chapter III follows
a method which was first developed by Lewis and Tolman,|and the
last chapter a method developed by Wilson and Lewis.J The writer
must also express his special obligations to the works of Einstein,
Planck, Poincare, Laue, Ishiwara and Laub.
It is hoped that the mode of presentation is one that will be found
well adapted not only to introduce the study of relativity theory to
those previously unfamiliar with the subject but also to provide the
necessary methodological equipment for those who wish to pursue
the theory into its more complicated applications.
After presenting, in the first chapter, a brief outline of the historical
development of ideas as to the nature of the space and time of science,
we consider, in Chapter II, the two main postulates upon which the
theory of relativity rests and discuss the direct experimental evidence
for their truth. The third chapter then presents an elementary and
* Philosophical Magazine, vol. 18, p. 510 (1909); Physical Review, vol. 31, p. 26
(1910); Phil. Mag., vol. 21, p. 296 (1911); ibid., vol. 22, p. 458 (1911); ibid., vol. 23,
p. 375 (1912); Phys. Rev., vol. 35, p. 136 (1912); Phil. Mag., vol. 25, p. 150 (1913);
ibid., vol. 28, p. 572 (1914); ibid., vol. 28, p. 583 (1914).
t Phil. Mag., vol. 18, p. 510 (1909).
J Proceedings of the American Academy of Arts and Sciences, vol. 48, p. 389
(1912).
Preface. 3
non-mathematical deduction of a number of the most important
consequences of the postulates of relativity, and it is hoped that this
chapter will prove especially valuable to readers without unusual
mathematical equipment, since they will there be able to obtain a
real grasp of such important new ideas as the change of mass with
velocity, the non-additivity of velocities, and the relation of mass
and energy, without encountering any mathematics beyond the
elements of analysis and geometry.
In Chapter IV we commence the more analytical treatment of
the theory of relativity by obtaining from the two postulates of
relativity Einstein's transformation equations for space and time as
well as transformation equations for velocities, accelerations, and
for an important function of the velocity. Chapter V presents
various kinematical applications of the theory of relativity following
quite closely Einstein's original method of development. In par
ticular we may call attention to the ease with which we may handle
the optics of moving media by the methods of the theory of relativity
as compared with the difficulty of treatment on the basis of the ether
theory.
In Chapters VI, VII and VIII we develop and apply a theory of
the dynamics of a particle which is based on the Einstein trans
formation equations for space and time, Newton's three laws of motion,
and the principle of the conservation of mass.
We then examine, in Chapter IX, the relation between the theory
of relativity and the principle of least action, and find it possible to
introduce the requirements of relativity theory at the very start into
this basic principle for physical science. We point out that we
might indeed have used this adapted form of the principle of least
action, for developing the dynamics of a particle, and then proceed
in Chapters X, XI and XII to develop the dynamics of an elastic
body, the dynamics of a thermodynamic system, and the dynamics
of an electromagnetic system, all on the basis of our adapted form
of the principle of least action.
Finally, in Chapter XIII, we consider a four-dimensional method
of expressing and treating the results of relativity theory. This
chapter contains, in Part I, an epitome of some of the more important
methods in four-dimensional vector analysis and it is hoped that it
4 Preface.
can also be used in connection with the earlier parts of the book as a
convenient reference for those who are not familiar with ordinary
three-dimensional vector analysis.
In the present book, the writer has confined his considerations to
cases in which there is a uniform relative velocity between systems of
coordinates. In the future it may be possible greatly to extend the
applications of the theory of relativity by considering accelerated
systems of coordinates, and in this connection Einstein's latest work
on the relation between gravity and acceleration is of great interest.
It does not seem wise, however, at the present time to include such
considerations in a book which intends to present a survey of accepted
theory.
The author will feel amply repaid for the work involved in the
preparation of the book if, through his efforts, some of the younger
American physicists can be helped to obtain a real knowledge of the
important work of Einstein. He is also glad to have this opportunity
to add his testimony to the growing conviction that the conceptual
space and time of science are not God-given and unalterable, but are
rather in the nature of human constructs devised for use in the de
scription and correlation of scientific phenomena, and that these
spatial and temporal concepts should be altered whenever the discovery
of new facts makes such a change pragmatic.
The writer wishes to express his indebtedness to Mr. William H.
Williams for assisting in the preparation of Chapter I.
CHAPTER I.
HISTORICAL DEVELOPMENT OF IDEAS AS TO THE NATURE OF
SPACE AND TIME.
1. Since the year 1905, which marked the publication of Einstein's
momentous article on the theory of relativity, the development of
scientific thought has led to a complete revolution in accepted ideas
as to the nature of space and time, and this revolution has in turn
profoundly modified those dependent sciences, in particular mechanics
and electromagnetics, which make use of these two fundamental
concepts in their considerations.
In the following pages it will be our endeavor to present a de
scription of these new notions as to the nature of space and time,*
and to give a partial account of the consequent modifications which
have been introduced into various fields of science. Before pro
ceeding to this task, however, we may well review those older ideas
as to space and time which until now appeared quite sufficient for
the correlation of scientific phenomena. We shall first consider the
space and time of Galileo and Newton which were employed in the
development of the classical mechanics, and then the space and time
of the ether theory of light.
PART I. THE SPACE AND TIME OF GALILEO AND NEWTON.
2. The publication in 1687 of Newton's Principia laid down so
satisfactory a foundation for further dynamical considerations, that
it seemed as though the ideas of Galileo and Newton as to the nature
of space and time, which were there employed, would certainly remain
forever suitable for the interpretation of natural phenomena. And
indeed upon this basis has been built the whole structure of classical
mechanics which, until our recent familiarity with very high velocities,
has been found completely satisfactory for an extremely large number
of very diverse dynamical considerations.
* Throughout this work by "space" and "time" we shall mean the conceptual
space and time of science.
5
6 Chapter One.
An examination of the fundamental laws of mechanics will show
how the concepts of space and time entered into the Newtonian
system of mechanics. Newton's laws of motion, from which the
whole of the classical mechanics could be derived, can best be stated
with the help of the equation
P-|(«u). (1)
This equation defines the force F acting on a particle as equal to the
rate of change in its momentum (i. e., the product of its mass m and
its velocity u), and the whole of Newton's laws of motion may be
summed up in the statement that in the case of two interacting par
ticles the forces which they mutually exert on each other are equal in
magnitude and opposite in direction.
Since in Newtonian mechanics the mass of a particle is assumed
constant, equation (1) may be more conveniently written
du d (dr\
F = - = - -
or
m ,
dt dt\dt J
d dx
'-4(1).
and this definition of force, together with the above-stated principle
of the equality of action and reaction, forms the starting-point for
the whole of classical mechanics.
The necessary dependence of this mechanics upon the concepts
of space and time becomes quite evident on an examination of this
fundamental equation (2), in which the expression for the force acting
on a particle is seen to contain both the variables x, y, and z, which
specify the position of the particle in space, and the variable tt which
specifies the time.
3. Newtonian Time. To attempt a definite statement as to the
Historical Development. 7
meaning of so fundamental and underlying a notion as that of time
is a task from which even philosophy may shrink. In a general
way, conceptual time may be thought of as a one-dimensional, uni
directional, one-valued continuum. This continuum is a sort of frame
work in which the instants at which actual occurrences take place
find an ordered position. Distances from point to point in the
continuum, that is intervals of time, are measured by the periods of
certain continually recurring cyclic processes such as the daily rota
tion of the earth. A unidirectional nature is imposed upon the time
continuum among other things by an acceptance of the second law
of thermodynamics, which requires that actual progression in time
shall be accompanied by an increase in the entropy of the material
world, and this same law requires that the continuum shall be one-
valued since it excludes the possibility that time ever returns upon
itself, either to commence a new cycle or to intersect its former path
even at a single point.
In addition to these characteristics of the time continuum, which
have been in no way modified by the theory of relativity, the New
tonian mechanics always assumed a complete independence of time and
the three-dimensional space continuum which exists along with it.
In dynamical equations time entered as an entirely independent vari
able in no way connected with the variables whose specification
determines position in space. In the following pages, however, we
shall find that the theory of relativity requires a very definite inter
relation between time and space, and in the Einstein transformation
equations we shall see the exact way in which measurements of time
depend upon the choice of a set of variables for measuring position
in space.
4. Newtonian Space. An exact description of the concept of
space is perhaps just as difficult as a description of the concept of time.
In a general way we think of space as a three-dimensional, homo
geneous, isotropic continuum, and these ideas are common to the
conceptual spaces of Newton, Einstein, and the ether theory of light.
The space of Newton, however, differs on the one hand from that of
Einstein because of a tacit assumption of the complete independence
of space and time measurements; and differs on the other hand from
that of the ether theory of light by the fact that " free " space was
8 Chapter One.
assumed completely empty instead of filled with an all-pervading
quasi-material medium — the ether. A more definite idea of the par
ticularly important characteristics of the Newtonian concept of space
may be obtained by considering somewhat in detail the actual methods
of space measurement.
Positions in space are in general measured with respect to some
arbitrarily fixed system of reference which must be threefold in
character corresponding to the three dimensions of space. In par
ticular we may make use of a set of Cartesian axes and determine,
for example, the position of a particle by specifying its three Cartesian
coordinates x, y and z.
In Newtonian mechanics the particular set of axes chosen for
specifying position in space has in general been determined in the
first instance by considerations of convenience. For example, it is
found by experience that, if we take as a reference system lines drawn
upon the surface of the earth, the equations of motion based on New
ton's laws give us a simple description of nearly all dynamical phe
nomena which are merely terrestrial. When, however, we try to
interpret with these same axes the motion of the heavenly bodies, we
meet difficulties, and the problem is simplified, so far as planetary
motions are concerned, by taking a new reference system determined
by the sun and the fixed stars. But this system, in its turn, becomes
somewhat unsatisfactory when we take account of the observed
motions of the stars themselves, and it is finally convenient to take a
reference system relative to which the sun is moving with a velocity
of twelve miles per second in the direction of the constellation Hercules.
This system of axes is so chosen that the great majority of stars have
on the average no motion with respect to it, and the actual motion
of any particular star with respect to these coordinates is called the
peculiar motion of the star.
Suppose, now, we have a number of such systems of axes in uni
form relative motion; we are confronted by the problem of finding
some method of transposing the description of a given kinematical
occurrence from the variables of one of these sets of axes to those of
another. For example, if we have chosen a system of axes S and
have found an equation in x, y, z, and t which accurately describes the
motion of a given point, what substitutions for the quantities involved
Historical Development. 9
can be made so that the new equation thereby obtained will again
correctly describe the same phenomena when we measure the dis
placements of the point relative to a new system of reference S'
which is in uniform motion with respect to £? The assumption of
Galileo and Newton that " free " space is entirely empty, and the
further tacit assumption of the complete independence of space and
time, led them to propose a very simple solution of the problem, and
the transformation equations which they used are generally called
the Galileo Transformation Equations to distinguish them from the
Einstein Transformation Equations which we shall later consider.
5. The Galileo Transformation Equations. Consider two systems
of right-angled coordinates, S and S', which are in relative motion in
the X direction with the velocity V; for convenience let the X axes,
OX and O'X', of the two systems coincide in direction, and for further
simplification let us take as our zero point for time measurements the
instant when the two origins 0 and 0' coincide. Consider now a
point which at the time t has the coordinates x, y and z measured in
system S. Then, according to the space and time considerations of
Galileo and Newton, the coordinates of the point with reference to
system S' are given by the following transformation equations :
x' = x - Vt, (3)
y' = y, (4)
(5)
(6)
These equations are fundamental for Newtonian mechanics, and may
appear to the casual observer to be self-evident and bound up with
necessary ideas as to the nature of space and time. Nevertheless,
the truth of the first and the last of these equations is absolutely
dependent on the unsupported assumption of the complete inde
pendence of space and time measurements, and since in the Einstein
theory we shall find a very definite relation between space and time
measurements we shall be led to quite a different set of transformation
equations. Relations (3), (4), (5) and (6) will be found, however, to
be the limiting form which the correct transformation equations as
sume when the velocity between the systems V becomes small com-
10 Chapter One.
pared with that of light. Since until very recent times the human
race in its entire past history has been familiar only with velocities
that are small compared with that of light, it need not cause surprise
that the above equations, which are true merely at the limit, should
appear so self-evident.
6. Before leaving the discussion of the space and time system of
Newton and Galileo we must call attention to an important charac
teristic which it has in common with the system of Einstein but
which is not a feature of that assumed by the ether theory. If we
have two systems of axes such as those we have just been considering,
we may with equal right consider either one of them at rest and the
other moving past it. All we can say is that the two systems are in
relative 'motion; it is meaningless to speak of either one as in any
sense " absolutely " at rest. The equation x' = x — Vt which we
use in transforming the description of a kinematical event from the
variables of system S to those of system S' is perfectly symmetrical
with the equation x = x' + Vt' which we should use for a trans
formation in the reverse direction. Of all possible systems no par
ticular set of axes holds a unique position among the others. We
shall later find that this important principle of the relativity of motion
is permanently incorporated into our system of physical science as
the first postulate of relativity. This principle, common both to the
space of Newton and to that of Einstein, is not characteristic of the
space assumed by the classical theory of light. The space of this
theory was supposed to be filled with a stationary medium, the
luminiferous ether, and a system of axes stationary with respect to
this ether would hold a unique position among the other systems
and be the one peculiarly adapted for use as the ultimate system of
reference for the measurement of motions.
We may now briefly sketch the rise of the ether theory of light and
point out the permanent contribution which it has made to physical
science, a contribution which is now codified as the second postulate
of relativity.
PART II. THE SPACE AND TIME OF THE ETHER THEORY.
7. Rise of the Ether Theory. Twelve years before the appearance
of the Principia, Homer, a Danish astronomer, observed that an
Historical Development. 11
eclipse of one of the satellites of Jupiter occurred some ten minutes
later than the time predicted for the event from the known period
of the satellite and the time of the preceding eclipse. He explained
this delay by the hypothesis that it took light twenty-two minutes
to travel across the earth's orbit. Previous to Romer's discovery,
light was generally supposed to travel with infinite velocity. Indeed
Galileo had endeavored to find the speed of light by direct experiments
over distances of a few miles and had failed to detect any lapse of
time between the emission of a light flash from a source and its ob
servation by a distant observer. Romer's hypothesis has been re
peatedly verified and the speed of light measured by different methods
with considerable exactness. The mean of the later determinations
is 2.9986 X 1010 cm. per second.
8. At the time of Romer's discovery there was much discussion
as to the nature of light. Newton's theory that it consisted of par
ticles or corpuscles thrown out by a luminous body was attacked by
Hooke and later by Huygens, who advanced the view that it was
something in the nature of wave motions in a supposed space-filling
medium or ether. By this theory Huygens was able to explain
reflection and refraction and the phenomena of color, but assuming
longitudinal vibrations he was unable to account for polarization.
Diffraction had not yet been observed and Newton contested the
Hooke-Huygens theory chiefly on the grounds that it was contra
dicted by the fact of rectilinear propagation and the formation of
shadows. The scientific prestige of Newton was so great that the
emission or corpuscular theory continued to hold its ground for a
hundred and fifty years. Even the masterly researches of Thomas
Young at the beginning of the nineteenth century were unable to
dislodge the old theory, and it was not until the French physicist,
Fresnel, about 1815, was independently led to an undulatory theory
and added to Young's arguments the weight of his more searching
mathematical analysis, that the balance began to turn. From this
time on the wave theory grew in power and for a period of eighty
years was not seriously questioned. This theory has for its essential
postulate the existence of an all-pervading medium, the ether, in
which wave disturbances can be set up and propagated. And the
physical properties of this medium became an enticing field of inquiry
and speculation.
12 Chapter One.
9. Idea of a Stationary Ether. Of all the various properties with
which the physicist found it necessary to endow the ether, for us the
most important is the fact that it must apparently remain stationary,
unaffected by the motion of matter through it. This conclusion was
finally reached through several lines of investigation. We may first
consider whether the ether would be dragged along by the motion of
nearby masses of matter, and, second, whether the ether enclosed in a
moving medium such as water or glass would partake in the latter's
motion.
10. Ether in the Neighborhood of Moving Bodies. About the
year 1725 the astronomer Bradley, in his efforts to measure the
parallax of certain fixed stars, discovered that the apparent position
of a star continually changes in such a way as to trace annually a
small ellipse in the sky, the apparent position always lying in the
plane determined by the line from the earth to the center of the
ellipse and by the direction of the earth's motion. On the corpuscular
theory of light this admits of ready explanation as Bradley himself
discovered, since we should expect the earth's motion to produce an
apparent change in the direction of the oncoming light, in just the
same way that the motion of a railway train makes the falling drops
of rain take a slanting path across the window pane. If c be the
velocity of a light particle and v the earth's velocity, the apparent or
relative velocity would be c — v and the tangent of the angle of
aberration would be - .
c
Upon the wave theory, it is obvious that we should also expect a
similar aberration of light, provided only that the ether shall be
quite stationary and unaffected by the motion of the earth through it,
and this is one of the important reasons that most ether theories have
assumed a stationary ether unaffected by the motion of neighboring
matter. *
In more recent years further experimental evidence for assuming
that the ether is not dragged along by the neighboring motion of
large masses of matter was found by Sir Oliver Lodge. His final
experiments were performed with a large rotating spheroid of iron
* The most notable exception is the theory of Stokes, which did assume that
the ether moved along with the earth and then tried to account for aberration with
the help of a velocity potential, but this led to difficulties, as was shown by Lorentz.
Historical Development. 13
with a narrow groove around its equator, which was made the path
for two rays of light, one travelling in the direction of rotation and
the other in the opposite direction. Since by interference methods
no difference could be detected in the velocities of the two rays, here
also the conclusion was reached that the ether was not appreciably
dragged along by the rotating metal.
11. Ether Entrained in Dielectrics. With regard to the action of
a moving medium on the ether which might be entrained within it,
experimental evidence and theoretical consideration here too finally
led to the supposition that the ether itself must remain perfectly
stationary. The earlier view first expressed by Fresnel, in a letter
written to Arago in 1818, was that the entrained ether did receive a
fraction of the total velocity of the moving medium. Fresnel gave
2 "1
to this fraction the value — , where AC is the index of refraction of
M2
the substance forming the medium. On this supposition, Fresnel
was able to account for the fact that Arago's experiments upon the
reflection and refraction of stellar rays show no influence whatever
of the earth's motion, and for the fact that Airy found the same angle
of aberration with a telescope filled with water as with air. More
over, the later work of Fizeau and the accurate determinations of
Michelson and Morley on the velocity of light in a moving stream
of water did show that the speed was changed by an amount corre
sponding to Fresnel's fraction. The fuller theoretical investigations
of Lorentz, however, did not lead scientists to look upon this increased
velocity of light in a moving medium as an evidence that the ether
is pulled along by the stream of water, and we may now briefly sketch
the developments which culminated in the Lorentz theory of a com
pletely stationary ether.
12. The Lorentz Theory of a Stationary Ether. The considera
tions of Lorentz as to the velocity of light in moving media became
possible only after it was evident that optics itself is a branch of the
wider science of electromagnetics, and it became possible to treat
transparent media as a special case of dielectrics in general. In 1873,
in his Treatise on Electricity and Magnetism, Maxwell first advanced
the theory that electromagnetic phenomena also have their seat in
the luminiferous ether and further that light itself is merely an electro-
14 Chapter One.
magnetic disturbance in that medium, and Maxwell's theory was
confirmed by the actual discovery of electromagnetic waves in 1888
by Hertz.
The attack upon the problem of the relative motion of matter and
ether was now renewed with great vigor both theoretically and experi
mentally from the electromagnetic side. Maxwell in his treatise had
confined himself to phenomena in stationar}^ media. Hertz, however,
extended Maxwell's considerations to moving matter on the assump
tion that the entrained ether is carried bodily along by it. It is evi
dent, however, that in the field of optical theory such an assumption
could not be expected to account for the Fizeau experiment, which
had already been explained on the assumption that the ether receives
only a fraction of the velocity of the moving medium; while in the
field of electromagnetic theory it was found that Hertz's assumptions
would lead us to expect no production of a magnetic field in the
neighborhood of a rotating electric condenser providing the plates of
the condenser and the dielectric move together with the same speed
and this was decisively disproved by the experiment of Eichenwald.
The conclusions of the Hertz theory were also out of agreement with
the important experiments of H. A. Wilson on moving dielectrics.
It remained for Lorentz to develop a general theory for moving
dielectrics which was consistent with the facts.
The theory of Lorentz developed from that of Maxwell by the
addition of the idea of the electron, as the atom of electricity, and his
treatment is often called the " electron theory." This atomistic
conception of electricity was foreshadowed by Faraday's discovery
of the quantitative relations between the amount of electricity asso
ciated with chemical reactions in electrolytes and the weight of
substance involved, a relation which indicates that the atoms act as
carriers of electricity and that the quantity of electricity carried by a
single particle, whatever its nature, is always some small multiple of a
definite quantum of electricity, the electron. Since Faraday's time,
the study of the phenomena accompanying the conduction of elec
tricity through gases, the study of radioactivity, and finally indeed
the isolation and exact measurement of these atoms of electrical
charge, have led us to a very definite knowledge of many of the
properties of the electron.
Historical Development. 15
While the experimental physicists were at work obtaining this
more or less first-hand acquaintance with the electron, the theoretical
physicists and in particular Lorentz were increasingly successful in
explaining the electrical and optical properties of matter in general
on the basis of the behavior of the electrons which it contains, the
properties of conductors being accounted for by the presence of mov
able electrons, either free as in the case of metals or combined with
atoms to form ions as in electrolytes, while the electrical and optical
properties of dielectrics were ascribed to the presence of electrons
more or less bound by quasi-elastic forces to positions of equilibrium.
This Lorentz electron theory of matter has been developed in great
mathematical detail by Lorentz and has been substantiated by nu
merous quantitative experiments. Perhaps the greatest significance
of the Lorentz theory is that such properties of matter as electrical
conductivity, magnetic permeability and dielectric inductivity, which
occupied the position of rather accidental experimental constants in
Maxwell's original theory, are now explainable as the statistical result
of the behavior of the individual electrons.
With regard now to our original question as to the behavior of
moving optical and dielectric media, the Lorentz theory was found
capable of accounting quantitatively for all known phenomena, in
cluding Airy's experiment on aberration, Arago's experiments on the
reflection and refraction of stellar rays, FresneFs coefficient for the
velocity of light in moving media, and the electromagnetic experi
ments upon moving dielectrics made by Rontgen, Eichenwald, H. A.
Wilson, and others. For us the particular significance of the Lorentz
method of explaining these phenomena is that he does not assume, as
did Fresnel, that the ether is partially dragged along by moving
matter. His investigations show rather that the ether must remain
perfectly stationary, and that such phenomena as the changed velocity
of light in moving media are to be accounted for by the modifying
influence which the electrons in the moving matter have upon the
propagation of electromagnetic disturbances, rather than by a dragging
along of the ether itself.
Although it would not be proper in this place to present the
mathematical details of Lorentz 's treatment of moving media, we
may obtain a clearer idea of what is meant in the Lorentz theory by a
16 Chapter One.
stationary ether if we look for a moment at the five fundamental
equations upon which the theory rests. These familiar equations, of
which the first four are merely Maxwell's four field equations, modified
by the introduction of the idea of the electron, may be written
1 de u
i ah
curle= -caT'
div e = p,
div h = 0,
in which the letters have their usual significance. (See Chapter XII.)
Now the whole of the Lorentz theory, including of course his treat
ment of moving media, is derivable from these five equations, and
the fact that the idea of a stationary ether does lie at the basis of
his theory is most clearly shown by the first and last of these equa
tions, which contain the velocity u with which the charge in question
is moving, and for Lorentz this velocity is to be measured with respect
to the assumed stationary ether.
We have devoted this space to the Lorentz theory, since his work
marks the culmination of the ether theory of light and electromag-
netism, and for us the particularly significant fact is that by this
line of attack science was inevitably led to the idea of an absolutely
immovable and stationary ether.
13. We have thus briefly traced the development of the ether
theory of light and electromagnetism. We have seen that the space
continuum assumed by this theory is not empty as was the space of
Newton and Galileo but is assumed filled with a stationary medium,
the ether, and in conclusion should further point out that the time
continuum assumed by the ether theory was apparently the same as
that of Newton and Galileo, and in particular that the old ideas as to
the absolute independence of space and time were all retained.
Historical Development. 17
PART III. RISE OF THE EINSTEIN THEORY OF RELATIVITY.
14. The Michelson-Morley Experiment. In spite of all the brilli
ant achievements of the theory of a stationary ether, we must now
call attention to an experiment, performed at the very time when
the success of the ether theory seemed most complete, whose result
was in direct contradiction to its predictions. This is the celebrated
Michelson-Morley experiment, and to the masterful interpretation of
its consequences at the hands of Einstein we owe the whole theory of
relativity, a theory -which will nevermore permit us to assume that
space and time are independent.
If the theory of a stationary ether were true we should find, con
trary to the expectations of Newton, that systems of coordinates in
relative motion are not symmetrical, a system of axes fixed relatively
to the ether would hold a unique position among all other systems
moving relative to it and would be peculiarly adapted for the measure
ment of displacements and velocities. Bodies at rest with respect
to this system of axes fixed in the ether would be spoken of as " ab
solutely " at rest and bodies in motion through the ether would be
said to have " absolute " motion. From the point of view of the
ether theory one of the most important physical problems would be
to determine the velocity of various bodies, for example that of the
earth, through the ether.
Now the Michelson-Morley experiment was devised for the very
purpose of determining the relative motion of the earth and the ether.
The experiment consists essentially in a comparison of the velocities
of light parallel and perpendicular to the earth's motion in its orbit.
A ray of light from the source S falls on the half silvered mirror A,
where it is divided into two rays, one of which travels to the mirror B
and the other to the mirror C, where they are totally reflected. The
rays are recombined and produce a set of interference fringes at 0.
(See figure 1.)
We may now think of the apparatus as set so that one of the
divided paths is parallel to the earth's motion and the other per
pendicular to it. On the basis of the stationary ether theory, the
velocity of the light with reference to the apparatus would evidently
be different over the two paths, and hence on rotating the apparatus
3
18
Chapter One.
through an angle of ninety degrees we should expect a shift in the
position of the fringes. Knowing the magnitude of the earth's
velocity in its orbit and the dimensions of the apparatus, it is quite
possible to calculate the magnitude of the expected shift, a quantity
o
FIG. 1.
entirely susceptible of experimental determination. Nevertheless the
most careful experiments made at different times of day and at
different seasons of the year entirely failed to show any such shift
at all.
This result is in direct contradiction to the theory of a stationary
ether and could be reconciled with that theory only by very arbitrary
assumptions. Instead of making such assumptions, the Einstein
theory of relativity finds it preferable to return in part to the older
ideas of Newton and Galileo.
15. The Postulates of Einstein. In fact, in accordance with the
results of this work of Michelson-Morley and other confirmatory
experiments, the Einstein theory takes as its first postulate the idea
familiar to Newton of the relativity of all motion. It states that
there is nothing out in space in the nature of an ether or of a fixed
set of coordinates with regard to which motion can be measured,
that there is no such thing as absolute motion, and that all we can
speak of is the relative motion of one body with respect to another.
Historical Development. 19
Although we thus see that the Einstein theory of relativity has
returned in part to the ideas of Newton and Galileo as to the nature
of space, it is not to be supposed that the ether theory of light and
electromagnetism has made no lasting contribution to physical science.
Quite on the contrary, not only must the ideas as to the periodic and
polarizable nature of the light disturbance, which were first appre
ciated and understood with the help of the ether theory, always
remain incorporated in every optical theory, but in particular the
Einstein theory of relativity takes as the basis for its second postulate
a principle that has long been familiar to the ether theory, namely
that the velocity of light is independent of the velocity of the source.
We shall see in following chapters that it is the combination of this
principle with the first postulate of relativity that leads to the whole
theory of relativity and to our new ideas as to the nature and inter
relation of space and titne.
CHAPTER II.
THE TWO POSTULATES OF THE EINSTEIN THEORY OF
RELATIVITY.
16. There are two general methods of evaluating the theoretical
development of any branch of science. One of these methods is to
test by direct experiment the fundamental postulates upon which
the theory rests. If these postulates are found to agree with the facts,
we may feel justified in assuming that the whole theoretical structure
is a valid one, providing false logic or unsuspected and incorrect
assumptions have not later crept in to vitiate the conclusions. The
other method of testing a theory is to develop its interlacing chain of
propositions and theorems and examine the results both for their
internal coherence and for their objective validity. If we find that
the conclusions drawn from the theory are neither self-contradictory
nor contradictory of each other, and furthermore that they agree
with the facts of the external world, we may again feel that our theory
has achieved a measure of success. In the present chapter we shall
present the two main postulates of the theory of relativity, and indicate
the direct experimental evidence in favor of their truth. In following
chapters we shall develop the consequences of these postulates, show
that the system of consequences stands the test of internal coherence,
and wherever possible compare the predictions of the theory with
experimental facts.
The First Postulate of Relativity.
17. The first postulate of relativity as originally stated by Newton
was that it is impossible to measure or detect absolute translatory
motion through space. No objections have ever been made to this
statement of the postulate in its original form. In the development
of the theory of relativity, the postulate has been modified to include
the impossibility of detecting translatory motion through any medium
or ether which might be assumed to pervade space.
In support of the principle is the general fact that no effects due
to the motion of the earth or other body through the supposed ether
20
The Two Postulates. 21
have ever been observed. Of the many unsuccessful attempts to
detect the earth's motion through the ether we may call attention to
the experiments on the refraction of light made by Arago, Respighi,
Hoek, Ketteler and Mascart, the interference experiments of Ketteler
and Mascart, the work of Klinkerfuess and Haga on the position of
the absorption bands of sodium, the experiment of Nordmeyer on the
intensity of radiation, the experiments of Fizeau, Brace and Strasser
on the rotation of the plane of polarized light by transmission through
glass plates, the experiments of Mascart and of Rayleigh on the
rotation of the plane of polarized light in naturally active substances,
the electromagnetic experiments of Rontgen, Des Coudres, J. Koenigs-
berger, Trouton, Trouton and Noble, and Trouton and Rankine, and
finally the Michelson and Morley experiment, with the further work
of Morley and Miller. For details as to the nature of these experi
ments the reader may refer to the original articles or to an excellent
discussion by Laub of the experimental basis of the theory of rela
tivity. *
In none of the above investigations was it possible to detect any
effect attributable to the earth's motion through the ether. Never
theless a number of these experiments are in accord with the final
form given to the ether theory by Lorentz, especially since his work
satisfactorily accounts for the Fresnel coefficient for the changed
velocity of light in moving media. Others of the experiments men
tioned, however, could be made to accord with the Lorentz theory
only by very arbitrary assumptions, in particular those of Michelson
and Morley, Mascart and Rayleigh, and Trouton and Noble. For
the purposes of our discussion we shall accept the principle of the
relativity of motion as an experimental fact.
The Second Postulate of the Einstein Theory of Relativity.
18. The second postulate of relativity states that the velocity of
light in free space appears the same to all observers regardless of the
relative motion of the source of light and the observer. This postulate
may be obtained by combining the first postulate of relativity with a
principle which has long been familiar to the ether theory of light.
This principle states that the velocity of light is unaffected by a
motion of the emitting source, in other words, that the velocity with
* Jahrbuch der Radioaktivitdt, vol. 7, p. 405 (1910).
22 Chapter Two.
which light travels past any observer is not increased by a motion
of the source of light towards the observer. The first postulate of
relativity adds the idea that a motion of the source of light towards
the observer is identical with a motion of the observer towards the
source. The second postulate of relativity is seen to be merely a
combination of these two principles, since it states that the velocity
of light in free space appears the same to all observers regardless both
of the motion of the source of light and of the observer.
19. It should be pointed out that the two principles whose com
bination thus leads to the second postulate of Einstein have come
from very different sources. The first postulate of relativity prac
tically denies the existence of any stationary ether through which
the earth, for instance, might be moving. On the other hand, the
principle that the velocity of light is unaffected by a motion of the
source was originally derived from the idea that light is transmitted
by a stationary medium which does not partake in the motion of the
source. This combination of two principles, which from a historical
point of view seem somewhat contradictory in nature, has given to
the second postulate of relativity a very extraordinary content.
Indeed it should be particularly emphasized that the remarkable
conclusions as to the nature of space and time forced upon science
by the theory of relativity are the special product of the second
postulate of relativity.
A simple example of the conclusions which can be drawn from
this postulate will make its extraordinary nature evident.
a'
b B V
FIG. 2.
S is a source of light and A and B two moving systems. A is
moving towards the source S, and B away from it. Observers on the
systems mark off equal distances aaf and W along the path of the light
and determine the time taken for light to pass from a to a' and b to b'
respectively. Contrary to what seem the simple conclusions of
common sense, the second postulate requires that the time taken
The Two Postulates. 23
for the light to pass from a to a' shall measure the same as the time
for the light to go from 6 to &'. Hence if the second postulate of
relativity is correct it is not surprising that science is forced in general
to new ideas as to the nature of space and time, ideas which are in
direct opposition to the requirements of so-called common sense.
Suggested Alternative to the Postulate of the Independence of the
Velocity of Light and the Velocity of the Source.
20. Because of the extraordinary conclusions derived by com
bining the principle of the relativity of motion with the postulate
that the velocity of light is independent of the velocity of its source,
a number of attempts have been made to develop so-called emission
theories of relativity based on the principle of the relativity of motion
and the further postulate that the velocity of light and the velocity
of its source are additive.
Before examining the available evidence for deciding between the
rival principles as to the velocity of light, we may point out that
this proposed postulate, of the additivity of the velocity of source
and light, would as a matter of fact lead to a very simple kind of
relativity theory without requiring any changes in our notions of
space and time. For if light or other electromagnetic disturbance
which is being emitted from a source did partake in the motion of
that source in such a way that the velocity of the source is added to
the velocity of emission, it is evident that a system consisting of the
source and its surrounding disturbances would act as a whole and
suffer no permanent change in configuration if the velocity of the
source were changed. This result would of course be in direct agree
ment with the idea of the relativity of motion which merely requires
that the physical properties of a system shall be independent of its
velocity through space.
As a particular example of the simplicity of emission theories we
may show, for instance, how easily they would account for the nega
tive result of the Michelson-Morley experiment. If 0, figure 3, is a
source of light and A and B are mirrors placed a meter away from 0, the
Michelson-Morley experiment shows that the time taken for light to
travel to A and back is the same as for the light to travel to B and
back, in spite of the fact that the whole apparatus is moving through
space in the direction 0 — B, due to the earth's motion around the sun.
24 Chapter Two.
The basic assumption of emission theories, however, would require
exactly this result, since it says that light travels out from 0 with a
constant velocity in all directions with
respect to 0. and not with respect to
some ether through which 0 is supposed
to be moving.
Direction of Earth's Mot ion The problem now before us is to
decide between the two rival principles
as to the velocity of light, and we shall
\B find that the bulk of the evidence is all
FlG 3 in favor of the principle which has led
to the Einstein theory of relativity with
its complete revolution in our ideas as to space and time, and against
the principle which has led to the superficially simple emission theo
ries of relativity.
21. Evidence Against Emission Theories of Light. All emission
theories agree in assuming that light from a moving source has a
velocity equal to the vector sum of the velocity of light from a sta
tionary source and the velocity of the source itself at the instant of
emission. And without first considering the special assumptions
which distinguish one emission theory from another we may first
present certain astronomical evidence which apparently stands in
contradiction to this basic assumption of all forms of emission
theory. This evidence was pointed out by Comstock* and later by
de Sitter, f
Consider the rotation of a binary star as it would appear to an
observer situated at a considerable distance from the star and in its
plane of rotation. (See figure 4.) If an emission theory of light
be true, the velocity of light from the star in position A will be c + u,
where u is the velocity of the star in its orbit, while in the position B
the velocity will be c — u. Hence the star will be observed to arrive
in position A, - - seconds after the event has actually occurred, and
c "T~ u
in position J5, _ seconds after the event has occurred. This will
* Phys. Rev., vol. 30, p. 291 (1910).
t Phys. Zeitschr., vol. 14, pp. 429, 1267 (1913).
The Two Postulates. 25
make the period of half rotation from A to B appear to be
where A2 is the actual time of a half rotation in the orbit, which for
.
/ Observer
Observer
FIG. 4.
simplicity may be taken as circular. On the other hand, the period
of the next half rotation from B back to A would appear to be
2ul
Now in the case of most spectroscopic binaries the quantity —^
C"
is not only of the same order of magnitude as At but oftentimes prob
ably even larger. Hence, if an emission theory of light were true,
we could hardly expect without correcting for the variable velocity
of light to find that these orbits obey Kepler's laws, as is actually
the case. This is certainly very strong evidence against any form
of emission theory. It may not be out of place, however, to state
briefly the different forms of emission theory which have been tried.
22. Different Forms of Emission Theory. As we have seen, emis
sion theories all agree in assuming that light from a moving source
26 Chapter Two.
has a velocity equal to the vector sum of the velocity of light from a
stationary source and the velocity of the source itself at the instant
of emission. Emission theories differ, however, in their assumptions
as to the velocity of light after its reflection from a mirror. The three
assumptions which up to this time have been particularly considered
are (1) that the excited portion of the reflecting mirror acts as a new
source of light and that the reflected light has the same velocity c
with respect to the mirror as has original light with respect to its source ;
(2) that light reflected from a mirror acquires a component of velocity
equal to the velocity of the mirror image of the original source, and
hence has the velocity c with respect to this mirror image; and (3) that
light retains throughout its whole path the component of velocity
which it obtained from its original moving source, and hence after
reflection spreads out with velocity c in a spherical form around a
center which moves with the same speed as the original source.
Of these possible assumptions as to the velocity of reflected light,
the first seems to be the most natural and was early considered by the
author but shown to be incompatible, not only with an experiment
which he performed on the velocity of light from the two limbs of
the sun,* but also with measurements of the Stark effect in canal
rays.f The second assumption as to the velocity of light was made
by Stewart,t but has also been shown f to be incompatible with
measurements of the Stark effect in canal rays. Making use of the
third assumption as to the velocity of reflected light, a somewhat
complete emission theory has been developed by Ritz,§ and un
fortunately optical experiments for deciding between the Einstein
and Ritz relativity theories have never been performed, although
such experiments are entirely possible of performance.! Against the
Ritz theory, however, we have of course the general astronomical
evidence of Comstock and de Sitter which we have already described
above.
For the present, the observations described above, comprise the
whole of the direct experimental evidence against emission theories
* Phys. Rev., vol. 31, p. 26 (1910).
f Phys. Rev., vol. 35, p. 136 (1912).
J Phys. Rev., vol. 32, p. 418 (1911).
§ Ann. de chim. et phys., vol. 13, p. 145 (1908); Arch, de Geneve, vol. 26, p. 232
(1908); Sdentia, vol. 5 (1909).
The Two Postulates. 27
of light and in favor of the principle which has led to the second
postulate of the Einstein theory. One of the consequences of the
Einstein theory, however, has been the deduction of an expression
for the mass of a moving body which has been closely verified by the
Kaufmann-Bucherer experiment. Now it is very interesting to note,
that starting with what has thus become an experimental expression
for the mass of a moving body, it is possible to work backwards to a
derivation of the second postulate of relativity. For the details of
the proof we must refer the reader to the original article.*
Further Postulates of the Theory of Relativity.
23. In the development of the theory of relativity to which we
shall now proceed we shall of course make use of many postulates.
The two which we have just considered, however, are the only ones,
so far as we are aware, which are essentially different from those
common to the usual theoretical developments of physical science.
In particular in our further work we shall assume without examination
all such general principles as the homogeneity and isotropism of the
space continuum, and the unidirectional, one-valued, one-dimensional
character of the time continuum. In our treatment of the dynamics
of a particle we shall also assume Newton's laws of motion, and the
principle of the conservation of mass, although we shall find, of course,
that the Einstein ideas as to the connection between space and time
will lead us to a non-Newtonian mechanics. We shall also make
extensive use of the principle of least action, which we shall find a
powerful principle in all the fields of dynamics.
* Phys. Rev., vol. 31, p. 26 (1910).
CHAPTER III.
SOME ELEMENTARY DEDUCTIONS.
24. In order gradually to familiarize the reader with the conse
quences of the theory of relativity we shall now develop by very
elementary methods a few of the more important relations. In this
preliminary consideration we shall lay no stress on mathematical
elegance or logical exactness. It is believed, however, that the
chapter will present a substantially correct account of some of the
more important conclusions of the theory of relativity, in a form
which can be understood even by readers without mathematical
equipment.
Measurements of Time in a Moving System.
25. We may first derive from the postulates of relativity a relation
connecting measurements of time intervals as made by observers in
systems moving with different velocities. Consider a system S
(Fig. 5) provided with a plane mirror a a, and an observer A, who
FIG. 5.
has a clock so that he can determine the time taken for a beam of
light to travel up to the mirror and back along the path Am A.
Consider also another similar system S', provided with a mirror b 6,
and an observer B, who also has a clock for measuring the time it
takes for light to go up to his mirror and back. System S' is moving
past S with the velocity V, the direction of motion being parallel
to the mirrors a a and b b, the two systems being arranged, more-
28
Some Elementary Deductions.
29
over, so that when they pass one another the two mirrors a a and
b b will coincide, and the two observers A and B will also come into
coincidence.
A, considering his system at rest and the other in motion, measures
the time taken for a beam of light to pass to his mirror and return,
over the path A m A, and compares the time interval thus obtained
with that necessary for the performance of a similar experiment
by B, in which the light has to pass over a longer path such as B n B',
shown in figure 6, where B B' is the distance through which the
— a
v — — i
1
[
\
1
\
1
\
1
\
1
\
\
\
1
\
\
\
\
\
T> _/
\
B - - -
0
FIG. 6.
A
observer B has moved during the time taken for the passage of the
light up to the mirror and back.
Since, in accordance with the second postulate of relativity, the
velocity of light is independent of the velocity of its source, it is
evident that the ratio of these two time intervals will be proportional
to the ratio of the two paths A m A and B n B', and this can easily
be calculated in terms of the velocity of light c and the velocity V
of the system S'.
From figure 6 we have
(A m)2 = (p n)2 = (B n)2 - (B p)2.
Dividing by (B n)2,
(Am)* _ (Pjtf
(BnY '' " IB n)2 '
But the distance B p is to B n as 7 is to c, giving us
A m
30 Chapter Three.
and hence A will find, either by calculation or by direct measurement
if he has arranged clocks at B and B', that it takes a longer time for
the performance of B's experiment than for the performance of his
/
: -\/ 1 —
own in the ratio 1
It is evident from the first postulate of relativity, however, that
B himself must find exactly the same length of time for the light to
pass up to his mirror and come back as did A in his experiment,
because the two systems are, as a matter of fact, entirely symmetrical
and we could with equal right consider B's system to be the one at
rest and A's the one in motion.
We thus find that two observers, A and B, who are in relative motion
will not in general agree in their measurements of the time interval neces
sary for a given event to take place, the event in this particular case,
for example, having been the performance of B's experiment; indeed,
making use of the ratio obtained in a preceding paragraph, we may
go further and make the quantitative statement that measurements of
time intervals made with a moving clock must be multiplied by the quantity
in order to agree with measurements made with a stationary
system of clocks.
It is sometimes more convenient to state this principle in the
form: A stationary observer using a set of stationary clocks will
/ y2
obtain a greater measurement in the ratio 1 : -v/1 — — for a given
time interval than an observer who uses a clock moving with the
velocity V.
Measurements of Length in a Moving System.
26. We may now extend our considerations, to obtain a relation
between measurements of length made in stationary and moving
systems.
As to measurements of length perpendicular to the line of motion
of the two systems S and Sf, a little consideration will make it at once
evident that both A and B must obtain identical results. This is
true because the possibility is always present of making a direct com-
Some Elementary Deductions. 31
parison of the meter sticks which A and B use for such measurements
by holding them perpendicular to the line of motion. When the
relative motion of the two systems brings such meter sticks into
juxtaposition, it is evident from the first postulate of relativity that
A's meter and B's meter must coincide in length. Any difference in
length could be due only to the different velocity of the two systems
through space, and such an occurrence is ruled out by our first postulate.
Hence measurements made with a moving meter stick held perpendicular
to its line of motion will agree with those made with stationary meter
sticks.
27. With regard to measurements of length parallel to the line of
motion of the systems, the affair is much more complicated. Any
direct comparison of the lengths of meter sticks in the two systems
would be very difficult to carry out; the consideration, however, of a
simple experiment on the velocity of light parallel to the motion of
the systems will lead to the desired relation.
Let us again consider two systems S and S' (fig. 7), S' moving
past S with the velocity V.
m
B
FIG. 7.
A and B are observers on these systems provided with clocks and
meter sticks. The two observers lay off, each on his own system,
paths for measuring the velocity of light. A lays off a distance of
one meter, A m, so that he can measure the time for light to travel
to the mirror m and return, and B, using a meter stick which has
the same length as A's when they are both at rest, lays off the dis
tance B n.
Each observer measures the length of time it takes for light to
travel to his mirror and return, and will evidently have to find the
same length of time, since the postulates of relativity require that the
velocity of light shall be the same for all observers.
32 Chapter Three.
Now the observer A, taking himself as at rest, finds that B's
light travels over a path B nf B' (fig. 8), where n n' is the distance
B B' n n'
FIG. 8.
through which the mirror n moves while the light is travelling up to
it, and B B' is the distance through which the source travels before
the light gets back. It is easy to calculate the length of this path.
We have
nn' _ V
B n' " c
and
BB' V
Bne B'=~~ c'
Also, from the figure,
B n' = B n + n n',
Bri B' = BnB + 2nri - BB'.
Combining, we obtain
Bri B' _ _l_
BnB '' V^'
c2
Thus A finds that the path traversed by B's light, instead of being
exactly two meters as was his own, will be longer in the ratio of
/ F2\
1 : ( 1 — — ) . For this reason A is rather surprised that B does
not report a longer time interval for the passage of the light than he
himself found. He remembers, however, that he has already found
that measurements of time made with a moving clock must be multi
plied by the quantity —. — in order to agree with his own, and
C'
sees that this will account for part of the discrepancy between the
expected and observed results. To account for the remaining dis
crepancy the further conclusion is now obtained that measurements of
Some Elementary Deductions. 33
length made with a moving meter sticky parallel to its motion, must be
I V2
multiplied by the quantity -y 1 in order to agree with those made
in a stationary system.
In accordance with this principle, a stationary observer will
obtain a smaller measurement for the length of a moving body than
will an observer moving along with the object. This has been called
the Lorentz shortening, the shortening occurring in the ratio
in the line of motion.
The Setting of Clocks in a Moving System.
28. It will be noticed that in our considerations up to this point
we have considered cases where only a single moving clock was needed
in performing the desired experiment, and this was done purposely,
since we shall find, not only that a given time interval measures
shorter on a moving clock than on a system of stationary clocks,
but that a system of moving clocks which have been set in synchronism
by an observer moving along with them will not be set in synchronism
for a stationary observer.
Consider again two systems S and S' in relative motion with the
velocity V. An observer A on system S places two carefully com
pared clocks, unit distance apart, in the line of motion, and has the
time on each clock read when a given point on the other system
passes it. An observer B on system S' performs a similar experiment.
The time interval obtained in the two sets of readings must be the
same, since the first postulate of relativity obviously requires that the
relative velocity of the two systems V shall have the same value for
both observers.
The observer A, however, taking himself as at rest, and familiar
with the change in the measurements of length and time in the moving
system which have already been deduced, expects that the velocity
as measured by B will be greater than the value that he himself
obtains in the ratio ^ , since any particular one of B's clocks
34 Chapter Three.
gives a shorter value for a given time interval than his own, while
B's measurements of the length of a moving object are greater than
his own, each by the factor *\jl — — •- . In order to explain the actual
result of B's experiment he now has to conclude that the clocks which
for B are set synchronously are not set in synchronism for himself.
From what has preceded it is easy to see that in the moving system,
from the point of view of the stationary observer, clocks must be set
further and further ahead as we proceed towards the rear of the
system, since otherwise B would not obtain a great enough difference
in the readings of the clocks as they come opposite the given point
on the other system. Indeed, if two clocks are situated in the moving
system, S', one in front of the other by the distance l't as measured
by B, then for A it will appear as though B had set his rear clock ahead
I'V
by the amount — .
c
29. We have now obtained all the information which we shall
need in this chapter as to measurements of time and length in systems
moving with different velocities. We may point out, however, before
proceeding to the application of these considerations, that our choice
of A' s system as the one which we should call stationary was of course
entirely arbitrary and immaterial. We can at any time equally well
take B's system as the one to which we shall ultimately refer all our
measurements, and indeed all that we shall mean when we call one of
our systems stationary is that for reasons of convenience we have
picked out that particular system as the one with reference to which
we particularly wish to make our measurements. We may also
point out that of course B has to subject A' 8 measurements of time
and length to just the same multiplications by the factor — p=
as did A in order to make them agree with his own.
These conclusions as to measurements of space and time are of course
very startling when first encountered. The mere fact, however, that
they appear strange to so-called " common sense " need cause us
no difficulty, since the older ideas of space and time were obtained
from an ancestral experience which never included experiments with
Some Elementary Deductions. 35
72
high relative velocities, and it is only when the ratio — becomes
c
•
appreciable that we obtain unexpected results. To those scientists
who do not wish to give up their " common sense " ideas of space
and time we can merely say that if they accept the two postulates
of relativity then they will also have to accept the consequences
which can be deduced therefrom. The remarkable nature of these
consequences merely indicates the very imperfect nature of our older
conceptions of space and time.
The Composition of Velocities.
30. Our conclusions as to the setting of clocks make it possible
to obtain an important expression for the composition of velocities.
Suppose we have a system S, which we shall take as stationary, and
on the system an observer A. Moving past S with the velocity V
is another system S' with an observer B, and finally moving past S'
in the same direction is a body whose velocity is u' as measured by
observer B. What will be the velocity u of this body as measured
by A?
Our older ideas led us to believe in the simple additivity of veloci
ties and we should have calculated u in accordance with the simple
expression
u = V + u'.
We must now allow, however, for the fact that u' is measured with
clocks which to A appear to be set in a peculiar fashion and running
at a different rate from his own, and with meter sticks which give
longer measurements than those used in the stationary system.
The determination of u' by observer B would be obtained by
measuring the time interval necessary for the body in question to
move a given distance I' along the system S'. If t' is the difference
in the respective clock readings when the body reaches the ends of
the line I', we have
I'V
We have already seen, however, that the two clocks are for A set —
units apart and hence for clocks set together the time interval would
36 Chapter Three.
I'V
have measured t' H 5- . Furthermore these moving clocks give
V2
time measurements which are shorter in the ratio \/ 1 — — : 1 than
\ c2
those obtained by A, so that for A the time interval for the body to
move from one end of V to the other would measure
V*'
'
furthermore, owing to the difference in measurements of length, this
I yj
line I' has for A the length I' \| 1 .
* * C
body is moving past S' with the velocity,
V2
line I' has for A the length I' \j 1 — — . Hence A finds that the
c
I'V I'V u'V
This makes the total velocity of the body past S equal to the sum
u = V +
or
u'F
u'V
This new expression for the composition of velocities is extremely
important. When the velocities u' and V are small compared with
the velocity of light c, we observe that the formula reduces to the simple
additivity principle which we know by common experience to be true
Some Elementary Deductions. 37
for all ordinary velocities. Until very recently the human race has
had practically no experience with high velocities and we now see
that for velocities in the neighborhood of that of light, the simple
additivity principle is nowhere near true.
In particular it should be noticed that by the composition of
velocities which are themselves less than that of light we can never
obtain any velocity greater than that of light. As an extreme case,
suppose for example that the system S' were moving past S itself
with the velocity of light (i. e., V = c) and that in the system S' a
particle should itself be given the velocity of light in the same direc
tion (i. e., u' = c); we find on substitution that the particle still has
only the velocity of light with respect to S. We have
c + c 2c
By the consideration of such conclusions as these the reader will
appreciate more and more the necessity of abandoning his older
naive ideas of space and time which are the inheritance of a long
human experience with physical systems in which only slow velocities
were encountered.
The Mass of a Moving Body.
31. We may now obtain an important relation for the mass of a
moving body. Consider again two similar systems, S at rest and S'
moving past with the velocity V. The observer A on system S has a
sphere made from some rigid elastic material, having a mass of m
grams, and the observer B on system Sf is also provided with a similar
sphere. The two spheres are made so that they are exactly alike
when both are at rest; thus B's sphere, since it is at rest with respect
to him, looks to him just the same as the other sphere does to A-
As the two systems pass each other (fig. 9) each of these clever experi
menters rolls his sphere towards the other system with a velocity of
u cm. per second, so that they will just collide and rebound in a line
perpendicular to the direction of motion. Now, from the first postu
late of relativity, system S' appears to B just the same as system S
appears to A, and jETs ball appears to him to go through the same
evolutions that A finds for his ball. A finds that his ball on collision
38
Chapter Three.
undergoes the algebraic change of velocity 2u, B finds the same change
in velocity 2u for his ball. B reports this fact to "A, and A knowing
that B's measurements of length agree with his own in this transverse
FIG. 9.
direction, but that his clock gives time intervals that are shorter than
/ 72
his own in the ratio \fl — — : 1, calculates that the change in veloc
ity of B's ball must be 2u
From the principle of the conservation of momentum, however,
A knows that the change in momentum of B's ball must be the same
as that of his own and hence can write the equation
mau =
where ma is the mass of A's ball and ra& is the mass of B's ball. . Solv
ing we have
ma
nib =
In other words, B's ball, which had the same mass ma as A's when
Some Elementary Deductions. 39
both were at rest, is found to have the larger mass — - — - -- when
placed in a system that is moving with the velocity V.*
The theory of relativity thus leads to the general expression
m
for the mass of a body moving with the velocity u and having the
mass ?tto when at rest.
Since we have very few velocities comparable with that of light
I ^2
it is obvious that the quantity \/l seldom differs much from
unity, which makes the experimental verification of this expression
difficult. In the case of electrons, however, which are shot off from
radioactive substances, or indeed in the case of cathode rays produced
with high potentials, we do have particles moving with velocities
comparable to that of light, and the experimental work of Kaufmann,
Bucherer, Hupka and others in this field provides one of the most
striking triumphs of the theory of relativity.
The Relation Between Mass and Energy.
32. The theory of relativity has led to very important conclusions
as to the nature of mass and energy. In fact, we shall see that matter
and energy are apparently different names for the same fundamental
entity.
When we set a body in motion it is evident from the previous
section that we increase both its mass as well as its energy. Now
we can show that there is a definite ratio between the amount of
energy that we give to the body and the amount of mass that we
give to it.
If the force / acts on a particle which is free to move, its increase in
kinetic energy is evidently
A# = ffdl.
But the force acting, is by definition, equal to the rate of increase in
* In carrying out this experiment the transverse velocities of the balls should
be made negligibly small in comparison with the relative velocity of the systems V.
40 Chapter Three.
the momentum of the particle
/=|(m«).
Substituting we have
We have, however, from the previous section,
m =
which, solved for u, gives us
Substituting this value of u in our equation for AE we obtain, after
simplification,
AE = fc*dm = c2Aw.
This says that the increase of the kinetic energy of the particle,
in ergs, is equal to the increase in mass, in grams, multiplied by the
square of the velocity of light. If now we bring the particle to rest
it will give up both its kinetic energy and its excess mass. Accepting
the principles of the conservation of mass and energy, we know, how
ever, that neither this energy nor the mass has been destroyed; they
have merely been passed on to other bodies. There is, moreover,
every reason to believe that this mass and energy, which were asso
ciated together when the body was in motion and left the body when
it was brought to rest, still remain always associated together. For
example, if the body should be brought to rest by setting another
body into motion, it is of course a necessary consequence of our con
siderations that the kinetic energy and the excess mass both pass
on together to the new body which is set in motion. A similar con
clusion would be true if the body is brought to rest by frictional forces,
since the heat produced by the friction means an increase in the kinetic
energies of ultimate particles.
Some Elementary Deductions. 41
In general we shall find it pragmatic to consider that matter and
energy are merely different names for the same fundamental entity.
One gram of matter is equal to 1021 ergs of energy.
c2 = (2.9986 X 1010)2 = approx. 1021.
This apparently extraordinary conclusion is in reality one which
produces the greatest simplification in science. Not to mention
numerous special applications where this principle is useful, we may
call attention to the fact that the great laws of the conservation of
mass and of energy have now become identical. We may also point
out that those opposing camps of philosophic materialists who defend
matter on the one hand or energy on the other as the fundamental
entity of the universe may now forever cease their unimportant bicker
ings.
CHAPTER IV.
THE EINSTEIN TRANSFORMATION EQUATIONS FOR SPACE
AND TIME.
The Lorentz Transformation.
33. We may now proceed to a systematic study of the consequences
of the theory of relativity.
The fundamental problem that first arises in considering
spatial and temporal measurements is that of transforming the
description of a given kinematical occurrence from the variables of
one system of coordinates to those of another system which is in
motion relative to the first.
Consider two systems of right-angled Cartesian coordinates S
and Sf (fig. 10) in relative motion in the X direction with the velocity V.
-x v) x'
'Z
FIG. 10.
The position of any given point in space can be determined by speci
fying its coordinates x, y, and z with respect to system S or its coordi
nates x', y' and z' with respect to system S'. Furthermore, for the
purpose of determining the time at which any event takes place, we
may think of each system of coordinates as provided with a whole
series of clocks placed at convenient intervals throughout the system,
the clocks of each series being set and regulated* by observers in the
* We may think of the clocks as being set in any of the ways that are usual
in practice. Perhaps the simplest is to consider the clocks as mechanisms which
have been found to "keep time" when they are all together where they can be
examined by one individual observer. The assumption can then be made, in ac-
42
Transformation Equations for Space and Time. 43
corresponding system. The time at which the event in question
takes place may be denoted by t if determined by the clocks belonging
to system S and by tr if determined by the clocks of system Sr.
For convenience the two systems S and S' are chosen so that the
axes OX and O'X' lie in the same line, and for further simplification
we choose, as our starting-point for time measurements, t and t' both
equal to zero when the two origins come into coincidence.
The specific problem now before us is as follows: If a given kine-
matical occurrence has been observed and described in terms of the
variables x', y', z' and t', what substitutions must we make for the
values of these variables in order to obtain a correct description of the
same kinematical event in terms of the variables x, y, z and J? In
other words, we want to obtain a set of transformation equations
from the variables of system Sf to those of system S. The equations
which we shall present were first obtained by Lorentz, and the process
of changing from one set of variables to the other has generally been
called the Lorentz transformation. The significance of these equa
tions from the point of view of the theory of relativity was first appre
ciated by Einstein.
Deduction of the Fundamental Transformation Equations.
34. It is evident that these transformation equations are going
to depend on the relative velocity V of the two systems, so that we
may write for them the expressions
x' = F^V, x, y, z, 0,
y' = F*(V, x, y, z, t),
z' = Fi(V, x, y, z, t),
t' = Ft(V, x, y, z} t),
where Fi, F2, etc., are the unknown functions whose form we wish
to determine.
It is possible at the outset, however, greatly to simplify these
relations. If we accept the idea of the homogeneity of space it is
evident that any other line parallel to OXX' might just as well have
been chosen as our line of X-axes, and hence our two equations for
x' and t' must be independent of y and z. Moreover, as to the equa-
cordance with our ideas of the homogeneity of space, that they will continue to
"keep time" after they have been distributed throughout the system.
44 Chapter Four.
tions for y' and z' it is at once evident that the only possible solutions
are y' = y and z' = z. This is obvious because a meter stick held
in the system S' perpendicular to the line of relative motion, OX',
of the system can be directly compared with meter sticks held similarly
in system S, and in accordance with the first postulate of relativity
they must agree in length in order that the systems may be entirely
symmetrical. We may now rewrite our transformation equations
in the simplified form
t' = F2(V, t, a),
and have only two functions, Fi and F2, whose form has to be de
termined.
To complete the solution of the problem we may make use of three
further conditions which must govern the transformation equations.
35. Three Conditions to be Fulfilled. In the first place, when the
velocity V between the systems is small, it is evident that the trans
formation equations must reduce to the form that they had in New
tonian mechanics, since we know both from measurements and from
everyday experience that the Newtonian concepts of space and time
are correct as long as we deal with slow velocities. Hence the limiting
form of the equations as V approaches zero will be (cf. Chapter I,
equations 3-4-5-6)
x' = x - Vt,
y' = y,
Zf = 2,
t' = t.
36. A second condition is imposed upon the form of the functions
Fi and Fz by the first postulate of relativity, which requires that the
two systems S and Sf shall be entirely symmetrical. Hence the
transformation equations for changing from the variables of system S
to those of system S' must be of exactly the same form as those used
in the reverse transformation, containing, however, — V wherever
H- V occurs in the latter equations. Expressing this requirement in
Transformation Equations for Space and Time. 45
mathematical form, we may write as true equations
where FI and Fz must have the same form as above.
37. A final condition is imposed upon the form of FI and F* by
the second postulate of relativity, which states that the velocity of a
beam of light appears the same to all observers regardless of the
motion of the source of light or of the observer. Hence our trans
formation equations must be of such a form that a given beam of
light has the same velocity, c, when measured in the variables of either
system. Let us suppose, for example, that at the instant t = t' = 0,
when the two origins come into coincidence, a light impulse is started
from the common point occupied by 0 and 0' '. Then, measured in
the coordinates of either system, the optical disturbance which is
generated must spread out from the origin in a spherical form with
the velocity c. Hence, using the variables of system S, the coordinates
of any point on the surface of the disturbance will be given by the
expression
x2 + 2/2 + z2 = c2/2, (7)
while using the variables of system Sf we should have the similar
expression
x'2 + 2/'2 + z'2 = cH>\ (8)
Thus we have a particular kinematical occurrence, the spreading out
of a light disturbance, whose description is known in the variables
of either system, and our transformation equations must be of such
a form that their substitution will change equation (8) to equation (7).
In other words, the expression x2 + y2 + z2 - c2*2 is to be an invariant
for the Lorentz transformation.
38. The Transformation Equations. The three sets of conditions
which, as we have seen in the last three paragraphs, are imposed upon
the form of FI and F2 are sufficient to determine the solution of the
problem. The natural method of solution is obviously that of trial,
46 Chapter Four.
and we may suggest the solution :
x' = -j=^= (x - Vt) = K(X - Vt), (9)
y' = y, (io)
z' = z, (11)
(12)
1 / V \ ( V \
''"lr^V~H-«('~H'
where we have placed K to represent the important and continually
1
recurring quantity
It will be found as a matter of fact by examination that these
solutions do fit all three requirements which we have stated. Thus,
when V becomes small compared with the velocity of light, c, the
equations do reduce to those of Galileo and Newton. Secondly, if
the equations are solved for the unprimed quantities in terms of the
primed, the resulting expressions have an unchanged form except for
the introduction of — V in place of + V, thus fulfilling the require
ments of symmetry imposed by the first postulate of relativity. And
finally, if we substitute the expressions for x', y', z' and t' in the poly
nomial a/2 + y'2 + z'2 = c2t'2} we shall obtain the expression x2 + 2/2
+ z2 — c2t2 and have thus secured the in variance of x2 + y2 + z2 — cH2
which is required by the second postulate of relativity.
We may further point out that the whole series of possible Lorentz
transformations form a group such that the result of two successive
transformations could itself be represented by a single transformation
provided we picked out suitable magnitudes and directions for the
velocities between the various systems.
It is also to be noted that the transformation becomes imaginary
for cases where V > c, and we shall find that this agrees with ideas
obtained in other ways as to the speed of light being an upper limit
for the magnitude of all velocities.
Transformation Equations for Space and Time. 47
Further Transformation Equations.
39. Before making any applications of our equations we shall find
it desirable to obtain by simple substitutions and differentiations a
series of further transformation equations which will be of great value
in our future work.
By the simple differentiation of equation (12) we can obtain
dx
where we have put x for -r." •
40. Transformation Equations for Velocity. By differentiation of
the equations for x' ', yr and z', nos. (9), (10) and (11), and substitution
of the value just found for -j- we may obtain the following transfor
mation equations for velocity:
x-V , ux-V
X ~ °r Ux~~ ~'
' c* c2
v-^
- xV
--?
where the placing of a dot has the familiar significance of differentiation
with respect to time, -r- being represented by x and ^7 by x'.
The significance of these equations for the transformation of
velocities is as follows: If for an observer in system S a point appears
to be moving with the uniform velocity (x, y, z) its velocity (x', ?/', z')t
as measured by an observer in system S'} is given by these expressions
(14), (15) and (16).
41. Transformation Equations for the Function — j= =. These
48 Chapter Four.
three transformation equations for the velocity components of a point,
permit us to obtain a further transformation equation for an important
function of the velocity which we shall find continually recurring in
our later work. This is the function — j -- , where we have indi-
cated the total velocity of the point by u, according to the expression
ui = ±2 + y2 + 22- By the substitution of equations (14), (15) and
(16) we obtain the transformation equation
(17)
42. Transformation Equations for Acceleration. By further dif
ferentiating equations (14), (15) and (16) and simplifying, we easily
obtain three new equations for transforming measurements of accel
eration from system S' to S, viz. :
x'=(i-~y*-*x, as)
$> = Ci _^y~2 ^ + 2/^(1 -^)~3/c~2*> (19)
z' = (l -^} * K-*z + z^2(l -~^J 3 K~*X, (20)
or
/ i/ T7\-3
K-*iix, (18)
V f 11 V
V ' U'V • (20)
CHAPTER V.
KINEMATICAL APPLICATIONS.
43. The various transformation equations for spatial and temporal
measurements which were derived in the previous chapter may now be
used for the treatment of a number of kinematical problems. In
particular it will be shown in the latter part of the chapter that a
number of optical problems can be handled with extraordinary facility
by the methods now at our disposal.
The Kinematical Shape of a Rigid Body.
44. We may first point out that the conclusions of relativity theory
lead us to quite new ideas as to what is meant by the shape of a rigid
body. We shall find that the shape of a rigid body will depend entirely
upon the relative motion of the body and the observer who is making
measurements on it.
Consider a rigid body which is at rest with respect to system Sf.
Let Xit 2//> z/ and x2', 2/2', £2' be the coordinates in system S' of two
points in the body. The coordinates of the same points as measured
in system S can be found from transformation equations (9), (10)
and (11), and by subtraction we can obtain the following expressions
V2
(2/2 - 2/0 = (2/2' - 2/i'), (22)
(*2 - 2/2) = (zj - */), (23)
connecting the distances between the pair of points as viewed in the
two systems. In making this subtraction terms containing t have
been cancelled out since we are interested in the simultaneous positions
of the points. In accordance with these equations we may distinguish
then between the geometrical shape of a body, which is the shape that
it has when measured on a system of axes which are at rest relative
to it, and its kinematical shape, which is given by the coordinates which
5 49
50 Chapter Five.
express the simultaneous positions of its various points when it is in
motion with respect to the axes of reference. We see that the kine-
matical shape of a rigid body differs from its geometrical shape by a
shortening of all its dimensions in the line of motion in the ratio
/ V2
A/1 — — : 1; thus a sphere, for example, becomes a Heaviside ellipsoid.
In order to avoid incorrectness of speech we must be very care
ful not to give the idea that the kinematical shape of a body is in
any sense either more or less real than its geometrical shape. We
must merely learn to realize that the shape of a body is entirely de
pendent on the particular set of coordinates chosen for the making
of space measurements.
The Kinematical Rate of a Clock.
45. Just as we have seen that the shape of a body depends upon
our choice of a system of coordinates, so we shall find that the rate of
a given clock depends upon the relative motion of the clock and its
observer. Consider a clock or any mechanism which is performing
a periodic action. Let the clock be at rest with respect to system
S' and let a given period commence at t\ and end at £2', the length of
the interval thus being At' = t-2' — t\.
From transformation equation (12) we may obtain
and by subtraction, since x% — Xi is obviously equal to Vt, we have
r
Vi-
At
Yl
c2
i
Kinematical Applications. 51
Thus an observer who is moving past a clock finds a longer period for
/ T2
the clock in the ratio 1 : \J 1 — — than an observer who is stationary
with respect to it. Suppose, for example, we have a particle which
is turning alternately red and blue. For an observer who is moving
past the particle the periods for which it remains a given color measure
/ 7*
longer in the ratio 1 : A/ 1 — — than they do to an observer who is
stationary with respect to the particle.
46. A possible opportunity for testing this interesting conclusion
of the theory of relativity is presented by the phenomena of canal
rays. We may regard the atoms which are moving in these rays as
little clocks, the frequency of the light which they emit corresponding
to the period of the clock. If now we should make spectroscopic
observations on canal rays of high velocity, the frequency of the
emitted light ought to be less than that of light from stationary atoms
of the same kind if our considerations are correct. It would of course
be necessary to view the canal rays at right angles to their direction
of motion, to prevent a confusion of the expected shift in the spectrum
with that produced by the ordinary Doppler effect (see Section 54).
The Idea of Simultaneity.
47. We may now also point out that the idea of the absolute simul
taneity of two events must henceforth be given up. Suppose, for
example, an observer in the system S is interested in two events
which take place simultaneously at the time t. Suppose one of these
events occurs at a point having the X coordinate x\ and the other
at a point having the coordinate z2; then by transformation equation
(12) it is evident that to an observer in system S', which is moving
relative to S with the velocity 7, the two events would take place
respectively at the times
and
1 / V \
T— V>?*J
v1-?
52 Chapter Five.
or the difference in time between the occurrence of the events would
appear to this other observer to be
-»,)• (25)
The Composition of Velocities.
48. The Case of Parallel Velocities. We may now present one of
the most important characteristics of Einstein's space and time,
which can be best appreciated by considering transformation equation
(14). or more simply its analogue for the transformation in the reverse
direction
c*
Consider now the significance of the above equation. If we
have a particle which is moving in the X direction with the velocity
Ux as measured in system S', its velocity ux with respect to system S
is to be obtained by adding the relative velocity of the two systems V
ux'V
and dividing the sum of the two velocities by 1 H -- ^— . Thus we see
c
that we must completely throw overboard our old naive ideas of the
direct additivity of velocities. Of course, in the case of very slow
velocities, when ux' and V are both small compared with the velocity
ux'V
of light, the quantity — ^-- is very nearly zero and the direct addition
of velocities is a close approximation to the truth. In the case of
velocities, however, which are in the neighborhood of the speed of
light, the direct addition of velocities would lead to extremely er
roneous results.
49. In particular it should be noticed that by the composition of
velocities which are themselves less than that of light we can never
obtain any velocity greater than that of light. Suppose, for example,
that the system S' were moving past S with the velocity of light
(i. e., V = c), and that in the system S' a particle should itself be
given the velocity of light in the X direction (i. e., ux' — c); we find
on substitution that the particle still has only the velocity of light
Kinematical Applications. 53
with respect to S. We have
c_+_c 2c
1 +
ux= — = - = c.
c
If the relative velocity between the systems should be one half
r
the velocity of light, - , and an experimenter on Sf should shoot off a
particle in the X direction with half the velocity of light, the total
velocity with respect to S would be
ic + \c 4
u* = --- TT2= *c.
•
50. Composition of Velocities in General. In the case of particles
which have components of velocity in other than the X direction it
is obvious that our transformation equations will here also provide
methods of calculation to supersede the simple addition of velocities.
If we place
U? = UX2 + Uy2 + USZ,
U" = «," + «," + tt.",
we may obtain by the substitution of equations (14), (15) and (16)
where a is the angle in the system S' between the X' axis and the
velocity of the particle uf. For the particular case that V and u'
are in the same direction, the equation obviously reduces to the
simpler form
u' + V
u=i + ^'
c2
which we have already considered.
51. We may also call attention at this point to an interesting char
acteristic of the equations for the transformation of velocities. It will
54 Chapter Five.
be noted from an examination of these equations that if to any ob
server a particle appears to have a constant velocity, i. e., to be
unacted on by any force, it will also appear to have a uniform although
of course different velocity to any observer who is himself in uniform
motion with respect to the first. An examination, however, of the
transformation equations for acceleration (18), (19), (20) will show
that here a different state of affairs is true, since it will be seen that a
point which has uniform acceleration (x, y, z) with respect to an ob
server in system S will not in general have a uniform acceleration in
another system Sf, since the acceleration in system S' depends not
only on the constant acceleration but also on the velocit3r in system S,
which is necessarily varying.
Velocities Greater than that of Light.
52. In the preceding section we have called attention to the fact
that the mere composition of velocities which are not themselves
greater than that of light will never lead to a speed that is greater
than that of light. The question naturally arises whether velocities
which are greater than that of light could ever possibly be obtained
in any way.
This problem can be attacked in an extremely interesting manner.
Consider two points A and B on the X axis of the system S, and
suppose that some impulse originates at A, travels to B with the
velocity u and at B produces some observable phenomenon, the start
ing of the impulse at A and the resulting phenomenon at B thus
being connected by the relation of cause and effect.
The time elapsing between the cause and its effect as measured
in the units of system S will evidently be
At = ts-tA = Xj±, (28)
where XA and XB are the coordinates of the two points A and B.
Now in another system £', which has the velocity V with respect
to S, the time elapsing between cause and effect would evidently be
^pv* ">**)"
Kinematical Applications. 55
where we have substituted for t'B and t'A in accordance with equation
(12). Simplifying and introducing equation (28) we obtain
uV
1 ~~ c2
(29)
Let us suppose now that there are no limits to the possible magni
tude of the velocities u and V, and in particular that the causal im
pulse can travel from A to B with a velocity u greater than that of
light. It is evident that we could then take a velocity u great enough
uV
so that - - would be greater than unity and At' would become nega-
c-
tive. In other words, for an observer in system S' the effect which
occurs at B would precede in time its cause which originates at A.
Such a condition of affairs might not be a logical impossibility; never
theless its extraordinary nature might incline us to believe that no
causal impulse can travel with a velocity greater than that of light.
We may point out in passing, however, that in the case of kine
matic occurrences in which there is no causal connection there is no
reason for supposing that the velocity must be less than that of light.
Consider, for example, a set of blocks arranged side by side in a long
row. For each block there could be an independent time mechanism
like an alarm clock which would go off at just the right instant so
that the blocks would fall down one after another along the line.
The velocity with which the phenomenon would travel along the
line of blocks could be arranged to have any value. In fact, the
blocks could evidently all be fixed to fall just at the same instant,
which would correspond to an infinite velocity. It is to be noticed
here, however, that there is no causal connection between the falling
of one block and that of the next, and no transfer of energy.
Application of the Principles of Kinematics to Certain Optical Prob
lems.
53. Let us now apply our kinematical considerations to some
problems in the field of optics. We may consider a beam of light
as a periodic electromagnetic disturbance which is propagated through
a vacuum with the velocity c. At any point in the path of a beam of
56 Chapter Five.
light the intensity of the electric and magnetic fields will be undergoing
periodic changes in magnitude. Since the intensities of both the
electric and the magnetic fields vary together, the statement of a
single vector is sufficient to determine the instantaneous condition
at any point in the path of a beam of light. It is customary to call
this vector (which might be either the strength of the electric or of
the magnetic field) the light vector.
For the case of a simple plane wave (i. e., a beam of monochromatic
light from a distant source) the light vector at any point in the path
of the light may be put proportional to
Ix + my + nz,
•
where x, y and z are the coordinates of the point under observation,
t is the time, Z, m and n are the cosines of the angles a, /5 and 7 which
determine the direction of the beam of light with reference to our
system, and w is a constant which determines the period of the light.
If now this same beam of light were examined by an observer in
system S' which is moving past the original system in the X direction
with the velocity V, we could write the light vector proportional to
,/, ZV + mV + nV\
CO I t — - I .
V c J
sin a/ 1*'-- ^f- -1. (31)
It is not difficult to show that the transformation equations which
we have already developed must lead to the following relations between
the measurements in the two systems*
* Methods for deriving the relation between the accented and unaccented
quantities will be obvious to the reader. For example, consider the relation between
co and to'. At the origin of coordinates x = y = z = Qin system S, we shall have
in accordance with expression (30) the light vector proportional to sin cot, and hence
similarly at the point 0', which is the origin of coordinates in system Sf, we shall
have the light vector proportional to sin coY. But the point 0' as observed from
system S moves with the velocity V along the X-axis and at any instant has the
position x = Vt; hence substituting in expression (30) we have the light vector at
the point 0' as measured in system S proportional to
sin wt ( 1 — Z - J , (36)
while as measured in system S' the intensity is proportional to
sin coT. (37)
Kinematical Applications. 57
(32)
(33)
(34)
F
1 - z-
c
(35)
(-1)'
With the help of these equations we may now treat some important
optical problems.
54. The Doppler Effect. At the origin of coordinates, x = y = z
= 0, in system S we shall evidently have from expression (30) the
light vector proportional to sin wt. That means that the vector
becomes zero whenever ut = 2N TT, where N is any integer; in other
2?r
words, the period of the light is p = — or the frequency
Similarly the frequency of the light as measured by an observer in
system S' would be
We have already obtained, however, a transformation equation for t', namely,
and further may place x = Vt. Making these substitutions and comparing ex
pressions (36) and (37) we see that we must have the relation
to' = «ic(l - lj)«
Methods of obtaining the relation between the cosines I, m and n and the corre
sponding cosines V, m', and n' as measured in system S' may be 1<
58 Chapter Five.
Combining these two equations and substituting the equation con
necting co and co' we have
v'
This is the relation between the frequencies of a given beam of light
as it appears to observers who are in relative motion.
If we consider a source of light at rest with respect to system S'
and at a considerable distance from the observer in system S, we
may substitute for v' the frequency of the source itself, VQ, and for I
we may write cos 0, where 0 is the angle between the line connecting
source and observer and the direction of motion of the source, leading
to the expression
Vn
(38)
(l-cos*^)
This is the most general equation for the Doppler effect. When
the source of light is moving directly in the line connecting source
and observer, we have cos 0 = 1, and the equation reduces to
(39)
K I 1 — —
which except for second order terms is identical with the older ex
pressions for the Doppler effect, and hence agrees with experimental
determinations.
We must also observe, however, that even when the source of
light moves at right angles to the line connecting source and observer
there still remains a second-order effect on the observed frequency,
in contradiction to the predictions of older theories. We have in this
case cos 0 = 0,
I yz
~ ~j - (40)
This is the change in frequency which we have already considered
when we discussed the rate of a moving clock. The possibilities of
Kinematical Applications. 59
direct experimental verification should not be overlooked (see sec
tion 46).
55. The Aberration of Light. Returning now to our transforma
tion equations, we see that equation (33) provides an expression for
calculating the aberration of light. Let us consider that the source
of light is stationary with respect to system S, and let there be an
observer situated at the origin of coordinates of system S' and thus
moving past the source with the velocity V in the X direction. Let <£
be the angle between the X-axis and the line connecting source of
light and observer and let <£' be the same angle as it appears to the
moving observer; then we can obviously substitute in equation (33),
cos <£ = /, cos 0' = lf, giving us
V
cos 0 - -
cos <f>f = - — y . (41)
1 — COS cj> —
c
This is a general equation for the aberration of light.
For the particular case that the direction of the beam of light is
perpendicular to the motion of the observer we have cos <£ = 0
cos *'---, (42)
C
which, except for second-order differences, is identical with the familiar
expression which makes the tangent of the angle of aberration nu
merically equal to V/c. The experimental verification of the formula
by astronomical measurements is familiar.
56. Velocity of Light in Moving Media. It is also possible to treat
very simply by kinematic methods the problem of the velocity of
light in moving media. We shall confine ourselves to the particular
case of a beam of light in a medium which is itself moving parallel
to the light.
Let the medium be moving with the velocity V in the X direction,
and let us consider the system of coordinates S' as stationary with
respect to the medium. Now since the medium appears to be sta
tionary with respect to observers in S' it is evident that the velocity
of the light with respect to S' will be c/V, where M is index of refraction
60 Chapter Five.
for the medium. If now we use our equation (26) for the addition of
velocities we shall obtain for the velocity of light, as measured by
observers in S,
c-+v
Carrying out the division and neglecting terms of higher order we
obtain
. (44)
The equation thus obtained is identical with that of Fresnel, the
/V - 1\
quantity ( — 1 being the well-known Fresnel coefficient. The
empirical verification of this equation by the experiments of Fizeau
and of Michelson and Morley is too well known to need further
mention.
For the case of a dispersive medium we should obviously have to
substitute in equation (44) the value of /z corresponding to the par
ticular frequency, *>', which the light has in system S'. It should be
noticed in this connection that the frequencies / and v which the
light has respectively in system S and system S', although nearly
enough the same for the practical use of equation (44), are in reality
connected by an expression which can easily be shown (see section 54)
to have the form
•K\1-7)'-
57. Group Velocity. In an entirely similar way we may treat the
problem of group velocity and obtain the equation
(46)
where Gr is the group velocity as it appears to an observer who is
Kinematical Applications. 61
stationary with respect to the medium. G' is, of course, an experi
mental quantity, connected with frequency and the properties of the
medium, in a way to be determined by experiments on the stationary
medium.
In conclusion we wish to call particular attention to the extra
ordinary simplicity of this method of handling the optics of moving
media as compared with those that had to be employed before the
introduction of the principle of relativity.
CHAPTER VI.
THE DYNAMICS OF A PARTICLE.
58. In this chapter and the two following, we shall present a
system of " relativity mechanics " based on Newton's three laws of
motion, the* Einstein transformation equations for space and time,
and the principle of the conservation of mass.
The Laws of Motion.
Newton's laws of motion may be stated in the following form:
I. Every particle continues in its state of rest or of uniform motion
in a straight line, unless it is acted upon by an external force.
II. The rate of change of the momentum of the particle is equal
to the force acting and is in the same direction.
III. For the action of every force there is an equal force acting
in the opposite direction.
Of these laws the first two merely serve to define the concept of
force, and their content may be expressed in mathematical form by
the following equation of definition
d , • du dm
F = dt(mu) = mdt+-dttt' (47)
where F is the force acting on a particle of mass m which has the
velocity u, and hence the momentum mu.
Quite different in its nature from the first two laws, which merely
give us a definition of force, the third law states a very definite physical
postulate, since it requires for every change in the momentum of a
body an equal and opposite change in the momentum of some other
body. The truth of this postulate will of course be tested by com
paring with experiment the results of the theory of mechanics which
we base upon its assumption.
Difference between Newtonian and Relativity Mechanics.
59. Before proceeding we may point out the particular difference
between the older Newtonian mechanics, which were based on the
laws of motion and the Galilean transformation equations for space
62
Dynamics of a Particle. 63
and time, and our new system of relativity mechanics based on
those same laws of motion and the Einstein transformation equations.
In the older mechanics there was no reason for supposing that the
mass of a body varied in any way with its velocity, and hence force
could be defined interchangeably as the rate of change of momentum
or as mass times acceleration, since the two were identical. In rela
tivity mechanics, however, we shall be forced to conclude that the
mass of a body increases in a perfectly definite way with its velocity,
and hence in our new mechanics we must define force as equal to the
total rate of change of momentum
d(mu) du dm
instead of merely as mass times acceleration m -7- . If we should try
to define force in " relativity mechanics " as merely equal to mass
times acceleration, we should find that the application of Newton's
third law of motion would then lead to very peculiar results, which
would make the mass of a body different in different directions and
force us to give up the idea of the conservation of mass.
The Mass of a Moving Particle.
60. In Section 31 we have already obtained in an elementary way
an expression for the mass of a moving particle, by considering a
collision between elastic particles and calculating how the resulting
changes in velocity would appear to different observers who are
themselves in relative motion. Since we now have at our command
general formula? for the transformation of velocities, we are now in
a position to handle this problem much more generally, and in particu
lar to show that the expression obtained for the mass of a moving particle
is entirely independent of the consideration of any particular type of
collision.
61. Transverse Collision. Let us first treat the case of a so-called
" transverse " collision. Consider a system of coordinates and two
exactly similar elastic particles, each having the mass mQ when at
rest, one moving in the X direction with the velocity -f u and the
other with the velocity - u. (See figure 11.) Besides the large
components of velocity + u and - u which they have in the X direc-
64 Chapter Six.
tion let them also have small components of velocity in the Y direc
tion, + v and — v. The experiment is so arranged that the particles
will just undergo a glancing collision as they pass each other and
rebound with components
> O _v of velocity in the Y direc-
y +v tion of the same magnitude,
9 **' ~u v, which they originally had,
but in the reverse direction.
(It is evident from the symmetry of the arrangement that the experi
ment would actually occur as we have stated.)
We shall now be interested in the way this experiment would appear
to an observer who is in motion in the X direction with the velocity V
relative to our original system of coordinates.
From equation (14) for the transformation of velocities, it can
be seen that this new observer would find for the X component velocities
of the two particles the values
u- V -u - V
*P — and u,= -- - (48)
and from equation (15) for the Y component velocities would find the
values
(49)
the signs depending on whether the velocities are measured before or
after the collision.
Now from Newton's third law of motion (i. e., the principle of
the equality of action and reaction) it is evident that on collision
the two particles must undergo the same numerical change in momen
tum.
For the experiment that we have chosen the only change in mo
mentum is in the Y direction, and the observer whose measurements
we are considering finds that one particle undergoes the total change
Dynamics of a Particle. 65
in velocity
V~2
and that the other particle undergoes the change in velocity
V2
Since these changes in the velocities of the particles are not equal,
it is evident that their masses must also be unequal if the principle
of the equality of action and reaction is true for all observers, as we
have assumed. This difference in the mass of the particles, each of
which has the mass ra0 when at rest, arises from the fact that the mass
of a particle is a function of its velocity and for the observer in question
the two particles are not moving with the same velocity.
Using the symbols mi and m* for the masses of the particles, we
may now write as a mathematical expression of the requirements of
the third law of motion
V2 V2
' c2 c2
Simplifying, we obtain by direct algebraic transformation
mi
l
!Z >/
1 / " M
_7y
I
\! +
uV 1
c2 /
c2
1
u7
I1" ( :
- r V
wF
66 Chapter Six.
which on the substitution of equations (48) gives us
(50)
ra2
This equation thus shows that the mass of a particle moving with
/ v?
the velocity u* is inversely proportional to -\/l — -- , and, denoting
the mass of the particle at rest by ra0, we may write as a general ex
pression for the mass of a moving particle
m = -=££= (51)
62. Mass the Same in All Directions. The method of derivation
that we have just used to obtain this expression for the mass of a
moving particle is based on the consideration of a so-called " trans
verse collision," and in fact the expression obtained has often been
spoken of as that for the transverse mass of a moving particle, while
a different expression, 7 — _2 X3/2 , has been used for the so-called
longitudinal mass of the particle. These expressions . and
m-,
I are, as a matter of fact, the values of the electric force
necessary to give a charged particle unit acceleration respectively
at right angles and in the same direction as its original velocity, and
hence such expressions would be proper for the mass of a moving par
ticle if we should define force as mass times acceleration. As already
* For simplicity of calculation we consider the case where the components of
velocity in the Y direction are small enough to be negligible in their effect on the
mass of the particles compared with the large components of velocity u\ and u2 in
the X direction. ., / f ,
Dynamics of a Particle. 67
stated, however, it has seemed preferable to retain, for force, Newton's
original definition which makes it equal to the rate of change of
momentum, and we shall presently see that this more suitable defini
tion is in perfect accord with the idea that the mass of a particle is
the same in all directions.
Aside from the unnecessary complexity which would be intro
duced, the particular reason making it unfortunate to have different
expressions for mass in different directions is that under such con
ditions it would be impossible to retain or interpret the principle of
the conservation of mass. And we shall now proceed to show that
by introducing the principle of the conservation of mass, the con
sideration of a " longitudinal collision " will also lead to exactly the
?fto .
same expression, — . , for the mass of a moving particle as we
c2
have already obtained from the consideration of a transverse collision.
63. Longitudinal Collision. Consider a system of coordinates and
two elastic particles moving in the X direction with the velocities
4- u and — u so that a " longitudinal " (i. e., head-on) collision will
occur. Let the particles be exactly alike, each of them having the
mass mQ when at rest. On collision the particles will evidently come
to rest, and then under the action of the elastic forces developed start
up and move back over their original paths with the respective veloci
ties — u and -f- u of the same magnitude as before.
Let us now consider how this collision would appear to an observer
who is moving past the original system of coordinates with the velocity
V in the X direction. Let Ui and w2 be the velocities of the particles
as they appear to this new observer before the collision has taken
place. Then, from our formula for the transformation of velocities
(14), it is evident that we shall have
+ -*=£' and V-'^f <<«>
1 -~$ 1+~?
Since these velocities u\ and u-2 are not of the same magnitude,
the two particles which have the same mass when at rest do not have
the same mass for this observer. Let us call the masses before col
lision mi and w2.
68 Chapter Six.
Now during the collision the velocities of the particles will all the
time be changing, but from the principle of the conservation of mass
the sum of the two masses must all the time be equal to m\ + m2.
When in the course of the collision the particles have come to relative
rest, they will be moving past our observer with the velocity — V,
and their momentum will be — (mi + m2)F. But, from the principle
of the equality of action and reaction, it is evident that this momen
tum must be equal to the original momentum before collision occurred.
This gives us the equation — (mi + m^V = m\u\ + m2w2. Substi
tuting our values (52) for u\ and uz we have
(-7)
and by direct algebraic transformation, as in the previous proof,
this can be shown to be identical with
Wl
m-2
/i u^
V1--?-
leading to the same expression that we obtained before for the mass
of a moving particle, viz.:
m
64. Collision of Any Type. We have derived this formula for the
mass of a moving particle first from the consideration of a transverse
and then of a longitudinal collision between particles which are elastic
and have the same mass when at rest. It seems to be desirable to
show, however, that the consideration of any type of collision between
particles of any mass leads to the same formula for the mass of a
moving particle.
For the mass m of a particle moving with the velocity u let us
write the equation m = m0/r(w2), where F( ) is the function whose
form we wish to determine. The mass is written as a function of
Dynamics of a Particle. 69
the square of the velocity, since from the homogeneity of space the
mass will be independent of the direction of the velocity, and the
mass is made proportional to the mass at rest, since a moving body
may evidently be thought of as divided into parts without change in
mass. It may be further remarked that the form of the function
F( ) must be such that its value approaches unity as the variable
approaches zero.
Let us now consider two particles having respectively the masses
mo and HQ when at rest, moving with the velocities u and w before
collision, and with the velocities U and W after a collision has taken
place.
From the principle of the conservation of mass we have
m0F(ux2 + uy2 + u?} + n<>F(wx* + wv2 + wz2)
= mQF(Ux* + U* + U*) + nQF(Wx* + W* + W*), (53)
and from the principle of the equality of action and reaction (i. e.,
Newton's third law of motion)
mQF(ux1 + u* + uf)uf + noF(w&2 + wy* + wflw*
= mQF(Ux2 + U* + Uf)U, + n,F(WI2 + Wf + W *)W *, (54)
mQF(ux2 + uf + u^u, + nQF(wx2 + wy2 + w*)wu
These velocities, uxj uy, uz, wx, wy, wz, Ux, etc., are measured, of
course, with respect to some definite system of " space-time " coordi
nates. An observer moving past this system of coordinates with the
velocity V in the X direction would find for the corresponding com
ponent velocities the values
/ y2 / v^
ux-V V1 ""? \ " c2 wx- V
X -. /,! pfp
~^V> u,VU» u,VU" W.T'
-jT ~*~ c2 *
as given by our transformation equations for velocity (14, 15, 16).
70
Chapter Six.
Since the law of the conservation of mass and Newton's third
law of motion must also hold for the measurements of the new ob
server; we may write the following new relations corresponding to
equations 53 to 56:
ux-V
72
uxV
*-:?
(53a)
m<)F{ux
c2
UX-V
1 -
u,v
c2
WX-V
(54a)
mQF{ux
yi
1 -
uxV
c2
nQF{w:
I V2
V1 - 72-
1 -
wxV
V2
c2
1 -
uxv
/- p
V1--?
(55a)
i
Dynamics of a Particle. 71
(56a)
It is evident that these equations (53a-56a) must be true no
matter what the velocity between the new observer and the original
system of coordinates, that is, true for all values of V. The velocities
ux, uy, ugf wx, etc., are, however, perfectly definite quantities, measured
with reference to a definite system of coordinates and entirely inde
pendent of V. If these equations are to be true for perfectly definite
values of ux, uy, uz, wx, etc., and for all values of V, it is evident that
the function F( ) must be of such a form that the equations are
identities in V. As a matter of fact, it is found by trial that V can
be cancelled from all the equations if we make F( ) of the form
— jp= ; and we see that the expected relation is a solution of the
/ "I
equations, although perhaps not necessarily a unique solution.
Before proceeding to use our formula for the mass of a moving
particle for • the further development of our system of mechanics,
we may call attention in passing to the fact that the experiments of
Kaufmann, Bucherer, and Hupka have in reality shown that the mass
of the electron increases with its velocity according to the formula
which we have just obtained. We shall consider the dynamics of the
electron more in detail in the chapter devoted to electromagnetic
theory. We wish to point out now, however, that in this derivation
we have made no reference to any electrical charge which might be
carried by the particle whose mass is to be determined. Hence we
may reject the possibility of explaining the Kaufmann experiment
by assuming that the charge of the electron decreases with its velocity,
since the increase in mass is alone sufficient to account for the results
of the measurement.
72 Chapter Six.
Transformation Equations for Mass.
65. Since the velocity of a particle depends on the particular
system of coordinates chosen for the measurement, it is evident that
the mass of the particle will also depend on our reference system of
coordinates. For the further development of our system of dynamics,
we shall find it desirable to obtain transformation equations for mass
similar to those already obtained for velocity, acceleration, etc.
We have
ra0
m =
where the velocity u is measured with respect to some definite system
of coordinates, S. Similarly with respect to a system of coordinates
S' which is moving relatively to S with the velocity V in the X direc
tion we shall have
We have already obtained, however, a transformation equation (17)
for the function of the velocity occurring in these equations and on
substitution we obtain the desired transformation equation
ra' = f 1 - /U~- } Km, (57)
where K has the customary significance
K =
Z!
" c2
By differentiation of (57) with respect to the time and simpli
fication, we obtain the following transformation equation for the
rate at which the mass of a particle is changing owing to change in
velocity
mV
Dynamics of a Particle.
73
Equation for the Force Acting on a Moving Particle.
06. We are now in a position to return to our development of the
dynamics of a particle. In the first place, the equation which we
have now obtained for the mass of a moving particle will permit
us to rewrite the original equation by which we defined force, in a
number of ways which will be useful for future reference.
We have our equation of definition (47)
d . du dm
F =-(„„) »„_ + _„,
which, on substitution of the expression for m, gives us
mo
m0 du d
^~dt + Jt
I u*
V1 -7J
u (59)
or, carrying out the indicated differentiation,
F =
u du
T -?
(60)
Transformation Equations for Force.
67. We are also in position to obtain transformation equations for
force. We have
d '
F = — (mu) = mu + mu
or
Fx = mux
Fv = muy
F2 = muz
mux,
mu-.
We have transformation equations, however, for all the quantities
on the right-hand side of these equations. For the velocities we
have equations (14), (15) and (16), for the accelerations (18), (19)
and (20), for mass, equation (57) and for rate of change of mass,
equation (58). Substituting above we obtain as our transformation
74
Chapter Six.
equations for force
Fx-mV
1 -
uxV
c2
= FX-
c2-uxV
c*-uxV
F
"
(61)
1 -
uxV
Fy,
(62)
1 -
uxV
F..
(63)
We may now consider a few applications of the principles governing
the dynamics of a particle.
The Relation between Force and Acceleration.
68. If we examine our equation (59) for the force acting on a
particle
du d
(59)
we see that the force is equal to the sum of two vectors, one of which
is in the direction of the acceleration — and the other in the direction
of the existing velocity u, so that in general force and the acceleration
it produces are not in the same di
rection. We shall find it interesting
to see, however, that if the force
which does produce acceleration in
a given direction be resolved per
pendicular and parallel to the accel
eration, the two components will
be connected by a definite relation.
Consider a particle (fig. 12) in
plane space moving with the ve
locity
0
FIG. 12. u = uxi + uy].
m <$
Dynamics of a Particle.
75
Let it be accelerated in the X direction by the action of the com
ponent forces Fx and Fy.
From our general equation (59) for the force acting on a particle
we have for these component forces
Ff =
I u*
V1-^
dui d
~dt ^~ di
mo
Fv ~~ '- ^2 dt + dt
Uy.
(64)
(65)
Introducing the condition that all the acceleration is to be in the Y
direction, which makes -~ = 0, and further noting that u? = ux2 + uv2,
by the division of equation (64) by (65), we obtain
F,
Fv
Fx =
uxu
C2 _ u^1
U£Uy
r «•
U.
(66)
Hence, in order to accelerate a particle in a given direction, we may
apply any force Fy in the desired direction, but must at the same time
apply at right angles another force Fx whose magnitude is given by
equation (66).
Although at first sight this state of affairs might seem rather
unexpected, a simple qualitative consideration will show the necessity
of a component of force perpendicular to the desired acceleration.
Refer again to figure 12; since the particle is being accelerated in the Y
direction, its total velocity and hence its mass are increasing. This
increasing mass is accompanied by increasing momentum in the X
direction even when the velocity in that direction remains constant.
The component force Fx is necessary for the production of this increase
in X-momentum.
In a later paragraph we shall show an application of equation (66)
in electrical theory.
76 Chapter Six.
Transverse and Longitudinal Acceleration.
69. An examination of equation (66) shows that there are two
special cases in which the component force Fx disappears and the
force and acceleration are in the same direction. Fx will disappear
when either ux or uy is equal to zero, so that force and acceleration
will be in the same direction when the force acts exactly at right
angles to the line of motion of the particle, or in the direction of the
motion (or of course also when ux and uy are both equal to zero and
the particle is at rest). It is instructive to obtain simplified ex
pressions for force for these two cases of transverse and longitudinal
acceleration.
Let us again examine our equation (60) for the force acting on a
particle
niQ du mo u du
(-!)'
For the case of a transverse acceleration there is no component of
force in the direction of the velocity u and the second term of the
equation is equal to zero, giving us
F -
For the case of longitudinal acceleration, the velocity u and the
acceleration -r are in the same direction, so that we may rewrite the
second term of (60), giving us
ra0 du mo u2 du
~==~
° u*
V ~ c*
and on simplification this becomes
m»
«*
V" <?)
Dynamics of a Particle. 77
An examination of this expression shows the reason why -j— ° 3/2
is sometimes spoken of as the expression for the longitudinal mass of a
particle.
The Force Exerted by a Moving Charge.
70. In a later chapter we shall present a consistent development
of the fundamentals of electromagnetic theory based on the Einstein
transformation equations for space and time and the four field equa
tions. At this point, however, it may not be amiss to point out that
the principles of mechanics themselves may sometimes be employed
to obtain a simple and direct solution of electrical problems.
Suppose, for example, we wish to calculate the force with which a
point charge in uniform motion acts on any other point charge. We
can solve this problem by considering a system of coordinates which
move with the same velocity as the charge itself. An observer
making use of the new system of coordinates could evidently calcu
late the force exerted by the charge in question by Coulomb's familiar
inverse square law for static charges, and the magnitude of the force
as measured in the original system of coordinates can then be deter
mined from our transformation equations for force. Let us proceed
to the specific solution of the problem.
Consider a system of coordinates S, and a charge e in uniform
motion along the X axis with the velocity V. We desire to know
the force acting at the time t on any other charge ei which has any
desired coordinates x, y, and z and any desired velocity ux, uy and ux.
Assume a system of coordinates, S', moving with the same velocity
as the charge e which is taken coincident with the origin. To an
observer moving with the system S', the charge e appears to be
always at rest and surrounded by a pure electrostatic field. Hence
in system S' the force with which e acts on e\ will be, in accordance
with Coulomb's law*
ee\r'
F' =
r'3
* It should be noted that in its original form Coulomb's law merely stated
that the force between two stationary charges was proportional to the product of
the charges and inversely to the distance between them. In the present derivation
78 Chapter Six.
or
)"• (69)
2'
where x', y', and z' are the coordinates of the charge e\ at the time tf*
For simplicity let us consider the force at the time t' = 0; then from
transformation equations (9), (10), (11), (12) we shall have
x' = K-^-X, y' = y, zf = z.
Substituting in (69), (70), (71) and also using our transformation
equations for force (61), (62), (63), we obtain the following equations
for the force acting on ei, as it appears to an observer in system S:
'- - (rt??+*)'*(X+ **<*"* t>*)' (?2)
_utV
c2
:z2 + ?/2 + z2)3/2'
_^F
c2
These equations give the force acting on e\ at the time t. From
V
transformation equation (12) we have t = — x, since t' = 0. At this
C
time the charge e, which is moving with the uniform velocity V along
we have extended this law to apply to the instantaneous force exerted by a stationary
charge upon any other charge.
The fact that a charge of electricity appears the same to observers in all systems
is obviously also necessary for the setting up of equations (69), (70), (71). That
such is the case, however, is an evident consequence of the atomic nature of elec
tricity. The charge e would appear of the same magnitude to observers both in
system S and system S't since they would both count the same number of electrons
on the charge. (See Section 157.)
Dynamics of a Particle. 79
the X axis, will evidently have the position
72
x< = ~# x> y* = °> z« = °-
For convenience we may now refer our results to a system of
coordinates whose origin coincides with the position of the charge e
at the instant under consideration. If X, Y and Z are the coordi
nates of ei with respect to this new system, we shall evidently have
the relations
72
X = x - ~x = /c-% Y = y, Z = z,
UX = UX, Uy = Uy, Ug = Ut.
Substituting into (72), (73), (74) we obtain
.
where for simplicity we have placed
5 =
These are the same equations which would be obtained by sub
stituting the well-known formula, for the strength of the electric and
magnetic field around a moving point charge into the fifth funda
mental equation of the Maxwell-Lorentz theory, f=p(e +-[uXh]*J.
They are really obtained in this way more easily, however, and are
seen to come directly from Coulomb's law.
The Field around a Moving Charge. Evidently we may also use
these considerations to obtain an expression for the electric field
produced by a moving charge e, if we consider the particular case
that the charge e\ is stationary (i. e., Ux = Uv = Ut = 0) and equal
80 Chapter Six.
to unity. Making these substitutions in (75), (76), (77) we obtain
the well-known expression for the electrical field in the neighborhood
of a moving point charge
where
r = Xi + Fj + Zk.
71. Application to a Specific Problem. Equations (75), (76), (77)
can also be applied in the solution of a
rather interesting specific problem.
Consider a charge e constrained to
move in the X direction with the ve
locity V and at the instant under con-
y sideration let it coincide with the origin
of a system of stationary coordinates
=v> YeX (fig. 13). Suppose now a second
charge e\, situated at the point X = 0,
Y = Y and moving in the X direction
with the same velocity V as the charge e,
and also having a component velocity
in the F direction Uy. Let us predict
the nature of its motion under the influ-
^ X
•p 13 ence of the charge e, it being otherwise
unconstrained.
From the simple qualitative considerations placed at our disposal
by the theory of relativity, it seems evident that the charge ei ought
merely to increase its component of velocity in the Y direction and
retain unchanged its component in the X direction, since from the
point of view of an observer moving along with e the phenomenon is
merely one of ordinary electrostatic repulsion.
Let us see whether our equations for the force exerted by a moving
charge actually lead to this result. By making the obvious sub
stitutions in equations (75) and (76) we obtain for the component
forces on e\
(79)
(80)
Dynamics of a Particle. 81
Now under the action of the component force Fx we might at
first sight expect the charge e\ to obtain an acceleration in the X
direction, in contradiction to the simple qualitative prediction that
we have just made on the basis of the theory of relativity. We
remember, however, that equation (66) prescribes a definite ratio
between the component forces Fx and Fy if the acceleration is to be
in the Y direction, and dividing (79) by (80) we actually "obtain the
necessary relation
F,_ VUV
Fy~ c2 -V2'
Other applications of the new principles of dynamics to electrical,
magnetic and gravitational problems will be evident to the reader.
Work.
72. Before proceeding with the further development of our theory
of dynamics we shall find it desirable to define the quantities work,
kinetic, and potential energy.
We have already obtained an expression for the force acting on a
particle and shall define the work done on the particle as the integral
of the force times the distance through which the particle is dis
placed. Thus
(81)
where r is the radius vector determining the position of the particle.
Kinetic Energy.
73. When a particle is brought from a state of rest to the velocity
u by the action of an unbalanced force F, we shall define its kinetic
energy as numerically equal to the work done in producing the velocity.
Thus
K = W =
Since, however, the kinetic energy of a particle turns out to be
entirely independent of the particular choice of forces used in pro
ducing the final velocity, it is much more useful to have an expression
for kinetic energy in terms of the mass and velocity of the particle.
We have
82 Chapter Six.
Substituting the value of F given by the equation of definition (47)
we obtain
du
f —
J dt U
= I mu-du -f- I u-udra
= I mudu -f- I u2dm.
Introducing the expression (51) for the mass of a moving particle
m,r*
— , we obtain
-du + I ^-7— ^ 0x3/2^
and on integrating and evaluating the constant of integration by
placing the kinetic energy equal to zero when the velocity is zero,
we easily obtain the desired expression for the kinetic energy of a
particle :
K = m0c2
1
- 1
(82)
= c2(m - w0). (83)
It should be noticed, as was stated above, that the kinetic energy
of a particle does depend merely on its mass and final velocity and is
entirely independent of the particular choice of forces which happened
to be used in producing the state of motion.
It will also be noticed, on expansion into a series, that our ex
pression (82) for the kinetic energy of a particle approaches at low
velocities the form familiar in the older Newtonian mechanics,
K =
Potential Energy.
74. When a moving particle is brought to rest by the action of a
Dynamics of a Particle. 83
conservative* force we say that its kinetic energy has been trans
formed into potential energy. The increase in the potential energy
of the particle is equal to the kinetic energy which has been destroyed
and hence equal to the work done by the particle against the force,
giving us the equation
= - W = --F-dr. (84)
The Relation between Mass and Energy.
75. We may now consider a very important relation between the
mass and energy of a particle, which was first pointed out in our
chapter on " Some Elementary Deductions."
When an isolated particle is set in motion, both its mass and
energy are increased. For the increase in mass we may write
Am = m — WQ,
and for the increase in energy we have the expression for kinetic energy
given in equation (83), giving us
A# = cz(m - W0),
or, combining with the previous equation,
AE = c2Aw. (85)
Thus the increase in the kinetic energy of a particle always bears
the same definite ratio (the square of the velocity of light) to its
increase in mass. Furthermore, when a moving particle is brought
to rest and thus loses both its kinetic energy and its extra (" kinetic ")
mass, there seems to be every reason for believing that this mass
and energy which are associated together when the particle is in
motion and leave the particle when it is brought to rest will still
remain always associated together. For example, if the particle is
brought to rest by collision with another particle, it is an evident
* A conservative force is one such that any work done by displacing a system
against it would be completely regained if the motion of the system should be re
versed.
Since we believe that the forces which act on the ultimate particles and con
stituents of matter are in reality all of them conservative, we shall accept the general
principle of the conservation of energy just as in Newtonian mechanics. (For a
logical deduction of the principle of the conservation of energy in a system of par
ticles, see the next chapter, section 89.)
84 Chapter Six.
consequence of our considerations that the energy and the mass
corresponding to it do remain associated together since they are both
passed on to the new particle. On the other hand, if the particle
is brought to rest by the action of a conservative force, say for example
that exerted by an elastic spring, the kinetic energy which leaves the
particle will be transformed into the potential energy of the stretched
spring, and since the mass which has undoubtedly left the particle
must still be in existence, we shall believe that this mass is now asso
ciated with the potential energy of the stretched spring.
76. Such considerations have led us to believe that matter and
energy may be best regarded as different names for the same funda
mental entity: matter, the name which has been applied when we
have been interested in the property of mass or inertia possessed
by the entity, and energy, the name applied when we have been
interested in the part taken by the entity in the production of motion
and other changes in the physical universe. We shall find these
ideas as to the relations between matter, energy and mass very fruit
ful in the simplification of physical reasoning, not only because it
identifies the two laws of the conservation of mass and the conser
vation of energy, but also for its frequent application in the solution
of specific problems.
77. We must call attention to the great difference in size between
the two units, the gram and the erg, both of which are used for the
measurement of the one fundamental entity, call it matter or energy
as we please. Equation (85) gives us the relation
E = czm, (86)
where E is expressed in ergs and m in grams; hence, taking the velocity
of light as 3 X 1010 centimeters per second, we shall have
1 gram = 9 X 1020 ergs. (87)
The enormous number of ergs necessary for increasing the mass of
a system to the amount of a single gram makes it evident that experi
mental proofs of the relation between mass and energy will be hard to
find, and outside of the experimental work on electrons at high veloci
ties, already mentioned in Section 64 and the well-known relations
Dynamics of a Particle. 85
between the energy and momentum of a beam of light, such evidence
has not yet been forthcoming.
As to the possibility of obtaining further direct experimental
evidence of the relation between mass and energy, we certainly can
not look towards thermal experiments with any degree of confidence,
since even on cooling a body down to the absolute zero of temperature
it loses but an inappreciable fraction of its mass at ordinary tempera
tures.* In the case of some radioactive processes, however, we may
find a transfer of energy large enough to bring about measurable
differences in mass. And making use of this point of view we might
account for the lack of exact relations between the atomic weights of
the successive products of radioactive decomposition.!
78. Application to a Specific Problem. We may show an inter
esting application of our ideas as to the relation between mass and
energy, in the treatment of a specific problem. Consider, just as in
Section 63, two elastic particles both of which have the mass m0 at
rest, one moving in the X direction with the velocity -f u and the
other with the velocity — uy in such a way that a head-on collision
between the particles will occur and they will rebound over their
original paths with the respective velocities — u and + u of the
same magnitude as before.
Let us now consider how this collision would appear to an observer
who is moving past the original system of coordinates with the velocity
V in the X direction. To this new observer the particles will be
moving before the collision with the respective velocities
u-V -u -V
Ul = ~— and u2 = —r,
as given by equation (14) for the transformation of velocities. Fur
thermore; when in the course of the collision the particles have come
to relative rest they will obviously be moving past our observer with
the velocity — V.
*It should be noticed that our theory points to the presence of enormous-.
stores of interatomic energy which are still left in substances cooled to the absolute
zero.
t See, for example, Comstock, Philosophical Magazine, vol. 15, p. 1 (1908).
86
Chapter Six.
Let us see what the masses of the particles will be both before and
during the collision. Before the collision, the mass of the first particle
will be
u- V
uV
c2
c2
and the mass of the second particle will be
ra0
F
Adding these two expressions, we obtain for the sum of the masses of
the two particles before collision,
Now during the collision, when the two particles have come to
relative rest, they will evidently both be moving past our observer
with the velocity — V and hence the sum of their masses at the
instant of relative rest would appear to be
2m0
a quantity which is smaller than that which we have just found for
the sum of the two masses before the collision occurred. This apparent
discrepancy between the total mass of the system before and during
the collision, is removed, however, if we realize that when the par-
Dynamics of a Particle. 87
tides have come to relative rest an amount of potential energy of
elastic deformation has been produced, which is just sufficient to re
store them to their original velocities, and the mass corresponding to
this potential energy will evidently be just sufficient to make the
total mass of the system the same as before collisipn.
In the following chapter on the dynamics of a system of particles
we shall make further use of our ideas as to the mass corresponding
to potential energy.
CHAPTER VII.
THE DYNAMICS OF A SYSTEM OF PARTICLES.
79. In the preceding chapter we discussed the laws of motion
of a particle. With the help of those laws we shall now derive some
useful general dynamical principles which describe the motions of a
system of particles, and in the following chapter shall consider an
application of some of these principles to the kinetic theory of gases.
The general dynamical principles which we shall present in this
chapter will be similar in form to principles which are already familiar
in the classical Newtonian mechanics. Thus we shall deduce princi
ples corresponding to the principles of the conservation of momentum,
of the conservation of moment of momentum, of least action and of
vis viva, as well as the equations of motion in the Lagrangian and
Hamiltonian (canonical) forms. For cases where the velocities of all
the particles involved are slow compared with that of light, we shall
find, moreover, that our principles become identical in content, as
well as in form, with the corresponding principles of the classical
mechanics. Where high velocities are involved, however, our new
principles will differ from those of Newtonian mechanics. In par
ticular we shall find among other differences that in the case of high
velocities it will no longer be possible to define the Lagrangian function
as the difference between the kinetic and potential energies of the
system, nor to define the generalized momenta used in the Hamil
tonian equations as the partial differential of the kinetic energy with
respect to the generalized velocity.
On the Nature of a System of Particles.
80. Our purpose in this chapter is to treat dynamical systems
consisting of a finite number of particles, each obeying the equation
of motion which we have already written in the forms,
-I
d_
dt
du dm
= m~di + ~dt^
m0 du d
(47)
u. (59)
88
Dynamics of a System of Particles. 89
It is not to be supposed, however, that the total mass of such a
system can be taken as located solely in these particles. It is evident
rather, since potential energy has mass, that there will in general be
mass distributed more or less continuously throughout the space in
the neighborhood of the particles. Indeed we have shown at the
end of the preceding chapter (Section 78) that unless we take account
of the mass corresponding to potential energy we can not maintain
the principle of the conservation of mass, and we should also find it
impossible to retain the principle of the conservation of momentum
unless we included the momentum corresponding to potential energy.
For a continuous distribution of mass we may write for the force
acting at any point on the material in a small volume, 5V,
*.
= ~dt
where f is the force per unit volume and g is the density of momentum.
This equation is of course merely an equation of definition for the
intensity of force at a point. We shall assume, however, that New
ton's third law, that is, the principle of the equality of action and
reaction, holds for forces of this type as well as for those acting on
particles. In later chapters we shall investigate the way in which g
depends on velocity, state of strain, etc., but for the purposes of this
chapter we shall not need any further information as to the nature
of the distributed momentum.
Let us proceed to the solution of our specific problems.
The Conservation of Momentum.
81. We may first show from Newton's third law of motion that
the momentum of an isolated system of particles remains constant.
Considering a system of particles of masses mi, mz, ms, etc., we
may write in accordance with equation 47,
f[
Fi+1,-1
etc.,
90 Chapter Seven.
where FI, F2, etc., are the external forces impressed on the individual
particles from outside the system and Ii, I2; etc., are the internal
forces arising from mutual reactions within the interior of the system.
Considering the distributed mass in the system, we may also write,
in accordance with (47 A) the further equation
(f 4- i)5V = jt(g8V), (90)
where f and i are respectively the external and internal forces acting
per unit volume of the distributed mass. Integrating throughout the
whole volume of the system V we have
= ^f' (91)
where G is the total distributed momentum in the system. Adding
this to our previous equations (89) for the forces acting on the indi
vidual particles, we have
VT7 L VT _L f^T7 I f 5x717 ^V I_ ^
2/Jc i -p 2^1i "J~ I 1$ r "p I Id V — — Z/miUi "j~ .
J J at at
But from Newton's third law of motion (i. e., the principle of the
equality of action and reaction) it is evident that the sum of the
internal forces, Sli -f J idV, which arise from mutual reactions within
the system must be equal to zero, which leads to the desired equation
of momentum
/j
fdv =- 37 (SwiUi -j- G). (92)
at
In words this equation states that at any given instant the vector
sum of the external forces acting on the system is equal to the rate
at which the total momentum of the system is changing.
For the particular case of an isolated system there are no external
forces and our equation becomes a statement of the principle of the
conservation of momentum.
The Equation of Angular Momentum.
82. We may next obtain an equation for the moment of momentum
of a system about a point.
Dynamics of a System of Particles. 91
Consider a particle of mass mi and velocity Ui. Let TI be the
radius vector from any given point of reference to the particle. Then
for the moment of momentum of the particle about the point we may
write
M! = TI X ?HiUi,
and summing up for all the particles of the system we may write
2Mi = Zfo X WiUi). (93)
Similarly, for the moment of momentum of the distributed mass we
may write
Mdis, = / (r X g)dF, (94)
where r is the radius vector from our chosen point of reference to a
point in space where the density of momentum is g and the inte
gration is to be taken throughout the whole volume, F, of the system.
Adding these two equations (93) and (94), we obtain for the total
amount of momentum of the system about our chosen point
M = ZOTiXrKiuO + / (r X g)dV;
and differentiating with respect to the time we have, for the rate of
change of the moment of momentum,
dM
dt
or, making the substitutions given by equations (89) and (90), and
dTi
writing — = Ui, etc. we have
~ = Z(rL X FO + Z(f! X Ii) + z(Ui X miUi)
at
+ / (r X f)dV + f (r X i)dV + / (u X g)dV.
To simplify this equation we may note that the third term is equal to
zero because it contains the outer product of a vector by itself. Fur
thermore, if we accept the principle of the equality of action and
92 Chapter Seven.
reaction, together with the further requirement that forces are not
only equal and opposite but that their points of application be in the
same straight line, we may put the moment of all the internal forces
equal to zero and thus eliminate the second and fifth terms. We
obtain as the equation of angular momentum
-—
l X FO + J (r X f)dV 4- J (u X g)dV. (95)
We may call attention to the inclusion in this equation of the
interesting term J (u X g)dV. If density of momentum and velocity
should always be in the same direction this term would vanish, since
the outer product of a vector by itself is equal to zero. In our con
sideration of the " Dynamics of Elastic Bodies/' however, we shall
find bodies with a component of momentum at right angles to their
direction of motion and hence must include this term in a general
treatment. For a completely isolated system it can be shown, how
ever, that this term vanishes along with the external forces and we
then have the principle of the conservation of moment of momentum.
The Function T.
83. We may now proceed to the definition of a function which
will be needed in our treatment of the principle of least action.
One of the most valuable properties of the Newtonian expression,
%mQu2, for kinetic energy was the fact that its derivative with respect
to velocity is evidently the Newtonian expression for momentum, mQu.
It is not true, however, that the derivative of our new expression
for kinetic energy (see Section 73), m0c
1
- 1
, with respect
to velocity is equal to momentum, and for that reason in our non-
Newtonian mechanics we shall find it desirable to define a new func
tion, !F, by the equation,
/ / 1^2 \
(96)
For slow velocities (i. e., small values of u) this reduces to the
Newtonian expression for kinetic energy and at all velocities we have
Dynamics of a System of Particles. 93
the relation,
dT
' = = mu, (97)
du du \ c2 / 1/2
c
/, ^
V1-?
showing that the differential of T with respect to velocity is momentum.
For a system of particles we shall define T as the summation of
the values for the individual particles:
(98)
The Modified Lagrangian Function.
84. In the older mechanics the Lagrangian function for a system
of particles was defined as the difference between the kinetic and
potential energies of the system. The value of the definition rested,
however, on the fact that the differential of the kinetic energy with
respect to velocity was equal to momentum, so that we shall now
find it advisable to define the Lagrangian function with the help of
our new function T in accordance with the equation
L = T - U. (99)
The Principle of Least Action.
85. We are now in a position to derive a principle corresponding
to that of least action in the older mechanics. Consider the path
by which our dynamical system actually moves from state (1) to
state (2). The motion of any 'particle in the system of mass m will
be governed by the equation
F=|(mu). (100)
Let us now compare the actual path by which the system moves
from state (1) to state (2) with a slightly displaced path in which the
laws of motion are not obeyed, and let the displacement of the particle
at the instant in question be 5r.
Let us take the inner product of both sides of equation (100) with
94 Chapter Seven.
5r; we have
F-6r = -r(mu)'5r
d . d8r
= _(mu.5r)._mu.__
d
= — (rau • 5r) — rau • 5u
(rau-5u + J?-dr)dt = d(wu-5r).
Summing up for all the particles of the system and integrating
between the limits ti and t2) we have
(Srau-Su + SF-5r)<i£ = [Srau-5r]'j.
Since ti and t2 are the times when the actual and displaced motions
coincide, we have at these times 5r = 0; furthermore we also have
u • 5u = udu, so that we may write
(2mu5u + f'8r)dt = 0.
With the help of equation (97), however, we see that 2mudu = dT,
giving us
f
•J ti
= 0. (101)
If the forces F are conservative, we may write F • 5r = — dU, where
dU is the difference between the potential energies of the displaced
and the actual configurations. This gives us
5 f 2 (T - U)dt = 0
J«,
or
r
«/«,
Ldt = 0, (102)
which is the modified principle of least action. The principle evi
dently requires that for the actual path by which the system goes
Dynamics of a System of Particles. 95
/•H
from state (1) to state (2), the quantity I Ldt shall be a minimum (or
Jtl
maximum).
Lagrange's Equations.
86. We may now derive the Lagrangian equations of motion from
the above principle of least action. Let us suppose that the position
of each particle of the system under consideration is completely deter
mined by n independent generalized coordinates 91, <£2, fa • • • </>n and
hence that L is some function of 91, 92, 9s • • • 9n, <£i, <fo, 4>z • - • 9n,
where for simplicity we have put <£i = —rr , fa = —rr , etc.
From equation (102) we have
/»<j /Vo / n *\T n r\T \
(5L)dt= (Evr-501 + ZvT-^i )dt = 0. (103)
»/<! J<! \ 1 "91 1 "91 /
But
591 = ^
which gives us
tn- ta- dL d
•£ a
or, since at times £1 and ^2, ^91 is zero, the first term in this expression
disappears and on substituting in equation (103) we obtain
Since, however, the limits Zi and tz are entirely at our disposal we must
have at every instant
Finally, moreover, since the 4>'s are independent parameters, we can
assign perfectly arbitrary values to $91, 692, etc., and hence must have
96 Chapter Seven.
the series of equations
<L( <**L\ _ dL
dt \ a<£i / d^i " 0>
dt\d<j>
etc.
These correspond to Lagrange's equations in the older mechanics,
differing only in the definition of L.
Equations of Motion in the Hamiltonian Form.
87. We shall also find it desirable to obtain equations of motion
in the Hamiltonian or canonical form.
Let us define the generalized momentum \f/i corresponding to the
coordinate </>i by the equation,
It should be noted that the generalized momentum is not as in
ordinary mechanics the derivative of the kinetic energy with respect
to the generalized velocity but approaches that value at low velocities.
Consider now a function Tr defined by the equation
T' - Vi0i + iW2 + • • • - T. (106)
Differentiating we have
dT' = i<i
dT _ dT .
^**~5£*i"
dT dT
-aT!^1"^^2"
and this, by the introduction of (105), becomes
dTf =
Dynamics of a System of Particles. 97
Examining this equation we have
'£--£• (108)
O(p\ C/<pi
¥r = 4i. (109)
In Lagrange's equations we have
But since U is independent of ^i we may write
d(T - U) dT
a ; ~ = VT = rlj
091 001
and furthermore by (108),
Substituting these two expressions in Lagrange's equations we obtain
d^l= _ d(T + U)
dt ''
or, writing T' + U = E, we have
--— , (110)
dt d(f>i
and since U is independent of fa we may rewrite equation (109) in
the form
^ = 4^. an)
The set of equations corresponding to (110) and (111) for all the
coordinates </>i, </>2, fa, • ' • <t>n and the momenta t£i, fa, fa, • - fa are
the desired equations of motion in the canonical form.
88. Value of the Function T'. We have given the symbol E to
the quantity T' + U, since T' actually turns out to be identical with
8
98 Chapter Seven.
the expression by which we defined kinetic energy, thus making
E = T' + U the sum of the kinetic and potential energies of the
system.
To show that T' is equal to K, the kinetic energy, we have by the
equation of definition (106)
r = <Wi + M* + . - . - T,
. dT . dT
But T by definition, equation (98), is
T =
which gives us
— ( _ul\~i'2 ^L
d(j>i \ c2 / d(j>i
du
and substituting we obtain
du . . . du
—- — j_ . . . j£
(112)
We can show, however, that the term in parenthesis is equal to u.
If the coordinates x, y, z determine the position of the particle in
question, we have,
and differentiating with respect to the <£'s, we obtain,
d &( dx dx dx dx dx
etc.,
Dynamics of a System of Particles. 99
Similarly
^L - ^M. dy_ _ dy^
dz dz dz dz
Let us write now
u = Az2 + y2 + z2,
du^ __ 1 / . dx . dy . dz \
dfa" Vx2 + if- + &\dfa ^i 2d*i/>
5x dy
or making the substitutions for —r , -r-r- , etc., given above, we have,
= -., -.j -
a^i ^V^^i ^i 2
Substituting now in (112) we shall obtain,
or, introducing the value of T given by equation (98), we have
— m0),
which is the expression (83) for kinetic energy.
Hence we see that the Hamiltonian function E = T -f U is the
sum of the kinetic and potential energies of the system as in Newtonian
mechanics.
The Principle of the Conservation of Energy.
89. We may now make use of our equations of motion in the
canonical form to show that the total energy of a system of interacting
100 Chapter Seven.
particles remains constant. If such were not the case it is obvious
that our definitions of potential and kinetic energy would not be
very useful.
Since E = Tr + U is a function of <£i, $2, fo, • • - ti, ^2, ^3, • • •, we
may write
dE d# . dE
dE . dE .
+ T-r<£l+TT-'/'2 + ••••
d\l/\ d\l/z
r)JF f)W
Substituting the values of - p , - p , etc., given by the canonical
0<pi 0^1
equations of motion (110) and (111), we have
dE
— = — 1//10! — 1//202 -
+ Wz + ' ' '
which gives us the desired proof that just as in the older Newtonian
mechanics the total energy of an isolated system of particles is a
conservative quantity.
On the Location of Energy in Space.
90. This proof of the conservation of energy in a system of inter
acting particles justifies us in the belief that the concept of energy
will not fail to retain in the newer mechanics the position of great
importance which it gradually acquired in the older systems of physical
theory. Indeed, our newer considerations have augmented the
important role of energy by adding, to its properties the attribute of
mass or inertia, and thus leading to the further belief that matter
and energy are in reality different names for the same fundamental
entity.
The importance of this entity, energy, makes it very interesting
to consider the possibility of ascribing a definite location in space to
any given quantity of energy. In the older mechanics we had a
hazy notion that the kinetic energy of a moving body was probably
located in some way in the moving body itself, and possibly a vague
Dynamics of a System of Particles. 101
idea that the potential energy of a raised weight might be located in
the space between the weight and the earth. Our discovery of the
relation between mass and energy has made it possible, however, to
give a much more definite, although not a complete, answer to inquiries
of this kind.
In our discussions of the dynamics of a particle (Chapter VI,
Section 61) we saw that an acceptance of Newton's principle of the
equality of action and reaction forced us to ascribe an increased mass
to a moving particle over that which it has at rest. This increase in
the mass of the moving particle is necessarily located either in the
particle itself or distributed in the surrounding space in such a way
that its center of mass always coincides with the position of the
particle, and since the kinetic energy of the particle is the energy
corresponding to this increased mass we may say that the kinetic energy
of a moving particle is so distributed in space that its center of mass
always coincides with the position of the particle.
If now we consider the transformation of kinetic energy into
potential energy we can also draw somewhat definite conclusions as to
the location of potential energy. By the principle of the conserva
tion of mass we shall be able to say that the mass of any potential
energy formed is just equal to the " kinetic " mass which has dis
appeared, and by the principle of the conservation of momentum we
can say that the velocity of this potential energy is just that necessary
to keep the total momentum of the system constant. Such con
siderations will often permit us to reach a good idea as to the location
of potential energy.
Consider, for example, a pair of similar attracting particles which
are moving apart from each other with the velocities + u and - u
and are gradually coming to rest under the action of their mutual
attraction, their kinetic energy thus being gradually changed into
potential energy. Since the total momentum of the system must
always remain zero, wre may think of the potential energy which is
formed as left stationary in the space between the two particles.
CHAPTER VIII.
THE CHAOTIC MOTION OF A SYSTEM OF PARTICLES.
The discussions of the previous chapter have placed at our disposal
generalized equations of motion for a system of particles similar in
form to those familiar in the classical mechanics, and differing only
in the definition of the Lagrangian function. With the help of these
equations it is possible to carry out investigations parallel to those
already developed in the classical mechanics, and in the present
chapter we shall discuss the chaotic motion of a system of particles.
This problem has received much attention in the classical mechanics
because of the close relations between the theoretical behavior of
such an ideal system of particles and the actual behavior of a mona-
tomic gas. We shall find no more difficulty in handling the problem
than was experienced in the older mechanics, and our results will of
course reduce to those of Newtonian mechanics in the case of slow
velocities. Thus we shall find a distribution law for momenta which
reduces to that of Maxwell for slow velocities, and an equipartition
law for the average value of a function which at low velocities becomes
identical with the kinetic energy of the particles.
91. The Equations of Motion. It has been shown that the Hamil-
tonian equations of motion
dE_ _
~ dt
dE _ dfr (113)
dS" dt :=01'
etc.,
will hold in relativity mechanics provided we define the generalized
momenta \j/i, fa, etc., not as the differential of the kinetic energy
with respect to the generalized velocities <£i, $2, etc., but as the dif
ferential with respect to 0i; <£2, etc., of a function
T =
102
Chaotic Motion of a System of Particles. 103
where 7n0 is the mass of a particle having the velocity u and the sum
mation 2 extends over all the particles of the system.
92. Representation in Generalized Space. Consider now a system
defined by the n generalized coordinates <£i, <£2> <fo, • • • , <t>n, and the
corresponding momenta \j/i, i/% fa, ' ' ' > tn- Employing the methods
so successfully used by Jeans,* we may think of the state of the
system at any instant as determined by the position of a point plotted
in a 2n-dimensional space. Suppose now we had a large number of
systems of the same structure but differing in state, then for each
system we should have at any instant a corresponding point in our
2n-dimensional space, and as the systems changed their state, in the
manner required by the laws of motion, the points would describe
stream lines in this space.
93. Liouville's Theorem. Suppose now that the points were
originally distributed in the generalized space with the uniform
density p. Then it can be shown by familiar methods that, just as
in the classical mechanics, the density of distribution remains uniform.
Take, for example, some particular cubical element of our gener
alized space dcfridfadfa - - • d\f/id\f/zdfa - - •. The density of dis
tribution will evidently remain uniform if the number of points
entering any such cube per second is equal to the number leaving.
Consider now the two parallel bounding surfaces of the cube which
are perpendicular to the $1 axis, one cutting the axis at the point 0i
and the other at the point 4>i + dfa. The area of each of these
surfaces is d^^dfa - • • d^id^zdfa • • - , and hence, if <j>i is the component
of velocity which the points have parallel to the 4>i axis, and T— is
001
the rate at which this component is changing as we move along the
axis, we may obviously write the following expression for the differ
ence between the number of points leaving and entering per second
through these two parallel surfaces
Finally, considering all the pairs of parallel bounding surfaces, we
* Jeans, The Dynamical Theory of Gases, Cambridge, 1916.
104 -Chapter Eight.
find for the total decrease per second in the contents of the element
I I I _|_ ,
d0i 302 30s
But the motions of the points are necessarily governed by the Hamil-
tonian equations (113) given above, and these obviously lead to the
relations
etc.
So that our expression for the change per second in the number of
points in the cube becomes equal to zero, the necessary requirement
for preserving uniform density.
This maintenance of a uniform distribution means that there is
no tendency for the points to crowd into any particular region of the
generalized space, and hence if we start some one system going and
plot its state in our generalized space, we may assume that, after an
indefinite lapse of time, the point is equally likely to be in any one of
the little elements dV. In other words, the different states of a system,
which we can specify by stating the region d<f>id4>zd(j>3 • • - d\l/idfadfa - - -
in which the values of the coordinates and momenta of the system fall,
are all equally likely to occur.*
94. A System of Particles. Consider now a system containing Na
particles which have the mass ma when at rest, Nb particles which
have the mass m^ Nc particles which have the mass mcj etc. If at
any given instant we specify the particular differential element
dx dy dz d\l/x d\j/y d\j/z which contains the coordinates x, y, z, and the
corresponding momenta \f/X) \f/v, \}/z for each particle, we shall thereby
completely determine what Planckf has well called the microscopic
state of the system, and by the previous paragraph any microscopic
* The criterion here used for determining whether or not the states are equally
liable to occur is obviously a necessary requirement, although it is not so evident
that it is a sufficient requirement for equal probability.
t Planck, Warmestrahlung, Leipzig, 1913.
Chaotic Motion of a System of Particles. 105
state of the system in which we thus specify the six-dimensional
position of each particle is just as likely to occur as any other micro
scopic state.
It must be noticed, however, that many of the possible micro
scopic states which are determined by specifying the six-dimensional
position of each individual particle are in reality completely identical,
since if all the particles having a given mass ma are alike among them
selves, it makes no difference which particular one of the various
available identical particles we pick out to put into a specified range
dx dy dz d^x d^y d\I/z.
For this reason we shall usually be interested in specifying the
statistical state* of the system, for which purpose we shall merely
state the number of particles of a given kind which have coordinates
falling in a given range dx dy dz d\l/x d\j/v d\f/z. We see that corre
sponding to any given statistical state there will be in general a
large number of microscopic states.
95. Probability of a Given Statistical State. We shall now be
particularly interested in the probability that the system of particles
will actually be in some specified statistical state, and since Liou-
ville's theorem has justified our belief that all microscopic states are
equally likely to occur, we see that the probability of a given statis
tical state will be proportional to the number of microscopic states
which correspond to it.
For the system under consideration let a particular statistical
state be specified by stating that Na', Na", Nar", • • •, AY, AY', AY",
•••, etc., are the number of particles of the corresponding masses
ma, mb, etc., which fall in the specified elementary regions dx dy dz
d$x dtv d^z, Nos. la, 2a, 3a, • • •, 16, 26, 36, • • •, etc. By familiar
methods of calculation it is evident that the number of arrangements
by which the particular distribution of particles can be effected,
that is, in other words, the number of microscopic states, W, which
correspond to the given statistical state, is given by the expression
Na\Nb\Nc...
W
Na'Na"N
* What we have here defined as the statistical state is what Planck calls the
macroscopic state of the system. The word macroscopic is unfortunate, however, in
implying a less minute observation as to the size of the elements dx dy dz d\f/x d\f/y d^»
in which the representative points are found.
106 Chapter Eight.
and this number W is proportional to the probability that the system
will be found in the particular statistical state considered.
If now we assume that each of the regions
dx dy dz d\f/x d^y d$z, Nos. la, 2a, 3a, • • •, 16, 26, 36, • • •, etc.
is great enough to contain a large number of particles,* we may
apply the Stirling formula
\N= J
for evaluating \Naj [N±, etc., and omitting negligible terms, shall
obtain for log W the result
'"
(N ' N ' N " N " N
-iV a , Jt" a . •*" a , -iV a , -tV a , a
.v: log ]v7 + -Nl log If: + -N7 log Iv7
Nb' Nb" Nb" Nb'" Nb'"
etc.
jya' j\rar/
For simplicity let us denote the ratios -rp- , -r^- , etc., by the
J\l a -i* a
symbols waf, wa", etc. These quantities wa', wa", etc., are evidently
the probabilities, in the case of this particular statistical state,
that any given particle ma will be found in the respective regions
Nos. la, 2a, etc.
We may now write
lOg W = — Na2Wa log Wa — Nb2Wb log Wb ~ , etc.,
where the summation extends over all the regions Nos. la, 2a, • • •,
16, 26, etc.
96. Equilibrium Relations. Let us now suppose that the system
of particles is contained in an enclosed space and has the definite
energy content E. Let us find the most probable distribution of the
particles. For this the necessary condition will be
d lOg W = - Na£(log Wa + l)BWa
- N6Z(log wb + l)dwb • • • =0. (114)
In carrying out our variation, however, the number of particles of
* The idea of successive orders of infinitesimals which permit the differential
region dx dy dz d^x d\f/y d\J/t to contain a large number of particles is a familiar one in
mathematics.
Chaotic Motion of a System of Particles. 107
each kind must remain constant so that we have the added relations
2dwa = 0, 25wb = 0, etc. (115)
Finally, since the energy is to have a definite value E, it must also
remain constant in the variation, which will provide still a further
relation. Since the energy of a particle will be a definite function of
its position and momentum,* let us write the energy of the system
in the form
E = NaZWaEa + N^w^Eb •+•••,
where Ea is the energy of a particle in the region la, etc.
Since in carrying out our variation the energy is to remain con
stant, we have the relation
E = Na2EadWa + Nb?EbdWb + '" = 0. (116)
Solving the simultaneous equations (114), (115), (116) by familiar
methods we obtain
log Wa + 1+ \Ea + M6 = 0,
log wb + 1 + \Eb + M6 = 0,
etc.,
where X, na, M&, etc., are undetermined constants. (It should be
specially noticed that X is the same constant in each of the series of
equations.)
Transforming we have
wa = aae~hEa,
Wb = abe-hEb, (117)
etc.,
as the expressions which determine the chance that a given particle
of mass ma, mb, etc., will fall in a given region dx dy dz d\f/x d\[/v d\l/f,
when we have the distribution of maximum probability. It should
be noticed that h, which corresponds to the X of the preceding equa
tions, is the same constant in all of the equations, while aa, ab, etc.,
are different constants, depending on the mass of the particles ma,
mb, etc.
* We thus exclude from our considerations systems in which the potential energy
depends appreciably on the relative positions of the independent particles.
108 Chapter Eight.
97. The Energy as a Function of the Momentum. Ea, Eb, etc.,
are of course functions of x, y, z, \[/x, \l/y, \f/e. Let us now obtain an
expression for Ea in terms of these quantities. If there is no external
field of force acting, the energy of a particle Ea will be independent
of x, y} and z, and will be determined entirely by its velocity and
mass. In accordance with the theory of relativity we shall have*
where ma is the mass of the particle at rest.
Let us now express Ea as a function of \px) \f/y, \j/t.
We have from our equations (105) and (98), which were used for
defining momentum
max
Constructing the similar expressions for \j/y and \f/z we may write the
relation
2 + f + Z2) mltf
1--
-*• o
which also defines
* This expression is that for the total energy of the particle, including that
internal energy m0c2 which, according to relativity theory, the particle has when
it is at rest. (See Section 75.) It would be just as correct to substitute for Ea in
equation (117) the value of the kinetic energy mac2 / — — 1 \ instead of the
total energy — a , since the two differ merely by a constant maci which would
11 -?
be taken care of by assigning a suitable value to aa.
Chaotic Motion of a System of Particles. 109
By simple transformations and the introduction of equation (118)
we obtain the desired relation
Ea = cV^2 + ma2c2. (120)
98. The Distribution Law. We may now rewrite equations (117)
in the form
Wa = aa<
,.— AC^^/Z+OTftSc* flO]}
Wb = &b6 i \*-£L)
etc.
These expressions determine the probability that a given particle
of mass ma, mb, etc. will fall in a given region dxdydzd\f/xd\t/yd\j/z, and
correspond to Maxwell's distribution law in ordinary mechanics. We
see that these probabilities are independent of the position x, y, z*
but dependent on the momentum.
aae~Ac^*2+ma2c2 is the probability that a given particle will fall in a
particular six-dimensional cube of volume dxdydzd\f/xd\f/yd\f/z. Let us
now introduce, for convenience, a new quantity aae~hc *1+m»2c2 which
will be the probability per unit volume that a given particle will have
the six dimensional location in question, the constants aa and aa
standing in the same ratio as the volumes dxdydzd\f/xd\[/yd\f/z and unity.
We may then write
wa =
Wb =
etc.
Since every particle must have components of momentum lying
between minus and plus infinity, and lie somewhere in the whole
volume V occupied by the mixture, we have the relation
£00 /~+oo /»+
I I
_j «/_ 00 «/ — oo
(122)
It is further evident that the average value of any quantity A
which depends on the momentum of the particles is given by the
* This is true only when, as assumed, no external field of force is acting.
110 Chapter Eight.
expression
[A]av. = V + ° r ' r°aae-hc^+^Adtxdtvdtz, (123)
— oo *—
where A is some function of \l/x, \j/y, and \J/Z.
99. Polar Coordinates. We may express relations corresponding
to (122) and (123) more simply if we make use of polar coordinates.
Consider instead of the elementary volume d\l/xd^yd\^z the volume
\f/z $meddd<j>d$ expressed in polar coordinates, where
The probability that a particle raa will fall in the region
dxdydz\l/z sm0d6d(j)d\f/ will be
dxdydz\f/2 sin
and since each particle must fall somewhere in the space x y z \l/x \f/v \pz
we shall have corresponding to (122) the relation
XTT /»2n- /»»
Jo Jo
(124)
Corresponding to equation (123), we also see that the average value
of any quantity A, which is dependent on the momentum of the
molecules of mass ma, will be given by the expression
[A]av. = 47rF {* aae-****"** Apd*. (125)
Jo
100. The Law of Equipartition. We may now obtain a law which
corresponds to that of the equipartition of vis viva in the classical
mechanics. Considering equation (124) let us integrate by parts, we
obtain
f*^a
Jo 3
- he)
Chaotic Motion of a System of Particles. Ill
Substituting the limits into the first term we find that it becomes
zero and may write
r
Jo
+ m02c2
But by equation (125) the left-hand side of this relation is the
\f/zc
average value of . =• for the particles of mass ma. We have
ma2c2
Introducing equation (119) which defines t2, we may transform this
expression into
= I • (126)
Since we have shown that h is independent of the mass of the
particles, we see that the average value of — .— is the same for particles
of all different masses. This is the principle in relativity mechanics
that corresponds to the law of the equipartition of vis viva in the
classical mechanics. Indeed, for low velocities the above expression
reduces to mQu2, the vis viva of Newtonian mechanics, a fact which
affords an illustration of the general principle that the laws of New
tonian mechanics are always the limiting form assumed at low veloci
ties by the more exact formulations of relativity mechanics.
We may now call attention in passing to the fact that this quantity
, whose value is the same for particles of different masses, is
2 A.
not the relativity expression for kinetic energy, which is given rather
by the formula c5
77*0
. So that in relativity mechanics
112 Chapter Eight.
the principle of the equipartition of energy is merely an approximation.
We shall later return to this subject.
101. Criterion for Equality of Temperature. For a system of par
ticles of masses ma, w&, etc., enclosed in the volume V, and having the
definite energy content E, we have shown that
and
are the respective probabilities that given particles of mass ma or
mass w& will have momenta between \f/ and \t/ + d\j/. Suppose now
we consider a differently arranged system in which we have Na par
ticles of mass ma by themselves in a space of volume Va and Nb
particles of mass ra& in a contiguous space of volume Vb, separated
from Va by a partition which permits a transfer of energy, and let
the total energy of the double system be, as before, a definite quantity
E (the energy content of the partition being taken as negligible).
Then, by reasoning entirely similar to that just employed, we can
obviously show that
and
are now the respective probabilities that given particles of mass ma
or mass mb will have momenta between \J/ and \J/ + d\j/, the only
changes in the expressions being the substitution of the volumes
Va and Vb in the place of the one volume V. Furthermore, this
distribution law will evidently lead as before to the equality of the
average values of
m,t*J
and
Since, however, the spaces containing the two kinds of particles are in
thermal contact, their temperature is the same. Hence we find that
the equality of the average values of — . w the necessary condition for
u2
v1-*
equality of temperature.
Chaotic Motion of a System of Particles. 113
The above distribution law also leads to the important corollary that
for any given system of particles at a definite temperature the momenta
and hence the total energy content is independent of the volume.
We may now . proceed to the derivation of relations which will
permit us to show that the important quantity . is directly
proportional to the temperature as measured on the absolute ther-
modynamic temperature scale.
102. Pressure Exerted by a System of Particles. We first need
to obtain an expression for the pressure exerted by a system of N
particles enclosed in the volume V. Consider an element of surface
dS perpendicular to the X axis, and let the pressure acting on it be p.
The total force which the element dS exerts on the particles that
impinge will be pdS, and this will be equal to the rate of change of
the momenta in the X direction of these particles.*
Now by equation (122) the total number of particles having
momenta between \f/x and \l/x + d^x in the positive direction is
/*\l/x+d\!ix /»+» /»
NV I
J\l/x J— oo J —
But xdS gives us the volume which contains the number of particles
having momenta between \I/X and \px + d\f/x which will reach dS in a
second. Hence the number of such particles which impinge per
second will be
xds r****** r+o° r+c
-—
V J^ J-ao J-ao
and their change in momentum, allowing for the effect of the rebound,
will be
2NdS
^x+d^z /»+» /»
J J-ao J-
Finally, the total change in momentum per second for all particles
can be found by integrating for all possible positive values of \f/x.
* The system is considered dilute enough for the mutual attractions of the
particles to be negligible in their effect on the external pressure.
9
114 Chapter Eight.
Equating this to the total force pdS we have
Xoo /*+oo /*+oo
I I ae-hc^^^
J—fO J— 00
Cancelling dS, multiplying both sides of the equation by the volume V,
changing the limits of integration and substituting m0x for \l/x,
we have
/*+°o /•*-}- oo /»-fc
t/ — oo *J—ao *J—ao
But this by equation (123) reduces to
p7 = N
1 -
1 ~c2
or, since
Wnli2 m0X2 ...^
=== + ~. —
we have from symmetry
N f -m^/2 1
(127)
Since at a given temperature we have seen that the term in parenthesis
is independent of the volume and the nature of the particles, we see
that the laws of Boyle and Avogadro hold also in relativity mechanics
for a system of particles.
For slow velocities equation (127) reduces to the familiar expression
N
pV = — (m0w2)av..
103. The Relativity Expression for Temperature. We are now in
a position to derive the relativity expression for temperature. The
thermodynamic scale of temperature may be defined in terms of the
efficiency of a heat engine. Consider a four-step cycle performed
with a working substance contained in a cylinder provided with a
piston. In the first step let the substance expand isothermally and
Chaotic Motion of a System of Particles. 115
reversibly, absorbing the heat Q2 from a reservoir at temperature T2;
in the second step cool the cylinder down at constant volume to TI',
in the third step compress to the original volume, giving out the
heat Qi at temperature TI, and in the fourth step heat to the original
temperature. Now if the working substance is of such a nature that
the heat given out in the second step could be used for the reversible
heating of the cylinder in the fourth step, we may define the absolute
Tz Q2
temperatures T2 and T\ by the relation — = 77- .*
1 1 Ui
Consider now such a cycle performed on a cylinder which con
tains one of our systems of particles. Since we have shown (Section
101) that at a definite temperature the energy content of such a
system is independent of the volume, it is evident that our working
substance fulfils the requirement that the heat given out in the second
step shall be sufficient for the reversible heating in the last step.
Hence, in accordance with the thermodynamic scale, we may measure
T 0
the temperatures of the two heat reservoirs by the relation — = 77-
1 1 vi
and may proceed to obtain expressions for Q2 and Qi.
In order to obtain these expressions we may again make use of the
principle that the energy content at a definite temperature is inde
pendent of the volume. This being true, we see that Q2 and Qi
must be equal to the work done in the changes of volume that take
place respectively at T2 and T7!, and we may write the relations
= f
Jv
Q! = pdV(zt rO-
«/F
But equation (127) provides an expression for p in terms of F, leading
on integration to the relations
N
* We have used this cycle for defining the thermodynamic temperature scale
instead of the familiar Carnot cycle, since it avoids the necessity of obtaining an
expression for the relation between pressure and volume in an adiabatic expansion.
116
Chapter Eight.
N
<*-¥
which gives us on division
^2
Qi
We see that the absolute temperature measured on the thermodynamic
scale is proportional to the average value of
We may finally express our temperature in the same units custom
arily employed by comparing equation (127)
PF=?
with the ordinary form of the gas law
pV = nRT,
where n is the number of mols of gas present.
We evidently obtain
(128)
T =
N
mQuz
3
W-?]
av.
N
" TOoM2 "
1
" m0w2
3nR
VT-f_
3/b
av.
U-
where the quantity — , which may be called the gas constant for a
single molecule, has been denoted, as is customary, by the letter k.
Chaotic Motion of a System of Particles. 117
Remembering the relation | — p^ = - , we have
C" J av.
fcr = r. (129)
104. The Partition of Energy. We have seen that our new equi-
partition law precludes the possibility of an exact equipartition of
energy. It becomes very important to see what the average energy
of a particle of a given mass does become at any temperature.
Equation (125) provides a general expression for the average value
of any property of the particles. For the average value of the energy
4- mQ2c2 of particles of mass mQ (see equation 120) we shall have
foe
ae-hcV**+mWc^^ 4-
_
The unknown constant a may be eliminated with the help of the
relation (124)
X00
and for h we may substitute the value given by (129), which gives us
the desired equation
f " J~5 ~z
[E]&v. = * ^ (130)
Jo
105. Partition of Energy for Zero Mass. Unfortunately, no gen
eral method for the evaluation of this expression seems to be available.
For the particular case that the mass m0 of the particles approaches
zero compared to the momentum, the expression reduces to
fV
Jo
118 Chapter Eight.
in terms of integrals whose values are known. Evaluating, we obtain
[£]„. = 3fcT.
For the total energy of N such particles we obtain
E =-- ZNkT,
nR
and introducing the relation k = — by which we defined k we have
E = ZnRT (131)
as the expression for the energy of n mols of particles if their value of
m0 is small compared with their momentum.
It is instructive to compare this with the ordinary expression of
Newtonian mechanics
E = ~nRTt
2t
which undoubtedly holds when the masses are so large and the veloci
ties so small that no appreciable deviations from the laws of New
tonian mechanics are to be expected. We see that for particles of
very small mass the average kinetic energy at any temperature is
twice as large as that for large particles at the same temperature.
It is also interesting to note that in accordance with equation (131)
a mol of particles which approach zero mass at the absolute zero,
would have a mass of
3 X 8.31 X 300
grams at room temperature (300° absolute). This suggests a field
of fascinating if profitless speculation.
106. Approximate Partition of Energy for Particles of any Desired
Mass. For particles of any desired mass we may obtain an approxi
mate idea of the relation between energy and temperature by ex
panding the expression for kinetic energy into a series. For the aver
age kinetic energy of a particle we have
- mo
Chaotic Motion of a System of Particles. 119
Expanding into a series we obtain for the total kinetic energy of N
particles
3 u4 15 u6 105 u8 \
+ -- + ~- + ~-+...), (132)
where u2, u4, etc., are the average values of u2, u4, etc., for the indi
vidual particles.
To determine approximately how the value of K varies with the
temperature we may also expand our expression (128) for temperature,
into a series; we obtain
Combining expressions (132) and (133) by subtraction and trans
position, we obtain
For the case of velocities low enough so that u4 and higher powers
can be neglected, this reduces to the familiar expression of Newtonian
3
mechanics, K = -nRT.
In case we neglect in expression (134) powers higher than u4 we
have the approximate relation
8c2 " 2Nm0c2 V 2
the left-hand term really being the larger, since the average square of a
quantity is greater than the square of its average. Since ( — ^— J
(3 V
- nRT ] , we may write the approxima-
120 Chapter Eight.
tion
or, noting that Nm0 = M, the total mass of the system at the abso
lute zero, we have
K
2 8
If we use the erg as our unit of energy, R will be 8.31 X 107; expressing
velocities in centimeters per second, c2 will be 1021, and M will be the
mass of the system in grams.
For one mol of a monatomic gas we should have in ergs
K = 12.4 X 107r + ^ 10-6?72.
M
In the case of the electron M may be taken as approximately
1/1800. At room temperature the second term of our equation would
be entirely negligible, being only 3.5 X 10~6 per cent of the first, and
still be only 3.5 X 10~4 per cent in a fixed star having a temperature of
30,000°. Hence at all ordinary temperatures we may expect the
law of the equipartition of energy to be substantially exact for par
ticles of mass as small as the electron.
Our purpose in carrying through the calculations of this chapter
has been to show that a very important and interesting problem in
the classical mechanics can be handled just as easily in the newer
mechanics, and also to point out the nature of the modifications in
existing theory which will have to be introduced if the later develop
ments of physics should force us to consider equilibrium relations for
particles of mass much smaller than that of the electron.
We may also call attention to the fact that we have here con
sidered a system whose equations of motion agree with the principles
of dynamics and yet do not lead to the equipartition of energy. This
is of particular interest at a time when many scientists have thought
that the failure of equipartition in the hohlraum stood in necessary
conflict with the principles of dynamics.
CHAPTER IX.
THE PRINCIPLE OF RELATIVITY AND THE PRINCIPLE OF
LEAST ACTION.
It has been shown by the work of Helmholtz, J. J. Thomson,
Planck and others that the principle of least action is applicable in
the most diverse fields of physical science, and is perhaps the most
general dynamical principle at our disposal. Indeed, for any system
whose future behavior is determined by the instantaneous values of a
number of coordinates and their time rate of change, it seems possible
to throw the equations describing the behavior of the system into
the form prescribed by the principle of least action. This generality
of the principle of least action makes it very desirable to develop the
relation between it and the principle of relativity, and we shall obtain
in this way the most important and most general method for deriving
the consequences of the theory of relativity. We have already
developed in Chapter VII the particular application of the principle
of least action in the case of a system of particles, and with the help
of the more general development which we are about to present, we
shall be able to apply the principle of relativity to the theories of
elasticity, of thermodynamics and of electricity and magnetism.
107. The Principle of Least Action. For our purposes the prin
ciple of least action may be most simply stated by the equation
" (BH + W)dt = 0. (135)
This equation applies to any system whose behavior is determined
by the values of a number of independent coordinates <Ai4>203 • • •
and their rate of change with the time <#>i02<^3 • • •, and the equation
describes the path by which the system travels from its configuration
at any time ti to its configuration at any subsequent time f2.
H is the so-called kinetic potential of the system and is a func
tion of the coordinates and their generalized velocities :
H = F(0i020s • • • Mifa • • •)• (136)
121
122 Chapter Nine.
dH is the variation of H at any instant corresponding to a slightly
displaced path by which the system might travel from the same
initial to the same final state in the same time interval, and W is the
external work corresponding to the variation 5 which would be done
on the system by the external forces if at the instant in question the
system should be displaced from its actual configuration to its con
figuration on the displaced path. Thus
W = $i«0i + $25<£2 + <M</>3 + • • •, (137)
where $1, <£2, etc., are the so-called generalized external forces which
act in such a direction as to increase the values of the corresponding
coordinates.
The form of the function which determines the kinetic potential
H depends on the particular nature of the system to which the principle
of least action is being applied, and it is one of the chief tasks of
general physics to discover the form of the function in the various
fields of mechanical, electrical and thermodynamic investigation.
As soon as we have found out experimentally what the form of H is
for any particular field of investigation, the principle of least action,
as expressed by equation (135), becomes the basic equation for the
mathematical development of the field in question, a development
which can then be carried out by well-known methods.
The special task for the theory of relativity will be to find a general
relation applicable to any kind of a system, which shall connect the
value of the kinetic potential H as measured with respect to a set of
coordinates S with its value Hf as measured with reference to another
set of coordinates Sf which is in motion relative to S. This relation
will of course be of such a nature as to agree with the principle of the
relativity of motion, and in this way we shall introduce the principle
of relativity at the very start into the fundamental equation for all
fields of dynamics.
Before proceeding to the solution of that problem we may put
the principle of least action into another form which is sometimes
more convenient, by obtaining the equations for the motion of a
system in the so-called Lagrangian form.
108. The Equations of Motion in the Lagrangian Form. To ob
tain the equations of motion in the Lagrangian form we may evidently
Relativity and the Principle of Least Action. 123
rewrite our fundamental equation (135) in the form
^(dH dH dH dH
( TT 5*i + ^T 5^2 + ' ' ' + TT d*1 + ^T 5^2 + ' ' '
Ji! \ofa 0^2 0^1 d02
+ $i5</>i + ^502 + • • • ) dt = 0.
We have now, however,
d d
60i = jt (60!), 6<£2 = ^ (602), etc.,
which gives us
or, since 60i, 602, etc., are by hypothesis zero at times £1 and Z2, we
obtain
etc.
On substituting these expressions in (138) we obtain
and since the variations of 0i, fa, etc., are entirely independent and
the limits of integration ti and tz are entirely at our disposal, this
equation will be true only when each of the following equations is
true. And these are the equations of motion in the desired Lagrangian
124 Chapter Nine.
form,
^d# _d#
at d(f)i d(f)\
d_m_BH_ = ^ (139>
dt d(f)2 C/02
etc.
In these equations H is the kinetic potential of a system whose
state is determined by the generalized coordinates </>i, $2, etc., and
their time derivatives <£i, 02, etc., where 3>i, <£2, etc., are the gener
alized external forces acting on the system in such a sense as to tend
to increase the values of the corresponding generalized coordinates.
109. Introduction of the Principle of Relativity. Let us now in
vestigate the relation between our dynamical principle and the prin
ciple of the relativity of motion. To do this we must derive an equa
tion for transforming the kinetic potential H for a given system
from one set of coordinates to another. In other words, if S and S'
are two sets of reference axes, Sf moving past S in the X-direction
with the velocity Y, what will be the relation between H and H',
the values for the kinetic potential of a given system as measured
with reference to S and S'f
It is evident from the theory of relativity that our fundamental
equation (135) must hold for the behavior of a given system using
either set of coordinates S or £', so that both of the equations
f 2 (dH + W)dt = 0 and f ' (BHf + W')dt' = 0 (140)
J t-i Jtif
or
f 2 (dH + W)dt = f 2 (dH' + W')dt' = 0
J t-i vti*
must hold for a given process, where it will be necessary, of course,
to choose the limits of integration ti and tz, t\ and t2' wide enough
apart so that for both sets of coordinates the varied motion will be
completed within the time interval. Since we shall find it possible
now to show that in general f Wdt = f Wdt', we shall be able to
obtain from the above equations a simple relation between H and H'.
110. Relation between f W'dt' and / Wdt. To obtain the desired
Relativity and the Principle of Least Action. 125
proof we must call attention in the first place to the fact that all
kinds of force which can act at a given point must be governed by
the same transformation equations when changing from system S to
system S'. This arises because when two forces of a different nature
are of such a magnitude as to exactly balance each other and produce
no acceleration for measurements made with one set of coordinates
they must evidently do so for any set of coordinates (see Chapter IV,
Section 42). Since we have already found transformation equations
for the force acting at a point, in our consideration of the dynamics
of a particle, we may now use these expressions in general for the
evaluation / W'dt'.
W is the work which would be done by the external forces if at
any instant t' we should displace our system from its actual con
figuration to the simultaneous configuration on the displaced path.
Hence it is evident that / W'dt will be equal to a sum of terms of the
type
(Fx'Sxf + Fy'6y' + Fz'8,')dt',
where Fx, Fy', Fz', is the force acting at a given point of the system
and 8xf, by', bz' are the displacements necessary to reach the corre
sponding point on the displaced path, all these quantities being
measured with respect to S'.
Into this expression we may substitute, however, in accordance
with equations (61), (62), (63) and (13), the values
c2
F K~I
' = ±V '
C~ (141)
dt' =
126 Chapter Nine.
We may also make substitutions for dx', dy' and dz' in terms of
dx, dy and dz, but to obtain transformation equations for these quanti
ties is somewhat complicated owing to the fact that positions on the
actual and displaced path, which are simultaneous when measured
with respect to S', will not be simultaneous with respect to S. We
have denoted by t' the time in system S' when the point on the actual
path has the position x', yf, z' and simultaneously the point on the
displaced path has the position (xr + dx'), (yf + 5yr), (z' + dz')9
when measured in system S', or by our fundamental transformation
equations (9), (10) and (11) the positions K(X' + Vt'), y', z' and
K([X' + dx'] + Vt'), (y' + dy'), (z' + dz') when measured in system S.
If now we denote by tA and tD the corresponding times in system S
we shall have, by our fundamental transformation equation (12),
and we see that in system S the point has reached the displaced
position at a time later than that of the actual position by the amount
KV
b -tA=--—f 8x >
and, since during this time-interval the displaced point would have
moved, neglecting higher-order terms, the distances
•jr*. v
KV . . .*V
these quantities must be subtracted from the coordinates of the
displaced point in order to obtain a position on the displaced path
which will be simultaneous with tA as measured in system S. We
obtain for the simultaneous position on the displaced path
xV vV
K([X' + dxf] + Vt') - K -5- 8x'9 y' + dyf - K y— xft
c c
Z' + dZf - K dXf,
Relativity and the Principle of Least Action. 127
and for the corresponding position on the actual path
K(x' + vf), y', *',
and obtain by subtraction
8x =
By = 5y' - K 5x'} (142)
zV
5z = dz' - K — &c'.
c2
Substituting now these equations, together with the other trans
formation equations (1.41), in our expression we obtain
i
" " 1 "
(143)
-F- te"
We thus see that we must always have the general equality
/ W'dt' = f Wdt. (144)
111. Relation between H' and H. Introducing this equation into
our earlier expression (140) we obtain as a general relation between
H' and H
f dH'dt' = f dHdt. (145)
Restricting ourselves to systems of such a nature that we can
128 Chapter Nine.
assign them a definite velocity u = xi + yj + ^k, we can rewrite
this expression in the following form, where by H& and HA we denote
the values of the kinetic potential respectively on the displaced and
actual paths
J SH'dt' - J HJdt' - f HA'dt' = J HA ( 1 - (*
+ dt
c
-* = HDdt- HAdt,
•and hence obtain for such systems the simple expression
zj/ _ H
( xV\
K(1-^)
I y/2 I ~tf
Noting the relation between \l 1 — — and \/ 1 — — given in equation
» c * c
V
(17), this can be rewritten
TTf TT
£1 £1
(146)
and this is the expression which we shall find most useful for our
future development of the consequences of the theory of relativity.
Expressing the requirement of the equation in words we may say
TT
that the theory of relativity requires an invariance of • . --- in the
Lorentz transformation.
112. As indicated above, the use of this equation is obviously
restricted to systems moving with some perfectly definite velocity u.
Systems satisfying this condition would include particles, infinitesimal
portions of continuous systems, and larger systems in a steady state.
113. Our general method of procedure in different fields of investi
gation will now be to examine the expression for- kinetic potential
which is known to hold for the field in question, provided the velocities
involved are low and by making slight alterations when necessary,
Relativity and the Principle of Least Action. 129
see if this expression can be made to agree with the requirements of
equation (146) without changing its value for low velocities. Thus
it is well known, for example, that, in the case of low velocities, for a
single particle acted on by external forces the kinetic potential may
be taken as the kinetic energy im0w2. For relativity mechanics, as
will be seen from the developments of Chapter VII, we may take for
the kinetic potential, — m^~ ^ 1 — — 2 , an expression which, except for
an additive constant, becomes identical with %m0y? at low velocities,
and which at all velocities agrees with equation (146).
10
CHAPTER X.
THE DYNAMICS OF ELASTIC BODIES.
We shall now treat with the help of the principle of least action
the rather complicated problem of the dynamics of continuous elastic
media. Our considerations will extend the appreciation of the inti
mate relation between mass and energy which we found in our treat
ment of the dynamics of a particle. We shall also be able to show
that the dynamics of a particle may be regarded as a special case
of the dynamics of a continuous elastic medium, and to apply our
considerations to a number of other important problems.
114. On the Impossibility of Absolutely Rigid Bodies. In the
older treatises on mechanics, after considering the dynamics of a
particle it was customary to proceed to a discussion of the dynamics
of rigid bodies. These rigid bodies were endowed with definite and
nuchangeable size and shape and hence were assigned five degrees
of freedom, since it was necessary to state the values of five variables
completely to specify their position in space. As pointed out by
Laue, however, our newer ideas as to the velocity of light as a limiting
value will no longer permit us to conceive of a continuous body as
having only a finite number of degrees of freedom. This is evident
since it is obvious that we could start disturbances simultaneously
at an indefinite number of points in a continuous body, and as these
disturbances cannot spread with infinite velocity it will be necessary
to give the values of an infinite number of variables in order com
pletely to specify the succeeding states of the system. For our newer
mechanics the nearest approach to an absolutely rigid body would
of course be one in which disturbances are transmitted with the
velocity of light. Since, then, the theory of relativity does not
permit rigid bo-dies we may proceed at once to the general theory of
deformable bodies.
PART I. STRESS AND STRAIN.
115. Definition of Strain. In the more familiar developments of
the theory of elasticity it is customary to limit the considerations to
130
Dynamics of Elastic Bodies. 131
the case of strains small enough so that higher powers of the dis
placements can be neglected, and this introduces considerable simpli
fication into a science which under any circumstances is necessarily
one of great complication. Unfortunately for our purposes, we
cannot in general introduce such a simplification if we wish to apply
the theory of relativity, since in consequence of the Lorentz shortening
a body which appears unstrained to one observer may appear tre
mendously compressed or elongated to an observer moving with a
different velocity. The best that we can do will be arbitrarily to
choose our state of zero deformation such that the strains will be
small when measured in the particular system of coordinates S in
which we are specially interested.
A theory of strains of any magnitude was first attempted by
Saint-Venant and has been amplified and excellently presented by
Love in his Treatise on the Theory of Elasticity, Appendix to Chapter I.
In accordance with this theory, the strain at any point in a body is
completely determined by six component strains which can be defined
by the following equations, wherein (u, v, w) is the displacement of a
point having the unstrained position (z, y, z} :
_ du
€yy=d-V + * <^V • (d"V ' (~
dw
(148)
dw dv du du dv dv dw dw
~dy + 'dz + dy to* dy d* 9y dz*
dw du du du dv_ dv_ dw dw
fo+fc+tote+te~d~zdx dz'
du du du du dv dv dw dw
It will be seen that these expressions for strain reduce to those
familiar in the theory of small strains if such second-order quantities as.
.or — — can be neglected.
dx J dy dz
132 Chapter Ten.
116. A physical significance for these strain components will be
obtained if we note that it can be shown from geometrical considera
tions that lines which are originally parallel to the axes have, when
strained, the elongations
ex = VI + 2exx - 1,
(149)
and that the angles between lines originally parallel to the axes are
given in the strained condition by the expressions
cos 0u =
cos 9XZ = **-=, (150)
COS VXy =
Geometrical considerations are also sufficient to show that in
case the strain is a simple elongation of amount e the following equa
tion will be true:
™ €yz €xz €xy — P i I,,* n ci \
~ ~ 2 '
I2 ~ m* ~ n* ~ 2mn 2ln ~ 2lm
where I, m, n are the cosines which determine the direction of the
elongation.
117. Definition of Stress. We have just considered the expres
sions for the strain at a given point in an elastic medium; we may
now define stress in terms of the work done in changing from one
.state of strain to another. Considering the material contained in
unit volume when the body is unstrained, we may write, for the work
done by this material on its surroundings when a change in strain
takes place,
Dynamics of Elastic Bodies. 133
dW = — 8E = txx8exx + tvyd€yy + £zz5e*2
(152)
and this equation serves to define the stresses trx, tyv, etc. In case
the strain varies from point to point we must consider of course the
work done per unit volume of the unstrained material. In case the
strains are small it will be noticed that the stresses thus defined are
identical with those used in the familiar theories of elasticity.
118. Transformation Equations for Strain. We must now prepare
for the introduction of the theory of relativity into our considerations,
by determining the way the strain at a given point P appears to ob
servers moving with different velocities. Let the point P in question
be moving with the velocity u = xi + 2/j + zk as measured in sys
tem S. , Since the state of zero deformation from which to measure
strains can be chosen perfectly arbitrarily, let us for convenience
take the strain as zero as measured in system S, giving us
€xi = fyy = *zz = tyz = *xz = *xy = 0. (153)
What now will be the strains as measured by an observer moving
along with the point P in question? Let us call the system of coordi
nates used by this observer S°. It is evident now from our considera
tions as to the shape of moving systems presented in Chapter V that
in system S° the material in the neighborhood of the point in question
will appear to have been elongated in the direction of motion in the
/ ~u*
ratio of 1 : -x/l — — . Hence in system S° the strain will be an elonga-
» c
tion
« = - 4= - 1 (154)
in the line determined by the direction cosines
m = — ,
u
(155)
We may now calculate from this elongation the components of
strain by using equation (151). We obtain
134
Chapter Ten.
vz
2c2
c2
(156)
xy
1 -
and these are the desired equations for the strains at the point P,
the accent ° indicating that they are measured with reference to a
system of coordinates S° moving along with the point itself.
119. Variation in the Strain. We shall be particularly interested
in the variation in the strain as measured in S° when the velocity
experiences a small variation Su, the strains remaining zero as mea
sured in S. For the sake of simplicity let us choose our coordinates
in such a way that the X-axis is parallel to the original velocity, so
that our change in velocity will be from u = xi to
u + <5u = (x + dx)i + dyj
Taking Su small enough so that higher orders can be neglected, and
noting that y = z = 0, we shall then have, from equations (156),
Dynamics of Elastic Bodies. 135
6e°xx = 7 o\, ~5 &c, 6e vu = 0,
5e%2 = 0, 5e%, = 0, (157)
5e°" = 7~ ~7^\7zd^ 8e°*v = 7 ^\~28V-
We shall also be interested in the variation in the strain as measured
in S° produced by a variation in the strain as measured in S. Con
sidering again for simplicity that the X-axis is parallel to the motion
of the point, we must calculate the variation produced in e°xx, e°vv,
etc., by changing the values of exx, eyy, etc., from zero to 5exx, 8eyy, etc.
The variation dexx will produce a variation in e°xx whose amount
can be calculated as follows: By equations (149) a line which has unit
length and is parallel to the X-axis in the unstrained condition will
have when strained the length A/1 + 2exx when measured in system S
and Vl + 2e°xx when measured in system S°. Since the strain in
system S is small, the line remains sensibly parallel to the X-axis,
which is also the direction of motion, and these quantities will be
connected in accordance with the Lorentz shortening by the equation
Vl + 2exx = A/1 - -J VT+ 2c°xx. (158)
* c
Carrying out now our variation 5exx, neglecting exx in comparison
with larger quantities and noting that except for second order quanti
ties,
1
tif VA^^;
^ c^"
we obtain
s o gc«'
(160)
Since the variations deyv, 5ezz, 8ey, affect only lines which are at
right angles to the direction of motion, we may evidently write
8e°yy = 8€VV) de°zz = 5ezz, 5e%2 = &„,. (161)
136 Chapter Ten.
To calculate 8e°xz we may note that in accordance with equations
(150) we must have
cos 6XZ =
Vl + 2«M Vl + 2e,~, '
C°S9°"=Vl + 2eCvi + 2e°,-
where 0X2 is the angle between lines which in the unstrained condition
are parallel to the X and Z axes respectively. In accordance with
the Lorentz shortening, however, we shall have
cos exz = - - cos X2.
Introducing this relation, remembering that exx = e22 = t°zz = 0, and
noting equation (159), we obtain
(162)
(>-*)
and similarly
('-!')'
We may now combine these equations (160), (161), (162) and
(163) with those for the variation in strain with velocity and obtain
the final set which we desire:
1 x 1
—„ 5x
de yy — deyy,
8e°zz = 5ezz,
dt°yZ = 5tyZ,
"o-?r (»-«
Dynamics of Elastic Bodies. 137
These equations give the variation in the strain measured in
system S° at a point P moving in the X direction with velocity u,
provided the strains are negligibly small as measured in S.
PART II. INTRODUCTION OF THE PRINCIPLE OF LEAST ACTION.
120. The Kinetic Potential for an Elastic Body. We are now in
a position to develop the mechanics of an elastic body with the help
of the principle of least action. In Newtonian mechanics, as is well
known, the kinetic potential for unit volume of material at a given
point P in an elastic body may be put equal to the density of kinetic
energy minus the density of potential energy, and it is obvious that
our choice for kinetic potential must reduce to that value at low
velocities. Our choice of an expression for kinetic potential is further
more limited by the fundamental transformation equation for kinetic
potential which we found in the last chapter
(146)
Taking these requirements into consideration, we may write for
the kinetic potential per unit volume of the material at a point P
moving with the velocity u the expression
H = -
where E° is the energy as measured in system S° of the amount of
material which in the unstrained condition (i. e., as measured in
system S) is contained in unit volume.
The above expression obviously satisfies our fundamental trans
formation equation (146) and at low velocities reduces in accordance
with the requirements of Newtonian mechanics to
H = JroV - E°,
provided we introduce the substitution made familiar by our previous
ElO
work, m° = — .
138
Chapter Ten.
121. Lagrange's Equations. Making use of this expression for the
kinetic potential in an elastic body, we may now obtain the equations
of motion and stress for an elastic body by substituting into Lagrange's
equations (139) Chapter IX.
Considering the material at the point P contained in unit volume
in the unstrained condition, we may choose as our generalized co
ordinates the six component strains exx, eyy, etc., with the corre
sponding stresses — txxj — tyy, etc., as generalized forces, and the
three coordinates x, y, z which give the position of the point with the
corresponding forces Fx, Fy and Fz.
It is evident that the kinetic potential will be independent of
the time derivatives of the strains, and if we consider cases in which
E° is independent of position, the kinetic potential will also be inde
pendent of the absolute magnitudes of the coordinates x, y and 2.
Substituting in Lagrange's equations (139), we then obtain
-£(-
«
(165)
dtdx\
dtd$\
d d
dt'dl
1 - -„ =
1 - —„ =
(166)
Dynamics of Elastic Bodies. 139
We may simplify these equations, however; by performing the
indicated differentiations and making suitable substitutions, we have
dE°xx dE°xxde
dexx de x£ dexx
But in accordance with equation (152) we may write
dE°
_ _ _ /
-j O l XX
0€ xx
and from equations (164) we may put
de°xx
dexx
2 '
Making the substitutions in the first of the Lagrangian equations we
obtain
-
V1 - *
122. Transformation Equations for Stress. Similar substitutions
can be made in all the equations of stress, and we obtain as our set
of transformation equations
(167)
123. Value of E°. With the help of these transformation equations
for stress we may calculate the value of E0, the energy content, as
measured in system S°, of material which in the unstrained condition
is contained in unit volume.
Consider unit volume of the material in the unstrained condition
and call its energy content w°°. Give it now the velocity u = x,
keeping its state of strain unchanged in system S. Since the strain
140
Chapter Ten.
is not changing in system S, the stresses txx, etc., will also be constant
in system S. In system S°, however, the component strain will
change in accordance with equations (156) from zero to
x* 1
xx 2c2/ _w_2V
V1 ~c2;
and the corresponding stress will be given at any instant by the
expression just derived,
txx being, as we have just seen, a constant. We may then write for
E° the expression
1 xz
~2c2
E
--^d
Noting that it = x we obtain on integration,
po _
(168)
as the desired expression for the energy as measured in system S°
contained in the material which in system S is unstrained and has
unit volume.
124. The Equations of Motion in the Lagrangian Form. We are
now in a position to simplify the three Lagrangian equations (166)
for FXJ Fy and Fz. Carrying out the indicated differentiation we have
u* dE°
c2 dx
and introducing the value of E° given by equation (168) we obtain
w°° + t,
(169)
Dynamics of Elastic Bodies.
141
Simple calculations will also give us values for Fv and Fg. We have
from (166)
_y_ I _^2aE
~ 2 c2 ' \ c2 a?>
But since we have adapted our considerations to cases in which the
direction of motion is along the X-axis, we have y = 0; furthermore
we may substitute, in accordance with equations (152), (157) and (167),
dE
dy
1
- t:
dy
We thus obtain as our three equations of motion
w°°-t-txxx
Fy=dt(txy^)>
(170)
In these equations the quantities Fx, Fy and F2 are the components
of force acting on a particular system, namely that quantity of material
which at the instant in question has unit volume. Since the volume
of this material will in general be changing, Fx, Fv and F, do not give
us the force per unit volume as usually defined. If we represent,
however, by fx, fv and fg the components of force per unit volume,
we may rewrite these equations in the form
V - -
^x ~ dt
(171)
142
Chapter Ten.
where by 67 we mean a small element of volume at the point in
question.
125. Density of Momentum. Since we customarily define force as
equal to the time rate of change of momentum, we may now write for
the density of momentum g at a point in an elastic body which is
moving in the X direction with the velocity u = x
txx X
(172)
It is interesting to point out that there are components of momen
tum in the Y and Z directions in spite of the fact that the material
at the point in question is moving in the X direction. We shall
later see the important significance of this discovery.
126. Density of Energy. It will be remembered that the forces
whose equations we have just obtained are those acting on unit
volume of the material as measured in system S, and hence we are
now in a position to calculate the energy density of our material.
Let us start out with unit volume of our material at rest, with the
energy content w°° and determine the work necessary to give it the
velocity u = x without change in stress or strain. Since the only
component of force which suffers displacement is Fx, we have
+ (w°° +
' ' W°° + txx X
j
xdt,
1 x '
1 u* c
IV1-*
•txx) r xd
Jo
v/i-p"1.
(173)
as an expression for the energy density of the elastic material.
127. Summary of Results Obtained from the Principle of Least
Action. We may now tabulate for future reference the results ob
tained from the principle of least action.
Dynamics of Elastic Bodies. 143
At a given point in an elastic medium which is moving in the X
direction with the velocity u = x, we have for the components of
stress
t\* i *
u- ,---2, *„-
V1-^
(167)
i° t°
<> XZ I XV
I I/'
V1-^
c* \ c2
For the density of energy at the point in question we have
w = W r-^-^f - tx*. (173)
For the density of momentum we have
g* = — 7== ~2» Sv = **v-2, g* = ^V' (172>
y1--?
PART III. SOME MATHEMATICAL RELATIONS.
Before proceeding to the applications of these results which we
have obtained from the principle of least action, we shall find it de
sirable to present a number of mathematical relations which will
later prove useful.
128. The Unsymmetrical Stress Tensor t. We have defined the
components of stress acting at a point by equation (152)
where 5W is the work which accompanies a change in strain and is
performed on the surroundings by the amount of material which was
contained in unit volume in the unstrained state. Since for con
venience we have taken as our state of zero strain the condition of
the body as measured in system S, it is evident that the components
^*x, tvv, etc., may be taken as the forces acting on the faces of a unit
cube of material at the point in question, the first letter of the sub-
144 . Chapter Ten.
script indicating the direction of the force and the second subscript
the direction of the normal to the face in question.
Interpreting the components of stress in this fashion, we may
now add three further components and obtain a complete tensor
i.
t = -\tyx tyy tyg, (174)
The three new components tyx, tzx, tzy are forces acting on the
unit cube, in the directions and on the faces indicated by the sub
scripts. A knowledge of their value was not necessary for our develop
ments of the consequences of the principle of least action, since it was
possible to obtain an expression for the work accompanying a change
in strain without their introduction. We shall find them quite im
portant for our later considerations, however, and may proceed to
determine their value.
tyx is the force acting in the Y direction tangentially to a face of
the cube perpendicular to the X-axis, and measured with a system
of coordinates S. Using a system of coordinates S° which is stationary
with respect to the point in question, we should obtain, for the measure
ment of this force,
.o
t yx -
in accordance with our transformation equation for force (62), Chapter
VI. Similarly we shall have the relation
t°Xy = txy.
In accordance with the elementary theory of elasticity, however, the
forces t°yx and t°xy which are measured by an observer moving with
the body will be connected by the relation
/° ^ vx
* xy •
t°xy being larger than t°yx in the ratio of the areas of face upon which
they act. Combining these three equations, and using similar methods
Dynamics of Elastic Bodies. 145
for the other quantities, we can obtain the desired relations
^*- I1 -?)*»" ^ = (l-^r)u U-<f» (175)
We see that t is an unsymmetrical tensor.
129. The Symmetrical Tensor p. Besides this unsymmetrical ten
sor t we shall find it desirable to define a further tensor p by the
equation
p = t + gu. (176)
We shall call gu the tensor product of g and u and may indicate
tensor products in general by a simple juxtaposition of vectors, gu is
itself a tensor with components as indicated below:
f gxux gxuy gxuz,
gu = J gvux gyuy gyuz, (177)
I gzux gzuy gzuz.
Unlike t, p will be a symmetrical tensor, since we may show, by
substitution of the values for g and u already obtained, that
Pyx = Pxy, Pzx = Pxz, Pzy = Pvz- (178)
Consider for example the value of pyx; we have from our definition
Pyx = tyx + QyUx,
and by equations (175) and (172) we have
Ux
Qy ~ xy '
and hence by substitution obtain
Pyx = *
We also have, however, by definition
and since for the case we are considering uy = 0, we arrive at the
equality
Pxy = Pyx-
The other equalities may be shown in a similar way.
11
146 Chapter Ten.
130. Relation between div t and tn. At a given point P in our
elastic body we shall define the divergence of the tensor t by the equa
tion
dtzx dtzy dtzz
^"~*T"~dF/ '
where i, j and k are unit vectors parallel to the axes, div t thus being
an ordinary vector. It will be seen that div t is the elastic force
acting per unit volume of material at the point P.
Considering an element of surface dS, we shall define a further
vector tn by the equation
tn = (txx COS a + txy COS ]8 + txz COS 7)!
+ (tyx cos a + tyy cos )8 + tyg cos 7)j (180)
+ (tzx cos a + tzy cos j8 + tzz cos 7)k,
where cos a, cos /3 and cos 7 are the direction cosines of the inward-
pointing normal to the element of surface dS.
Considering now a definite volume V enclosed by the surface S,
it is evident that div t and tn will be connected by the relation
• JdivW = ftndS, (181)
where the symbol 0 indicates that the integration is to be taken over
the whole surface which encloses the volume V. This equation is
of course merely a direct application of Gauss's formula, which states
in general the equality
= I (P cos a + Q cos |8 + R cos y)dS,
J%
where P, Q and R may be any functions of x, y and z.
Dynamics of Elastic Bodies. 147
We shall also find use for a further relation between div t and tn.
Consider a given point of reference 0, and let r be the radius vector
to any point P in the elastic body; we can then show with the help
of Gauss's Formula (182) that
- J (r X div t)dV = J" (r X tJdS
• f[(ty. - UJk + (**. - Uik + (txv - tyjc)ij]dV,
where X signifies as usual the outer product. Taking account of
equations (172) and (175) this can be rewritten
(183)
131. The Equations of Motion in the Eulerian Form. We saw in
sections (124) and (125) that the equations of motion in the Lagran-
gian form might be written
where f is the density of force acting at any point and g is the density
of momentum.
Provided that there are no external forces acting and f is pro
duced solely by the elastic forces, our definition of the divergence of a
tensor will now permit us to put
f = - div t,
and write for our equation of motion
dg
Expressing — in terms of partial differentials, and putting
d(5V)
— ,— = 57 div u
dt
we obtain
148 Chapter Ten.
Our symmetrical tensor p, however, was defined by the equation (176)
p = t + gu,
and hence we may now write our equations of motion in the very
beautiful Eulerian form
- div p - |5 . (184)
We shall find this simple form for the equations of motion very
interesting in connection with our considerations in the last chapter.
PART IV. APPLICATIONS OF THE RESULTS.
We may now use the results which we have obtained from the
principle of least action to elucidate various problems concerning
the behavior of elastic bodies.
132. Relation between Energy and Momentum. In our work on
the dynamics of a particle we found that the mass of a particle was
equal to its energy divided by the square of the velocity of light, and
hence have come to expect in general a necessary relation between
the existence of momentum in any particular direction and the trans
fer of energy in that same direction. We find, however, in the case
of elastically stressed bodies a somewhat more complicated state of
affairs than in the case of particles, since besides the energy which is
transported bodily by the motion of the medium an additional quan
tity of energy may be transferred through the medium by the action
of the forces which hold it in its state of strain. Thus, for example,
in the case of a longitudinally compressed rod moving parallel to its
length, the forces holding it in its state of longitudinal compression
will be doing work at the rear end of the rod and delivering an equal
quantity of energy at the front end, and this additional transfer of
energy must be included in the calculation of the momentum of the
bar.
As a matter of fact, an examination of the expressions for momen
tum which we obtained from the principle of least action will show
the justice of these considerations. For the density of momentum
in the X direction we obtained the expression
/v«
Qx = (w + txx) — ,
c
Dynamics of Elastic Bodies. 149
and we see that in order to calculate the momentum in the X direc
tion we must consider not merely the energy w which is being bodily
carried along in that direction with the velocity z, but also must take
into account the additional flow of energy which arises from the
stress txx. .As we have already seen in Section 128, this stress txx can
be thought of as resulting from forces which act on the front and
rear faces of a centimeter cube of our material. Since the cube is
moving with the velocity x, the force on the rear face will do the
work txxx per second and this will be given up at the forward face.
We thus have an additional density of energy-flow in the X direction
of the magnitude txxx and hence a corresponding density of momen-
txxx
turn — — .
c2
Similar considerations explain the interesting occurrence of com
ponents of momentum in the Y and Z directions,
in spite of the fact that the material involved is moving in the X
direction. The stress txy, for example, can be thought of as resulting
from forces which act tangentially in the X direction on the pair of
faces of our unit cube which are perpendicular to the Y axis. Since
the cube is moving in the X direction with the velocity x, we shall
have the work txyx done at one surface per second and transferred to
the other, and the resulting flow of energy in the X direction is ac-
txvx
companied by the corresponding momentum — — .
133. The Conservation of Momentum. It is evident from our
previous discussions that we may write the equation of motion for
an elastic medium in the form
fBV =
where g is the density of momentum at any given point and f is the
force acting per unit volume of material. We have already obtained ,
from the principle of least action, expressions (172) which permit
the calculation of g in terms of the energy density, stress and velocity
at the point in question, and our present problem is to discuss some
what further the nature of the force f .
150 Chapter Ten.
We shall find it convenient to analyze the total force per unit
volume of material f into those external forces iexl. like gravity, which
are produced by agencies outside of the elastic body and the internal
force tint- which arises from the elastic interaction of the parts of the
strained body itself. It is evident from the way in which we have
defined the divergence of a tensor (179) that for this latter we may
write
fin*. = - div t. (185)
Our equation of motion then becomes
(iext. -divt)67 = ~^, (186)
(MI
or, integrating over the total volume of the elastic body,
ft~..dV- fdivtdV = ~fgdV = ^j-, (187)
where G is the total momentum of the body. With the help of the
purely analytical relation (181) we may transform the above equation
into
s-f, (188)
where tn is defined in accordance with (180) so that the integral
XtndS becomes the force exerted by the surroundings on the sur
face of the elastic body.
In the case of an isolated system both fext. and tn would evidently
be equal to zero and we have the principle of the conservation of
momentum.
134. The Conservation of Angular Momentum. Consider the
radius vector r from a point of reference O to any point P in an elastic
body; then the angular momentum of the body about O will be
M = / (r X g)dV,
and its rate of change will be
Dynamics of Elastic Bodies. 151
Substituting equation (186), this may be written
X f«t.W -- (r X div t)dV + (u X
or, introducing the purely mathematical relation (183) we have,
C
X fea)dV + (r X t.)dS. (190)
r/TVT C
-# = J
We see from this equation that the rate of change of the angular
momentum of an elastic body is equal to the moment of the external
forces acting on the body plus the moment of the surface forces.
In the case of an isolated system this reduces to the important
principle of the conservation of angular momentum.
135. Relation between Angular Momentum and the Unsymmetrical
Stress Tensor. The fact that at a point in a strained elastic medium
there may be components of momentum at right angles to the motion
of the point itself, leads to the interesting conclusion that even in a
state of steady motion the angular momentum of a strained body
will in general be changing.
This is evident from equation (189), in the preceding section,
which may be written
In the older mechanics velocity u and momentum g were always in
the same direction so that the last term of this equation became zero.
In our newer mechanics, however, we have found (172) components
of momentum at right angles to the velocity and hence even for a body
moving in a straight line with unchanging stresses and velocity we find
that the angular momentum is increasing at the rate
(192)
and in order to maintain the body in its state of uniform motion we
must apply external forces with a turning moment of this same amount.
The presence of this increasing angular momentum in a strained
body arises from the unsymmetrical nature of the stress tensor, the inte
gral j (u X g)dV being as a matter of fact exactly equal to the integral
152
Chapter Ten.
over the same volume of the turning moments of the unsymmetrical
components of the stress. Thus, for example, if we have a body mov
ing in the X direction with the velocity u = xi we can easily see from
equations (172) and (175) the truth of the equality
(u X g) = [(tv, -
(txg -
(txy - tyt)ij].
136. The Right-Angled Lever. An interesting example of the
principle that in general a turning
_^ 7 *^
1 moment is needed for the uniform
translatory motion of a strained body
is seen in the apparently paradoxical
case of the right-angled lever.
Consider the right-angled lever
shown in figure 14. This lever is sta
tionary with respect to a system of
coordinates S°. Referred to S° the
two lever arms are equal in length :
FIG. 14.
and the lever is in equilibrium under the action of the equal forces
Let us now consider the equilibrium as it appears, using a system
of coordinates S with reference to which the lever is moving in X
direction with the velocity V. Referred to this new system of co
ordinates the length li of the arm which lies in the Y direction will be
the same as in system S°, giving us
Zi = Zi°.
But for the other arm which lies in the direction of motion we shall
have, in accordance with the Lorentz shortening,
For the forces Ft and F2 we shall have, in accordance with our equa-
Dynamics of Elastic Bodies. 153
tions for the transformation of force (61) and (62),
771 771
r 2 = r
We thus obtain for the moment of the forces around the pivot B
V*\ Po,ol/2_F,^
• <? ) ~Fltl c> Flh~*>
and are led to the remarkable conclusion that such a moving lever
will be in equilibrium only if the external forces have a definite turning
moment of the magnitude given above.
The explanation of this apparent paradox is obvious, however,
in the light of our previous discussion. In spite of the fact that the
lever is in uniform motion in a straight line, its angular momentum
is continually increasing owing to the fact that it is elastically strained,
and it can be shown by carrying out the integration indicated in
equation (192) that the rate of change of angular momentum is as a
V2
matter of fact just equal to the turning moment FA ~ •
0
v2
This necessity for a turning moment FA — can also be showi:
directly from a consideration of the energy flow in the lever. Since
the force FI is doing the work F\V per second at the point A, a stream
of energy of this amount is continually flowing through the lever
from A to the pivot B. In accordance with our ideas as to the rela
tion between energy and mass, this new energy which enters at A each
second has the mass — ^- , and hence each second the angular mo
mentum of the system around the point B is increased by the amount
FiV V2
~& l ~ rl c2 '
We have already found, however, exactly this same expression for
the moment of the forces around the pivot B and hence see that they
are of just the magnitude necessary to keep the lever from turning,
thus solving completely our apparent paradox.
154
Chapter Ten.
137. Isolated Systems in a Steady State. Our considerations have
shown that the density of momentum is equal to the density of energy
flow divided by the square of the velocity of light. If we have a
system which is in a steady internal state, and is. either isolated or
merely subjected to an external pressure with no components of force
tangential to the bounding surface, it is evident that the resultant
flow of energy for the whole body must be in the direction of motion,
and hence for these systems momentum and velocity will be in the
same direction without the complications introduced by a trans
verse energy flow.
Thus for an isolated system in a steady internal state we may
write,
(193)
'-5
138. The Dynamics of a Particle. It is important to note that
particles are interesting examples of systems in which there will
obviously be no transverse component of energy flow since their
infinitesimal size precludes the action of tangential surface forces.
We thus see that the dynamics of a particle may be regarded as a
special case of the more general dynamics which we have developed
in this chapter, the equation of motion for a particle being
m
dt
in agreement with the work of Chapter VI.
139. Conclusion. We may now point out in conclusion the chief
results of this chapter. With the help of Einstein's equations for
spatial and temporal considerations, we have developed a set of
transformation equations for the strain in an elastic body. Using the
components of strain and velocity as generalized coordinates, we then
introduced the principle of least action, choosing a form of function
Dynamics of Elastic Bodies. 155
for kinetic potential which agrees at low velocities with the choice
made in the older theories of elasticity and at all velocities agrees
with the requirements of the principle of relativity. Using the
Lagrangian equations, we were then able to develop all that is neces
sary for a complete theory of elasticity.
The most important consequence of these considerations is an
extension in our ideas as to the relation between momentum and
energy. We find that the density of momentum in any direction
must be placed equal to the total density of energy flow in that same
direction divided by the square of the velocity of light; and we find
that we must include in our density of energy flow that transferred
through the elastic body by the forces which hold it in its state of
strain and suffer displacement as the body moves. This involves in
general a flow of energy and hence momentum at right angles to the
motion of the body itself.
At present we have no experiments of sufficient accuracy so that
we can investigate the differences between this new theory of elasticity
and the older ones, and hence of course have found no experimental
contradiction to the new theory. It will be seen, however, from the
expressions for momentum that even at low velocities the conse
quences of this new theory will become important as soon as we
run across elastic systems in which very large stresses are involved.
It is also important to show that a theory of elasticity can be de
veloped which agrees with the requirements of the theory of relativity.
In fairness, it must, however, be pointed out in conclusion that since
our expression for kinetic potential was not absolutely uniquely deter
mined there may also be other theories of elasticity which will agree
with the principle of relativity and with all the facts as now known.
CHAPTER XI.
THE DYNAMICS OF A THERMODYNAMIC SYSTEM.
We may now use our conclusions as to the relation between the
principle of least action and the theory of relativity to obtain informa
tion as to the behavior of thermodynamic systems in motion.
140. The Generalized Coordinates and Forces. Let us consider a
thermodynamic system whose state is defined by the generalized
coordinates volume v, entropy S and the values of x, y and z which
determine its position. Corresponding to these coordinates we shall
have the generalized external forces, the negative of the pressure,
— p, temperature, T, and the components of force, Fx, Fy and Fz.
These generalized coordinates and forces are related to the energy
change 8E accompanying a small displacement 6, in accordance with
the equation
5E = - 5W = - pdv + TSS + Fxdx + FySy + F,5z. (194)
141. Transformation Equation for Volume. Before we can apply
the principle of least action we shall need to have transformation
equations for the generalized coordinates, volume and entropy.
In accordance with the Lorentz shortening, we may write the
following expression for the volume v of the system in terms of v° as
measured with a set of axes $° with respect to which the system is
stationary :
x2 + y2 + z2
c2
where u is the velocity of the system.
By differentiation we may obtain expressions which we shall find
useful,
5 = -^=, (195)
dV° V
~Q^ == 7 I7«r\l To = 7 r9~\ ~^> '
156
Dynamics of a Thermodynamic System. 157
142. Transformation Equation for Entropy. As for the entropy
of a thermodynamic system, this is a quantity which must appear
the same to all observers regardless of their motion. This invariance
of entropy is a direct consequence of the close relation between the
entropy of a system in a given state and the probability of that state.
Let us write, in accordance with the Boltzmann-Planck ideas as to
the interdependence of these quantities,
S = k log W,
where S is the entropy of the system in the state in question, fc is a
universal constant, and W the probability of having a microscopic
arrangement of molecules or other elementary constituent parts which
corresponds to the desired thermodynamic state. Since this prob
ability is evidently independent of the relative motion of the observer
and the system we see that the entropy of a system S must be an
invariant and may write
S = S°. '(197)
143. Introduction of the Principle of Least Action. The Kinetic
Potential. We are now in a position to introduce the principle of
least action into our considerations by choosing a form of function
for the kinetic potential which will agree at low velocities with the
familiar principles of thermodynamics and will agree at all velocities
with the requirements of the theory of relativity.
If we use volume and entropy as our generalized coordinates, these
conditions are met by taking for kinetic potential the expression
H = - E°\l --. (198)
This expression agrees with the requirements of the theory of
IT
relativity that . shall be an invariant (see Section 111) and
at low velocities reduces to H = — E, which with our choice of
coordinates is the familiar form for the kinetic potential of a thermo
dynamic system.
158 Chapter Eleven.
It should be noted that this expression for the kinetic potential
of a thermodynamic system applies of course only provided we pick
out volume v and entropy S as generalized coordinates." If, following
Helmholtz, we should think it more rational to take v as one coordinate
and a quantity 0 whose time derivative is equal to temperature,
0 = T7, as the other coordinate, we should obtain of course a different
expression for the kinetic potential; in fact should have under those
circumstances
H = (E° - T°S
Using this value of kinetic potential, however, with the corresponding
coordinates we should obtain results exactly the same as those which
we are now going to work out with the help of the other set of coordi
nates.
144. The Lagrangian Equations. Having chosen a form for the
kinetic potential we may now substitute into the Lagrangian equa
tions (139) and obtain the desired information with regard to the
behavior of thermodynamic systems.
Since we shall consider cases in which the energy of the system is
independent of the position in space, the kinetic potential will be
independent of the coordinates x, y and 2, depending only on their
time derivatives. Noting also that the kinetic potential is inde
pendent of the time derivatives of volume and entropy, we shall
obtain the Lagrangian equations in the simple form
Dynamics of a Thermodynamic System. 159
145. Transformation Equation for Pressure. We may use the first
of these equations to show that the pressure is a quantity which
appears the same to all observers regardless of their relative motion.
We have
U*\ Ji U23E° Ji u*
--? = - V1 -7«"ST := " V1 -7*
dv° dv '
dE°
But, in accordance with equation (194), p° = — —5- . and in ac-
ov
cordance with equation (195),
dv° 1
dv
which gives us the desired relation
p = P°. (200)
Defining pressure as force per unit area, this result will be seen
to be identical with that which is obtained from the transformation
equations for force and area which result from our earliest considera
tions.
146. Transformation Equation for Temperature. The second of
the Lagrangian equations (199) will provide us information as to
measurements of temperature made by observers moving with different
velocities. We have
~ dE° dS°
dS° dS '
But, in accordance with equation (194), — ^ = T° and in accordance
r) Si°
with (197) -r^r = 1. We obtain as our transformation equation,
oo
~7> (201)
T
and see that the quantity . is an invariant for the Lorentz
transformation
160
Chapter Eleven.
147. The Equations of Motion for Quasistationary Adiabatic
Acceleration. Let us now turn our attention to the last three of the
Lagrangian equations. These are the equations for the motion of a
thermodynamic system under the action of external force. It is
evident, however, that these equations will necessarily apply only
to cases of quasistationary acceleration, since our development of
the principle of least action gave us an equation for kinetic potential
which was true only for systems of infinitesimal extent or large systems
in a steady internal state. It is also evident that we must confine our
considerations to cases of adiabatic acceleration, since otherwise the
value of E° which occurs in the expression for kinetic potential might
be varying in a perfectly unknown manner.
The Lagrangian equations for force may be advantageously trans
formed. We have
= - — (-
1 ~ dtdx\~
E
x__ I _ it? 3EC
c2 V1 ~c2 dx
But by equations (194), (196) and (197) we have
8E
We obtain
dx
and
dx
= 0.
E
(202)
Similar equations may be obtained for the components of force in
the Y and Z directions and these combined to give the vector equation
(203)
Dynamics of a Thermodynamic System. 161
This is the fundamental equation of motion for the dynamics of a
thermodynamic system.
148. The Energy of a Moving Thermodynamic System. We may
use this equation to obtain an expression for the energy of a moving
thermodynamic system. If we adiabatically accelerate a thermo
dynamic system in the direction of its motion, its energy will increase
both because of the work done by the force
which produces the acceleration and because of the work done by the
pressure p = p° which acts on a volume which is continually dimin
ishing as the velocity u increases, in accordance with the expression
/ u2
v° A/I — — .
Hence we may write for the total energy
E = E° + -IT
E° °° ° °°
V1
pv
E = -== - P°v° I - - = — - P1'- (204)
149. The Momentum of a Moving Thermodynamic System. We
may compare this expression for the energy of a thermodynamic
system with the following expression for momentum which is evident
from the equation (203) for force:
c
We find again, as in our treatment of elastic bodies presented
in the last chapter, that the momentum of a moving system may be
calculated by taking the total flow of energy in the desired direction
12
162
Chapter Eleven.
and dividing by c2. Thus, comparing equations (204) and (205),
we have
E
pv
(206)
E
where the term -^ u takes care of the energy transported bodily along
c
pv
by the system and the term — u takes care of the energy transferred
c
in the u direction by the action of the external pressure on the rear
and front end of the moving system.
150. The Dynamics of a Hohlraum. As an application of our con
siderations we may consider the dynamics of a hohlraum, since a
hohlraum in thermodynamic equilibrium is of course merely a special
example of the general dynamics which we have just developed. The
simplicity of the hohlraum and its importance from a theoretical
point of view make it interesting to obtain by the present method the
same expression for momentum that can be obtained directly but
with less ease of calculation from electromagnetic considerations.
As is well known from the work of Stefan and Boltzmann, the
energy content E° and pressure p° of a hohlraum at rest and in thermo
dynamic equilibrium are completely determined by the temperature
T° and volume v° in accordance with the equations
where a is the so-called Stefan's constant.
Substituting these values of E° and p° in the equation for the
motion of a thermodynamic system (203), we obtain
dt
'4 av°T°* u'
d
"4 avT* u"
I U2C
{ V1-^ J
dt
°(>-$r.
(207)
as the equation for the quasistationary adiabatic acceleration of a
Dynamics of a Thermodynamic System. 163
hohlraum. In view of this equation we may write for the momentum
of a hohlraum the expression
It is a fact of significance that our dynamics leads to a result for
the momentum of a hohlraum which had been adopted on the ground
of electromagnetic considerations even without the express intro
duction of relativity theory.
CHAPTER XII.
ELECTROMAGNETIC THEORY.
The Einstein theory of relativity proves to be of the greatest
significance for electromagnetics. On the one hand, the new electro
magnetic theory based on the first postulate of relativity obviously
accounts in a direct and straightforward manner for the results of the
Michelson-Morley experiment and other unsuccessful attempts to
detect an ether drift, and on the other hand also accounts just as
simply for the phenomena of moving dielectrics as did the older
theory of a stationary ether. Furthermore, the theory of relativity
provides considerably simplified methods for deriving a great many
theorems which were already known on the basis of the ether theory,
and gives us in general a clarified insight into the nature of electro
magnetic action.
151. The Form of the Kinetic Potential. In Chapter IX we in
vestigated the general relation between the principle of least action
and the theory of the relativity of motion. We saw that the develop
ment of any branch of dynamics would agree with the requirements
of relativity provided only that the kinetic potential H has such a form
TT
that the quantity , is an invariant for the Lorentz transfor
mation. Making use of this discovery we have seen the possibility
of developing the dynamics of a particle, the dynamics of an elastic
body, and the dynamics of a thermodynamic system, all of them in
forms which agree with the theory of relativity by merely introducing
slight modifications into the older expressions for kinetic potential in
TT
such a way as to obtain the necessary invariance for
In the case of electrodynamics, however, on account of the closely
interwoven historical development of the theories of electricity and
relativity, we shall not find it necessary to introduce any modification
164
Electromagnetic Theory. 165
in the form of the kinetic potential, but may take for H the following
expression, which is known to lead to the familiar equations of the
Lorentz electron theory
(curl <t>)2 /e u\ 1
^_? _<)> . (?_|_PHU f (209)
where the integration is to extend over the whole volume of the
system 7, e is the intensity of the electric field at the point in question,
<(> is the value of the vector potential, p the density of charge and u its
velocity.*
Let us now show that the expression which we have chosen for
kinetic potential does lead to the familiar equations of the electron
theory.
152. The Principle of Least Action. If now we denote by f the
force per unit volume of material exerted by the electromagnetic
action it is evident that we may write in accordance with the principle
of least action (135)
= 0, (210)
where 5r is the variation in the radius vector to the particle under
consideration, and where the integration is to be taken over the
whole volume occupied by the system and between two instants of
time ti and tz at which the actual and displaced configurations of the
system coincide.
153. The Partial Integrations. In order to simplify this equation,
we shall need to make use of two results which can be obtained by
partial integrations with respect to time and space respectively.
Thus we may write
£* dt(abb} = f' ad(bb) = [a56]g - J dt\-^
* Strictly speaking this expression for kinetic potential is not quite correct,
since kinetic potential must have the dimensions of energy. To complete the equa
tion and give all the terms their correct dimensions, we could multiply the first term
by the dielectric inductivity of free space e, and the last two terms by the magnetic
permeability i*. Since, however, e and n have the numerical value unity with the
usual choice of units, we shall not be led into error in our particular considerations
if we omit these factors.
166 Chapter Twelve.
or, since the displaced and actual motions coincide at ti and £2,
fdt(adb) = ~
We may also write
fdV(a^) = f dydz(adb) =
or, since we are to carry out our integrations over the whole volume
occupied by the system, we shall take our functions as zero at the
limits of integration and may write
/"(•£ )--/"(•=)•
Since similar considerations apply to derivatives with respect to the
other variables y and z, we can also obtain
/ dV a div b = - / dV b - grad a, (213)
fdVa.- curl b = / dV b - curl a. (214)
154. Derivation of the Fundamental Equations of Electromagnetic
Theory. Making use of these purely mathematical relationships we
are now in a position to develop our fundamental equation (210).
Carrying out the indicated variation, noting that Su = , and
making use of (211) and (214) we easily obtain
+f
1
5r = 0.
In developing the consequences of this equation, it should be
noted, however, that the variations are not all of them independent;
thus, since we shall define the density of charge by the equation
p = div e, (216)
it is evident that it will be necessary to preserve the truth of this
equation in any variation that we carry out. This can evidently be
Electromagnetic Theory. 167
done if we add to our equation (215) the expression
/ dtdVifr[&p - div 5e] = 0,
where ^ is an undetermined scalar multiplier. We then obtain with
the help of (213)
I dtdV [ e + - ~ + grad ^ | • 5e + | curl curl <|>
(217)
= 0,
and may now treat the variations 5e and 6<(> as entirely independent
of the others; we must then have the following equations true
e= -i|£-grad*, (218)
curl curl <(>=- + — , (219)
c c
and have thus derived from the principle of least action the funda
mental equations of modern electron theory. We may put these in
their familiar form by denning the magnetic field strength h by the
equation
h = curl <|> (220)
We then obtain from (219)
and, noting the mathematical identity curl grad ^ = 0, we obtain
from (218)
curle= --§. (222)
C dt
We have furthermore by definition (216)
div e = p, (223)
and noting equation (220) may write the mathematical identity
div h = 0. (224)
168 Chapter Twelve.
These four equations (221—4) are the familiar expressions which
have been made the foundation of modern electron theory. They
differ from Maxwell's original four field equations only by the intro
duction in (221) and (223) of terms which arise from the density of
charge p of the electrons, and reduce to Maxwell's set in free space.
155. We have not yet made use of the last three terms in the
fundamental equation (217) which results from the principle of least
action. As a matter of fact, it can be shown that these terms can be
transformed into the expression
fdtdV [f * - f [uX curl <H* + p grad * + f J -Sr;
(225)
and hence lead to the familiar fifth fundamental equation of modern
electron theory,
(226)
f = p{e+ ["xh]*J.
The transformation of the last three terms of (217) into the form
given above (225) is a complicated one and it has not seemed neces
sary to present it here since in a later paragraph we shall show the
possibility of deriving the fifth fundamental equation of the electron
theory (226) by combining the four field equations (221-4) with the
transformation equations for force already obtained from the principle
of relativity. The reader may carry out the transformation himself,
however, if he makes use of the partial integrations which we have
already obtained, notes that in accordance with the principle of the
conservation of electricity we must have dp = — divp 6r and notes
that Su = — — , where the differentiation — indicates that we are
at at
following some particular particle in its motion, while the differentia-
d d<|>
tion — occurring in — indicates that we intend the rate of change
at some particular stationary point.
156. The Transformation Equations for e, h and p. We have thus
shown the possibility of deriving the fundamental equations of modern
Electromagnetic Theory. 169
electron theory from the principle of least action. We now wish to
introduce the theory of relativity into our discussions by presenting
a set of equations for transforming measurements of e, h and p from
one set of space-time coordinates S to another set S' moving past S
in the X-direction with the velocity V. This set of equations is as
follows :
ex = ex,
(227)
z' = K ( ez +— hy J ,
ez =
hx' = h
hy' = «
&•/ « i
. ' - (229)
where K has its customary significance
As a matter of fact, this set of transformation equations fulfills
all the requirements imposed by the theory of relativity. Thus, in
the first place, it will be seen, on development, that these equations
are themselves perfectly symmetrical with respect to the primed and
unprimed quantities except for the necessary change from + V to
— V. In the second place, it will be found that the substitution of
these equations into our five fundamental equations for electro
magnetic theory (221-2-3-4-6) will successfully transform them
into an entirely similar set with primed quantities replacing the
unprimed ones. And finally it can be shown that these equations
agree with the general requirement derived in Chapter IX that the
170 Chapter Twelve.
TT
quantity — . shall be an invariant for the Lorentz trans-
1 -
formation.
rr
To demonstrate this important invariance of — . we may
point out that by introducing equations (220), (221) and (214), our
original expression for kinetic potential
2 (curl «t»)2
+--Y- ->
can easily be shown equal to
and, noting that our fundamental equations for space and time pro
vide us with the relation
we can easily show that our transformation equations for e and h do
lead to the equality
H Hr
tf I u>*
1-^ \l—3
We thus know that our development of the fundamental equations
for electromagnetic theory from the principle of least action is indeed
in complete accordance with the theory of relativity, since it conforms
with the general requirement which was found in Chapter IX to be
imposed by the theory of relativity on all dynamical considerations.
157. The Invariance of Electric Charge. As to the significance of
the transformation equations which we have presented for e, h and p,
we may first show, in accordance with the last of these equations,
that a given electric charge will appear the same to all observers no
matter what their relative motion.
Electromagnetic Theory. 171
To demonstrate this we merely have to point out that, by intro
ducing equation (17), we may write our transformation equation
for p (229) in the form
which shows at once that the two measurements of density of charge
made by 0 and 0' are in exactly the same ratio as the corresponding
measurements for the Lorentz shortening of the charged body, so
that the total charge will evidently measure the same for the two
observers.
We might express this invariance of electric charge by writing the
equation
Q' = Q. (231)
It should be noted in passing that this result is in entire accord
with the whole modern development of electrical theory, which lays
increasing stress on the fundamentally and indivisibility of. the
electron as the natural unit quantity of electricity. On this basis
the most direct method of determining the charge on an electrified
body would be to count the number of electrons present and this
number must obviously appear the same both to observer 0 and
observer 0'.*
158. The Relativity of Magnetic and Electric Fields. As to the
significance of equations (227) and (228) for transforming the values
of the electric and magnetic field strengths from one system to another,
we see that at a given point in space we may distinguish between the
electric vector e = ex\ + evj + ezk as measured by our original
observer 0 and the vector e' = ex'i -f- evfj -f ez'k as measured in
units of his own system by an observer 0' who is moving past 0 with
the velocity V in the .XT-direction. Thus if 0 finds in an unvarying
electromagnetic field that Qe is the force on a small test charge Q
which is stationary with respect to his system, 0' will find experi-
* A similar invariance of electric charge has been made fundamental in the
author's development of the theory of similitude (i. e., the theory of the relativity
of size). See for example Phys. Rev., vol. 3, p. 244 (1914).
172 Chapter Twelve.
mentally for a similar test charge that moves along with him a value
for the force Qef, where e' can be calculated from with the help of
these equations (227). Similar remarks would apply to the forces
which would act on magnetic poles.
These considerations show us that we should now use caution in
speaking of a pure electrostatic or pure magnetic field, since the
description of an electromagnetic field is determined by the particular
choice of coordinates with reference to which the field is measured.
159. Nature of Electromotive Force. We also see that the " elec
tromotive " force which acts on a charge moving through a magnetic
field finds its interpretation as an " electric " force provided we make
use of a system of coordinates which are themselves stationary with
respect to the charge. Such considerations throw light on such ques
tions, for example, as to the seat of the " electromotive " forces in
" homopolar " electric dynamos where there is relative motion of a
conductor and a magnetic field.
Derivation of the Fifth Fundamental Equation.
160. We may now make use of this fact that the forces acting on
a moving charge of electricity may be treated as purely electrostatic,
by using a set of coordinates which are themselves moving along with
the charge, to derive the fifth fundamental equation of electromagnetic
theory.
Consider an electromagnetic field having the values e and h for
the electric and magnetic field strengths at some particular point.
What will be the value of the electromagnetic force f acting per
unit volume on a charge of density p which is passing through the
point in question with the velocity u?
To solve the problem take a system of coordinates Sf which itself
moves with the same velocity as the charge, for convenience letting
the X-axis coincide with the direction of the motion of the charge.
Since the charge of electricity is stationary with respect to this system,
the force acting on it as measured in units of this system will be by
definition equal to the product of the charge by the strength of the
electric field as it appears to an observer in this system, so that we may
write
F' = Q'e',
Electromagnetic Theory. 173
or
Fx' = Q'ef', Fyf = Q'evf, Fzf = Q'e,'.
For the components of the electrical field ex, ey', ez', we have just
obtained the transformation equations (227), while in our earlier
dynamical considerations in Chapter VI we obtained transformation
equations (61), (62), and (63) for the components of force. Sub
stituting above and bearing in mind that ux = V, uy = uz = 0, and
that Q' = Q, we obtain on simplification
Fx = Qex,
Fy = Q ( ey - y hz J ;
r. -«(•• + 7 *i)i
which in vectorial form gives us the equation
F =
or for the force per unit volume
f = pe + [uxh]*. (226)
This is the well-known fifth fundamental equation of the Maxwell-
Lorentz theory of electromagnetism. We have already indicated the
method by which it could be derived from the principle of least action.
This derivation, however, from the transformation equations, provided
by the theory of relativity, is particularly simple and attractive.
Difference between the Ether and the Relativity Theories of Electro-
magnetism.
161. In spite of the fact that we have now found five equations
which can be used as a basis for electromagnetic theory which agree
with the requirements of relativity and also have exactly the same
form as the five fundamental equations used by Lorentz in building
up the stationary ether theory, it must not be supposed that the
relativity and ether theories of electromagnetism are identical. Al
though the older equations have exactly the same form as the ones
which we shall henceforth use, they have a different interpretation,
since our equations are true for measurements made with the help
of any non-accelerated set of coordinates, while the equations of
174 Chapter Twelve.
Lorentz were, in the first instance, supposed to be true only for mea
surements which were referred to a set of coordinates which were
stationary with respect to the assumed luminiferous ether. Suppose,
for example, we desire to calculate with the help of equation (226),
the force acting on a charged body which is moving with the velocity u;
we must note that for the stationary ether theory, u must be the
velocity of the charged body through the ether, while for us u may be
taken as the velocity past any set of unaccelerated coordinates, pro
vided e and h are measured with reference to the same set of co
ordinates. It will be readily seen that such an extension in the mean
ing of the fundamental equations is an important simplification.
162. A word about the development from the theory of a stationary
ether to our present theory will not be out of place. When it was
found that the theory of a stationary ether led to incorrect con
clusions in the case of the Michelson-Morley experiment, the hypo
thesis was advanced by Lorentz and Fitzgerald that the failure of that
experiment to show any motion through the ether was due to a con
traction of the apparatus in the direction of its motion through the
/ 172
ether in the ratio 1 : A/I — — . Lorentz then showed that if all sys-
\ c2
terns should be thus contracted in the line of their motion through the
ether, and observers moving with such system make use of suitably
contracted meter sticks and clocks adjusted to give what Lorentz
called the " local time," their measurements of electromagnetic
phenomena could be described by a set of equations which have
nearly the same form as the original four field equations which would
be used by a stationary observer. It will be seen that Lorentz was
thus making important progress towards our present idea of the com
plete relativity of motion. The final step could not be taken, however,
without abandoning our older ideas of space and time and giving up
the Galilean transformation equations as the basis of kinematics.
It was Einstein who, with clearness and boldness of vision, pointed
out that the failure of the Michelson-Morley experiment, and all
other attempts to detect motion through the ether, is not due to a
Electromagnetic Theory. 175
fortuitous compensation of effects but is the expression of an important
general principle, and the new transformation equations for kinematics
to which he was led have not only provided the basis for an exact
transformation of the field equations but have so completely revo
lutionized our ideas of space and time that hardly a branch of science
remains unaffected.
163. With regard to the present status of the ether in scientific
theory, it must be definitely stated that this concept has certainly
lost both its fundamentality and the greater part of its usefulness,
and this has been brought about by a gradual process which has only
found its culmination in the work of Einstein. Since the earliest
days of the luminiferous ether, the attempts of science to increase the
substantiality of this medium have met with little success. Thus
we have had solid elastic ethers of most extreme tenuity, and ethers
with a density of a thousand tons per cubic millimeter; we have had
quasi-material tubes of force and lines of force ; we have had vibratory
gyrostatic others and perfect gases of zero atomic weight; but after
every debauch of model-making, science has recognized anew that a
correct mathematical description of the actual phenomena of light
propagation is superior to any of these sublimated material media.
Already for Lorentz the ether had been reduced to the bare function
of providing a stationary system of reference for the measurement of
positions and velocities, and now even this function has been taken
from it by the work of Einstein, which has shown that any unaccel-
erated system of reference is just as good as any other.
To give up the notion of an ether will be very hard for many
physicists, in particular since the phenomena of the interference and
polarization of light are so easily correlated with familiar experience
with wave motions in material elastic media. Consideration will
show us, however, that by giving up the ether we have done nothing
to destroy the periodic or polarizable nature of a light disturbance.
When a plane polarized beam of light is passing through a given
point in space we merely find that the electric and magnetic fields at
that point lie on perpendiculars to the direction of propagation and
undergo regular periodic changes in magnitude. There is no need of
going beyond these actual experimental facts and introducing any
hypothetical medium. It is just as simple, indeed simpler, to say
176 Chapter Twelve.
that the electric or magnetic field has a certain intensity at a given
point in space as to speak of a complicated sort of strain at a given
point in an assumed ether.
Applications to Electromagnetic Theory.
164. The significant fact that the fundamental equations of the
new electromagnetic theory have the same form as those of Lorentz
makes it of course possible to retain in the structure of modern elec
trical theory nearly all the results of his important researches, care
being taken to give his mathematical equations an interpretation in
accordance with the fundamental ideas of the theory of relativity. It
is, however, entirely beyond our present scope to make any presenta
tion of electromagnetic theory as a whole, and in the following para
graphs we shall confine ourselves to the proof of a few theorems which
can be handled with special ease and directness by the methods intro
duced by the theory of relativity.
165. The Electric and Magnetic Fields around a Moving Charge.
Our transformation equations for the electromagnetic field make it
very easy to derive expressions for the field around a point charge in
uniform motion. Consider a point charge Q moving with the velocity
V. For convenience consider a system of reference S such that Q is
moving along the X-axis and at the instant in question, t = 0, let the
charge coincide with the origin of coordinates 0. We desire now to
calculate the values of electric field e and the magnetic field h at any
point in space xt y, z.
Consider another system of reference, S', which moves along with
the same velocity as the charge Q, the origin of coordinates 0' ', and
the charge always coinciding in position. Since the charge is sta
tionary with respect to their new system of reference, we shall have
the electric field at any point x', y', z' in this system given by the
equations
Qx'
ex' =
e' =
(V2 + y'2 + z'1)31* '
Qy'
(x* + y'2 + z'2)"* '
Qz'
'**?*'
Electromagnetic Theory. 177
while the magnetic field will obviously be zero for measurements made
in system £', giving us
hx' = 0,
A/ = 0,
A,' = 0.
Introducing our transformation equations (9), (10) and (11) for xf,
y' and z' and our transformation equations (227) and (228) for the
electric and magnetic fields and substituting i = 0, we obtain for the
values of e and h in system S at the instant when the charge passes
through the point 0,
Or/
hx = 0,
V
v c
hz = - ey,
or, putting s for the important quantity ^ x2 + ( 1 - -^ j (?/2 + s2)
and writing the equations in the vectorial form where we put
r = (x\ + yj + zk),
we obtain the familiar equations for the field around a point charge
13
178 Chapter Twelve.
in uniform motion with the velocity u = V in the X-direction
(232)
h = - [u X e].* (233)
c
166. The Energy of a Moving Electromagnetic System. Our
transformation equations will permit us to obtain a very important
expression for the energy of an isolated electromagnetic system in
terms of the velocity of the system and the energy of the same system
as it appears to an observer who is moving along with it.
Consider a physical system surrounded by a shell which is im
permeable to electromagnetic radiation. This system is to be thought
of as consisting of the various mechanical parts, electric charges and
electromagnetic fields which are inside of the impermeable shell.
The system is free in space, except that it may be acted on by external
electromagnetic fields, and its energy content thus be changed.
Let us now equate the increase in the energy of the system to the
work done by the action of the external field on the electric charges
in the system. Since the force which a magnetic field exerts on a
charge is at right angles to the motion of the charge it does no work
and we need to consider only the work done by the external electric
field and may write for the increase in the energy of the system
A# = //// p(exux + eyuy + ezuz)dx dy dz dt, (234)
where the integration is to be taken over the total volume of the
system and over any time interval in which we may be interested.
Let us now transform this expression with the help of our trans
formation equations for the electric field (227) for electric charge
(229), and for velocities (14-15-16). Noting that our fundamental
equations for kinematic quantities give us dx dy dz dt = dxf dyf dz' dt' ,
we obtain
A# = K ffff P'(ex'uxf + ev'uv' + ez'us')dx' dy' dzf dt'
+ «V ffff P' (••' + ^ h,' - ^f hy' ) dx' dy' dz' dt'.
Electromagnetic Theory. 179
Consider now a system which both at the beginning and end of our
time interval is free from the action of external forces; we may then
rewrite the above equation for this special case in the form
A# = K&E' + KV f 2Fx'dt',
where, in accordance with our earlier equation (234), kE' is the increase
in the energy of the system as it appears to observer 0' and ZFZ'
is the total force acting on the system in X-direction as measured
byO'.
The restriction that the system shall be unacted on by external
forces both at the beginning and end of our time interval is necessary
because it is only under those circumstances that an integration
between two values of t can be considered as an integration between
two definite values of t', simultaneity in different parts of the system
not being the same for observers 0 and 0' '.
We may now apply this equation to a specially interesting case.
Let the system be of such a nature that we can speak of it as being
at rest with respect to S', meaning thereby that all the mechanical
parts have low velocities with respect to S' and that their center of
gravity moves permanently along with S'. Under these circum
stances we may evidently put j *LFt'dt' = 0 and may write the
above equation in the form
/—*
V1-?
or
dE
where u is the velocity of the system, and E° is its energy as measured
by an observer moving along with it. The energy of a system which
is unacted on by external forces is thus a function of two variables, its
energy E0 as measured by an observer moving along with the system
and its velocity u.
180 Chapter Twelve.
We may now write
E = EQ + (f>(u) + const.,
where <j>(u) represents the energy of the system which depends solely
on the velocity of the system and not on the changes in its EQ values.
<j>(u) will thus evidently be the kinetic energy of the mechanical masses
in the system which we have already found (82) to have the value
0 — m0c2 where ra0 is to be taken as the total mass of the
c2
mechanical part of our system when at rest. We may now write
E = — (m0c2 + EQ) — mQc2 + const.
u2
Or, assuming as before that the constant is equal to ra0c2, which will
be equivalent to making a system which has zero energy also have
zero mass, we obtain
=- (».* + *«). (235)
which is the desired expression for the energy of an isolated system
which may contain both electrical and mechanical parts.
167. Relation between Mass and Energy. This expression for the
energy of a system that contains electrical parts permits us to show
that the same relation which we found between mass and energy for
mechanical systems also holds in the case of electromagnetic energy.
Consider a system containing electromagnetic energy and enclosed
by a shell which is impermeable to radiation. Let us apply a force F
to the system in such a way as to change the velocity of the system
without changing its EQ value. We can then equate the work done
per second by the force to the rate of increase of the energy of the
system. We have
dE
F-u - ,. .
dt
Electromagnetic Theory.
181
But from equation (235) we can obtain a value for the rate of increase
dE . .
of energy —- , giving us
F-u = Fxux + Fyiiy + FzUg = ( m0 +-7 )
and solving this equation for F we obtain
E0
du
dt
IT
(236)
which for low velocities assumes the form
(237)
Examination of these expressions shows that our system which
contains electromagnetic energy behaves like an ordinary mechanical
Tjl
^ ) at low velocities or ra0 + —^
— =5*
system with the mass
any desired velocity u. To the energy of the system E0, part of which
is electromagnetic, we must ascribe the mass — just as we found in
the case of mechanical energy. We realize again that" matter and
energy are but different names for the same fundamental entity,
1021 ergs of energy having the mass 1 gram.
The Theory of Moving Dielectrics.
168. The principle of relativity proves to be very useful for the
development of the theory of moving dielectrics.
It was first shown by Maxwell that a theory of electromagnetic
phenomena in material media can be based on a set of field equations,
similar in form to those for free space, provided we introduce besides
the electric and magnetic field strengths, E and F, two new field vectors.
182 Chapter Twelve.
the dielectric displacement D and the magnetic induction B, and
also the density of electric current in the medium i. These quantities
are found to be connected by the four following equations similar in
form to the four field equations for free space:
(238)
curl E= -if, (239)
div D = p, (240)
div B = 0. (241)
For stationary homogeneous media, the dielectric displacement,
magnetic induction and electric current are connected with the
electric and magnetic field strengths by the following equations:
D = eE, (242)
B - MH, (243)
i = o-E, (244)
where e is the dielectric constant, fj, the magnetic permeability and a
the electrical conductivity of the medium in question.
169. Relation between Field Equations for Material Media and
Electron Theory. It must not be supposed that the four field equa
tions (238-241) for electromagnetic phenomena in material media are
in any sense contradictory to the four equations (221-224) for free
space which we took as the fundamental basis for our development of
electromagnetic theory. As a matter of fact, one of the main achieve
ments of modern electron theory has been to show that the electro
magnetic behavior of material media can be explained in terms of
the behavior of the individual electrons and ions which they contain,
these electrons and ions acting in accordance with the four fundamental
field equations for free space. Thus our new equations for material
media merely express from a macroscopic point of view the statistical
result of the behavior of the individual electrons in the material in
question. E and H in these new equations are to be looked upon as
the average values of e and h which arise from the action of the
individual electrons in the material, the process of averaging being so
Electromagnetic Theory. 183
carried out that the results give the values which a macroscopic ob
server would actually find for the electric and magnetic forces acting
respectively on a unit charge and a unit pole at the point in question.
These average values, E and H, will thus pay no attention to the
rapid fluctuations of e and h which arise from the action and motion
of the individual electrons, the macroscopic observer using in fact
differentials for time, dt, and space, dx, which would be large from a
microscopic or molecular viewpoint.
Since from a microscopic point of view E and H are not really
the instantaneous values of the field strength at an actual point in
space, it has been found necessary to introduce two new vectors,
electric displacement, D, and magnetic induction, B, whose time
rate of change will determine the curl of E and H respectively. It will
evidently be possible, however, to relate D and B to the actual electric
and magnetic fields e and h produced by the individual electrons,
and this relation has been one of the problems solved by modern
electron theory, and the field equations (238-241) for material media
have thus been shown to stand in complete agreement with the most
modern views as to the structure of matter and electricity. For
the purposes of the rest of our discussion we shall merely take these
equations as expressing the experimental facts in stationary or in
moving media.
170. Transformation Equations for Moving Media. Since equa
tions (238 to 241) are assumed to give a correct description of electro
magnetic phenomena in media whether stationary or moving with
respect to our reference system S, it is evident that the equations
must be unchanged in form if we refer our measurements to a new
system of coordinates Sf moving past S, say, with the velocity V in the
X-direction.
As a matter of fact, equations (238 to 241) can be transformed
into an entirely similar set
curl E = — — TT7 ;
div D' = P',
div B' = 0,
184 Chapter Twelve.
provided we substitute for x, y, z and t the values of x', y', z' and tf
given by the fundamental transformation equations for space and
time (9 to 12), and substitute for the other quantities in question the
relations
Ev' =
Ezf =
Dx' = Dx,
Dz' =
Hx' = Hx,
B,' = B
(246)
ix = K(IX - Vp), (247)
t,' = lz.
It will be noted that for free space these equations will reduce to
the same form as our earlier transformation equations (227 to 229)
since we shall have the simplifications D = E, B = H and i = pu.
We may also call attention at this point to the fact that our funda-
Electromagnetic Theory. 185
mental equations for electromagnetic phenomena (238-241) in di
electric media might have been derived from the principle of least
action, making use of an expression for kinetic potential which could
//E-D H-B\
dV( — - --- — 1, and it will be noticed
that our transformation equations for these quantities are such as to
IT
preserve that necessary in variance for —7= = which we found in
Chapter IX to be the general requirement for any dynamical develop
ment which agrees with the theory of relativity.
171. We are now in a position to handle the theory of moving
media. Consider a homogeneous medium moving past a system of
coordinates S in the Jf -direction with the velocity F; our problem is
to discover relations between the various electric and magnetic
vectors in this medium. To do this, consider a new system of co
ordinates Sf also moving past our original system with the velocity V.
Since the medium is stationary with respect to this new system S' we
may write for measurements referred to S' in accordance with equa
tions (242 to 244) the relations
D' = eE',
B' - jeH',
i' = <rE',
which, as we have already pointed out, are known experimentally to
be true in the case of stationary, homogeneous media. €, ^ and a- are
evidently the values of dielectric constant, permeability and con
ductivity of the material in question, which would be found by an
experimenter with respect to whom the medium is stationary.
Making use of our transformation equations (245 to 247) we can
obtain by obvious substitutions the following set of relations for
measurements made with respect to the original system of coordi
nates S:
D, = eEx,
(248)
186
Chapter Twelve.
>z +—Hy = e( E2 + — By }
c \ c V
Bx =
V
BV+-ES = v,
V
Bz --Ev -
(249)
(250)
172. Theory of the Wilson Experiment. The equations which we
have just developed for moving media are, as a matter of fact, in
complete accord with the celebrated experiment of H. A. Wilson on
moving dielectrics and indeed all other experiments that have been
performed on moving media.
Wilson's experiment consisted in the rotation of a hollow cylinder
of dielectric, in a magnetic field which was parallel to the axis of the
cylinder. The inner and outer surfaces of the cylinder were covered
with a thin metal coating, and arrangements made with the help of
wire brushes so that electrical contact could be made from these
coatings to the pairs of quadrants of an electrometer. By reversing
the magnetic field while the apparatus was in rotation it was possible
to measure with the electrometer the charge produced by the electrical
displacement in the dielectric. We may make use of our equations
to compute the quantitative size of the effect.
FIG. 15.
Electromagnetic Theory. 187
Let figure 15 represent a cross-section of the rotating cylinder.
Consider a section of the dielectric AA which is moving perpendicularly
to the plane of the paper in the X-direction with the velocity V. Let
the magnetic field be in the F-direction parallel to the axis of rotation.
The problem is to calculate dielectric displacement Dz in the Z-
direction.
Referring to equations (248) we have
and, substituting the value of Bv given by equations (249),
V „
we obtain
V
V
or, neglecting terms of orders higher than — , we have
c
Dz = eE,+- (efjL - l)Hv. (251)
For a substance whose permeability is practically unity such as
Wilson actually used the equation reduces to
D, = eEz + -j (e - !)#„,
and this was found to fit the experimental facts, since measurements
with the electrometer show the surface charge actually to have the
magnitude Dz per square centimeter in accordance with our equation
div D = p.
It would be a matter of great interest to repeat the Wilson experi
ment with a dielectric of high permeability so that we could test the
complete equation (251). This is of some importance since the
original Lorentz theory led to a different equation,
Dz = eE2 + (e -
CHAPTER XIII.
FOUR-DIMENSIONAL ANALYSIS.
173. In the present chapter we shall present a four-dimensional
method of expressing the results of the Einstein theory of relativity,
a method which was first introduced by Minkowski, and in the form
which we shall use, principally developed by Wilson and Lewis. The
point of view adopted, consists essentially in considering the properties
of an assumed four-dimensional space in which intervals of time are
thought of as plotted along an axis perpendicular to the three Car
tesian axes of ordinary space, the science of kinematics thus becoming
the geometry of this new four-dimensional space.
The method often has very great advantages not only because it
sometimes leads to considerable simplification of the mathematical
form in which the results of the theory of relativity are expressed,
but also because the analogies between ordinary geometry and the
geometry of this imaginary space often suggest valuable modes of
attack. On the other hand, in order to carry out actual numerical
calculations and often in order to appreciate the physical significance
of the conclusions arrived at, it is necessary to retranslate the results
obtained by this four-dimensional method into the language of ordinary
kinematics. It must further t>e noted, moreover, that many im
portant results of the theory of relativity can be more easily obtained
if we do not try to employ this four-dimensional geometry. The
reader should also be on his guard against the fallacy of thinking that
extension in time is of the same nature as extension in space merely
because intervals of space and time can both be represented by
plotting along axes drawn on the same piece of paper.
174. Idea of a Time Axis. In order to grasp the method let us
consider a particle constrained to move along a single axis, say OX,
and let us consider a time axis OT perpendicular to OX. Then the
position of the particle at any instant of time can be represented by a
point in the XT plane, and its motion as time progresses by a line in
the plane. If, for example, the particle were stationary, its behavior
188
Four Dimensional Analysis.
189
in time and space could be represented by a line parallel to the time
axis OT as shown for example by the line ab in figure 16. A particle
A
\ Ax
FIG. 16.
dx
moving with the uniform velocity u = -r- could be represented by a
straight line etc making an angle with the time axes, and the kine-
matical behavior of an accelerated particle could be represented by a
curved line.
By conceiving of a /tmr-dimensional space we can extend this
method which we have just outlined to include motion parallel to
all three space axes, and in accordance with the nomenclature of
Minkowski might call such a geometrical representation of the space-
time manifold " the world," and speak of the points and lines which
represent the instantaneous positions and the motions of particles as
" world-points " and " world-lines."
175. Non-Euclidean Character of the Space. It will be at once
evident that the graphical method of representing kinematical events
which is shown by Figure 16 still leaves something to be desired. One
of the most important conclusions drawn from the theory of relativity
was the fact that it is impossible for a particle to move with a velocity
greater than that of light, and it is evident that there is nothing in
our plot to indicate that fact, since we could draw a line making any
desired angle with the time axis, up to perpendicularity, and thus
190
Chapter Thirteen.
represent particles moving with any velocity up to infinity,
Ax
u
= co.
It is also evident that there is nothing in our plot to correspond to
that invariance in the velocity of light which is a cornerstone of the
theory of relativity. Suppose, for example, the line OC, in figure 17,
/Q
T'l
/
/
/
/
/
/ !
1 1
/ /
I /
1 1
1 1
1 1
1 1
1 1
i 1
FIG. 17.
represents the trajectory of a beam of light with the velocity —7 = c;
there is then nothing so far introduced into our method of plotting
to indicate the fact that we could not equally well make use of another
set of axes OX'T' ', inclined to the first and thus giving quite a different
Ax'
value, —7 , to the velocity of the beam of light.
There are a number of methods of meeting this difficulty and
obtaining the invariance for the four-dimensional expression x2 + y2
+ z2 — c2t2 (see Chapter IV) which must characterize our system of
kinematics. One of these is to conceive of a four-dimensional Eu-
Four Dimensional Analysis. 191
clidean space with an imaginary time axis, such that instead of plotting
real instants in time along this axis we should plot the quantity
I = ict where i = V — 1. In this way we should obtain invariance
for the quantity x2 + y2 + z2 -+- I2 = x2 + y2 + z2 — c2t2, since it may
be regarded as the square of the magnitude of an imaginary four-
dimensional radius vector. This method of treatment has been
especially developed by Minkowski, Laue, and Sommerfeld. Another
method of attack, which has been developed by Wilson and Lewis
and is the one which we shall adopt in this chapter, is to use a real
time axis, for plotting the real quantity ct, but to make use of a non-
Euclidean four-dimensional space in which the quantity (x2 + y2 + z2
— c2t2) is itself taken as the square of the magnitude of a radius vector.
This latter method has of course the disadvantages that come from
using a non-Euclidean space; we shall find, however, that these reduce
largely to the introduction of certain rules as to signs. The method
has the considerable advantage of retaining a real time axis which is
of some importance, if we wish to visualize the methods of attack and
to represent them graphically.
We may now proceed to develop an analysis for this non-Euclidean
space. We shall find this to be quite a lengthy process but at its
completion we shall have a very valuable instrument for expressing
in condensed language the results of the theory of relativity. Our
method of treatment will be almost wholly analytical, and the geo
metrical analogies may be regarded merely as furnishing convenient
names for useful analytical expressions. A more geometrical method
of attack will be found in the original work of Wilson and Lewis.
PART I. VECTOR ANALYSIS OF THE NON-EUCLIDEAN FOUR-
DIMENSIONAL MANIFOLD.
176. Consider a four-dimensional manifold in which the position
of a point is determined by a radius vector
r =
where ki, k2, k3 and k4 may be regarded as unit vectors along four
mutually perpendicular axes and Xi, xz, xs, and x^. as the magnitudes
of the four components of r along these four axes. We may identify
Xi, Xz, and #3 with the three spatial coordinates of a point x, y and z
192 Chapter Thirteen.
with reference to an ordinary set of space axes and consider x± as a
coordinate which specifies the time (multiplied by the velocity of
light) when the occurrence in question takes place at the point xyz.
We have
Xi = x, x2 = y, x3 = z, £4 = ct, (252)
and from time to time we shall make these substitutions when we
wish to interpret our results in the language of ordinary kinematics.
We shall retain the symbols Xi, x2, x3, and x.i throughout our develop
ment, however, for the sake of symmetry.
177. Space, Time and Singular Vectors. Our space will differ in
an important way from Euclidean space since we shall consider three
classes of one-vector, space, time and singular vectors. Considering
the coordinates Xi, xz, #3 and x± which determine the end of a radius
vector,
Space or y-vectors will have components such that
and we shall put for their magnitude
Time or d-vectors will have components such that
Z42 > (X? + X? + Z32),
and we shall put for their magnitude
s = z42 - x? - x22 - x32. . (254)
Singular or a-vectors will have components such that
and their magnitude will be zero.
178. Invariance of x2 + y2 + z2 — cH2. Since we shall naturally
consider the magnitude of a vector to be independent of any particular
choice of axes we have obtained at once by our definition of magnitude
for any rotation of axes that invariance for the expression
Four Dimensional Analysis. 193
which is characteristic of the Lorentz transformation, and have thus
evidently set up an imaginary space which will be suitable for plotting
kinematical events in accordance with the requirements of the theory
of the relativity of motion.
179. Inner Product of One-Vectors. We shall define the inner
product of two one-vectors with the help of the following rules for the
multiplication of unit vectors along the axes
ki-kj = k2-k2 - k3-k3 = 1, k4-k4 = -- 1, kn-km = 0. (255)
It should be noted, of course, that there is no particular sig
nificance in picking out the product k4-ki as the one which is nega
tive; it would be equally possible to develop a system in which the
products kL • ki, k2 • k2, and k3 • k3 should be negative and k4 • k4 positive.
The above rules for unit vectors are sufficient to define completely
the inner product provided we include the further requirements that
this product shall obey the associative law for a scalar factor and the
distributive and commutative laws, namely
= n(a-b) = (a-b)(n),
a-(b + c) = a-b + a-c, (256)
a-b = b-a.
For the inner product of a one-vector by itself we shall have, in
accordance with these rules,
r-r = (ziki + £2k2 + z3k3 + z4k4) • (ziki + z2k2 + z3k3 -f z4k4)
= (Zl2 + Z2* + X.2 - Z42)
and hence may use the following expressions for the magnitudes of
vectors in terms of inner product
s = A/r-r for ^-vectors, s = V— r-r for 5-vectors. (258)
For curved lines we shall define interval along the curve by the
equations
f ds = f Vdr • dr for ^-curves,
J _ (259)
J" ds = f V — dr - dr for 6-curves.
14
194 Chapter Thirteen.
Our rules further show us that we may obtain the space components
of any one vector by taking its inner product with a unit vector
along the desired axis and may obtain the time component by taking
the negative of the corresponding product. Thus
r-k2 = (ziki + z2k2 + £3k3 + Z4k4)-k2 = x2,
(260)
r-k3
r-k4 = (ziki + z2k2 + z3k3 + £4k4) -k4 = — z4.
We see finally moreover in general that the inner product of any
pair of vectors will be numerically equal to the product of the mag
nitude of either by the projection of the other upon it, the sign de
pending on the nature of the vectors involved.
180. Non-Euclidean Angle. We shall define the non-Euclidean
angle 6 between two vectors rx and r2 in terms of their magnitudes
Si and s2 by the expressions
db TI-TZ = (si X projection s2) = SiS2 cosh 0, (261)
the sign depending on the nature of the vectors in the way indicated
in the preceding section. We note the analogy between this equation
and those familiar in Euclidean vector-analysis, the hyperbolic
trigonometeric functions taking the place of the circular functions
used in the more familiar analysis.
For the angle between unit vectors k and k' we shall have
coshfl = ± k-k', (262)
where the sign must be chosen so as to make cosh 6 positive, the
plus sign holding if both are 7-vectors and the minus sign if both are
5- vectors.
181. Kinematical Interpretation of Angle in Terms of Velocity.
At this point we may temporarily interrupt the development of our
four-dimensional analysis to consider a kinematical interpretation of
non-Euclidean angles in terms of velocity. It will be evident from
our introduction that the behavior of a moving particle can be repre
sented in our four-dimensional space by a 5-curve,* each point on
* It is to be noted that the actual trajectories of particles are all of them repre
sented by 5-curves since as we shall see 7-curves would correspond to velocities
greater than that of light.
Four Dimensional Analysis. 195
this curve denoting the position of the particle at a given instant of
time, and it is evident that the velocity of the particle will be deter
mined by the angle which this curve makes with the axes.
Let r be the radius vector to a given point on the curve and con
sider the derivative of r with respect to the interval s along the curve;
we have
dr dxi dx2 dx3 dx4
and this may be regarded as a unit vector tangent to the curve at the
point in question.
If 0 is the angle between the k4 axis and the tangent to the curve
at the point in question, we have by equation (262)
cosh 4> = — w • k4 = ~ ;
as
making the substitutions for xlf x2, x3) and x4, in terms of x, y, z and t
we may write, however,
ds = Vdz42 - dx? - dxf - dxf ~ 1 ~~ cdt>
which gives us
cosh 0 = . - = (265)
2
and by the principles of hyperbolic trigonometry we may write the
further relations
u
sinh 0 = -7=^== , (266)
(267)
VECTORS OF HIGHER DIMENSIONS
182. Outer Products. We shall define the outer product of two
one-vectors so that it obeys the associative law for a scalar factor, the
196 Chapter Thirteen.
distributive law and the anti-commutative law, namely,
(na) X b = w(a X b) = a X (wb),
aX(b + c)=aXb+aXc (a + b)Xc = aXc + bXc; (268)
a X b = -bXa.
From a geometrical point of view, we shall consider the outer
product of two one-vectors to be itself a two-vector, namely the paral
lelogram, or more generally, the area which they determine. The
sign of the two-vector may be taken to indicate the direction of pro
gression clockwise or anti-clockwise around the periphery. In order
to accord with the requirement that the area of a parallelogram deter
mined by two lines becomes zero when they are rotated into the same
direction, we may complete our definition of outer product by adding
the requirement that the outer product of a vector by itself shall be
zero.
a X a = 0. (269)
We may represent the outer products of unit vectors along the
chosen axes as follows:
ki X ki = k2 X k2 = k3 X k3 - k4 X k4 = 0,
ki X k2 = - k2 X ki = kia = - k21, (270)
ki X k3 = — k3 X ki = k^ = — k3i, etc.,
where we may regard ki2, for example, as a unit parallelogram in the
plane XfiX*.
We shall continue to use small letters in Clarendon type for one-
vectors and shall use capital letters in Clarendon type for two-vectors.
The components of a two-vector along the six mutually perpendicular
planes XiOX2, XiOX3, etc., may be obtained by expressing the one-
vectors involved in terms of their components along the axes and
carrying out the indicated multiplication, thus:
A = a X b = (aiki + a2k2 -f a3k3 +
X (6iki + 62k2 + 63k3 + 64k4)
= (a i62 - a2&i)ki2 + (ai&3 - as&i)ki3 + (0164 - a46i)ki4
-f (a264 — a462)k24 + (a364 — a463)k34,
Four Dimensional Analysis. 197
or, calling the quantities (ai&2 — a2fri), etc., the component magni
tudes of A, A 12, etc., we may write
A = A12k12 + A13k13 + 4i4k,4 + A23k23 + A24k24 + A34k,4. (272)
The concept of outer product may be extended to include the
idea of vectors of higher number of dimensions than two. Thus the
outer product of three one-vectors, or of a one-vector and a two-vector
will be a three-vector which may be regarded as a directed parallelo-
piped in our four-dimensional space. The outer product of four one-
vectors will lead to a four-dimensional solid which would have direction
only in a space of more than four dimensions and hence in our case
will be called a pseudo-scalar. The outer product of vectors the
sum of whose dimensions is greater than that of the space considered
will vanish.
The results which may be obtained from different types of outer
multiplication are tabulated below, where one-vectors are denoted
by small Clarendon type, two-vectors by capital Clarendon type,
three-vectors by Tudor black capitals, and pseudo-scalars by bold face
Greek letters.
A = a X b = -bXa= (alb2 - «2&i)k12 + (a^ - a3&i)k13
+ (a 164 — a46i) ku + (a263 — a362)k23 + (a264 — a462)k2i
+ (a364 — a463)k34,
H = c X A = (ciA23 - c2Ai3 4- c3Ai2)k123
(273)
c4A23)k234,
a = A X B =
The signs in these expressions are determined by the general rule
that the sign of any unit vector knmo will be reversed by each transposition
of the order of a pair of adjacent subscripts, thus:
kabcd = — kbacd = kbcad, etc., ' ' ' . (274)
198 Chapter Thirteen.
183. Inner Product of Vectors in General. We have previously
defined the inner product for the special case of a pair of one-vectors,
in order to bring out some of the important characteristics of our
non-Euclidean space. We may now give a general rule for the inner
product of vectors of any number of dimensions.
The inner product of any pair of vectors follows the associative
law for scalar factors, and follows the distributive and commutative
laws.
Since we can express any vector in terms of its components, the
above rules will completely determine the inner product of any pair
of vectors provided that we also have a rule for obtaining the inner
products of the unit vectors determined by the mutually perpendicular
axes. This rule is as follows: Transpose the subscripts of the unit
vectors involved so that the common subscripts occur at the end and
in the same order and cancel these common subscripts. If both the
unit vectors still have subscripts the product is zero; if neither vector
has subscripts the product is unity, and if one of the vectors still has
subscripts that itself will be the product. The sign is to be taken
as that resulting from the transposition of the subscripts (see equa
tion (274)), unless the subscript 4 has been cancelled, when the sign
will be changed.
For example:
ki24-k34 = ki2-k3 = 0,
k132-ki23 = - ki23-k123 = - 1, (275)
ki24-k42 = - ki24-k24 = ki.
It is evident from these rules that we may obtain the magnitude
of any desired component of a vector by taking the inner product of
the vector by the corresponding unit vector, it being noticed, of course,
that when the unit vector involved contains the subscript 4 we obtain
the negative of the desired component. For example, we may obtain
the ki2 component of a two-vector as follows:
A]2 = A-ki2 = (Ai2ki2 + A13k13 + A]4ki4
(276)
+ A23k23 + Asiksi + A34k34) -ki2.
184. The Complement of a Vector. In an n-dimensional space
any m-dimensional vector will uniquely determine a ~new vector of
Four Dimensional Analysis. 199
dimensions (n— m) which may be called the complement of the
original vector. The complement of a vector may be exactly defined
as the inner product of the original vector with the unit pseudo-scalar
ki23 ... n. In general, we may denote the complement of a vector
by placing an asterisk * after the symbol. As an example we may
write as the complement of a two-vector A in our non-Euclidean four-
dimensional space:
A* = A-ki234 = (Ai2k12 + Ai3ki3 + Auku
+ A23k23 + A24k24 + Awkrf) • k1234 (277)
185. The Vector Operator, <0 or Quad. Analogous to the familiar
three-dimensional vector-operator del,
v-k'i + *£, + *£;• (278)
we may define the four-dimensional vector-operator quad,
o'**£+*£+*5-*5;- (279)
If we have a scalar or a vector field we may apply these operators
by regarding them formally as one-vectors and applying the rules
for inner and outer multiplication which we have already given.
Thus if we have a scalar function F which varies continuously
from point to point we can obtain a one-vector which we may call
the four-dimensional gradient of F at the point in question by simple
multiplication; we have
GradF=OF = k1g + k2g+k,g-k,g;. (280)
If we have a one-vector field, with a vector f whose value varies
from point to point we may obtain by inner multiplication a scalar
quantity which we may call the four-dimensional divergence of f
we have
^-f + + + -. (281)
OX\ OXz 0X3 0X4
Taking the outer product with quad we may obtain a two-vector, the
200 Chapter Thirteen.
four-dimensional curl of f,
k24 +
By similar methods we could apply quad to a two-vector function F
and obtain the one-vector function 0 • F and the three-vector func
tion 0 X F.
186. Still regarding 0 as a one-vector we may obtain a number of
important expressions containing 0 more than once; we have:
0 X (OF) = 0, (283) 0 X (0 X f) = 0, (286)
0-(0-F)=0, (284) OX(OXF)=0, (287)
O'(0-lf)=0, (285)
0:(0 xf) = O(O-f) - (O-O)f, (288)
0-(0 X F) = 0 X (0-F) + (O'O)F, (289)
0-(0 xf)= 0 x(O-JT)- (0-0)3f- (290)
The operator 0 ' 0 or O2 nas l°ng been known under the name
of the D'Alembertian,
a2 d2 a2 d2 a2
(291)
From the definition of the complement of a vector given in the
previous section it may be shown by carrying out the proper expansions
that
(0 X0)* = 0-0*, (292)
where 0 is a vector of any number of dimensions.
187. Tensors. In analogy to three-dimensional tensors we may
define a four-dimensional tensor as a quantity with sixteen components
as given in the following table:
rn /TT /TT rn
JL 11 J- 12 1 13 * 14,
rji rri rn /TT
-i 21 J 22 Ji 23 J 24j
r r r r (293)
J- 31 * 32 -L 33 * 34>
Four Dimensional Analysis. 201
with the additional requirement that the divergence of the tensor,
defined as follows, shall itself be a one-vector.
}
(294)
•MS*--
188. The Rotation of Axes. Before proceeding to the application
of our four-dimensional analysis to the actual problems of relativity
theory we may finally consider the changes in the components of a
vector which would be produced by a rotation of the axes. We have
already pointed out that the quantity (xi2 -f x22 + £s2 — #42) is an
invariant in our space for any set of rectangular coordinates having
the same origin since it is the square of the magnitude of a radius
vector, and have noted that in this way we have obtained for the
quantity (x2 + y2 + z2 — c2t2) the desired invariance which is charac
teristic of the Lorentz transformation. In fact we may look upon
the Lorentz transformation as a rotation from a given set of axes to a
new set, with a corresponding re-expression of quantities in terms of
the new components. The particular form of Lorentz transformation,
familiar in preceding chapters, in which the new set of spatial axes
has a velocity component relative to the original set, in the X-direction
alone, will be found to correspond to a rotation of the axes in which
only the directions of the Xi and X4 axes are changed, the X% and X3
axes remaining unchanged in direction.
Let us consider a one-vector
a = (a-iki + a2k2 + a3k3 + a4ki) = (a/k/ + a2'k2' + a/k3' + a/k/),
where ab a2, a3 and a4 are the component magnitudes, using a set of
axes which have kx, k2, k3 and k4 as unit vectors and a/, a2'> as' and a/
the corresponding magnitudes using another set of mutually per
pendicular axes with the unit vectors k/, k/, k/ and k/. Our problem,
202
Chapter Thirteen.
now, is to find relations between the magnitudes «i, a2, a3 and a4 and
a/, a2', as' and a/.
We have already seen sections, (179) and (183), that we may obtain
any desired component magnitude of a vector by taking its inner
product with a unit vector in the desired direction, reversing the
sign if the subscript 4 is involved. We may obtain in this way an
expression for a\ in terms of a/, a/, a/ and a/. We have
Ol = a-ka = (a/k/ + a2'k2' + a3'k3' + a/k^-kx
= ai'ki' • ki + a2'k2' • ki + a/k/ • ki + a/k/ • ki. (295)
By similar multiplications with k2, k3 and k4 we may obtain expres
sions for a2, a3, and — a4. The results can be tabulated in the con
venient form
a/
a,'
V
a*'
ai
k/'k!
k/.ki
k/'ki
fc'-k,
a2
k/.k2
k/-k2
ka'-k,
ki'vk,
a3
k/-k3
k2'-k3
ks'-k.
k/-k3
a4
- k/.k4
- k/.k4
- k/-k4
-k/-k4
(296)
Since the square of the magnitude of the vector, (ai2 + a22 + a23
— a42), is a quantity which is to be independent of the choice of axes,
we shall have certain relations holding between the quantities k/-ki,
kir-k2, etc. These relations, which are analogous to the familiar
Four Dimensional Analysis.
203
conditions of orthogonality in Euclidean space, can easily be shown
to be
and
+ (k/.k,)2 + (k/.k,)2 - (k/-k4)2 = 1,
+ (k2'-k2)2 + (k2'-k3)2 - (k/-k4)2 = 1,
+ (k3'-k2)2 + (k/-k3)2 - (k/-k4)2 - 1,
+ (k^-ko)2 + (k/-k3)2 - (k/-k4)2 = -
H (k/.k2)(k2'.k2) + (k/-k3)(k/-k3)
(297)
=0,
etc., for each of the six pairs of vertical columns in table (296).
Since we shall often be interested in a simple rotation in which
the directions of the X2 and Xs axes are not changed, we shall be able
to simplify this table for that particular case by writing
k2' = k2, k3' = k3,
and noting the simplifications thus introduced in the products of the
unit vectors, we shall obtain
ai1
a,'
a,'
04'
a\
k^-k,
0
0
ki'-k,
a2
as
0
1
0
0
0
0
1
0
0|
-ki'-ki
0
0
- k/-k4
(298)
204 Chapter Thirteen.
If now we call 0 the angle of rotation between the two time axes
' and OX4} we may write, in accordance with equation (262),
- k/-k4 = cosh </>.
Since we must preserve the orthogonal relations (297) and may
also make use of the well-known expression of hyperbolic trigonometry
cosh2 <f> — sinh2
1,
we may now rewrite our transformation table in the form
Oi'
a,'
a,'
a/
hi
cosh (f)
0
0
sinh 0
a2
0
1
0
0
«3
0
0
1
0
a4
sinh </>
0
0
cosh </>
(299)
By a similar process we may obtain transformation tables for the
components of a two-vector A. Expressing A in terms of the unit
vectors ki2', ki3', kH', etc., and taking successive inner products with
the unit vectors ki2, ki3, ki4, etc., we may obtain transformation
equations which can be expressed by the tabulation (300) shown on
the following page.
Four Dimensional Analysis.
205
A 12 ki2'-ki2
k12'-k13
A14
ki3'-k12 ki4'-k12
k13'-k13 k14'-k13
A24'
k24'-k12
k24'-k13
-k12'-ki4 ;-ki/-k14 -ki4'-ki4 -k22/-k14 -k24/-ki4
k13' • k23 k14'
13 -K23 KM 'K23
-k12'-ko4
w
-k13'-k24-k14'.k24! -k2/-k24 -k2/
k3/-k12
k3/-k13
-k3/-k14; (300)
/ • k34 — ki4' • k34 — k23' • k34 — k24' • k34 — k?4r • k24
For the particular case of a rotation in which the direction of the
X2 and X3 axes are not changed we shall have
k2' = k2, k3' = k3,
and very considerable simplification will be introduced. We shall
have, for example,
k12'-k12 = (k/ X k^-Ckx X k2) = (k/ X k2)-(k! X k2) = k/.k^
kw'-kn = (k/ X kaO-Cki X k2) = (k/ X ks)-(ki X k2) = 0,
etc.
Making these and similar substitutions and introducing, as before,
206
Chapter Thirteen.
the relation — k'4-k4 = cosh 0 where 0 is the non-Euclidean angle
between the two time axes, we may write our transformation table
in the form
A12'
Ar;
Au'
A,,'
A*'
Au'
A 12
cosh 0
0
0
0
sinh 0
0
Aa
0
cosh 0
0
0
0
sinh 0
Au
0
0
1
0
0
0
A 23
0
0
0
1
0
0
A^
— sinh 0
0
0
0
cosh 0
0
AM
0
— sinh 4>
0
0
0
cosh 0
(301)
189. Interpretation of the Lorentz Transformation as a Rotation
of Axes. We may now show that the Lorentz transformation may
be looked upon as a change from a given set of axes to a rotated set.
Since the angle 0 which occurs in our transformation tables is
that between the k4 axis and the new k/ axis, we may write, in ac
cordance with equations (265) and (266),
cosh 0 =
sinh 0 = — p=
where V is the velocity between the two sets of space axes which
correspond to the original and the rotated set of four-dimensional
axes. This will permit us to rewrite our transformation table for the
Four Dimensional Analysis.
207
components of a one-vector in the forms
Vic
a,
a,
a
F/c
72
_
oV
_Z!
0
0
V/c
72
(302)
Consider now any point P(XI, x2, x3, ar4). The radius vector from
the origin to this point will be r = (ziki + a:2k2 + x3k3 + x4k4), or,
making use of the relations between xi, x2, a:3, x4 and x, y, z, t given
by equations (252), we may write
r =
Applying our transformation table to the components of this one-
vector, we obtain the familiar equations for the Lorentz transformation
x - VI
208 Chapter Thirteen.
y = y,
z' = z,
V2
We thus see that the Lorentz transformation is to be interpreted
in our four-dimensional analysis as a rotation of axes.
190. Graphical Representation. Although we have purposely re
stricted ourselves in the foregoing treatment to methods of attack
which are almost purely analytical rather than geometrical in nature,
the importance of a graphical representation of our four-dimensional
manifold should not be neglected. The difficulty of representing all
four axes on a single piece of two-dimensional paper is not essentially
different from that encountered in the graphical representation of the
facts of ordinary three-dimensional solid geometry, and these diffi
culties can often be solved by considering only one pair of axes at a
time, say OXi and OX4) and plotting the occurrences in the XiOX^
plane. The fact that the geometry of this plane is a non-Euclidean
one presents a more serious complication since the figures that we
draw on our sheet of paper will obviously be Euclidean in nature,
but this difficulty also can be met if we make certain conventions as
to the significance of the lines we draw, conventions which are funda
mentally not so very unlike the conventions by which we interpret as
solid, a figure drawn in ordinary perspective.
Consider for example the diagram shown in figure 18, where we
have drawn a pair of perpendicular axes, OXi, and OX4, and the
two unit hyperbolae given by the equations
x? - x? = 1,
(303)
xf -x? = - 1,
together with their asymptotes, OA and OB, given by the equation
xf - x? = 0. (304)
This purely Euclidean figure permits, as a matter of fact, a fairly
satisfactory representation of the non-Euclidean properties of the
manifold with which we have been dealing.
Four Dimensional Analysis.
209
OX i and OX> may be considered as perpendicular axes in the
non-Euclidean XiOX^ plane. Radius vectors lying in the quadrant
A OB, will have a greater component along the X4 than along the Xi
axis and hence will be S-vectors with the magnitude s = Vx42 — £i2,
where x\ and x\ are the coordinates of the terminal of the vector.
D
FIG. 18.
7-radius-vectors will lie in the quadrant BOC and will have the mag
nitude s = V#i2 — £42. Radius vectors lying along the asymptotes
OA and OB will have zero magnitudes (s = Vzr — x42 = 0) and
hence will be singular vectors.
Since the two hyperbola? have the equations x-f — x,f = 1 and
Xi~ — #42 = — 1, rays such as Oa, Oaf, Ob, etc., starting from the
origin and terminating on the hyperbola?, will all have unit magnitude.
Hence we may consider the hyperbolae as representing unit pseudo-
circles in our non-Euclidean plane and consider the rays as repre
senting the radii of these pseudo-circles.
A non-Euclidean rotation of axes will then be represented by
changing from the axes OXi and OX± to OAY and 0AY, and taking
Oaf and Ob' as unit distances along the axes instead of Oa and Ob.
15
210 Chapter Thirteen.
It is easy to show, as a matter of fact, that such a change of axes
and units does correspond to the Lorentz transformation. Let Xi
and Xi be the coordinates of any point with respect to the original
axes OXi and OX4, and Xi" and x*" the coordinates of the same point
referred to the oblique axes OXi and OX/, no change having yet
been made in the actual lengths of the units of measurement. Then,
by familiar equations of analytical geometry, we shall have
Xi = Xi" cos 6 + #4" sin 6, (
(305)
£4 = Xi" sin 0 -f x" cos 0,
where 0 is the angle XiOX\.
We have, moreover, from the properties of the hyperbola,
Oa Ob " Vcos2 e - sin20 '
and hence if we represent by x\ and x<{ the coordinates of the point
with respect to the oblique axes and use Oa' and Ob' as unit distances
instead of Oa and Ob, we shall obtain
cos 0 sin B
Vcos2 0 — sin2 0 Vcos2 0 — sin2 0 '
, sin 0 , ; cos 0
Vcos2 0 - sin2 0 ! Vcos2 0 - sin2 0 '
It is evident, however, that we may write
sin 0 dxi V
—i = tan 0 = — - = — ,
cos 0 cte4 c
where V may be regarded as the relative velocity of our two sets of
space axes. Introducing this into the above equations and also
writing Xi = x, x4 = ct, x^ = x', x4' = ctf, we may obtain the familiar
equations
V1-.-?
Four Dimensional Analysis. 211
We thus see that our diagrammatic representation of non-Euclidean
rotation in the XiOX^ plane does as a matter of fact correspond to
the Lorentz transformation.
Diagrams of this kind can now be used to study various kine-
matical events. 5-curves can be drawn in the quadrant A OB to repre
sent the space-time trajectories of particles, their form can be in
vestigated using different sets of rotated axes, and the equations for
the transformation of velocities and accelerations thus studied.
7-lines perpendicular to the particular time axis used can be drawn to
correspond to the instantaneous positions of actual lines in ordinary
space and studies made of the Lorentz shortening. Singular vectors
along the asymptote OB can be used to represent the trajectory of a
ray of light and it can be shown that our rotation of axes is so devised
as to leave unaltered, the angle between such singular vectors and the
OX4 axis, corresponding to the fact that the velocity of light must
appear the same to all observers. Further development of the possi
bilities of graphical representation of the properties of our non-
Euclidean space may be left to the reader.
PART II. APPLICATIONS OF THE FOUR-DIMENSIONAL ANALYSIS.
191. We may now apply our four-dimensional methods to a
number of problems in the fields of kinematics, mechanics and electro
magnetics. Our general plan will be to express the laws of the par
ticular field in question in four-dimensional language, making use of
four-dimensional vector quantities of a kinematical, mechanical, or
electromagnetic nature. Since the components of these vectors
along the three spatial axes and the temporal axis will be closely
related to the ordinary quantities familiar in kinematical, mechanical,
and electrical discussions, there will always be an easy transition from
our four-dimensional language to that ordinarily used in such dis
cussions, and necessarily used when actual numerical computations
are to be made. We shall find, however, that our four-dimensional
language introduces an extraordinary brevity into the statement of a
number of important laws of physics.
KINEMATICS.
192. Extended Position. The position of a particle and the par
ticular instant at which it occupies that position can both be indi-
212 Chapter Thirteen.
cated by a point in our four-dimensional space. We can call this
the extended position of the particle and determine it by stating the
value of a four-dimensional radius vector
r = (xiki + z2k2 + z3k3 + x4k4). (306)
193. Extended Velocity. Since the velocity of a real particle can
never exceed that of light, its changing position in space and time
will be represented by a 6-curve.
The equation for a unit vector tangent to this 5-curve will be
dr dxi dxz dx3 dx*
(307)
where ds indicates interval along the 5-curve; and this important
vector w may be called the extended velocity of the particle.
Remembering that for a 5-curve
ds = Vcfo42 - dx? - dxz2 - dx32 = cdt-yl - — , (308)
we may rewrite our expression for extended velocity in the form
w- ,—- l-.+ kJ. (309)
where u is evidently the ordinary three-dimensional velocity of the
particle.
Since w is a four-dimensional vector in our imaginary space, we
may use our tables for transforming the components of w from one
set of axes to another. We shall find that we may thus obtain trans
formation equations for velocity identical with those already familiar
in Chapter IV.
The four components of w are
u,
k
V'-S
and with the help of table (302) we may easily obtain, by making
simple algebraic substitutions, the following familiar transformation
Four Dimensional Analysis.
213
equations :
1 -
c2
yj
' c2
L/
This is a good example of the ease with which we can derive our
familiar transformation equations with the help of the four-dimensional
method.
194. Extended Acceleration. We may define the extended accel
eration of a particle as the rate of curvature of the 6-line which deter
mines its four-dimensional position. We have
c =
d2! dw _ d
~ ds ~ ds
Or, introducing as before the relation ds = cdl-\J 1 — — , we may write
1 du 1 u du
(310)
u du
(311)
214 Chapter Thirteen.
where u is evidently the ordinary three-dimensional velocity, and -j-
CtL
the three-dimensional acceleration; and we might now use our trans
formation table to determine the transformation equations for accel
eration which we originally obtained in Chapter IV.
195. The Velocity of Light. As an interesting illustration of the
application to kinematics of our four-dimensional methods, we may
point out that the trajectory of a ray of light will be represented by a
singular line. Since the magnitude of all singular vectors is zero by
definition, we have for any singular line
or, since the magnitude will be independent of any particular choice
of axes, we may also write
Transforming the first of these equations we may write
dxj + dx22 + dx^ _ dx* + dy2 + dz2 _
dxt2 cW
or
dl
dt = C'
Similarly we could obtain from the second equation
dl'
dt = C'
We thus see that a singular line does as a matter of fact correspond
to the four-dimensional trajectory of a ray of light having the velocity c,
and that our four-dimensional analysis corresponds to the require
ments of the second postulate of relativity that a ray of light shall
have the same velocity for all reference systems.
THE DYNAMICS OF A PARTICLE.
196. Extended Momentum. We may define the extended momen
tum of a material particle as equal to the product ra0w of its mass mQ,
measured when at rest, and its extended velocity w. In accordance
Four Dimensional Analysis. 215
with equation (309) for extended velocity, we may write then, for
the extended momentum,
(312)
Or, if in accordance with our considerations of Chapter VI we put
for the mass of the particle at the velocity u
we may write
m =
u
m0w = m- + mk4. (313)
c
We note that the space component of this vector is ordinary momen
tum and the time component has the magnitude of mass, and by
applying our transformation table (302) we can derive very simply
the transformation equations for mass and momentum already
obtained in Chapter VI.
197. The Conservation Laws. We may now express the laws for
the dynamics of a system of particles in a very simple form by stating
the principle that the extended momentum of a system of particles is a
quantity which remains constant in all interactions of the particles,
we have then
/ wu \
;m0w = 2 I — + wk4 ) = a constant,
(314)
where the summation 2 extends over all the particles of the system.
It is evident that this one principle really includes the three
principles of the conservation of momentum, mass, and energy.
This is true because in order for the vector 2m Ow to be a constant
quantity, its components along each of the four axes must be con
stant, and as will be seen from the above equation this necessitates
the constancy of the momentum 2rau, of the total mass 2m, and of
m
the total energy 2 — .
216 Chapter Thirteen.
THE DYNAMICS OF AN ELASTIC BODY.
Our four-dimensional methods may also be used to present the
results of our theory of elasticity in a very compact form.
198. The Tensor of Extended Stress. In order to do this we shall
first need to define an expression which may be called the four-dimen
sional stress in the elastic medium. For this purpose we may take the
symmetrical tensor Tm defined by the following table:
xx Pxy Pxz CQX)
v^ Pvy Pyz
rp
"* Pzx Pzy Pzz CQZ, (315)
So o
X o y o z
w,
1C C C
where the spatial components of Tm are equal to the components of
the symmetrical tensor p which we have already defined in Chapter
X and the time components are related to the density of momentum g,
density of energy flow s and energy density w, as shown in the tabu
lation.
From the symmetry of this tensor we may infer at once the simple
relation between density of momentum and density of energy flow:
g = ^ i (316)
with which we have already become familiar in Section 132.
199. The Equation of Motion. We may, moreover, express the
equation of motion for an elastic medium unacted on by external
forces in the very simple form
div Tm = 0. (317)
It will be seen from our definition of the divergence of a four-
dimensional tensor, Section 187, that this one equation is in reality
equivalent to the two equations
div p + ~ = 0 (318)
and
div
div s + — = 0.
ot
Four Dimensional Analysis. 217
The first of these equations is identical with (184) of Chapter X,
which we found to be the equation for the motion of an elastic medium
in the absence of external forces, and the second of these equations
expresses the principle of the conservation of energy.
The elegance and simplicity of this four-dimensional method of
expressing the results of our laborious calculations in Chapter X can
not fail to be appreciated.
ELECTROMAGNETICS.
We also find it possible to express the laws of the electromagnetic
field very simply in our four-dimensional language.
200. Extended Current. We may first define the extended current,
a simple but important one-vector, whose value at any point will de
pend on the density and velocity of charge at that point. We shall
take as the equation of definition
q = Pow = p + k4 , (319)
where
Po
is the density of charge at the point in question.
201. The Electromagnetic Vector M. We may further define a
two-vector M which will be directly related to the familiar vectors
strength of electric field e and strength of magnetic field h by the
equation of definition
M = (fcikw + /?2k3i + h3kn - e&u - e2k24 - e3k34)
or (320)
M* = (eik23 + e2k3i -f eski2 + /hkH + /i2k24 + /?3k34),
where e\, e->, e3, and hi, hZj h3 are the components of e and h.
202. The Field Equations. We may now state the laws of the
electromagnetic field in the extremely simple form
0-M = q, (321)
0 X M = 0. (322)
218 Chapter Thirteen.
These two simple equations are, as a matter of fact, completely
equivalent to the four field equations which we made fundamental
for our treatment of electromagnetic theory in Chapter XII. Indeed
if we treat {> formally as a one-vector
/ d
( k! —
\ dxL
d d d
o - + k3 — - k4 --
dx2 dx3 dx4
and apply it to the electromagnetic vector M expressed in the extended
form given in the equation of definition (320) we shall obtain from
(321) the two equations
1 de u
div e = p,
and from (322)
div h = 0,
1 dh
where we have made the substitution #4 = ct. These are of course
the familiar field equations for the Maxwell-Lorentz theory of electro-
magnetism.
203. The Conservation of Electricity. We may also obtain very
easily an equation for the conservation of electric charge. In accord
ance with equation (284) we may write as a necessary mathematical
identity
0-(0'M)=0. (323)
Noting that 0 • M = q, this may be expanded to give us the equation
of continuity.
div Pu + ^ = 0. (324)
ot
204. The Product M • q. We have thus shown the form taken by
the four field equations when they are expressed in four dimensional
language. Let us now consider with the help of our four-dimensional
methods what can be said about the forces which determine the
motion of electricity under the action of the electromagnetic field.
Consider the inner product of the electromagnetic vector and
Four Dimensional Analysis. 219
the extended current:
+ k4
(325)
f [uXh]*\ e-u
= P(e+- — -j + P— k,
We see that the space component of this vector is equal to the ex
pression which we have already found in Chapter XII as the force
acting on the charge contained in unit volume, and the time com
ponent is proportional to the work done by this force on the moving
charge; hence we may write the equation
(326)
an expression which contains the same information as that given by
the so-called fifth fundamental equation of electromagnetic theory,
f being the force exerted by the electromagnetic field per unit volume
of charged material.
205. The Extended Tensor of Electromagnetic Stress. We may
now show the possibility of defining a four-dimensional tensor Te such
that the important quantity M-q shall be equal to — div Te. This
will be valuable since we shall then be able to express the equation
of motion for a combined mechanical and electrical system in a very
simple and beautiful form.
Consider the symmetrical tensor
Tfji rri
12 ^13 1 14,
31 -t 32 * 33 -t 34,
T rn rn rri
41 1 42 1 43 1 44,
defined by the expression
(328)
where j, k = 1, 2, 3, 4.
220 Chapter Thirteen.
It can then readily be shown by expansion that
- div Te = M-(O-M) + M*-(0-M*).
But, in accordance with equations (321), (326), (292) and (322), this
is equivalent to
Since in free space the value of the force f is zero, we may write
for free space the equation
div Te = 0. (330)
This one equation is equivalent, as a matter of fact, to two im
portant and well-known equations of electromagnetic theory. If we
develop the components Tu, T12, etc., of our tensor in accordance
with equations (328) and (320) we find that we can write
Yxx Yxy *//xz — ,
C
T, = 1
'vv tyz ~ ,
C
(331)
So o
x oy e>2
7 7 ~c w'
where we shall have
*« = " \(e?
txv = - (exev
etc.
sx = c(ejiz — ezhv),
etc.
- ej - e? + h* - hj - h*
(332)
^ thus being equivalent to the well-known Maxwell three-dimensional
stress tensor, sxj sv, etc., being the components of the Poynting vector
c[e X h]*, and w being the familiar expression for density of electro-
Four Dimensional Analysis. 221
magnetic energy — | — . We thus see that equation (330) is equiva
lent to the two equations
1 as
div* + -2~ = 0,
dw
div s + — = 0.
ot
The first of these is the so-called equation of electromagnetic momen
tum, and the second, Poynting's equation for the flow of electromag
netic energy.
206. Combined Electrical and Mechanical Systems. For a point
not in free space where mechanical and electrical systems are both
involved, taking into account our previous considerations, we may
now write the equation of motion for a combined electrical and
mechanical system in the very simple form
div Tm + div Te = 0.
And we may point out in closing that we may reasonably expect all
forces to be of such a nature that our most general equation of motion
for any continuous system can be written in the form
div T7! + div T2 + • • • = 0.
APPENDIX I.— SYMBOLS FOR QUANTITIES.
Scalar Quantities. (Indicated by Italic type.)
c speed of light.
e electric charge.
E energy.
H kinetic potential.
K kinetic energy.
I, m, n direction cosines.
L Lagrangian function.
p pressure.
Q quantity of electricity.
S entropy.
t time.
/ ; 2 \
T temperature, function 2m0c2 1 1 — -\! 1 -- - 1
U potential energy.
v volume.
V relative speed of coordinate systems, volume.
w energy density.
W work.
e dielectric constant.
V
ju index of refraction, magnetic permeability.
v frequency.
p density of charge.
a electrical conductivity.
</> non-Euclidean angle between time axes.
• • generalized coordinates.
\l/ scalar potential.
\J/i\f/z\l/3- - • generalized momenta.
222
Appendix I. 223
Vector Quantities. (Indicated by Clarendon type.)
B magnetic induction.
c extended acceleration.
D dielectric displacement,
e electric field strength in free space.
E electric field strength in a medium.
f force per unit volume.
F force acting on a particle.
g density of momentum.
h magnetic field strength in free space.
H magnetic field strength in a medium.
i density of electric current.
M angular momentum, electromagnetic vector.
p symmetrical elastic stress tensor.
q extended current.
r radius vector
s density of energy flow.
t unsymmetrical elastic stress tensor.
u velocity.
w extended velocity.
<(> vector potential.
APPENDIX II.— VECTOR NOTATION.
Three Dimensional Space.
Unit Vectors, i j k
Radius Vector, r = x\ + yj -f zk
Velocity,
dr
U = ~ = Xi + yj + 'Zk
= uxi + uvj + uzk
Acceleration,
= uxi + iyj + wzk
Inner Product,
a-b = axbx + atf6tf + aebz
Outer Product,
a X b = (axbv - aj>x)ij + (ay6, - a,6v)jk + (a A - az62)ki
Complement of Outer Product,
[a X b]* = (aybz - a,6y)i + (aJ>, - axbz)j + (axby - aybx)k
The Vector Operator Del or V,
d d d
dy dz
curia = [V X a]* = _
dy dz dz dx
224
/day _dOx\
\dx ' dy J
Appendix II. 225
Non-Euclidean Four Dimensional Space.
Unit Vectors, ki k2 k3 k4
Radius Vector,
r = ziki + z2k2 + z3k3 4-
= xi + yj + 2k + dk4
One Vector,
a = Qiki + a2k2 + a3k3 +
Two Vector,
A = ^i2k12 + Ai3k13 + Auku + A23k23 + A24k24 + A34k34
Three Vector,
H = 5Ii23ki23 + 5li24ki24 + Hi34ki34 + 5I234k234
Pseudo Scalar,
a = aki234
Transposition of Subscripts,
Inner Product of One Vectors,
(See Section 183).
Outer Product of One Vectors,
ka&... X knm... =
Complement of a Vector,
0* = <£-k
The Vector Operator Quad or 0>
7 DAY USE
RETURN TO DESK FROM WHICH BORROWED
ASTRONOMY, MATHE/'.ATICS-
STATISTICS LIBRARY
This publication is due on the LAST DATE
and HOUR stamped below.
•,- ; 7-
-Due-
joemjiQf aPRJN6($fflfjBst(
Sflbfeei to'recalf after-
JAN 0 4 1993
RB 17-60m-3,'63
(D5808slO)4188
General Library
University of California
Berkeley
U..C....?ERKELEY LIBRARIES
382.t
7
STAT,
UBRARY
UNIVERSITY OF CALIFORNIA LIBRARY