74 min read 15620 words
Table of Contents

(1887)

On the physical meaning of the principle of least action (By H. von Helmholtz)

When I speak of the principle of least action in this article, I would not like for that term to be understood in just the original form that

Maupertuis published in the principle of least action in 1744.

  • It received a precise determination for the variational condition and a complete proof much later by Lagrange.

His principle should subsume the various transformed forms of that theorem by Sir W. Rowan Hamilton.

Hamilton presented 2 differential equations that C. G. J. Jacobi later combined into a single one in which the common source of these transformations lay, along with many other possible ones, while the physical assumptions with which the calculations started were in no way changed as a result.

Jacobi first applied the principle of least action to only the mechanics of ponderable bodies and represented the motions of a system as being either freely mobile or rigidly coupled to another mass point in a chain.

The physical assumptions from which he started were then given essentially by Newton’s laws of motion and the manner by which one cared to define the phenomenon that would correspond to the action of rigidly-coupled mass points mechanically.

However, he further showed that once one first learns how to treat Maupertius’s integral correctly, the validity of the law of constancy of energy must be assumed.

At first, this seems a restriction on the validity of the principle of least action. But more recent physical investigations have established that the law of the constancy of energy is also valid in general, and that apparent restriction does not, in fact, restrict anything.

However, one must know completely all of the forms that the equivalence of energy can take for a process being examined in order to include them in calculations.

On the other hand, it might seem questionable whether other physical processes that might enter in, and which are not treated simply by means of the motions of ponderable masses and Newton’s laws of motion, and in which one operates with energy quanta, can also be grasped by the principle of least action.

I will choose one of Hamilton’s forms to be the most convenient form for the principle of least action in the investigations that will be carried out here, which is the one that allows external forces that depend upon time to act upon the mechanical system considered, whose internal forces are only conservative ones.

If we denote the potential energy of the system by F and the vis viva by L then the function (viz., Hamilton’s principal function) whose time integral will be a minimum for the normal motion between end points will be:

H = F – L,

while the energy of the system will be:

E = F + L.

In this, F depends upon only the coordinates, while L is a homogeneous function of second degree of the velocities.

The function H is the one in terms of whose differential quotients Lagrange expressed the forces that act upon the moving system from the outside. Since that function plays an important role in all of the problems that will be treated here, I would like to propose the name of kinetic potential for it, due to just that relationship with the forces. An entire series of corresponding names for different special chapters of physics has already been proposed.

Thus, they include:

  • F. E. Neumann’s potential of two electrical currents
  • R. Clausius’s () electrodynamical potential;

J. W. Gibbs called the same thermodynamic function that I called “free energy” the force function for constant temperature, while P. Duhem (), by contrast, called that function the thermodynamic potential. There thus exist sufficient examples for the new choice of terminology.

The principle of least action can then be expressed as follows:

The mean value of the kinetic potential that is calculated for equal time elements is a minimum for the actual path of the system (a limiting value for longer intervals, respectively), in comparison to all other neighboring paths that lead from the initial position to the final positions in the same amount of time.

Blank

For a state of rest, the kinetic potential goes to the values of the potential energy (the potential in the sense that was used up to now, respectively). We do not need to take the mean value for them, since the values that were different while in motion will all be equal to each other here. For a state of rest, our theorem then says simply that the potential energy must be a minimum for equilibrium.

Jacobi showed that the function H can also include time explicitly with making the construction of the variation and differential equations that follow from it impossible. I have employed it in order to add a sum [ ] P p α α

∑ ⋅ to H, in which the pα are coordinate

and the Pα mean the forces that act in the direction of the coordinates, while the latter are taken in a sense that will be discussed in more detail below. The Pα will be considered to be given functions of time that are, however, independent of the coordinates. In this form, the minimum principle will yield the Lagrange equations for the forces Pα , and in that way, an entire series of special investigations that are based upon Lagrange’s equations of motion will also be subsumed by the somewhat-modified principle of least action. Where it is necessary to distinguish that modified principle from the original one, I would like to call it the law of minimum kinetic potential. The form that Lagrange gave to the equations of motion is, in fact, important, due to the fact that we can apply it to cases in which processes are at work that are no longer rationally resolvable in various ways, such as friction, galvanic resistance, etc., and in which equilibrium must exist between the conservative forces that are included in Lagrange’s formula. Now, there are other work equivalents, besides the potential and actual energies of ponderable masses, namely, thermal, electrodynamical, and electromagnetic ones. Up till now, the motion of heat has generally only been considered as an especially complicated form of the motion of exclusively ponderable atoms. However, since ether waves simultaneously radiate from warm bodies, this restriction [which can, in fact, be derived from Carnot’s law under simpler assumptions, as Clausius (* ) and Boltzmann (**) showed] should be considered to be a hypothesis that is only adequate initially; the action of other forces (e.g., electrodynamical ones) cannot be excluded with any confidence. I have proved that, by contrast, the known laws of reversible heat processes can, in fact, be expressed in the form of Lagrange’s equation of motion, and thus also the law of minimum kinetic potential, in my papers on the statics of monocyclic motions ( ***). However, that shows that temperature, which measures the intensity of thermal motion, enters into the function to be integrated in a very complicated form, as do the velocities, when one defines the values of the vis viva in the ponderable system. In the cited papers, I showed that the same formulas could also be true, with certain restricting assumptions, for systems of ponderable masses through the elimination of certain coordinates, such that there is no contradiction between the appearance of such complicated forms and the application of Lagrange’s equations of motion. However, if one would like to learn the general properties of systems that are governed by the principle of least action then it would be necessary to drop the older, narrower assumption that the velocities enter into just the values of the vis viva, and indeed in the form of a homogeneous function of degree two, and to examine how things behave when H is a function of the coordinates and velocities of arbitrary form. The fact that chemical forces (whenever we can compel them to act in only a reversible way) follow Carnot’s law has indeed been confirmed experimentally in just a small number of cases, but they are all the more unacceptable since they exhibit measurable relations between processes of an apparently quite different nature († ).

(* ) Poggendorff Annalen, 142 (1871), 433-461. () Wiener Sitzungsberichte LIII (1866), Abt. II, pp. 195-220. (*) This Journal, Bd. 97, pp. 112-123. († ) Cf., my three papers on the thermodynamics of chemical processes. Sitzungsberichte der Berliner Akademie, 1882, 2 Feb., 27 July; 1883, 31 May. – A good survey of the materials that were used up to now is in the book of P. Duhem that was cited above. Helmholtz – On the physical meaning of the principle of least action. 4 Finally, observations of the distant electromagnetic and electrodynamic effects of closed, electric currents have led to expressions for the ponderomotive and electromotor forces that are closely linked with the ones that Lagrange gave for the mechanics of ponderable bodies. The first to give such a formulation of the laws of electrodynamics was F. E. Neumann, sr. ( * ). For him, the velocities of the electric currents – i.e., the set of all electricity that goes through a surface element in the conductor that is bounded by material particles in a unit time − appeared as velocities. Later, W. Weber and Clausius gave other forms in which relative or absolute velocities of electrical quanta in space entered in place of current velocities. The consequences of the various formulations agree completely for closed currents; for unclosed currents, they differ. To the extent that facts are known in this latter domain, they show that Neumann’s law is inadequate when one includes only the motions of electricity in conductors in one’s calculations while applying it. Moreover, one must also consider the motions of electricity in insulators that Faraday and Maxwell considered, which might occur under increasing or diminishing dielectric polarization. The previously-observed effects of unclosed currents also fit into Neumann’s law, thus-extended. However, a deviation in the form of the functions from the ones for ponderable masses also exists here. For electrodynamical phenomena, the velocities of electricity appear as functions of degree two whose coefficients, however, will refer to rectangular coordinates, not constants, like the masses in the values of the vis viva on ponderable systems. Secondly, linear functions of velocity will enter whenever permanent magnets act. It was precisely my investigations into the form of the kinetic potential that is required by Maxwell’s theory of electrodynamics that led me to the present preliminary investigations. The theory of light has also subsumed all of its main facts under the hypothesis that the ether is a medium with properties that are similar to the solid-elastic ponderable bodies. The known difficulties in the theory of reflection and refraction will be even easier to defeat by Maxwell’s hypothesis. However one might cleave to one or the other meaning, one would have to consider the principle of least work to be valid for the motion of light, at least to the extent that its phenomena can be explained by that theory. That already implies that the domain of validity of the principle of least action goes far beyond the limits of the mechanics of ponderable bodies, and that Maupertuis’s high hopes for its absolutely general validity seem to be approaching fulfillment, so tenuous was the mechanical proof and so contradictory were the metaphysical speculations that the author himself knew about at the time. Nowadays, it is regarded as highly probable that it is the general law for all reversible natural processes, and as far as the irreversible ones are concerned (e.g., the generation and conduction of heat), their irreversibility does not seem to be in the nature of things, but only rests upon the limitations of our tools that makes it impossible for us to arrange disorganized atomic motions or to make the motion of all atoms that take part in the motion of heat go precisely backwards.

(* ) Cf., my papers in the theory of electrodynamics. This Journal, Bd. 72, pp. 57; Bd. 75, pp. 35; Bd. 78, pp. 273. Helmholtz – On the physical meaning of the principle of least action. 5 In every case, it seems to me that the general validity of the principle of least action has been insured to the extent that it can occupy a high position as a heuristic principle and a guide for any attempts to formulate the laws of new classes of phenomena. In addition, it has the advantage of summarizing all conditions that are influential for the class of phenomena under consideration in the narrowest scope of a formula, and thus giving a complete overview of everything that is essential. By this state of affairs, I find it useful for the general principle to give an overview of the proof and the general consequences, which can be kept very brief whenever one applies only known methods to somewhat extended assumptions. I have therefore endeavored to highlight the consequences that are concerned with observable behavior, and which, when combined, will, in turn, serve as an indicator of the validity of the principle in the domain in question. In § 1, the law of minimum kinetic potential is developed with the greatest possible freedom in the nature of the function H, and Lagrange’s equations of motion are derived from it. The eliminations by which such general forms can also occur for systems of ponderable bodies are discussed. In § 2, the constancy of energy is derived from our form of the principle, and one learns how to calculate the value of energy from the value of kinetic potential. It is: E = H − H q q α α α   ∂ ⋅   ∂   ∑ , where qα are the velocities. That will show that the principle of least action is not always valid, conversely, in any case where the constancy of energy is true. The latter then expresses more than the former, and finding what it expresses in addition will be our problem. At the same time, some mechanical and physical processes will be specified in order to be able to refer to them as explanatory examples for the contents of the first two paragraphs, as well as the following ones, and to make the significance of the principle intuitive. § 3 then treats the opposite problem, namely, that of deriving H from E. That must involve integrating the aforementioned differential equation, and that will introduce arbitrary integration constants that must be homogeneous functions of degree one in the qα . This step is meaningful, insofar as it will then become possible to find the kinetic potential from a complete knowledge of the dependency of the energy upon the coordinates and velocities, and thus all of the laws of motion for the system, assuming that the principle of least action is valid. One will be able to find the terms that are linear in qα, which correspond to “hidden motions,” with no difficulties, for the most part. § 4 treats the interdependencies between the forces that are simultaneously exerted upon the system in different directions and its accelerations and velocities. That subsumes a series of interesting connections with physical phenomena, such as, e.g., the one between Ampère’s electromagnetic and electrodynamic laws, on the one hand, and the law of induction, on the other; a series of thermodynamic laws – e.g., the relationship between the raising of the pressure on a contained mass by raising its temperature and the raising of temperature by compression, and the corresponding behavior for thermoelectric and electro-chemical processes. Finally, one can also prove, in turn, that the Helmholtz – On the physical meaning of the principle of least action. 6 principle of least action is valid whenever the interdependencies between the forces that were detailed in § 4 exist. That proof is, however, deferred to a later communication. In § 5, Hamilton’s theorem, in its general form, will be recapitulated, and in § 6, the reciprocity theorem that flows from it for the changes in the forward and reverse motion that result from small impulses after a certain time has elapsed will be given. Some reciprocal relationships that I myself have verified for sound and light in previous papers, but only for systems at rest, fall within that scope. Finally, in § 7, the moment of motion shall be introduced in place of the velocities, which will yield another form of the variational problem, and along with the alreadyknown altered representations of the values of the force, another reciprocity law for forward and backward motion, as well. § 1. Formulation of the principle. I assume that the instantaneous state of the system of bodies in question is given completely by a sufficient number of mutually-independent coordinates pa ; I denote the velocities of evolution by: (1) qa

dp dt a . Furthermore, I let Pa denote the force with which the system of moving bodies acts upon the change in the coordinate pa , such that (− Pa ) will be the external force that must act upon the system in the direction of the coordinate pa in order for the assumed motion of the system to be able to happen in the assumed way. The forces Pa , which were introduced by Lagrange, are, in general, aggregates of force components that themselves can act upon different parts of the system, and thus their magnitudes and composition are defined in such a way that (Pa ⋅ dpa ) is the work that the force Pa does outwardly when the coordinate changes from pa to (pa + dpa ), whereas Pa will perform no work when pa remains unchanged while arbitrary variations of the remaining coordinates pa occur. In the sequel, we shall assume that the quantities Pa are functions of time, but they are independent of the coordinates during the given period from t = t0 to t = t1 . Let H be a function of the coordinates and velocities, of which, we will first demand only that it must have finite first and second differential quotients with respect to pa and qa at all positions of the path that evolves during the stated time period. Moreover, we define the integral: (1a ) Φ = 1 0 [ ] t t dt H P p   ⋅ + ⋅     ∫ ∑ a a a , Helmholtz – On the physical meaning of the principle of least action. 7 in which the pa are varied in such a way that their variations δpa will be zero for t = t0 and t = t1 , but they will be arbitrarily-differentiable functions of time at the intermediate times. It then follows from known methods of the variational calculus that one will have: (1b ) δΦ = 0 when one has: (1c ) 0 = Pa + H d H p dt q ∂ ∂  −   ∂ ∂ a a   during the duration of the motion. As is known, these are the equations of motion for the system in the form that was given by Lagrange. Elimination of coordinates. In the original applications of the principle to the motions of a free system of material points, as I already remarked in the introduction, H had the form: H = F – L, in which F shall be a function of only the pa , and L shall be a homogeneous function of degree two of the qa whose coefficients depend upon the pa . For a free system, the number of coordinates pa is three times as large as the number of mass points that are present. However, in many cases, a decrease in the number of coordinates can occur without changing the form of the representation that was given in equations (1a ), (1b ), and (1c ). Among these cases, the one that has been treated best up to now is then one in which the degrees of freedom of the system are restricted by so-called fixed constraints, which can be expressed mathematically as equations in the coordinates. The composition of the function H from F and L that was given above and the behavior of the last two functions will not change as a result of that, but the number of variable coordinates can be reduced substantially. Another noteworthy reduction in the number of coordinates occurs when individual ones of them, which we would like to denote by the index b, appear in the values of H only through their differential quotients qb , so the corresponding forces Pb need to be equal to zero. Under these circumstances, equations (1c ), which express the value of the Pb , will reduce to: (2) 0 = d H dt q   ∂   ∂  b or (2a ) H q ∂ ∂ b = − cb . Helmholtz – On the physical meaning of the principle of least action. 8 One can employ these equations, which are linear in the qa (qb , respectively), to express the qb in terms of the remaining velocities and the pa , and then eliminate them from the value of H. We denote the expression for the value of H that arises from this elimination by H. One will then have: p ∂ ∂ a H

H H q p q p ∂ ∂  ∂

  • ⋅   ∂ ∂ ∂   ∑ b a b a b . Thus, with consideration given to (2a ): H p ∂ ∂ a = [ ] c q p ∂     − ⋅ ∂   ∑ b b a b H . If we set: (2b ) H − ∑[ ] c q⋅ b b b = H′ then we will find that: H p ∂ ∂ a = H p ∂ ′ ∂ a , and likewise: H q ∂ ∂ a = H q ∂ ′ ∂ a , (2c ) pa = − H d H p dt q ∂ ∂ ′ ′  
  •   ∂ ∂ a a   . The function H′ thus enters in this case, which is free of the qb and pb , but also includes terms that are linear in the qa and originate in the values of the qb , and it completely replaces the original function in the definition of the equations of motions (2c ). Examples of this are the rotations of a top around its symmetry axis, when its direction, but not its angular velocity, can change around that axis, and furthermore, the motion of a system that is referred to a rectangular coordinate system that is rotating; e.g., the Earth. Corresponding to this analogy that is given by the mechanics of ponderable bodies, we would meanwhile also like to refer to other cases of physical processes in which the function H includes terms that are linear in the velocities as cases with hidden motion, although at the moment there are cases in which the existence of such a hidden motion is not confirmed beyond a doubt, such as for the interaction between magnets and electrical currents. It is known that it was adopted for the magnet by Ampère; it also showed its influence in the electromagnetic rotation of the plane of polarization of light, as Sir W. Thomson remarked, even when no perceptible electric current acts upon it. Helmholtz – On the physical meaning of the principle of least action. 9 This case is distinguished from the one in which H includes the velocities only in terms of degree two essentially by the fact that the motion cannot go backwards under the same conditions unless the hidden motions are simultaneously reversed. Other eliminations can bring about even more complicated forms for the function H, at least for restricted classes of motion; I have discussed such cases in my first treatise on the monocyclic motions (* ). We can choose the conditions of that elimination somewhat differently here. One assumes that a group of coordinates pc is present whose corresponding qc enter into the values of the vis viva only multiplied by each other, but not combined into products with the qb in this group, such that all 2H q q ∂ ∂ ∂c b = 0; moreover, one assumes that the forces Pc always remain equal to zero. Under these circumstances, motions of the system are possible for which the pc remain continually constant, so the qc = 0. The equations of motion for this class of motions simplify due to the fact that when all qc = 0, one will also have all:

H q ∂ ∂ c = 0. We thus get from (1c ): (3) 0 = H q ∂ ∂ c , (3a ) Pb = − H d H p dt q ∂ ∂ 

  •   ∂ ∂ b b   . Now, if equations (3), whose number is equal to that of the pc , succeed in expressing these quantities as functions of the pb and qb then one can eliminate the pc from H by means of the values thus-obtained, where, in general, H will be a complicated function of the qb that we would like to denote by H. From the principles of differential calculus, one will then have: p ∂ ∂ b H = H H p p p p ∂ ∂  ∂
  • ⋅   ∂ ∂ ∂   ∑ c b c b c , q ∂ ∂ b H = H H p q p q ∂ ∂  ∂
  • ⋅   ∂ ∂ ∂   ∑ c b c b c ; thus, due to equations (3), one will have, in turn: (3b ) H p ∂ ∂ b = p ∂ ∂ b H and H q ∂ ∂ b = q ∂ ∂ b H .

(* ) This Journal, Bd. 97, pp. 120-122. Helmholtz – On the physical meaning of the principle of least action. 10 The equations of motion (3a ) then reduce to: (3c ) Pb = − d p dt q ∂ ∂ 

  •   ∂ ∂ b b   H H , in which only the pb and qb appear, and generally H is no longer the sum of a function of the coordinates and a homogeneous function of degree two of the velocities. However, if the original H is such a function, and thus no hidden motions will have any influence, then equations (3) will be of degree two in the qb ; the value of the pc can then remain unchanged (even when it is multi-valued) when all of the qb simultaneously change their signs, from which, it will follow that the total motion can also go backwards in this case. In the mechanics of ponderable masses, problems in which the function H includes the velocities qa in terms of first or higher degree can be referred to as incomplete problems, insofar as a part of the possible motions is excluded, and a part of the coordinates that are necessary for the determination of the position of the system does not enter into the function H, and certain forces are constantly set equal to zero, so they can no longer be determined arbitrarily. Functional determinant of the moment of motion. For the sake of brevity, we would like to denote the quantities ∂H / ∂qa that enter into the previous derivatives by: (3d ) sa = − H q ∂ ∂ a , and call the sa moments of motion. For the motion of a free system that is referred to rectangular coordinates, they correspond to the product of the mass and velocity, whose differential quotient with respect to time is Newton’s measure of the corresponding force component: X = d dx m dt dt   ⋅    . In the cases that were summarized, the influence that the inertia of the moving masses exerts upon a well-defined type of motion was different from that of the position of the masses. Thus, e.g., for a rotational motion of a solid body, the moment of motion is equal to the moment of inertia, multiplied by the angular velocity. In that sense, the quantities sa now measure the influence that the inertia of the moving mass has, and its acceleration enlists a corresponding part of the force of motion, as equations (1c ) show. In the original, complete problems of the mechanics of ponderable bodies, the sa are linear, homogeneous functions of the qa whose coefficients are functions of the pa , in general, and one then has a system of linear equations: Helmholtz – On the physical meaning of the principle of least action. 11 (3e ) sa = s q q   ∂ ⋅   ∂  ∑ a b b , which will represent the qb as linear, homogeneous functions of the sa when they are solved for those variables. That representation would not be possible if the determinant of the quantities s q ∂ ∂ a b (the 2H q q ∂ ∂ ∂a b , resp.) were identically zero. However, the latter case cannot come about without the vis viva being zero for certain motions with finite velocities. Namely, since L is an essentially positive, homogeneous function of degree two of the qa , one will have: 2L = ∑[ ] q s⋅ a a a . If the aforementioned determinant were zero then all of the sa , and correspondingly L, as well, could be zero without the qa needing to be zero. The condition that the determinant of equations (3e ) is not identically zero can also be expressed as follows: No identity can exist between the quantities sa and pa , with the exception of the qa , and for that reason, the qa can always be represented as functions of the sa and the pa . This relationship will not be changed if we set individual sa equal to constants, as in the case of hidden motions, or also set them equal to zero, as in the case of the eliminated pa . The value of the remaining sa will not be changed by those variations. Since the same thing is also true for the electrical motions and reversible heat motions, to the extend that their physical laws have been ascertained up to now, there is, up to now, no physical motivation to consider the exceptional cases in which the determinant in equations (3e ) might be equal to zero, and for that reason the assumption will be made from now on that the determinant cannot be identically zero, except for at most special values of the pa . Once that condition has been established, the variational problem can be expressed in such a way that the equations that were singled out in the beginning of this paragraph, namely: (1) qa = dp dt a , will be assumed in it. As above, let H be a function of the pa and qa , and let the Pa be functions of time. One sets: (1d ) Φ1 = 1 0 t t dp H dt H q P p dt q         ∂   − − ⋅ + ⋅         ∂     ∫ ∑ a a a a a a and demands that: (1e ) δΦ1 = 0 Helmholtz – On the physical meaning of the principle of least action. 12 must be true for arbitrary variations of the pa and qa , which are both to be treated as independent variables. One should have δpa = 0 at the times t0 and t1, while the dqa also remain arbitrary. The variation of qb yields: (1f ) 0 = 2 a H dp q q q dt   ∂     ⋅ −     ∂ ∂   ∑ a a a b , which implies equations (1), since the determinant of the 2H q q ∂ ∂ ∂a b should not vanish identically. The variation of the pa is performed as above, and will yield the same result. If one denotes the function of the pa and qa that enters into (1d ) by: E = H − H q q   ∂ ⋅   ∂   ∑ a a a (this is the energy, as will be shown in the next paragraph) then one will get: (1g ) Φ1 = 1 0 t t dp dt E s P p dt       − ⋅ + ⋅       ∫ ∑ a a a a a . I cite this form here, since we will encounter an analogous form in the conclusion, and both of them can be regarded as very exotic when one meets up with them in physical investigations before one knows for sure which quantities are to be referred to as pa , qa , and sa . On the other hand, it is precisely these forms that are included in the complete statement of the problem. § 2. Relationship to the principle of the constancy of energy. If one multiplies the equations of motion (1c ), in sequence, by qa and adds them then one will get: ∑[ ] P q⋅ a a = − H d H H dq q q p dt q q dt       ∂ ∂ ∂ ⋅ + ⋅ − ⋅       ∂ ∂ ∂       ∑ ∑ ∑ a a a a a a a a = − d H H q dt q       ∂   − ⋅     ∂     ∑ a a a . If we set: Helmholtz – On the physical meaning of the principle of least action. 13 (4) E = H H q q   ∂ − ⋅   ∂   ∑ a a a , as we have done up to now, then we can write: (4a ) [ ] dE P q dt dt dt ∑ ⋅ ⋅ + ⋅ a a = 0. The sum that enters into this is the work that the force Pa does on the environment in the time interval dt, and that will then imply that the quantity E continually increases or decreases according to whether that force does positive or negative work, respectively. It will then follow from this that E denotes the energy supply of the system, expressed in terms of its coordinates pa and velocities qa . It emerges from this that the principle of least action, when it is taken in the form of § 1, always includes the principle of the constancy of energy. On the other hand, the principle of least action is not necessarily true in all conceivable cases that are subject to the law of the constancy of energy. One can make many supplements to the system of equations (1c ) that do not at all affect the derivation of equations (4a ), but they will probably cancel the summation in the variational formula. For example, one adds a term (ϕ ⋅ qb ) to those equations (1c ) that have the index a in their terms, and a term (− ϕ ⋅ qb ) to the ones that have the index b, in which ϕ is any function of the coordinates. If, in order to derive the energy equation, we then multiply the former equation by qa and the latter one by qb then the extra terms will drop away, and the constancy of energy will not be affected. By contrast, the corresponding variation: ϕ [qb ⋅ δpa – qa ⋅ δpb ] can be considered to be the complete variation of a function of the pa and qa under the integral only when ϕ depends upon the variables (qb ⋅ pb ) and (qb ⋅ pa ). If the function ϕ that enters into the supplementary terms is independent of the velocities then the corresponding motion will not be reversible. However, we can make ϕ into a linear function of the velocities; the entire motion can then go backwards, as well. Since one can install such terms in any arbitrarily-chosen pair of equations from the system (1c ), a great multitude of cases are conceivable in which the law of the constancy of energy is valid, but not that of least action. It then follows from this that when the latter principle is true, it will express a special character of the conservative natural forces that are present that is not already present as a result of their being defined to be conservative forces. Illuminating this idea more clearly will be the objective of the investigation that follows. Helmholtz – On the physical meaning of the principle of least action. 14 Explanatory example: Since it will be repeatedly desirable to cite examples in the sequel in which the significance of the theorems that are obtained becomes intuitive, I shall allow myself to cite some suitable spaces here to the extent that it is necessary to characterize them, and to which I can refer briefly, not only for the contents of this and the following paragraphs, but also later on. I. Example of top. Let the top be a rotating body on a gimbal mounting. The outer ring a might rotate around a vertical axis, and we let α denote the angle of rotation, when measured from a well-defined vertical plane in space. The second ring b rotates inside of the first one around a horizontal axis, and I shall let β denote the angle between the planes of the rings a and b. The rotational axis of the top lies in the ring b at right angles to the rotational axis between a and b. Let the angle between a reference meridian on the top and the plane of b be γ, let the moment of inertia of the top around its rotational axis be A, and let the moment of inertia around one of its equatorial axes by B; that of the ring itself will be neglected. The vis viva on the top will then be: (4) 2 2 2 1 1 2 2 2 cos sin , . d d d d L dt dt dt dt H L γ α α β β β           = ⋅ + ⋅ + ⋅ ⋅ +                    = − A B That will yield the forces A, B, C that tend to increase the angles α, β, γ, respectively: (4a ) A = − 2 cos cos sin d d d d dt dt dt dt γ α α β β β      ⋅ ⋅ + ⋅ + ⋅ ⋅        A B , (4b ) B = − 2 2 2 sin cos sin cos d d d d d dt dt dt dt dt γ α α α β β β β β     ⋅ ⋅ + ⋅ + ⋅ ⋅ ⋅ − ⋅         A B B , (4c ) C = − cos d d d dt dt dt γ α β       ⋅ + ⋅       A . The force A is simply a rotational moment that makes the circle a rotate, and likewise, B makes b rotate. However, C rotates the circle relative to b, so it must have its pivot at b. If the force C is absent then (4c ) will imply that: (4d ) − cos d d dt dt γ α β   ⋅ + ⋅     A = c. That will yield the value of H′, according to (2b ): Helmholtz – On the physical meaning of the principle of least action. 15 (4e ) H′ = + 2 2 2 1 1 2 2 2 sin cos c d d d c dt dt dt α β α β β       − ⋅ ⋅ + + ⋅ ⋅             B A , and the values of the forces, when derived in accordance with (4a ), (4b ), and (4c ): (4f ) 2 2 2 2 cos sin , sin sin cos . d d A c B dt dt d d d B c dt dt dt α β β α α β β β β    = ⋅ − ⋅ ⋅           = ⋅ ⋅ + ⋅ ⋅ ⋅ − ⋅       B B The first constant term in the value of H′ can be omitted, since it enters into just the arbitrary constant of the value of E; the last term the linear one that is absent from the value of L. II. Example: Electrodynamic effect of closed, circular currents on the potential law (

). We would like to understand Jb to mean the current intensity of the b th circular current and pa to mean the coordinates of the ponderable masses, whose vis viva we shall neglect. The function H has the form: (5) H = − 1 2 , ∑∑[ ] Q J J ⋅ ⋅ b c b c b c , in which Qb,c are functions of pa , and each of the two successive indices b and c refer to all current circles. The induced electromotive forces, which I would like to denote by Eb , are: (5a ) Eb = − , ( ) d Q J dt   ⋅     ∑ b c c c , (5b ) Pa = 1 , 2 a Q J J p   ∂ ⋅ ⋅   ∂   ∑∑ b c b c b c . If a permanent magnet whose position is given by the coordinate p0 is involved then a series of linear terms will be added to H that I would like to denote by h, and which will have the form: (5c ) h = ∑[ ] R J⋅ b b b , in which the Rb are functions of the coordinates pa and p0 . The calculation of the forces comes about here by the same methods. The total electrodynamic energy is:

(* ) See, among others, my paper in this Journal, Bd. 72, pp. 70-72. Helmholtz – On the physical meaning of the principle of least action. 16 E = H − H J J   ∂ ⋅   ∂   ∑ b b b = − H. The function h will vanish, since: h − h J J   ∂ ⋅   ∂   ∑ b b b = 0. I have shown that the E = − H that enters into this, like the vis viva on ponderable masses, is a necessarily positive quantity for closed currents (* ). In addition, E is a homogeneous function of degree two in the Ja , and the same considerations can be applied to it as the ones that were discussed at the end of § 1, in which the determinant of the quantities 2H J J ∂ ∂ ⋅∂ a b cannot be equal to zero identically. III. Example. Thermodynamics. For a suitable choice of coordinates, the laws of reversible thermal processes can be represented in the form (**): (6) [ ] , . d L P F L p dt q dQ d dt dt ϑ ϑ  ∂ ∂    = − − −    ∂ ∂      ∂ = − ⋅       ∂ a a a F (6a ) E = F – ϑ ⋅ ϑ ∂ ∂ F

  • L. In this, F is what I called the “free energy,” which is a function of the coordinates pa and the absolute temperature ϑ, and L is the vis viva of the visible motions of the heavy masses, and thus a function of the pa and the qa that is homogeneous of degree two in the latter and independent of ϑ. dQ is the amount of heat that enters the body in the element of time dt – i.e., the work that is done by the environment – and only forces that bring about the motion of heat will be exerted. We will get this form, with different variables, when we set: ϑ ∂ ∂ F = − S, then let s denote a function of S, and further set:

(* ) This Journal, Bd. 72, pp. 86, et seq. and pp. 125. (**) See this Journal, Bd. 97, pp. 112-117. The vis viva of the visible motions L is added in order to insure the completeness that is desirable here. Helmholtz – On the physical meaning of the principle of least action. 17 (6b ) , . S s H s ϑ η ϑ η ϑ  ∂ ⋅ =  ∂  ∂  = − ⋅ − ⋅  ∂ F F If we now express H and s as functions of the pa and η then the equations above can be written [loc. cit., Abh. I, eq. (1d )]: (6c ) dQ dt = + ϑ ⋅ dS dt = η ⋅ ds dt , (6c ) Pa = − [ ] d L H L p dt q ∂ ∂   − −   ∂ ∂ a a   , (6c ) E = H – η ⋅ H η ∂ ∂

  • L. These equations, like the first ones (6), (6a ), have entirely the same form (cf., loc. cit., §
  1. as the ones for the motion of a monocyclic system whose kinetic potential is (H – L), and for which η denotes the velocity and s, the moment of motion of the monocyclic motion. If we let P(η) denote the force that is exerted in the direction of the velocity η then we will have: (6f ) P(η) ⋅ η ⋅ dt = − dQ. The analogy with the Lagrange expressions thus remains true here, as well, and in that way that entropy S of the motion might depend upon the moment of motion s of the monocyclic motion. In this case, this possibility of the coupling of equally-warm systems of bodies to a larger system, and the kinetic theory of gases lead one to assume that: ϑ = s ⋅ q, S = − ϑ ∂ ∂ F = C ⋅ log s. Thus, for any given system of bodies, the temperature would be set to be proportional to the vis viva of the heat motion, as Clausius and Boltzmann apparently already sought to before me (* ). The later applications of our example are independent of the question of the relationship between the functions S and s.

(* ) In a communication to the Berlin Academy (Sitzungsberichte 8 December 1884), I have expressed this theorem more definitively, but then I recognized that a step in the proof could not be justified without demanding a further restricting condition whose physical meaning I still do not know how to interpret, although I hope this will come to pass. I must therefore accept that the objection that L. Boltzmann made against that paper (Wiener Sitzungsberichte (2), Abh. XCII, Bd. 8 October 1885) is justified in that regard. Helmholtz – On the physical meaning of the principle of least action. 18 The motion of heat can apparently be regarded as an especially far-reaching example of the elimination of the coordinates pc , and for that reason H can be a complicated function of ϑ or η. However, my investigations into the combined monocyclic systems have shown that many combined forms of motions that are already quite similar to the internal molecular motions of warm bodies can also lead to the same laws. § 3. The derivation of the kinetic potential from the value of energy. For the physical investigations, it is usually easier and more certain to recognize what the factors are that influence the energy supply of any system of bodies, and thus, to determine the value of the function E, than it is to find all of the laws of the variations and determine the kinetic potential from that. We thus come to the examination of the extent to which the latter can be determined from the value of the energy supply. We assume that the quantity E has been found as a function of the pa and qa . For the form of that function, equation (4) implies that: (4) E = H − H q q   ∂ ⋅   ∂   ∑ a a a . It follows from this that: (7) E q ∂ ∂ b = − 2H q q q   ∂ ⋅   ∂ ∂   ∑ a a a b . For the variation of the function Φ that is given in equation (1a ) that is necessary in order to construct the equations of motion, it must be assumed that the first and second differential quotients of H always remain finite along the path that is traversed by the system. Thus, it follows from equation (7) that when all qa = 0, all ∂E / ∂qa will also be equal to zero. Other restrictions of the function E that are implied by the physical interpretation might be mentioned only briefly here.

  1. For a free system, the coordinates that are involved refer to only the relative positions of the masses of the system, because the same process of motion must be capable of proceeding for the same relative positions of the masses everywhere in space.
  2. The value of E must have a minimum for finite separations of the masses and finite velocities; otherwise, the supply of work in the system would be infinitely large. Thus, the value of E must necessarily be a positive quantity for infinitely-increasing qa . I Helmholtz – On the physical meaning of the principle of least action. 19 have sought to show the inadmissible consequences that follow from the opposite assumption for the electrodynamic theory of W. Weber (* ). Equation (4) next easily implies that when H can be represented as a sum of homogeneous, entire functions of the qa of differing degrees, the same thing will be true for E. If we denote a homogeneous function of degree n of the qa by Pn , and if: (7a ) H = ∑[ ] Pn n then (7b ) E = ∑[(1 ) ] − ⋅Pn n n , or: E = P0 – P2 – 2P3 , etc. The terms P1 of first degree drop out from the value of E, P0 corresponds to the potential energy, which is independent of the motion, which we have denoted by F above, and P2, to (− L). Higher terms enter into the problems of the mechanics of ponderable bodies only in the altered cases in which one eliminated certain coordinates pc . Moreover, the problem that was posed can also be solved when E is an entirely arbitrary function of the velocities that satisfies only the condition that was posed above that, from equation (7), all ∂E / ∂qa will approach the value zero when the qa = 0. For our purposes, it will be sufficient to retain the determination that was made above, in which the coefficients in the system of equations (7) should be finite, although there will also be cases in which those integrable coefficients would become infinite. In order to solve our problem, we would like to set: (8) qa = x ⋅ qa , in place of the qa in the values of E and H, and understand x to mean a variable factor whose variation will indeed change the absolute values of the qa , but not their mutual ratios. After that substitution has been made, I will denote the functions H and E by H′ and E′. One then has: (8a ) E x ∂ ′ ∂ = E q   ∂ ⋅   ∂   ∑ a a a q . Since all ∂E / ∂qa = 0 when all qa = 0, from the convention that was made for equation (7), but, from equation (8), that will happen only when x = 0, that will imply that:

(* ) This Journal, Bd. 72, pp. 85, § 4 to § 7; Bd. 75, pp. 35-62. – Helmholtz, Gesammelte Wissenschaftliche Abhandlungen, Bd. I, pp. 684. Helmholtz – On the physical meaning of the principle of least action. 20 (8b ) E x ∂ ′ ∂ = 0, when x = 0, and indeed, from our assumptions, for very small x, ∂E / ∂x must become proportional to x itself, if not a higher power of x. On the other hand, we have:

H x ∂ ′ ∂

H q   ∂ ⋅   ∂   ∑ a a a q , so: (8c ) x ⋅ H x ∂ ′ ∂

H q q   ∂ ⋅   ∂   ∑ a a a = − E′ + H′, and therefore: (8d ) E x ∂ ′ ∂ = − x ⋅ 2 2 H x ∂ ′ ∂ . If yet a second solution exists for the differential equation (8c): (8c) E′ = H′ − x ⋅ H x ∂ ′ ∂ , which we would like to denote by H″, then: 0 = H′ – H″ – x ⋅ x ∂ ∂ [H′ – H″] ; i.e.: log (H′ – H″) = log x + log C, or H′ – H″ = x ⋅ C, in which C can be a function of the qa . However, if (H′ – H″) can also be represented as a function of the qa , which are free of x, then it can be only homogeneous of first degree. We now need only to find a particular integral of equation (8c ). We will obtain one when we first write equation (8c ) for x = 0: E0 = H0 , and deduce this from (8c ): (E′ – E0) = (H′ – H0) – x ⋅ x ∂ ∂ [H′ – H0]. Dividing by x 2 will give this: Helmholtz – On the physical meaning of the principle of least action. 21 − 2 1 x (E′ – E0) = H H0 x x ∂   ′ −   ∂   . From the discussion that was made for (8b ), the quantity on the left-hand side is also finite for x = 0, and we then obtain by integrating between the limits x = 0 and x = 1: (8f ) H′ – H0 = − 1 0 2 0 E E x ′− ∫ ⋅ dx + H1 , in which the integration constant H1 , as was remarked, can be any homogeneous function of degree one in the qa . E is then derived uniquely from H by means of equation (4), while in the derivation of H from E, the function H1, which corresponds to the “hidden” motions, will remain undetermined. Whether such terms of first degree mix with each other will be ascertained in special problems mostly from the conditions under which the motion can take proceed backwards. If one then knows which physical quantities in the value of E in the problem in question are to be treated as coordinates and which, as velocities then one can, as a rule, solve the problem that was posed here. However, the opposite cases can occur in which resolving the state question seems to be uncertain. § 4. The general characteristics of the forces on moving systems. It is known that the forces that act upon systems at rest from the outside, which satisfy the law of the constancy of energy, exhibit certain legitimate relations with each other that are expressed in the equations: P p ∂ ∂ a b

P p ∂ ∂ b a , and that when these equations are fulfilled, the value of the potential energy can be found. One likewise finds that similar relations that are implied immediately by Lagrange’s expressions for the forces can be presented for moving systems that are subject to the law of minimum kinetic energy. They are thus not merely to be regarded as functions of the coordinates pa , as in the systems at rest, but also as functions of the velocities qa and the accelerations: (9) q′ a

dq dt a . Equation (1c ): Helmholtz – On the physical meaning of the principle of least action. 22 (1c ) Pa = − H d H p dt q ∂ ∂ 

  •   ∂ ∂ a a   immediately yields: Pa = − 2 2 H H H q q p q q q q ∂ ∂ ∂    
  • ⋅ + ⋅ ′     ∂ ∂ ∂ ∂ ∂     ∑ ∑ b b a a b a b b b . A. Forces and accelerations. As one sees, when represented in this form, the forces are linear functions of the accelerations. The coefficient of the q′ b in the value of the force Pa can thus be written: (9a ) P q ∂ ∂ ′ a b = 2H q q ∂ ∂ ∂a b = P q ∂ ∂ ′ b a ; i.e.: If the acceleration q′ b makes the force Pa larger by a certain amount then the same increase in the acceleration q′ a will make the force Pb larger by the same amount. Whether such an influence is present in a given case or not will depend upon whether the quantities 2H q q ∂ ∂ ∂a b are non-zero or equal to zero, respectively. The stated quantities are zero, for example, for the motions of a completely-free system of ponderable masses when they are referred to rectangular coordinates. Every individual force component affects the acceleration only in the direction of the coordinate to which it is refers. For the top in example I of § 2, we have: A β ∂ ∂ ′′ = B α ∂ ∂ ′′ = 0, A γ ∂ ∂ ′′ = C α ∂ ∂ ′′ = − A ⋅ cos β, C β ∂ ∂ ′′ = B γ ∂ ∂ ′′ = 0, in which α″, β″, γ″ denote the accelerations of the angles α, β, γ, resp. In example II for the electrodynamic effects, one has: P J ∂ ∂ ′ a b = q ∂ ∂ ′ b a E = 0, J ∂ ∂ ′ b c E = J ∂ ∂ ′ c b E . Helmholtz – On the physical meaning of the principle of least action. 23 The former equation says: Since the ponderomotive force of the circular current does not depend upon the acceleration of the current, the induced electromotive force can also not depend upon the acceleration of the current conductor (but possibly upon the velocities, in both cases). The latter equation says that when for a given position and form of the circular currents b and c, a rise in the force Eb that acts upon b cause an increase in Jc by electromagnetic induction, the same rise in the force Ec will produce the same effect on Jb . This reciprocal relationship is not present in example III for the thermodynamic effects, since the vis viva L of heavy masses does not depend upon the temperature, and therefore the product ϑ ⋅ qa will not enter into the value of (F – L) = H. B. Relations between forces and velocities. It follows further from equations (7) that: P q ∂ ∂ a b = − 2 2 2 H H d H p q p q dt q q ∂ ∂ ∂  
    •   ∂ ∂ ∂ ∂ ∂ ∂ a b b a a b   . Thus: (9b ) 2 2 2 2 . P P d H q q dt q q d d P P dt q dt q  ∂ ∂   ∂  + = ⋅   ∂ ∂ ∂ ∂          ∂ ∂ = ⋅ = ⋅      ∂ ∂ ′ ′      a b b a a b a b b a In the very great number of cases, where: (9c ) P q ∂ ∂ ′ a b = P q ∂ ∂ ′ b a = 2H q q ∂ ∂ ∂a b = const., it will follow that: (9d ) P q ∂ ∂ a b = − P q ∂ ∂ b a ; i.e., when a rise in the velocity qb for the same position and acceleration makes the force Pa increase, a corresponding rise in qa will diminish the force Pb . The case in which the prerequisite (9c ) is fulfilled have already been remarked in the examples that were cited in A. They best show the extended meaning of this theorem, but also the fact that one must control the fulfillment of the prerequisite, before one applies the simpler theorem (9d ), instead of the generally correct one (9b ). Example I. If a force that increases the angle β – i.e., the axis of the top tends to move from the vertical – causes a greater precessional motion α then a force that causes the precessional motion to accelerate will bring the axis to the vertical line. Helmholtz – On the physical meaning of the principle of least action. 24 Example II. Electromagnetic induction, according to Lenz. The motion of two circular currents with respect to each other that is produced by ponderomotive, electrodynamical forces will bring about electromotive, induced forces that act against the currents. The corresponding relationship will be true for the motion of a magnet relative to a current conductor. Example III. Thermodynamics. When rises in temperature raise the pressure of a system of bodies, compression of them will raise the temperature. For this case, we can write equation (9d ), after multiplying both sides by η, using the notations and explanations of § 2 for this example: (9e ) ( ) [ ] , log or, from (6 ) : . log f P P q dQ P q dt η η η η  ∂ ∂ ⋅ = −  ∂ ∂     ∂   ∂  = +    ∂ ∂   a a a a Now, from (6c ), one has: dQ dt = η ⋅ ds dt = η ⋅ s s d q p dt η η η   ∂ ∂ ⋅ + ⋅ ⋅   ∂ ∂   ∑ a a a . Thus, one has: (9f ) dQ q dt ∂     ∂ a   = η ⋅ s p ∂ ∂ a . From (6d ), one had: Pa = − [ ] d L H L p dt q ∂ ∂   − −   ∂ ∂ a a   , and since L is independent of h, one will have: (9g ) P η ∂ ∂ a = − 2H p η ∂ ∂ ∂a = s p ∂ ∂ a . by which, in conjunction with (9 f), the validity of equation (9c ) will be confirmed, and thus, also the applicability of our general theorem. Thus, any of the functions η in equation (6b ) can be regarded as the velocity, except that dη / dt must then correspondingly mean the acceleration. Also, the temperature ϑ, in turn, belongs to the integrating denominator η such that one also has: dQ q dt ∂     ∂ a   = log P ϑ ∂ ∂ a . Helmholtz – On the physical meaning of the principle of least action. 25 Since one must have dϑ / dt = 0 in this application, dQ q dt ∂     ∂ a   will be the velocity with which the heat enters when the parameter pa increases with the velocity, while ϑ remains constant. This will give the formulation of the theorem that was given above. The same considerations can also be applied to the reversible parts of thermoelectric and electrochemical processes. Peltier’s phenomenon: If warming at one place in a closed conductor brings about an electrical current then the same current will produce cooling there (ignoring the formation of heat by the resistance of the conductor.) Electrochemistry: If warming of a constant galvanic element raises the electromotive force then the current in it will make the heat latent (

). However, the formulas above not only exhibit the sense of the change, but also, at the same time, they give one information about how one is to deal with the quantities. C. Relations between the forces and coordinates. Finally, it follows from equation (9) that: (9h ) 2 2 1 2 . P P d H H p p dt q p q p d P P dt q q  ∂ ∂   ∂ ∂  − = −   ∂ ∂ ∂ ∂ ∂ ∂        ∂ ∂  = −   ∂ ∂    a b b a a b b a a b b a For the case of rest, where the right-hand side will be zero, this will yield the general law of conservative forces: (9i ) P p ∂ ∂ a b

P p ∂ ∂ b a . However, the same condition is also fulfilled when the motion proceeds temporarily in such a way that the right-hand side of (9h ) is equal to zero. Thus, we can also apply the law (9i ) in order to define a force function for the forces of warm bodies (monocyclic systems, resp.), in the event that only one of the functions η in equation (6b ) remains constant during the motion. If we therefore neglect the vis viva L on the associated motions then from equation (6b ) then we will have simply: Pa = − H p ∂ ∂ a ,

(* ) See my treatises on the thermodynamics of chemical processes. Sitzungsberichte der Berliner Akademie, 1882, 2 Feb., pp. 24-26; 1882, 27 July, pp. 825-835. Helmholtz – On the physical meaning of the principle of least action. 26 so our equation (9i ) will be fulfilled. However, we will almost always be in this case when we are concerned with the mechanics of terrestrial bodies that contain more or less heat. Even when the bodies are in a state of violent internal motion, we can, e.g., define force functions for the molecular forces for their elastic effects by means of the law that was proved here, and apply them as if their state of equilibrium were one of stable equilibrium in absolute rest.


Here, I would like to remark that the reciprocal relationships that were expressed in the equations: (9a ) P q ∂ ∂ ′ a b

P q ∂ ∂ ′ b a , (9b ) P P q q ∂ ∂ + ∂ ∂ a b b a = 2 d P dt q   ∂   ∂ ′   b a , (9h ) P P p p ∂ ∂ − ∂ ∂ a b b a = 2 1 2 d P P dt q q   ∂ ∂   − ∂ ∂   a b b a , in conjunction with the fact that the Pa are linear functions of the q′ a , which one can write as: (9k ) 2P q q ∂ ∂ ∂′ ′ a b c = 0, and with the previously-given definitions: (1) qa = a dp dt , (9) q′ a = a dq dt , are sufficient to prove that a kinetic potential exists such that the force Pa could be expressed in terms of the differential quotients of it in the way that Lagrange gave, and that the equations of motion could be reduced to the principle of least action. The relationships between the forces that were summarized here thus include a way of completely characterizing the motions that that are subject to the principle of least action. The proof of this theorem can be given immediately with the previously-prepared tools of analysis for the case in which no more than three coordinates pa are present. Helmholtz – On the physical meaning of the principle of least action. 27 However, theorems from the theory of potentials functions in spaces of three dimensions will be used for that. If one would wish to go on to more coordinates pa then one would need the corresponding theorems for a larger number of coordinates. They can be defined to the extent that they are necessary for our proof. However, since that is something that is interesting in its own right, it seems to me that it would not be suitable to go into that peripherally, and for that reason I would prefer to give the stated proof on another occasion. Other general characteristics of the motions that take place under the principle of least action will be described in the next paragraphs.


“Ueber die physikalische Bedeutung des Princips der kleinsten Wirkung,” J. f. d. reine u. angew. Math. 100 (1887), 213-222. On the physical meaning of the principle of least action. (Continuation of the paper on page 137 of this volume) (By H. von Helmholtz) Translated by D. H. Delphenich


§ 5. Generalization of Hamilton’s differential equation. Hamilton has taught us how to represent the function Φ that he defined, under somewhat restricting assumptions, to be: Φ = 1 0 ( ) t t F L dt − ⋅ ∫ as a function of time t = t1 – t0 and the values of the coordinates at the time t1 and t0 . We would like to denote the moments of motion for time t1 by pa and sa and the ones for time t0 , by pa and sa , resp. It is assumed that the changes in the pa during the time interval (t1 – t0) result from the laws of motion. The value of the integral that is denoted by Φ can then be calculated as a function of the pa , pa , and t, and for this kind of representation, we will have: (10) , , s p E t  ∂Φ = −  ∂   ∂Φ  = ∂   ∂Φ  =  ∂ a a a a s p or (10a ) dΦ = E ⋅ dt − ∑ ∑ [ ] [ ] s dp d ⋅ + ⋅ a a a a a a s p . This entire conversion can also be performed under the extended assumptions that were made in § 1 (* ). For the present purposes, it will suffice to do this under that assumption

(* ) As C. G. J. Jacobi has already remarked in his lectures on dynamics. Lecture XIX. Helmholtz – On the physical meaning of the principle of least action. 29 that Pa = 0. Moreover, the function H might be an arbitrary function of the pa and qa , as long as it fulfills the continuity condition that were discussed above. As is known, the first two systems of equations (10) are obtained by carrying out the partial integrations that convert the variation of Φ by the δqa into a variation by δpa . The differential quotient with respect to time ∂Φ / ∂t that enters into the third of equations (10) will be obtained for unvaried values of the pa and pa when we look for the change in the value of Φ that occurs as a result of an actual motion for a lengthening of the time by dt. Thus, pa will increase by qa ⋅ dt, and on the other hand, equation (1a ) shows that the stated variation of Φ equals the final value of H ⋅ dt. Thus: H ⋅ dt = q t p     ∂Φ ∂Φ     + ⋅     ∂ ∂     ∑ a a ⋅ dt, or, from the first equations (10) and (4): t ∂Φ ∂ = E. The following relations between the quantities sa , sa , E now result from equations (10) when those quantities are represented as functions of the pa , pa , and t: (10b ) , , , s s p p s p  ∂ ∂  = ∂ ∂   ∂ ∂  = − ∂ ∂   ∂ ∂  =  ∂ ∂  a b b a a b b a a b b a s p s s p p (10c ) , . E s p t E t  ∂ ∂ = −   ∂ ∂  ∂ ∂  =  ∂ ∂  a a a a s p When these conditions are fulfilled: (10d ) E ⋅ dt − ∑ ∑ [ ] [ ] s dp d ⋅ + ⋅ a a a a a a s p = dΦ will be the complete differential of a function of the pa , pa , and t. Moreover, if the quantities E, sa , and sa that enter into the differential equation (10d ) are to correspond to the energy and moment of motion for a possible motion of the Helmholtz – On the physical meaning of the principle of least action. 30 system that is not acted upon by external forces then they will not be completely independent of each other. In fact, as Hamilton has shown already, the equations of motion of the system can be represented by the system of equations: (10e ) sa = const. Since the sa are functions of the pa , of t, and of the pa , in general, the pa will become functions of time from this, and the pa and sa , which represent integration constants, can be determined, so the position of the system will be given for later instants. Now, if one substitutes the value of pa that is thus obtained in the value of E for a conservative system then it would be converted into a function of the pa and sa that cannot, however, be dependent upon time for any longer. If we revert to equations (10) then this will say that: (10f ) t ∂Φ ∂ = E ,   ∂Φ   ∂   a a p p , or that a first-order differential equation must exist between the differential quotients ∂Φ / ∂t and ∂Φ / ∂pa of the function Φ whose coefficients depend upon only the pa . However, we can likewise traverse the path of a system backwards from a certain final position, in which we will have to treat the values of the pa and sa as if they were constant. The equations: sa = const. will then yield the quantities pa as functions of t and the fixed values of sa and pa . When these values of the pa are substituted into the function E, it will prove to be a function of the sa and pa , from which, t must be absent. It follows from this that there must be a second differential equation for the function Φ: (10g ) t ∂Φ ∂ = G p , p   ∂Φ   ∂   a a , between the differential quotients ∂Φ / ∂t and ∂Φ / ∂pa whose coefficients depend upon only the pa . Hamilton gave a specific form to these two differential equations for the function Φ, since he considered the two components components of the electrokinetic potential to be given, and indeed in the older, more restricted, form, while here we seek only the most general character of those equations that simultaneously corresponds to the principle of the conservation of energy and that of least action. This also comes down to saying that any pair of associated sa and sa should be values of the same function at the beginning and the end of the time interval t. If we apply the Helmholtz – On the physical meaning of the principle of least action. 31 differential equation (10d ) to very small time intervals t then for actual motions, the quantities will have to be set to: (10h ) pa – pa = qa ⋅ t, and these qa will approach the values of the velocity all the more closely as t becomes smaller. Likewise, however, the difference (sa – sa ) must also approach zero with decreasing t. If the differential equation (10d ) and these auxiliary conditions are fulfilled then the variational problem will also be fulfilled. To that end, one needs only to hold the pa constant, and vary the pa in the way that they would change when one varies the time interval dt for an unperturbed traversal of the motion; thus: dpa = 0 and dpa = qa ⋅ dt. From (10d ), this will imply that: dΦ = {E – ∑ [sa ⋅ qa ]} ⋅ dt, or (10i ) Φ = 0 { [ ]} t ∫ dt E s q ⋅ − ⋅ ∑ a a a in which, one must take E, sa , and qa under the integral sign to have the actual values that they have at time t, which is the start of the corresponding motion. That is the previous representation of the function Φ, and the differential equation (10d ) implies that this value of Φ must satisfy the minimum condition for the actual path of the system that is retraced. Namely, when we think of the path of the system from the position that is denoted by 0 to the one that is denoted by 2 as being divided by an intermediate, variable position, which we will denote by 1, then from (10i ): Φ0, 2 = Φ0, 1 + Φ1, 2 . If we now vary the coordinates of the intermediate position then, from (10d ), we will have: δΦ0, 2 = − δΦ1, 2 = −∑[ ] s p ⋅δ a a a , and as a result: δΦ0, 2 = 0. As is easily seen, this can be carried over to an arbitrary subdivision of the path into arbitrarily many pieces, and that will imply that the integral Φ0, 2 will not vary when makes any sort of small changes to the intermediate positions. The minimal theorem depends upon the fulfillment of the differential equation (10d ), and for the extended form of H, as well as for the original, more restricted form that Lagrange and Hamilton started with. Helmholtz – On the physical meaning of the principle of least action. 32 The conditions that exist between the quantities that enter into the differential (10d ) that were discussed in this paragraph reduce to one equation that gives E as a function of the pa and sa when one employs C. G. J. Jacobi’s ( * ) conversion. § 6. Reciprocity for the forward and reverse motions. I call the motion of a system reversible when the sequence of positions that it passed through during its forward motion can also be traversed by the reverse motion without the action of other forces, and with the same intermediate times for each pair of equal positions. The reverse motion will be possible when the values of the kinetic potential is not changed when one changes the signs of all qa . However, if products and powers of the qa of odd degree occur, which happen, e.g., for the interference of hidden motions (§ 1), then the motion will be reversible only when it is mechanically possible to also make some of the constants (viz., the velocities of the hidden motions) negative in such a way that the quantity H does not change in value under a simultaneous setting to negative values of these constants and all qa . This is easily obtained from a consideration of the equations of motion (1c ) when one considers that dt must also assume the opposite sign under reversal. Law of reciprocity. In my acoustic investigations (**), I proved a law of reciprocity that I easily extended to small oscillations around a stable equilibrium position of an arbitrary, oscillating, mechanical system in my lectures. However, it is more general, and true for any moving system that is subject to the law of least action and has a reversible motion. The original motion A will be unchanged when one keeps all initial positions at time t0 unchanged, but increases one of the moments of motion s1 by ds1. In that way, the coordinate p2 must increase by dp2 at time t. If one then changes the moment of motion s2 in the reversed motion when it goes through the value pa of the coordinates by the same amount that one changed s1 then the coordinate p1 will be changed by just as much as p2 after the time interval t = t1 – t0 . Since all dt and dpa must be zero, we will have: (11) dsa

s dp p   ∂ ⋅   ∂  ∑ a b b b .

(* ) Jacobi, loc. cit., Lecture XX. (**) “Theorie der Luftschwingungen in Röhren mit offenen Enden,” this Journal, Bd. 57, pp. 27-30. Helmholtz – On the physical meaning of the principle of least action. 33 Of these, only ds1 should be non-zero. For the sake of brevity in our notation, we would like to write: (11a ) σa, b

p ∂ ∂ a b s = − s p ∂ ∂ b a . From (10b ), the quantities σa, b are the same as the σb, a . We denote the determinant of the quantities σa, b by D(σ) . If these are not identically zero then, from equations (11), with the restriction that was made, we will have: (11b) dp2 = ( ) 1,2 log D σ σ ∂ ∂ ⋅ ds1 . By contrast, if we demand that all dpa = 0, and likewise all dsa = 0, with the exception of ds2 , then we will get, with consideration given to (11a ), the corresponding equation: dp1 = ( ) 1,2 log ( ) D σ σ ∂ − ∂ − ⋅ ds2 for the forward motion. For the reverse motion, the signs of the moments of motion are inverted, and thus, those of the σa, b , as well; thus, one will have: (11c ) dp1 = ( ) 1,2 log D σ σ ∂ ∂ ⋅ ds2 for them. It will then follow from the combination of equations (11b ) and (11c ) that: dp2 : ds1 = dp1 : ds2 , with which, the theorem that was expressed above will be proved. As far as the exceptional case is concerned, in which the determinant D(σ) is equal to zero identically, in that case, the dpb would not necessarily be equal to zero when all dsa are also equal to zero, without exception. Now, since the motion of the system must be determined completely, and therefore, the values of the pa would not be double-valued during the course of time t if the initial positions pa and the initial velocities were given at the start of the time interval t, this exceptional case could occur only if the qa were not determined completely by the values of the sa , which we excluded in the concluding remarks of § 1. It is therefore unnecessary to pay any attention to that exceptional case. The sudden changes in the values of the sa and the sa here, under which, the coordinates themselves should suffer no changes in their values, would come about mechanically in such a way that one lets forces Pa act during a very small time interval, Helmholtz – On the physical meaning of the principle of least action. 34 but with corresponding intensity. In that way, the various rising degrees of velocities can be traversed without also having to change the position of the time at which the greatest velocity is attained notably. For such an assumption, it follows from equation (1) that: − ∫ Pa ⋅ dt = s1 – s0 . Since Pa denoted the force that was exerted upon the moving system from the outside, in the notation that was used there, (− Pa ) will be the opposite external force that is required in order to bring about the desired change in motion. Following Sir W. Thomson, we would like to refer to such a force effect as a push in the direction of the coordinate pa . In that regard, one must remark that, in general, the forces Pa are aggregates of components that act upon different parts of the system, and are distributed in such a way that the aggregate of forces Pa performs no work under any variation of the remaining coordinates, except for pa . Moreover, since we have to distinguish between forward and reverse motion, it would be preferable to consider values of the dsa that increase the forward moment sa , as well as displacements dpa that increase the distance (pa – pa ), to be positive for the forward motion, while for the reverse motion, values of the dsa that increase the reverse moment (− sa ) and displacements (− dpa ) that increase the distance (pa – pa ) are considered to be negative, and to treat them as equivalent to the positive changes in dsa and dpa for the forward motion. The reciprocity theorem can then be expressed as: If a push that increases only the moment sa by dsa at the initial position of the forward motion has increased the coordinate of the final position pb by dpb after the time interval t then an equivalent reverse push that increases the reverse moment (− sb ) by the same amount at the earlier final position would provoke the equivalent reverse change in the coordinate pa after time t. § 7. Introducing the moment of motion as an independent variable, in place of velocity. The differential equation (10d ) gives many opportunities for the transformation of values by a different choice of independent variables, which Hamilton (* ) has already partially employed. However, since he assumed that the vis viva was a homogeneous function of second degree in the velocities there, here, I shall allow myself to carry out those of these conversions under which the action of external variable forces does not

(* ) Philosph. Transact., 1835, pt. I, pp. 98-100. – See also Jacobi, loc. cit., Lect. XIX. Helmholtz – On the physical meaning of the principle of least action. 35 need to be excluded for the general form of the problem. One will get them when one replaces the velocities qa in the values of H (E, resp.) with the moments of motion sa . We have considered the kinetic potential H to be a function of the pa and qa . Thus: (12) dH = H H dp dq p q   ∂ ∂ ⋅ + ⋅   ∂ ∂   ∑ a a a a . We have denoted: − H q ∂ ∂ a = sa . It then follows from this that: (12a ) dE = d [H + ∑( ) s q⋅ a a a ] = H dp q ds p   ∂ ⋅ + ⋅   ∂  ∑ a a a a . If the determinant of the quantities ∂sa / ∂qb is not equal to zero then we can introduce the pa and sa as variables, in place of the pa and qa , and, from (12a ), that will yield: H p ∂ ∂ a

E p ∂ ∂ a , qa

dp dt a

E s ∂ ∂ a . In this, H is assumed to be a function of the pa and the qa for the partial differentiations, but E is assumed to be a function of the pa and sa . That will imply the value of force that was given in (1c ): (12b ) , , E ds P p dt dp E dt s  ∂ = − −   ∂  ∂  =  ∂  a a a a a (12c ) H = E − E s s   ∂ ⋅   ∂   ∑ a a a . The corresponding variational problem will take on a somewhat different form from the one that Hamilton gave to it: (12d) Ψ = 1 0 t t ds dt E p P dt       ⋅ + +             ∫ ∑ a a a a . Helmholtz – On the physical meaning of the principle of least action. 36 In this, Pa are regarded as functions of only time, and E, as a function of the pa and sa . One varies the pa and the sa independently of each other and demands that the δsa = 0 at the limits of the time interval. The condition: δΨ = 0 will then give the two systems of equations of motion (12b ) with no other auxiliary equations. In this manner of representation, we do not at all generally need to know the kinetic energy, but we must know the quantities sa that we can derive from the qa for the general form of the E only by means of H. We will obtain the corresponding form of the differential equation (10d ) for the case in which the P are equal to zero when we add to both sides of that equation, namely: (10d ) dΦ = E ⋅ dt − ∑ (sa ⋅ dpa ) + ∑ (sa ⋅ dpa ), the term: d {∑ (sa ⋅ dpa ) − ∑ (sa ⋅ dpa )}. That will give: (12c ) [ ] [ ] [ ] [ ] . d s p E dt p ds d     Φ + ⋅ − ⋅ = ⋅ + ⋅ − ⋅       = Ψ ∑ ∑ ∑ ∑ a a a a a a a a a a a a s p p s Thus, if E, the pa , and the pa are represented as functions of time t and the sa and sa then: (12f ) , , , , . p p s s p s E p s t E t  ∂ ∂ =  ∂ ∂   ∂ ∂ =  ∂ ∂   ∂ ∂  = − ∂ ∂   ∂ ∂  = ∂ ∂   ∂ ∂  = −  ∂ ∂  a b b a a b b a a b b a b a a a p p s s p s p s The middle equation of this system can be employed, again, like (10b ), in order to define a second reciprocity theorem, by which, a displacement δp1 is performed at the start of the time interval t, while all other pa and all sa remain unchanged; let s2 be changed by δs2 after the time interval t. Under the reverse motion, only p2 will be changed by δp2, and Helmholtz – On the physical meaning of the principle of least action. 37 the moment s1 will be changed by δs1 after the time interval t. One will then have, once more: (12g ) δp1 : δs2 = δp2 : δs1 , assuming that the determinant of the equations: δpa

p d   ∂ ⋅   ∂  ∑ a b b b s s is not equal to zero. If it is then the two positions will be reciprocal foci of the motion. Berlin, April 1886.


Send us your comments!