Category: Physics

  • Galileo’s Two New Sciences Galileo’s Physics: Strength of Materials and Motion

    Galileo’s Two New Sciences Galileo’s Physics: Strength of Materials and Motion

    This document is a translation of Galileo’s “Two New Sciences” from 1914, including introductions and prefaces. The work covers various scientific concepts, including the strength of materials, discussing why larger structures might be proportionally weaker than smaller ones and exploring the force needed to cause breakage. It also examines motion, differentiating between uniform and accelerated motion, and investigates projectile trajectories, identifying their path as parabolic under certain assumptions. Furthermore, the text touches upon acoustics and the relationship between string vibrations and musical intervals, as well as debates philosophical explanations of physical phenomena, such as the cause of acceleration in freefall.

    Galileo’s Physics: Strength of Materials and Motion

    Based on the sources provided, Galileo’s physics, as presented in his Dialogues concerning Two New Sciences, covers fundamental areas including the strength of materials and the science of motion. This work is considered by Galileo himself to be among the most important of his studies and has been confirmed by posterity as his masterpiece and the foundation of modern physical science. The book is structured as a dialogue between three interlocutors: Salviati, Sagredo, and Simplicio.

    Key Areas of Physics Discussed:

    The Dialogues are divided into discussions over four days, focusing primarily on two new sciences:

    1. The science of the resistance of solids to fracture.
    2. The science of motion (local motion), which includes uniform motion, naturally accelerated motion, violent motions, and projectiles.

    The Science of Resistance of Solids to Fracture (First Day)

    This part of the Dialogues investigates the strength of materials. Galileo examines why larger structures seem disproportionately weaker than smaller, similar ones.

    • Strength and Scale: Observations are made that the strength and resistance against breaking do not increase in the same ratio as the amount of material; for example, a nail twice as big might support eight times the weight.
    • Causes of Cohesion: The discussion touches upon what holds the parts of a solid together, considering possibilities like a “gluey or viscous substance” and the role of vacuum. The interlocutors are puzzled by how such a binding force persists in materials exposed to high heat.
    • Resistance of Beams and Prisms: Galileo analyzes how the shape and orientation of beams affect their resistance to fracture. He demonstrates that a ruler or prism is stronger when standing on edge than when lying flat, in proportion to its width to thickness. For similar cylinders and prisms, the moments (stretching forces due to their own weight and length acting as a lever arm) bear a ratio that is the sesquiplicate ratio (i.e., power of 3/2) of their lengths. A problem discussed is determining the maximum length a prism can be increased without breaking under its own weight and a load. He also explores what shape should be given to a beam to have constant bending strength throughout its length, identifying a parabolic solid as having this property.
    • Strength of Hollow Solids: The discussion also includes the strength of hollow solids, which are used in nature and art to increase strength without adding weight, citing examples like bird bones and reeds.

    The Science of Motion (Second, Third, and Fourth Days)

    This section, particularly the third and fourth days, is where Galileo lays down the foundations of the science of motion.

    • Uniform Motion: Uniform motion is defined as motion where equal distances are traversed in equal time intervals. Key theorems are presented relating distance, time, and speed. For unequal speeds, the time intervals required to traverse a given space are inversely as the speeds. The distance traversed is the product of the speed and the time, or conversely, the speed is the ratio of distance to time.
    • Naturally Accelerated Motion: Galileo defines uniformly accelerated motion as starting from rest and acquiring equal increments of speed during equal time intervals. This definition leads to important conclusions about the relationship between distance and time for falling bodies.
    • The distance traversed by a body falling from rest is in the duplicate ratio of the time, meaning the distance is proportional to the square of the time.
    • The increments in the distances traversed during equal time intervals are to one another as the odd numbers beginning with unity.
    • Galileo presents arguments and experiments supporting the idea that falling bodies accelerate and that this acceleration is initially slow, increasing continuously.
    • Falling Bodies and Medium Resistance: Contrary to some older ideas (attributed to Aristotle), Galileo’s physics suggests that bodies of the same substance, regardless of weight, move with the same speed in the same medium. Differences in falling speed in a medium are attributed to the resistance of the medium, which affects bodies of different specific gravities or shapes differently. He notes that for heavy, dense bodies falling short distances, the difference in fall times is negligible. He uses pendulum experiments to study the times of descent for bodies of different weights traversing equal arcs, finding their speeds to be equal. The resistance of the medium eventually reduces speed to a constant value for any body falling from rest.
    • Motion on Inclined Planes: The time of descent for a body along an inclined plane is related to the time of fall along a vertical line. A significant result is that if a vertical line is the diameter of a semicircle, and an inclined line drawn from the top of the vertical line is a chord of that semicircle, the time of descent along the inclined line is equal to the time of fall along the vertical line. He also investigates the path that allows for the quickest descent between two points. The speed acquired by a body descending along an inclined plane is equal to the speed acquired by falling freely from the same vertical height.
    • Projectiles (Violent Motions): Projectile motion is described as a composition of two motions: a uniform horizontal motion and a naturally accelerated vertical motion. Galileo demonstrates that the path of a projectile is a semi-parabola. He discusses how the speed acquired by a body falling from a certain height can be used as a standard to determine the uniform horizontal velocity required to describe a specific parabola. The momentum or speed of the projectile at any point in its parabolic path can be determined. The effect of the medium’s resistance on projectiles is discussed and deemed negligible for practical purposes with fast, heavy projectiles.

    Concepts and Methods

    Throughout the Dialogues, Galileo employs a mathematical approach, heavily relying on geometry and proportions to define concepts and demonstrate theorems. He contrasts the power of sharp distinction belonging to geometry with logic, which he sees more as a tool for testing arguments rather than stimulating discovery. The text also touches upon abstract concepts like continuous quantities, indivisibles, and the infinite. Terminology, such as “moment,” “speed,” “force,” and “momentum,” is used, sometimes with discussion of their meanings and variations.

    Galileo’s Physics of Motion

    Based on the provided excerpts from Galileo’s Dialogues concerning Two New Sciences, the physics of motion is a central theme, constituting the “Second new science” discussed over the Third and Fourth Days. This science investigates what Galileo calls “local motion”. The discussion on motion is divided into three parts: uniform motion, naturally accelerated motion, and violent motions or projectiles.

    Uniform Motion

    Galileo begins by defining uniform motion as one in which the distances traversed by a moving particle during any equal intervals of time are themselves equal. He adds the word “any” to the older definition to emphasize that this equality must hold for all equal time intervals, not just specific ones.

    Key theorems regarding uniform motion are presented, establishing fundamental relationships between distance, time, and speed:

    • If a particle moves uniformly, the time intervals required to traverse two distances are to each other in the ratio of these distances. This means time is proportional to distance for constant speed.
    • If a particle traverses two distances in equal time intervals, these distances will bear to each other the same ratio as the speeds. Conversely, if the distances are as the speeds, the times are equal. This implies distance is proportional to speed for equal times.
    • The time required to traverse a given distance at different speeds is inversely proportional to those speeds.
    • If two particles move uniformly with different speeds over unequal times, the ratio of the distances covered bears the compound ratio of the speeds and time intervals. This is equivalent to stating that distance = speed × time.
    • Conversely, if two particles move uniformly with unequal speeds over unequal distances, the ratio of the time intervals occupied is the product of the ratio of the distances and the inverse ratio of the speeds. This is equivalent to stating that time = distance / speed.

    Naturally Accelerated Motion

    Following uniform motion, Galileo discusses naturally accelerated motion, such as that of heavy falling bodies. He aims to find a definition that best fits natural phenomena, emphasizing that while one can invent arbitrary motions, the focus here is on motions found in nature.

    Galileo proposes a definition for uniformly accelerated motion: it is a motion which, starting from rest, acquires equal increments of speed during equal time intervals. He contrasts this with the idea that speed might be proportional to the space traversed, arguing and demonstrating that this latter idea is incorrect.

    A crucial consequence of Galileo’s definition of uniformly accelerated motion is the relationship between distance and time for a body falling from rest:

    • The spaces described by a body falling from rest with a uniformly accelerated motion are to each other as the squares of the time-intervals employed in traversing these distances. This means distance is proportional to the square of time (d ∝ t²).
    • From this, it follows that if equal time intervals are considered from the beginning of the motion, the spaces traversed during these intervals are to one another as the series of odd numbers (1, 3, 5, 7, etc.). So, in the first time interval, it covers a certain distance; in the second equal time interval, it covers three times that distance; in the third, five times, and so on.

    Galileo investigates falling bodies and the effect of the medium’s resistance. He challenges the Aristotelian view that bodies of different weights move in the same medium with speeds proportional to their weights. Through arguments and observations, he suggests that in a medium devoid of resistance (a vacuum), all bodies would fall with the same speed. Differences observed in media like air are attributed to the medium’s resistance, which affects bodies of different specific gravities or shapes differently. Pendulum experiments are used to support the idea that bodies of different weights (but the same substance or specific gravity) traverse equal arcs in equal times, suggesting their speeds are equal when medium resistance is accounted for.

    Galileo also analyzes motion on inclined planes. He shows that the speed acquired by a body descending along an inclined plane is equal to the speed acquired by falling freely from the same vertical height. The time of descent along an inclined plane is related to the time of fall along the vertical height, bearing a specific geometrical relationship (if the vertical is the diameter of a semicircle and the inclined plane is a chord from the top, the time of descent along the chord equals the time of fall along the diameter). He discusses how the speed varies with the inclination of the plane, being maximum along a vertical direction and diminishing as the plane diverges from vertical.

    The concept of momentum, velocity, impetus, tendency to motion, ability, or energy is used throughout the discussion of motion, particularly in relation to acquired speed and the force of descent. Galileo defines a standard for measuring speed or momentum, often using the speed acquired after falling a certain distance (like the height of a spear) as a reference.

    A key idea related to motion on planes is that any velocity once imparted to a moving body will be rigidly maintained as long as external causes of acceleration or retardation are removed, such as on horizontal planes. This implies that motion along a horizontal plane is perpetual if the velocity is uniform, as it cannot be diminished or destroyed.

    Violent Motions and Projectiles

    The physics of motion culminates in the discussion of violent motions, specifically the motion of projectiles. Galileo describes this motion as a composition of two independent motions: one which is uniform and horizontal, and another which is vertical and naturally accelerated.

    By analyzing the combination of these two motions, Galileo demonstrates that the path of a projectile is a semi-parabola. He uses geometry and the previously established properties of uniform and naturally accelerated motion to show how the horizontal distance traveled is proportional to time (due to uniform horizontal velocity) and the vertical distance fallen is proportional to the square of time (due to natural acceleration downwards).

    The concept of the composition of momenta or velocities is important here. Galileo states that when the motion of a body is the resultant of two uniform motions (one horizontal, one vertical), the square of the resultant momentum is equal to the sum of the squares of the two component momenta. When one component is uniform horizontal and the other is naturally accelerated vertical, the resultant path is a parabola, and the momentum is always increasing because the vertical speed increases. The momentum at any point in the parabolic path is determined by combining the constant horizontal momentum and the vertical momentum acquired by falling from rest through the corresponding vertical height. The square of the resultant momentum is equal to the sum of the squares of the two components. The momentum acquired at the terminal point of a semi-parabola is equal to that acquired in falling through a vertical distance equal to the sum of the ‘sublimity’ (related to the initial horizontal speed) and the altitude of the semi-parabola.

    Galileo acknowledges that the resistance of the air affects projectile motion, altering the ideal parabolic path and causing motion to finally cease. Air resistance offers greater impedance to less dense bodies and increases with the speed of motion. However, for heavy, dense projectiles moving at speeds that are not excessively high, the effect of air resistance is considered small and negligible for practical purposes, allowing the parabolic trajectory to be observed very exactly.

    Overall, Galileo’s physics of motion, as presented in these excerpts, moves from precise definitions of uniform and accelerated motion to the analysis of compound motions like projectile motion, using mathematical reasoning and geometrical demonstrations to establish key theorems and properties while considering the influence of external factors like medium resistance.

    Galileo on Material Resistance and Strength of Solids

    Based on the provided excerpts, the “First new science” discussed in Galileo’s Dialogues concerning Two New Sciences deals with the resistance which solid bodies offer to fracture by external forces. This subject is considered of great utility, especially in the sciences and mechanical arts, and is said to abound in properties and theorems not previously observed or demonstrated.

    One of the central findings presented is that when machines and structures are built of the same material and maintain the same ratio between parts, larger ones will not be as strong or as resistant against violent treatment as smaller ones. This means that similar structures are not proportionately strong. Even if the material were absolutely perfect, the mere fact that it is matter means the larger machine will not correspond with exactness to the smaller in strength. There is a necessary limit for every structure, whether artificial or natural, beyond which neither art nor nature can pass, assuming the material and proportion are preserved. Examples supporting this include:

    • A long, thin rod that just supports itself will break if a hair’s breadth is added to its length, and a larger rod of the same proportion will also break under its own weight, while shorter ones will be strong enough to support more than their weight.
    • A large column might break under its own weight, even while preserving the ratio of length to thickness found in a smaller, intact column made of the same stone.
    • Conversely, when decreasing size, the strength of the body is not diminished in the same proportion; smaller bodies have greater relative strength. A small dog can carry multiple dogs of his own size, but a horse likely cannot carry one of its own size. Similarly, a small scantling or marble cylinder will not break when falling from a height that causes a large beam or column to go to pieces. This leads to the conclusion that similar solids do not exhibit a strength which is proportional to their size.

    The coherence of materials, which provides this resistance, is discussed as being produced by several causes. In materials like wood and rope, fibers run lengthwise and render the material strong. In materials like stone or metal, the coherence seems to be due to nature’s repugnance which she exhibits towards a vacuum, and potentially a gluey or viscous substance which binds parts firmly together. The force of the vacuum can be demonstrated by attempting to separate two highly polished plates of marble, metal, or glass placed face to face; they exhibit such a repugnance to separation that the upper one can lift the lower one. This resistance is present between the parts of a solid and contributes to their coherence. The force of the vacuum can be measured, for instance, by the weight of a column of water (about eighteen cubits high) that can be sustained by a pump. However, while the vacuum is a sufficient cause for holding two polished plates together, it alone is not sufficient to bind together the parts of a solid cylinder of marble or metal when pulled violently. This suggests the need for another cause. The combined resistance of an extraordinarily great number of exceedingly minute vacua between the smallest parts might, however, provide a significant resistance.

    The discussion moves from resistance to a direct longitudinal pull to resistance against bending forces. A solid capable of sustaining a very heavy weight longitudinally is easily broken by the transverse application of a weight. Fracture in a beam fixed at one end occurs at the point where the support acts as a fulcrum for a lever. The magnitude of the applied force at the end bears to the magnitude of the resistance in the thickness of the prism a ratio related to the length of the beam compared to half of its thickness (or the semidiameter for a cylinder).

    Several propositions quantify how bending strength varies with the dimensions of prisms and cylinders made of the same material:

    • A prism or ruler whose width is greater than its thickness offers more resistance to fracture when the force is applied in the direction of its breadth (standing on edge) than in the direction of its thickness (lying flat). The resistance is in the ratio of the width to the thickness.
    • Considering the effect of the prism’s own weight when fixed horizontally at one end, the bending moment due to its weight increases in proportion to the square of the length.
    • In prisms and cylinders of equal length but unequal thicknesses, the resistance to fracture (bending strength) increases in the same ratio as the cube of the diameter of the base. Longitudinal resistance depends on the base area (square of diameter), but transverse resistance involves the lever arm of the radius, leading to the cubic relationship.
    • For prisms and cylinders which differ in both length and thickness, the resistance to fracture (load they can support at their ends) is directly proportional to the cubes of the diameters of their bases and inversely proportional to their lengths.

    Regarding beams supported at both ends, fracture typically occurs at the middle under a central load. If a cylinder is supported at both ends and a force is applied at some point other than the middle, a different force is required to produce fracture. The resistance to fracture at any two points on the beam is in the ratio of the rectangles formed by the segments into which each point divides the total length. This means the prism grows constantly stronger and more resistant to pressure at points more removed from the middle. This suggests that in large heavy beams, a considerable portion near the ends could be cut away to lessen the weight without significantly diminishing strength.

    Galileo also discusses the problem of designing a solid that is equally resistant at every point. He demonstrates that if a prism is cut along a diagonal line, one resulting shape grows weaker as it is shortened, while another grows stronger. This leads to the idea that there must be a shape that offers the same resistance at all points. Cutting a prism along a parabola achieves this. This process removes one-third of the volume, reducing the weight by thirty-three percent without diminishing strength, a fact of utility in the construction of large vessels where lightness is important. It is noted in the source that this curve is actually a catenary.

    Finally, the strength of hollow solids is examined. These are frequently employed in art and nature (e.g., bones of birds, reeds) to greatly increase strength without adding weight. A hollow lance or tube is much stronger than a solid one of the same length and weight. For two cylinders, one hollow and one solid, having equal volumes and lengths, their bending strengths are to each other in the ratio of their diameters. The hollow tube’s strength exceeds the solid cylinder’s strength in the proportion that its diameter exceeds the solid cylinder’s diameter.

    Galileo’s Mathematical Physics

    Drawing on the provided excerpts, Galileo’s Two New Sciences deeply explores several mathematical concepts, which are foundational to the physical principles being discussed. The author, referred to as “our Academician,” demonstrates his findings through geometrical methods, suggesting that while some conclusions might have been previously reached, they had not been proven “in a rigid manner from fundamental principles”. Geometry is presented as a powerful tool for “sharp distinction” and stimulation to discovery, potentially surpassing logic in this regard. Euclid’s elements are assumed to be familiar to the reader.

    Several key mathematical concepts are woven throughout the text:

    • Geometry as the Method of Proof: The entire framework of the discussion relies heavily on geometrical demonstrations. Propositions regarding the strength of materials and the motion of projectiles are proved using figures, lines, ratios, and areas. The discussion even includes specific geometrical problems, such as describing a circle where lines drawn from two points to any point on the circumference maintain a constant ratio.
    • The Nature of the Infinite and Indivisible: A significant philosophical and mathematical debate arises regarding how continuous quantities (like lines or solids) can be composed of indivisible quantities (like points). This concept is described as “incomprehensible to us”. An objection is raised that adding indivisibles cannot create a divisible quantity. Galileo addresses this using the idea that while finite numbers of points are limited, lines of different lengths contain an infinite number of points, and one line does not contain more points than another. He draws a parallel between the infinite number of points in a line and the infinite number of “finite parts” that can be assigned to it. The analogy of comparing the number of all integers to the number of squared integers highlights the counterintuitive nature of dealing with infinity. The discussion extends this to argue that bending a straight line into a circle (a polygon with infinite sides) can be seen as reducing the infinite number of points to actuality, similar to how bending into a square actualizes four parts. This concept of indivisibles and the infinite is also used to explain phenomena like contraction without interpenetration of finite parts.
    • Ratio and Proportion: These are central to quantifying physical relationships. The strengths of beams, cylinders, and prisms are expressed as ratios of their dimensions, often involving squares and cubes. The times of descent along inclined planes are compared using ratios, including the inverse ratio of the square roots of their heights. Mean proportionals are frequently used in constructions and proofs involving ratios and squares. The concept of sesquialteral ratio (3:2 power) is applied to the relationship between the volume and surface area of similar solids, and the moments of similar cylinders and prisms.
    • Powers (Squares, Cubes, Roots): Squares and cubes are used to relate the dimensions of objects to their strength or resistance. The moment of a beam’s weight increases with the square of its length. The resistance of cylinders and prisms to fracture is proportional to the cube of their diameter. The concept of squares and roots also appears in the discussion of infinite numbers.
    • Areas and Volumes: Calculations of areas and volumes are used in various geometric proofs, such as demonstrating the equality of a cone and a “bowl” (cylinder minus hemisphere) sliced at different heights. The volume calculation of a parabolic solid is used in the context of optimizing beam shape. The concept of area is also relevant when discussing the surface area of cylinders.
    • Conic Sections (Parabola): The parabola plays a crucial role in the discussion of projectile motion. Galileo demonstrates that a projectile carried by uniform horizontal motion and naturally accelerated vertical motion describes a semi-parabola. Properties of the parabola necessary for this demonstration are explicitly proven. A parabolic shape is also suggested for beams to achieve uniform resistance at all points, reducing weight without diminishing strength.
    • Trigonometric Concepts and Tables: In the section on projectile motion, concepts related to angles of elevation and their tangents are used to determine the “altitude” and “sublimity” of parabolas describing projectile paths. A table giving altitudes and sublimities based on the angle of elevation for a constant amplitude is provided, demonstrating the practical application of these concepts. The tangent of an angle is explicitly related to the altitude and sublimity in a constant amplitude scenario.
    • Mathematical Definitions: The text notes that mathematical definitions are simply the “imposition of names” or abbreviations to simplify communication.

    Overall, the sources demonstrate a deep reliance on mathematical principles, particularly geometry and the concepts of ratio, proportion, and powers, to rigorously describe and prove physical phenomena in the emerging science of mechanics and strength of materials. The discussion also touches upon complex foundational issues concerning the nature of continuous quantities and the infinite, highlighting the mathematical challenges inherent in describing the physical world.

    The Physics of Sound and Harmony

    Drawing on the provided excerpts, the dialogue in Two New Sciences delves into the realm of acoustics and sound, presenting explanations for musical phenomena based on the properties of vibrations and ratios. This discussion is treated as a “splendid subject”, and the “Author” (referred to as “our Academician”) has thought much upon it and demonstrated his findings.

    Here are the key points discussed regarding acoustics and sound:

    • Vibrations as the Source of Sound: The fundamental idea is that sound is produced by vibrating bodies. These vibrations cause the surrounding air to vibrate and quiver, creating ripples or pulses that spread through space and strike the tympanum of the ear. The mind then translates this stimulus into sound.
    • Experimental Evidence for Waves: Experiments are described to illustrate these vibrations and waves. Scraping a chisel quickly produces a whistling sound, and simultaneous marks left on the surface are closer together for higher pitches and farther apart for lower pitches. Sounding a glass by rubbing the rim produces vibrations and a tone, and if the glass contains water or sits in a vessel of water, regular ripples are seen spreading from it. These ripples visually represent the waves produced by the sound. Feeling the chisel tremble or a shiver in the hand when it hisses, and feeling motion in the throat when speaking (especially low, strong tones), also provides tactile evidence of these vibrations.
    • Factors Affecting Pitch: Traditionally, the pitch of a string was related to its length. However, the dialogue points out that while shortening a string to half its length (with constant tension and size) produces an octave (2:1 ratio), changing tension or size yields different relationships. To produce an octave by stretching, the required weight must be quadrupled (ratio of 4:1). To produce an octave by changing size (weight per unit length), the size must be reduced to one-fourth (ratio of 4:1). Similarly, the fifth (3:2 ratio by length) requires a squared ratio (9:4) for tension or size (weight). This leads to questioning why philosophers focused solely on the length ratio.
    • Frequency as the True Determinant of Musical Intervals: The crucial insight presented is that the ratio of a musical interval is not immediately determined by the string’s length, size, or tension, but rather by the ratio of their frequencies. This means the ratio of the number of pulses of air waves that strike the ear in a given time.
    • Explanation of Consonance and Dissonance: The different sensations produced by combinations of notes (pleasing or offensive) are explained by the regularity or irregularity of the pulses striking the ear drum.
    • Dissonance arises from discordant or incommensurable vibrations that strike the ear “out of time”. This keeps the ear drum “in perpetual torment, bending in two different directions”. An example given is two strings with a frequency ratio based on the side and diagonal of a square (an incommensurable ratio), producing a harsh dissonance similar to an augmented fourth or diminished fifth.
    • Consonance occurs when pairs of tones strike the ear with a certain regularity. This regularity is due to the pulses delivered by the two tones in the same time interval being commensurable in number.
    • Specific Consonances:Unison (1:1 frequency ratio): Vibrations always coincide, acting like a single string, and thus is not strictly called a consonance.
    • Octave (2:1 frequency ratio): For every pulse from the lower string, the higher string delivers two. This means at every other vibration of the upper string, both pulses arrive simultaneously. Half of the upper string’s pulses are delivered in unison with the lower string’s pulses. It is described as the principal harmony, very like unison, but perhaps “too much softened and lacks fire” due to the regularity and frequency of coinciding pulses.
    • Fifth (3:2 frequency ratio): For every two vibrations of the lower string, the upper one gives three. One-third of the pulses from the upper string strike in unison with the lower. Between each pair of concordant vibrations, two solitary vibrations of the upper string and one solitary vibration of the lower string intervene, separated by equal time intervals. This is described as producing a “tickling of the ear drum” with a mix of softness and sprightliness.
    • Fourth (4:3 frequency ratio): Three solitary vibrations intervene between simultaneous pulses.
    • Second (9:8 frequency ratio): Only every ninth vibration of the upper string coincides with one of the lower string. The numerous discordant pulses in between produce a harsh effect.
    • Pendulum Analogy: The motion of pendulums is used to provide a visual illustration of these concepts. Each pendulum has a definite time of vibration. Suspending pendulums with lengths corresponding to agreeable musical intervals (like 16, 9, and 4 units for vibrations in the ratio 2:3:4, which includes octaves and fifths) shows that their threads cross and align at the same point after a definite number of vibrations, repeating a cycle. If the vibration ratios are incommensurable or return only after a long time/many vibrations, the eye is confused by the disorderly crossing, just as the ear is pained by irregular air waves. Blowing on a pendulum at its natural frequency demonstrates how repeated impulses at the correct timing can build up motion, analogous to how the air waves from one string cause a sympathetically vibrating string to oscillate.

    In summary, the discussion in the sources presents a theory of sound and music grounded in the physics of vibrations, emphasizing the critical role of frequency ratios in determining musical intervals and the regularity of air pulses hitting the ear drum as the basis for consonance and dissonance. This is supported by experimental observations and geometric reasoning, moving beyond explanations based solely on string length.

    Two New Sciences Quiz and Study Guide

    Dialogue Concerning the Two New Sciences Study Guide

    Quiz

    1. According to Salviati, how can one determine the maximum length a wire of a given material can sustain itself without breaking under its own weight?
    2. What experiment does Salviati propose to measure the resistance of a vacuum?
    3. How does Salviati explain the ability of a rope made of short fibers to sustain great force?
    4. What unexpected observation does Salviati make about the relationship between the number of squares and the number of numbers?
    5. How does Salviati suggest one can distinguish between actual and potential division of a line?
    6. What does Salviati propose as the reason why hard substances, when reduced to a fine powder, become fluid-like when suspended in a liquid?
    7. What experiment with a vibrating glass and water does Salviati describe to illustrate the concept of waves produced by a sonorous body?
    8. According to the source material, how is the ratio of a musical octave explained in terms of string length?
    9. When considering the bending strength of prisms and cylinders, what type of resistance is the focus of the discussion?
    10. How does Salviati propose to determine the weight of compressed air compared to water?

    Quiz Answer Key

    1. One can determine the maximum length by fixing one end of a wire and attaching loads to the other until it breaks. By knowing the maximum load supported and the wire’s own weight, one can calculate the length of wire of that same size whose weight equals the breaking load plus the wire’s weight.
    2. Salviati proposes a device involving a hollow cylinder filled with water and a tightly fitting stopper with an attached wire. By hanging a vessel filled with heavy material onto a hook on the wire until the stopper separates from the water’s surface, the weight of the stopper, wire, and vessel with contents represents the force of the vacuum.
    3. Salviati explains that while a single fiber can be easily broken by pulling, when many fibers are tightly bound together, as in a rope, they are grasped by the surrounding medium along their entire length, requiring great force to separate them.
    4. Salviati observes that there are as many squares as there are numbers because every square has a unique root and every root has a unique square, and every number is the root of some square.
    5. Salviati suggests that bending a straight line into a polygon with a finite number of sides brings those parts into actuality. Similarly, bending the line into a polygon with an infinite number of sides (a circle) makes the infinite number of points actual.
    6. Salviati suggests that when a hard substance is reduced to a fine powder, it is resolved into infinitely small, indivisible components, which he believes accounts for why they become fluid when suspended in a liquid.
    7. Salviati describes bowing a viola base string near a goblet of thin glass with the same tone, causing the goblet to vibrate. He also mentions observing ripples spreading in water when a glass containing water is sounded by rubbing the rim.
    8. The source material explains that an octave corresponds to a ratio of two, meaning that sounding half the length of a string after sounding the full length produces the octave.
    9. The discussion focuses on the resistance to fracture (bending strength) when a rod is fastened at right angles into a vertical wall, as opposed to its resistance to longitudinal pull.
    10. Salviati describes two methods, one involving a vessel with compressed air pushing water out, and another involving weighing a vessel with compressed air before and after release, to determine the weight of a known volume of compressed air and compare it to the weight of an equal volume of water.

    Essay Format Questions

    1. Analyze the discussion of finite and infinite division within the text. What are the arguments presented by the characters, and what conclusions does Salviati ultimately draw regarding the nature of continuous quantities?
    2. Discuss the experiments and reasoning presented in the text regarding the weight and specific gravity of air. How does this discussion challenge or support existing ideas (specifically Aristotle’s), and what methods are proposed for empirical investigation?
    3. Explain Salviati’s analysis of musical intervals in terms of vibrating strings. How are ratios applied to string length, tension, and size to produce different intervals, and what does this reveal about the relationship between physical properties and perceived sound?
    4. Examine the concepts of “moment,” “resistance,” and “bending strength” as discussed in the context of the strength of materials. How do these concepts relate to the shape, size, and material of objects, and what principles are derived concerning their ability to withstand fracture?
    5. Describe the experiments and explanations offered by Salviati concerning the motion of falling bodies and projectiles. How are concepts like momentum, velocity, and trajectory discussed, and what mathematical principles, such as the properties of parabolas, are introduced to describe these motions?

    Glossary of Key Terms

    • Momenta / Momentum: Used in several senses throughout the text, including force, speed, impetus, ability, energy, and the quantity of motion or effect acquired by a body. Often associated with the capacity to produce an effect.
    • Forza: Force, also used in the sense of mechanical advantage or the power of percussion.
    • Violenza: Great force, particularly in the context of breaking a rope.
    • Vacuum: A space devoid of matter. The text explores the resistance attributed to a vacuum as a binding force.
    • Cubits: A unit of length.
    • Scantling (corrente): A piece of lumber, smaller than a beam.
    • Beam (trove): A larger piece of lumber.
    • Resistance (resistenza): The capacity of a material or object to withstand forces, particularly breaking strength or bending strength.
    • Resistenza allo strapparsi: Resistance to tearing or pulling apart.
    • Corpulenza: Density.
    • Levity: The quality of being light; the opposite of gravity.
    • Quantita: Quantity, also used to refer to volume.
    • Mole e quantita: Volume and quantity.
    • Monochord: A musical instrument with a single string, used to demonstrate musical intervals.
    • Diapason: The musical interval of an octave.
    • Diapente: The musical interval of a fifth.
    • Sesquialtera ratio: A ratio of 3:2.
    • Dupla sesquiquarta: A ratio of 9:4, which is the square of 3:2.
    • Intervals musici: Musical intervals.
    • Tuono: Tone or pitch.
    • Indivisible: Something that cannot be divided into smaller parts, such as a point.
    • Divisible: Something that can be divided into smaller parts, such as a line or surface.
    • Infinite, in potenza; e finite, in atto: Potentially infinite, and actually finite. Refers to the state of division.
    • Amplitude: In the context of projectile motion, the horizontal distance covered by a projectile.
    • Altitude: In the context of projectile motion or parabolas, the vertical height.
    • Sublimity: A term used in the context of parabolas, related to the horizontal distance traveled for a certain vertical drop or rise.
    • Naturally accelerated motion: Motion in which the velocity increases uniformly with time, such as the motion of a freely falling body.
    • Uniform motion: Motion in which the velocity remains constant.
    • Parabola: A specific curved shape described by projectiles under the influence of gravity.
    • Tangente: Tangent, a line that touches a curve at a single point.
    • Mean proportional: A term from geometry where, in a sequence of three numbers (a, b, c), b is the mean proportional if a/b = b/c (or b² = ac).
    • Ex aequali in proportione perturbata: A term from geometry referring to a compound ratio derived from a sequence of ratios where the order of the terms is altered.
    • Quadrants: In geometry, a quarter of a circle.
    • Annulus: The region between two concentric circles.

    Briefing on Selected Science and Philosophy

    This briefing document summarizes key themes, ideas, and facts presented in the provided excerpts, drawing directly from the text where appropriate. The excerpts primarily focus on concepts in physics and mechanics, presented in a dialogue format between characters named Salviati, Simplicio, and Sagredo.

    I. Strength of Materials and Scaling

    A significant portion of the excerpts is dedicated to exploring the strength of materials and how this strength changes with scale.

    • Scaling and Breaking Strength: A crucial observation is made regarding the inability of larger objects, even if proportionally scaled, to support themselves or external loads in the same manner as smaller ones. This is illustrated with a hypothetical rod that is just able to support its own weight. Salviati states: “…if a hair’s breadth be added to its length it will break under its own weight and will be the only rod of the kind in the world. Thus if, for instance, its length be a hundred times its breadth, you will not be able to find another rod whose length is also a hundred times its breadth and which, like the former, is just able to sustain its own weight and no more: all the larger ones will break while all the shorter ones will be strong enough to support something more than their own weight.” This highlights a fundamental concept of scaling – properties that depend on surface area scale differently than those that depend on volume, leading to limitations as size increases.
    • Comparison of Material Strengths: The text explores how the breaking strength of materials can be quantified and compared. An experiment is described to determine the maximum load a wire can sustain before breaking. This allows for the calculation of the maximum length of a wire or rod of that material that can support its own weight. Salviati explains this with a copper wire example, concluding that “all copper wires, independent of size, can sustain themselves up to a length of 4801 cubits and no more.”
    • Types of Resistance: The discussion differentiates between different types of resistance to fracture. Two main types are mentioned: resistance to longitudinal pull and resistance to bending when supported at one end. Salviati notes that a rod can withstand a significantly larger force when pulled lengthwise compared to when it is subjected to a bending force. He highlights that the second type of resistance is the focus of their investigation, seeking to understand its proportionality in prisms and cylinders of varying dimensions.
    • Mathematical Relationships in Strength: The dialogue delves into the mathematical relationships governing the strength of cylinders and prisms. Several propositions are presented relating the resistance to fracture (bending strength) to the dimensions of the object.
    • “The resistance [strength] of a cylinder whose length remains constant varies as the third power of its diameter.”
    • “The resistance [strength] of a prism or cylinder of constant length varies in the sesquialteral ratio of its volume.”
    • “Prisms and cylinders which differ in both length and thickness offer resistances to fracture [i. e., can support at their ends loads] which are directly proportional to the cubes of the diameters of their bases and inversely proportional to their lengths.”
    • Largest Possible Size: The discussion addresses the concept of a maximum size for a prism or cylinder that can support its own weight. It is argued that for any given material and proportion, there exists a specific size that represents the boundary between breaking and not breaking under its own weight. “For there must be a prism of a certain size — in my opinion, it is unique and of a definite size — among all prisms — infinite in number — in occupying that boundary line between breaking and not breaking; so that every larger one will break under its own weight, and every smaller one will not break, but will be able to withstand some force in addition to its own weight.”
    • Hollow Solids: The benefit of hollow solids in construction and nature is discussed. It is noted that hollow structures, like the bones of birds or reeds, offer increased strength without proportional increases in weight. This design principle is recognized as being widely employed in both art and nature.

    II. The Nature of Matter and Vacuums

    The concept of a vacuum and its resistance to separation is explored, along with related ideas about the composition of matter.

    • Resistance of the Vacuum: The resistance to the separation of parts in a continuous substance, particularly water, is attributed to the resistance of the vacuum. An experiment is described to measure this force: a cylinder filled with water and a perfectly fitting stopper is inverted, and a weight is attached to the stopper. The weight required to separate the stopper from the water, breaking the continuous column, is presented as a measure of “the force of the vacuum [forza del vacuo].”
    • Limitations of Pumps: The observed limitation of pumps to raise water beyond a certain height (eighteen cubits) is linked to the resistance of the vacuum. This fixed elevation is seen as a constant value, independent of the pump’s size. “That is precisely the way it works; this fixed elevation of eighteen cubits is true for any quantity of water whatever, be the pump large or small or even as fine as a straw.”
    • Vacuum and Material Strength: The concept of the vacuum’s resistance is then applied to the breaking strength of solid materials. The resistance of a material is considered to have components, one of which is related to the vacuum. A method is outlined to determine the portion of a material’s breaking strength that is attributable to the vacuum resistance by comparing it to the weight of a column of water of the same diameter and eighteen cubits in length.
    • Microscopic Vacuums: The dialogue speculates on the possibility of extremely minute vacuums existing within the smallest particles of matter, contributing to the binding force between them. This idea is presented as a “passing thought, still immature and calling for more careful consideration.”
    • Infinite Number of Vacuums: The discussion touches on the philosophical paradox of whether an infinite number of vacuums can exist within a finite extent of metal, linking it to the concept of resolving a continuous quantity into infinitely many indivisible points.

    III. Motion and Velocity

    The excerpts explore various aspects of motion, including falling bodies, motion through different media, and the composition of velocities.

    • Motion in a Vacuum: The controversial idea that in a vacuum, bodies of different weights would fall with the same velocity is introduced, challenging the Aristotelian view. Salviati expresses confidence in this despite Simplicio’s disbelief: “I shall never believe that even in a vacuum, if motion in such a place were possible, a lock of wool and a bit of lead can fall with the same velocity.” Salviati assures Simplicio that he has a proper solution.
    • Resistance of the Medium: The resistance of the medium through which a body moves is acknowledged as a factor affecting its speed. This resistance is related to the density of the medium. The discussion proposes a method for determining the ratio of speeds of a body in different fluid media by considering the difference between the body’s specific gravity and that of the medium.
    • Weight of Air: The question of whether air has weight and how to measure its specific gravity is addressed. An experiment is described involving compressing air into a flask, weighing it, releasing the air, and weighing it again. The weight of the escaped air is then compared to the weight of a volume of water equal to the volume of the escaped air. This experiment is said to show that water is much heavier than air, contrary to some opinions.
    • Terminal Speed: The concept of a terminal speed is discussed, where the resistance of the medium eventually checks the acceleration of a falling body and reduces its motion to uniformity, even for very large or dense objects. “I can assert without hesitation that there is no sphere so large, or composed of material so dense but that the resistance of the medium, although very slight, would check its acceleration and would, in time reduce its motion to uniformity.”
    • Uniformly Accelerated Motion: The concept of uniformly accelerated motion is a key theme. The distance traversed by a freely falling body is stated to vary as the square of the time. This principle is fundamental to the analysis of projectile motion.
    • Composition of Velocities: The combination of horizontal and vertical motions is analyzed, particularly in the context of projectile motion. The concept of compounding two uniform momenta (velocities), one vertical and one horizontal, is presented. The magnitude of the resulting momentum is found by taking the square root of the sum of the squares of the individual momenta. This is presented as a “fixed and certain rule.” This principle is related to the parabolic trajectory of projectiles.

    IV. Sound and Music

    The nature of sound, its transmission, and the physical basis of musical harmony are explored.

    • Sound as Undulations: Sound is described as being produced by the vibrations of a sonorous body, which create undulations that spread through the air and stimulate the ear drum. The experiment with a vibrating glass causing ripples in water is used to illustrate these waves.
    • Musical Intervals and Ratios: The physical basis for musical intervals, such as the octave and the fifth, is discussed in terms of ratios of string lengths, tension, and size. The traditional explanation of intervals based on string lengths is presented (octave as 2:1, fifth as 3:2). However, it is noted that achieving the same intervals by changing tension or size requires squaring these ratios. For example, the fifth requires a tension ratio of (3/2)^2 = 9/4.
    • Synchronization of Pulses: The physical phenomenon of musical harmony is linked to the simultaneous arrival of pulses from vibrating sources at the ear drum. Consonant intervals like the octave and the fifth are explained by the regular synchronization of these pulses. The octave (2:1 ratio) results in every alternate pulse from one source coinciding with a pulse from the other. The fifth (3:2 ratio) involves a more complex pattern where pulses coincide less frequently, with solitary pulses interspersed between simultaneous ones.
    • Displaced Beats: The characteristic of the fifth is described by its “displaced beats” and the specific pattern of solitary pulses occurring between simultaneous pulses. This pattern is related to the tactile sensation experienced by the ear drum, described as a mix of “a gentle kiss and of a bite.”

    V. Other Scientific and Philosophical Concepts

    Beyond the main themes, the excerpts touch upon other related scientific and philosophical ideas.

    • Indivisibles: The philosophical concept of indivisible points and their role in forming continuous quantities is debated. The idea that a line can be composed of an infinite number of indivisible points is explored, leading to paradoxes related to the comparison of discrete and continuous quantities.
    • Potential vs. Actual Infinity: The distinction between potentially infinite and actually finite quantities is discussed in the context of dividing a continuous line.
    • Levers and Moments: The principle of levers and the concept of “moment” (referring to force or compound force) are introduced in the context of calculating the forces involved in lifting a heavy stone with a lever.
    • Equilibrium: The ability of fish to maintain equilibrium in water through the use of a bladder is described as an example of natural engineering that surpasses human capabilities in some operations.
    • Projectile Motion and Parabolic Trajectory: The trajectory of projectiles is analyzed, and it is mathematically demonstrated that this trajectory follows a parabolic path. The relationship between the horizontal and vertical components of motion is explored, and tables are provided showing the altitudes and sublimities of parabolas for different angles of elevation and initial speeds.
    • Perpetual Motion: The concept of a body maintaining acquired velocity is discussed, although the context of falling bodies and inclined planes indicates a focus on understanding the forces and motions involved rather than advocating for perpetual motion in the strict sense.
    • Measure of Momentum and Speed: Momentum and speed are treated as quantifiable concepts that can be measured and compared. Different scales and units are implicitly or explicitly used in the discussions and calculations.

    This briefing document provides a comprehensive overview of the main themes and important ideas present in the provided excerpts, highlighting key quotes and concepts discussed by the characters.

    Natural Science Explorations

    What are the limitations on the strength of materials based on their size?

    The sources indicate that the strength of a material does not scale directly with its size. A rod that is just strong enough to support its own weight will break if its length is increased by even a hair’s breadth. Furthermore, larger structures made of the same material with the same proportions as smaller ones will not be able to support a proportionally equal load. This is because the resistance to breaking changes based on the dimensions of the object, and this resistance is overcome by the increasing weight or force acting upon it as size increases.

    How is the force of a vacuum explained and measured?

    The force of a vacuum is described as the resistance to the separation of the parts of a continuous substance when there is no other resistance present. An experiment is proposed using a hollow cylinder (preferably of glass) with a tightly fitting stopper. The space between the stopper and the upper end of the cylinder is filled with water, and then the vessel is inverted. By attaching weights to the stopper until it separates from the water, the force of the vacuum can be measured. This resistance is likened to a rope being stretched until it breaks.

    How is the phenomenon of a column of water reaching a fixed maximum height in a pump explained?

    The sources attribute the fixed elevation of eighteen cubits that water reaches in a pump to the resistance of the vacuum. When the water is pulled up, it is stretched, and like a rope, it will break when its weight exceeds a certain limit. This maximum length is consistent regardless of the pump’s size or the amount of water, and the weight of the water column at this height represents the value of the vacuum’s resistance for that specific diameter. This concept allows for the determination of the maximum length various solid materials can be elongated without breaking under their own weight by comparing their breaking strength to the weight of a water column of equivalent diameter and 18 cubits in height.

    What is the discussion regarding the potential infinity of points within a finite extent?

    The sources delve into the paradox of potentially having an infinite number of indivisible points within a finite line or magnitude. This is explored through thought experiments involving the division of a line into increasingly smaller parts, suggesting that the ultimate division could result in an infinite number of indivisible components. The analogy of forming polygons with an increasing number of sides that eventually approximate a circle is used to suggest that a circle, as a polygon with an infinite number of sides, can be considered to have an infinite number of points. This challenges the idea that parts only exist actually after division, proposing that an infinite number of parts can exist potentially within a finite entity and be brought into actuality through transformation (like bending a line into a circle).

    How is sound and musical harmony related to wave phenomena?

    The sources connect sound to the vibration of bodies and the resulting propagation of waves through a medium, like air or water, which are then perceived by the ear. Experiments with vibrating strings and glasses of water are described, illustrating the creation of regular waves. Musical intervals are explained in terms of ratios, but the deeper understanding is found in the frequency of vibrations and the interaction of waves. For example, the octave corresponds to a 2:1 ratio in length but a 4:1 ratio in tension or size of a string. The fifth is linked to a 3:2 ratio in length, but a 9:4 ratio in tension or size. Harmonious sounds are associated with wave pulses that strike the ear drum simultaneously at regular intervals, while less consonant intervals, like the fifth, involve more complex and offset pulse patterns that create a different sensation.

    What is the relationship between the strength of solid figures (prisms and cylinders) and their dimensions?

    The sources analyze the resistance of prisms and cylinders to fracture and bending. It’s shown that the bending strength of a prism or cylinder with a constant length varies as the cube of its diameter. When considering figures that differ in both length and thickness, their resistance to fracture is directly proportional to the cube of their diameter and inversely proportional to their length. A key finding is that if the length and thickness increase in the same proportion, their strength does not remain constant; larger similar figures are weaker relative to their size and will break under their own weight while smaller ones can withstand additional force. There exists a maximum size for a similar figure that can just support its own weight.

    How does resistance of the medium affect the motion of falling bodies?

    The sources acknowledge that the resistance of the medium (like air or water) affects the motion of falling bodies. While a body’s acquired velocity tends to be maintained, the medium’s resistance acts as a cause of retardation, particularly at higher velocities. The density of the medium plays a role, with denser media causing greater resistance. The sources explore how to quantify this resistance and its effect on speed, noting that differences in speed are more pronounced for bodies of different substances moving through the same medium or for the same body moving through different media. The specific gravity of air is also discussed and experimentally determined relative to water to further understand its effect on motion.

    How are projectile trajectories analyzed, and what quantities are used to describe their motion?

    The sources demonstrate that the trajectory of a projectile can be described as a parabola. This motion is understood as a compound of a uniform horizontal velocity and a naturally accelerated vertical velocity (due to gravity). Quantities like space, time, and momentum are employed to analyze this motion. The vertical distance fallen is proportional to the square of the time, while the horizontal distance is proportional to the uniform horizontal speed and the time. The momentum acquired at a certain point is related to the time of fall and the initial velocity. The analysis also involves concepts of “altitude” and “sublimity” of the parabola, relating them to the initial speed and angle of projection to determine the amplitude and height of the trajectory. Tables are provided to relate angles of elevation to the altitudes and sublimities of parabolas for given initial speeds or amplitudes.

    The Original Text

    TWO NEW SCIENCES BY GALILEO

    THE MACMILLAN COMPANY KEW YORK • BOSTON • CHICAGO • DALLAS

    ATLANTA • SAN FRANCISCO

    MACMILLAN & CO., LIMITED LONDON • BOMBAY • CALCUTTA

    MELBOURNE

    THE MACMILLAN CO. OF CANADA, LTD. TORONTO

    GALILEO GALILEI.

    Subterman’s portrait, painted about 1640; now in the Galleria di Pitti at Florence.

    DIALOGUES

    CONCERNINC

    TWO NEW SCIENCES

    BY

    GALILEO GALILEI

    Translated from the Italian and Latin into English by

    HENRY CREW AND ALFONSO DE SALVIO

    of Northwestern University

    WITH AN INTRODUCTION BY

    ANTONIO FAVARO

    of the University of Padua.

    ” I think with vour friend that it has been of late too much the mode to slight the learning of the ancients.” Benjamin Franklin, Phil. Trans.

    6* 445- (I774-)

    Nrro f nrb

    THE MACMILLAN COMPANY 1914

    All rights reserved

    02>

    COPYRIGHT, 1914

    BY THE MACMILLAN COMPANY

    Set up and electrotyped. Published May, 1

    “La Dynamique est la science des forces accelera- trices or retardatrices, et des mouyemens varies qu’elles doivent produire. Cette science est due entierement aux modernes, et Galilee est celui qui en a jete les premiers fondemens.” Lagrange Mec. Anal. I. 221.

    TRANSLATORS’ PREFACE

    OR more than a century English speaking students have been placed in the anomalous position of hearing Galileo constantly re- ferred to as the founder of modern physical

    science, without having any chance to read, in their own language, what Galileo himself has to say. Archimedes has been made available by Heath ;Huygens’ Lighthas been turned into English by Thompson, while Motte has put the Principia of Newton back into the language in which it was conceived. To render the Physics of Galileo also accessible to English and American students is the purpose of the following translation.

    The last of the great creators of the Renaissance was not a prophet without honor in his own time; for it was only one group of his country-men that failed to appreciate him. Even during his life time, his Mechanics had been rendered into French by one of the leading physicists of the world, Mersenne.

    Within twenty-five years after the death of Galileo, his Dia- logues on Astronomy, and those on Two New Sciences, had been

    done into English by Thomas Salusbury and were worthily printed in two handsome quarto volumes. The Two New Sciences, which contains practically all that Galileo has to say on the subject of physics, issued from the English press in 1665.

    vi TRANSLATORS’ PREFACE It is supposed that most of the copies were destroyed in the great London fire which occurred in the year following. We are not aware of any copy in America : even that belonging to the British Museum is an imperfect one.

    Again in 1730 the Two New Sciences was done into English by Thomas Weston; but this book, now nearly two centuries old, is scarce and expensive. Moreover, the literalness with which this translation was made renders many passages either ambiguous or unintelligible to the modern reader. Other than these two, no English version has been made.

    Quite recently an eminent Italian scholar, after spending thirty of the best years of his life upon the subject, has brought to completion the great National Edition of the Works of Galileo. We refer to the twenty superb volumes in which Pro-

    fessor Antonio Favaro of Padua has given a definitive presenta- tion of the labors of the man who created the modern science of

    physics. The following rendition includes neither Le Mechaniche of Galileo nor his paper De Motu Accelerate, since the former of these contains little but the Statics which was current before the time of Galileo, and the latter is essentially included in the Dialogue of the Third Day. Dynamics was the one subject to which under various forms, such as Ballistics, Acoustics, As-

    tronomy, he consistently and persistently devoted his whole life. Into the one volume here translated he seems to have gathered, during his last years, practically all that is of value either to the engineer or the physicist. The historian, the philosopher, and the astronomer will find the other volumes replete with interesting material.

    It is hardly necessary to add that we have strictly followed the text of the National Edition — essentially the Elzevir edition of 1638. All comments and annotations have been omitted save here and there a foot-note intended to economize the reader’s time. To each of these footnotes has been attached the signa-

    ture [Trans.] in order to preserve the original as nearly intact as

    possible. Much of the value of any historical document lies in the lan-

    guage employed, and this is doubly true when one attempts to

    TRANSLATORS’ PREFACE vii trace the rise and growth of any set of concepts such as those employed in modern physics. We have therefore made this translation as literal as is consistent with clearness and modern-

    ity. In cases where there is any important deviation from this rule, and in the case of many technical terms where there is no deviation from it, we have given the original Italian or Latin phrase in italics enclosed in square brackets. The intention here is to illustrate the great variety of terms employed by the early physicists to describe a single definite idea, and conversely, to illustrate the numerous senses in which, then as now, a single word is used. For the few explanatory English words which are placed in square brackets without italics, the translators alone are responsible. The paging of the National Edition is indicated in square brackets inserted along the median line of the page. The imperfections of the following pages would have been

    many more but for the aid of three of our colleagues. Professor D. R. Curtiss was kind enough to assist in the translation of those pages which discuss the nature of Infinity: Professor O. H. Basquin gave valuable help in the rendition of the chapter on Strength of Materials; and Professor O. F. Long cleared up the meaning of a number of Latin phrases.

    To Professor A. Favaro of the University of Padua the trans- lators share, with every reader, a feeling of sincere obligation

    for his Introduction. H. C.

    A. DE S.

    EVANSTON, ILLINOIS, 15 February, 1914.

    INTRODUCTION

    RITING to his faithful friend Elia Diodati,

    Galileo speaks of the “New Sciences ” which he had in mind to print as being “superior to everything else of mine hitherto pub-

    lished”; elsewhere he says “they contain results which I consider the most important

    of all my studies”; and this opinion which he expressed concerning his own work has

    been confirmed by posterity: the “New Sciences” are, indeed, the masterpiece of Galileo who at the time when he made the above remarks had spent upon them more than thirty laborious years.

    One who wishes to trace the history of this remarkable work will find that the great philosopher laid its foundations during

    the eighteen best years of his life — those which he spent at Padua. As we learn from his last scholar, Vincenzio Vivian i, the numerous results at which Galileo had arrived while in this

    city, awakened intense admiration in the friends who had wit- nessed various experiments by means of which he was accus-

    tomed to investigate interesting questions in physics. Fra Paolo Sarpi exclaimed: To give us the Science of Motion, God and Nature have joined hands and created the intellect of Galileo.

    And when the “New Sciences” came from the press one of his foremost pupils, Paolo Aproino, wrote that the volume contained

    much which he had “already heard from his own lips” during student days at Padua.

    Limiting ourselves to only the more important documents which might be cited in support of our statement, it will suffice to mention the letter, written to Guidobaldo del Monte on the 29th of November, 1602, concerning the descent of heavy bodies

    x INTRODUCTION

    along the arcs of circles and the chords subtended by them; that to Sarpi, dated i6th of October, 1604, dealing with the free fall

    of heavy bodies; the letter to Antonio de’ Medici on the nth

    of February, 1609, in which he states that he has “completed all the theorems and demonstrations pertaining to forces and re-

    sistances of beams of various lengths, thicknesses and shapes, proving that they are weaker at the middle than near the ends, that they can carry a greater load when that load is distributed throughout the length of the beam than when concentrated at one point, demonstrating also what shape should be given to a beam in order that it may have the same bending strength at

    every point,” and that he was now engaged “upon some ques- tions dealing with the motion of projectiles”; and finally in the

    letter to Belisario Vinta, dated 7th of May, 1610, concerning his return from Padua to Florence, he enumerates various pieces of work which were still to be completed, mentioning explicitly three books on an entirely new science dealing with the theory of motion. Although at various times after the return to his native state he devoted considerable thought to the work which, even at that date, he had in mind as is shown by certain frag-

    ments which clearly belong to different periods of his life and which have, for the first time, been published in the National Edition; and although these studies were always uppermost in his thought it does not appear that he gave himself seriously to them until after the publication of the Dialogue and the com-

    pletion of that trial which was rightly described as the disgrace of the century. In fact as late as October, 1630, he barely men-

    tions to Aggiunti his discoveries in the theory of motion, and only two years later, in a letter to Marsili concerning the motion of projectiles, he hints at a book nearly ready for publication in which he will treat also of this subject; and only a year after this he writes to Arrighetti that he has in hand a treatise on the resistance of solids.

    But the work was given definite form by Galileo during his enforced residence at Siena: in these five months spent quietly with the Archbishop he himself writes that he has completed

    “a treatise on a new branch of mechanics full of interesting and useful ideas”; so that a few months later he was able to send

    INTRODUCTION xi

    word to Micanzio that the “work was ready”; as soon as his friends learned of this, they urged its publication. It was, how-

    ever, no easy matter to print the work of a man already con- demned by the Holy Office: and since Galileo could not hope to

    print it either in Florence or in Rome, he turned to the faithful Micanzio asking him to find out whether this would be possible in Venice, from whence he had received offers to print the Dia-

    logue on the Principal Systems, as soon as the news had reached there that he was encountering difficulties. At first everything went smoothly; so that Galileo commenced sending to Micanzio some of the manuscript which was received by the latter with an enthusiasm in which he was second to none of the warmest admirers of the great philosopher. But when Micanzio con-

    sulted the Inquisitor, he received the answer that there was an express order prohibiting the printing or reprinting of any work of Galileo, either in Venice or in any other place, nullo excepto.

    As soon as Galileo received this discouraging news he began to look with more favor upon offers which had come to him from Germany where his friend, and perhaps also his scholar, Gio-

    vanni Battista Pieroni, was in the service of the Emperor, as military engineer; consequently Galileo gave to Prince Mattia

    de’ Medici who was just leaving for Germany the first two Dia- logues to be handed to Pieroni who was undecided whether to

    publish them at Vienna or Prague or at some place in Moravia; in the meantime, however, he had obtained permission to print both at Vienna and at Olmiitz. But Galileo recognized danger at every point within reach of the long arm of the Court of Rome; hence, availing himself of the opportunity offered by the arrival of Louis Elzevir in Italy in 1636, also of the friendship between the latter and Micanzio, not to mention a visit at Arcetri, he decided to abandon all other plans and entrust to the Dutch publisher the printing of his new work the manu-

    script of which, although not complete, Elzevir took with him on his return home.

    In the course of the year 1637, the printing was finished, and at the beginning of the following year there was lacking only the index, the title-page and the dedication. This last had,

    xii INTRODUCTION

    through the good offices of Diodati, been offered to the Count of Noailles, a former scholar of Galileo at Padua, and since 1634 ambassador of France at Rome, a man who did much to alleviate the distressing consequences of the celebrated trial; and the offer was gratefully accepted. The phrasing of the dedication deserves brief comment. Since Galileo was aware, on the one hand, of the prohibition against the printing of his works and since, on the other hand, he did not wish to irritate the Court of Rome from whose hands he was always hoping for complete freedom, he pretends in the dedicatory letter (where, probably through excess of caution, he gives only main outlines) that he had nothing to do with the printing of his book, asserting that he will never again publish any of his researches, and will at most distribute here and there a manuscript copy. He even expresses great surprise that his new Dialogues have fallen into the hands of the Elzevirs and were soon to be published; so that, having been asked to write a dedication, he could think of no man more worthy who could also on this occasion defend him against his enemies. As to the title which reads: Discourses and Mathematical

    Demonstrations concerning Two New Sciences pertaining to Me- chanics and Local Motions, this only is known, namely, that the title is not the one which Galileo had devised and suggested; in fact he protested against the publishers taking the liberty of

    changing it and substituting “a low and common title for the noble and dignified one carried upon the title-page.”

    In reprinting this work in the National Edition, I have fol- lowed the Leyden text of 1638 faithfully but not slavishly, be- cause I wished to utilize the large amount of manuscript ma- terial which has come down to us, for the purpose of correcting

    a considerable number of errors in this first edition, and also for the sake of inserting certain additions desired by the author himself. In the Leyden Edition, the four Dialogues are followed

    by an “Appendix containing some theorems and their proofs, deal- ing with centers of gravity of solid bodies, written by the same

    Author at an earlier date” which has no immediate connection with the subjects treated in the Dialogues; these theorems were

    found by Galileo, as he tells us, “at the age of twenty-two and

    INTRODUCTION xiii

    after two years study of geometry” and were here inserted only to save them from oblivion.

    But it was not the intention of Galileo that the Dialogues on the New Sciences should contain only the four Days and the above-mentioned appendix which constitute the Leyden Edi-

    tion; while, on the one hand, the Elzevirs were hastening the printing and striving to complete it at the earliest possible date, Galileo, on the other hand, kept on speaking of another Day, besides the four, thus embarrassing and perplexing the printers. From the correspondence which went on between author and publisher, it appears that this Fifth Day was to have treated

    “of the force of percussion and the use of the catenary”; but as the typographical work approached completion, the printer became anxious for the book to issue from the press without further delay; and thus it came to pass that the Discorsi e Dimostrazioni appeared containing only the four Days and the Appendix, in spite of the fact that in April, 1638, Galileo had

    plunged more deeply than ever “into the profound question of percussion” and “had almost reached a complete solution.” The “New Sciences” now appear in an edition following the

    text which I, after the most careful and devoted study, deter- mined upon for the National Edition. It appears also in that

    language in which, above all others, I have desired to see it. In this translation, the last and ripest work of the great philosopher makes its first appearance in the New World: if toward this important result I may hope to have contributed in some meas-

    ure I shall feel amply rewarded for having given to this field of research the best years of my life.

    ANTONIO FAVARO. UNIVERSITY OF PADUA,

    2yth of October, 1913.

    D I SC O R S I

    DIMOSTRAZIONI

    MATEMATICHE,

    in tor no a due nttoue {cien^c

    Atcenenci alia MECANICX & i MOVIMENTI LOCALI,

    dclSignor

    GALILEO GALILEI LINCEO, Filofofo e Matematico primario del Sercniflimo

    Grand Duca di Tofcana.

    IN L E I D A,

    Apprcflb gli Elfcvirii. M. D. c. xxxvm.

    [43]

    TO THE MOST ILLUSTRIOUS LORD COUNT OF NOAILLES

    Counsellor of his Most Christian Majesty, Knight of the Order of the Holy Ghost, Field Marshal and Commander,

    Seneschal and Governor of Rouergue, and His

    Majesty’s Lieutenant in Auvergne, my Lord and Worshipful Patron

    OST ILLUSTRIOUS LORD:- In the pleasure which you derive from

    the possession of this work of mine I rec-

    ognize your Lordship’s magnanimity. The disappointment and discouragement I have felt over the ill-fortune which has followed my other books are already known to you. Indeed, I had decided not to publish any

    more of my work. And yet in order to save it from com- plete oblivion, it seemed to me wise to leave a manuscript

    copy in some place where it would be available at least to those who follow intelligently the subjects which I have treated.

    Accordingly I chose first to place my work in your Lordship’s hands, asking no more worthy depository, and believing that, on account of your affection for me, you would have at heart the preservation of my studies and labors. Therefore, when you were returning home from your mission to Rome, I came to pay my respects in person as I had already done many times before by letter. At this meeting I presented to your Lordship a copy of these two works which at that time I happened to have ready. In the gracious reception which you gave these I found assurance

    of

    xviii TO THE COUNT OF NOAILLES

    of their preservation. The fact of your carrying them to France and showing them to friends of yours who are skilled in these sciences gave evidence that my silence was not to be interpreted as complete idleness. A little later, just as I was on the point of

    [44] sending other copies to Germany, Flanders, England, Spain and possibly to some places in Italy, I was notified by the Elzevirs that they had these works of mine in press and that I ought to decide upon a dedication and send them a reply at once. This sudden and unexpected news led me to think that the eagerness of your Lordship to revive and spread my name by passing these works on to various friends was the real cause of their falling into the hands of printers who, because they had already published other works of mine, now wished to honor me with a beautiful and ornate edition of this work. But these writings of mine must have received additional value from the criticism of

    so excellent a judge as your Lordship, who by the union of many virtues has won the admiration of all. Your desire to enlarge the renown of my work shows your unparalleled generos-

    ity and your zeal for the public welfare which you thought would thus be promoted. Under these circumstances it is eminently fitting that I should, in unmistakable terms, grate-

    fully acknowledge this generosity on the part of your Lordship, who has given to my fame wings that have carried it into regions more distant than I had dared to hope. It is, therefore, proper that I dedicate to your Lordship this child of my brain. To this course I am constrained not only by the weight of obliga-

    tion under which you have placed me, but also, if I may so speak, by the interest which I have in securing your Lordship as the defender of my reputation against adversaries who may attack it while I remain under your protection. And now, advancing under your banner, I pay my respects

    to you by wishing that you may be rewarded for these kindnesses by the achievement of the highest happiness and greatness.

    I am your Lordship’s Most devoted Servant, GALILEO GALILEI.

    Arcetri, 6 March, 1638.

    THE PUBLISHER TO THE READER

    INCE society is held together by the mutual services which men render one to another, and since to this end the arts and sciences

    have largely contributed, investigations in these fields have always been held in great esteem and have been highly regarded by our wise forefathers. The larger the utility and excellence of the inventions, the greater has

    been the honor and praise bestowed upon the inventors. Indeed, men have even deified them and have united in the attempt to perpetuate the memory of their benefactors by the bestowal of this supreme honor.

    Praise and admiration are likewise due to those clever in- tellects who, confining their attention to the known, have

    discovered and corrected fallacies and errors in many and many a proposition enunciated by men of distinction and accepted for ages as fact. Although these men have only pointed out falsehood and have not replaced it by truth, they are never-

    theless worthy of commendation when we consider the well- known difficulty of discovering fact, a difficulty which led the prince of orators to exclaim: Utinam tarn facile possem vera

    reperire, quam falsa convincere* And indeed, these latest centuries merit this praise because it is during them that the arts and sciences, discovered by the ancients, have been reduced to so great and constantly increasing perfection through the investigations and experiments of clear-seeing minds. This development is particularly evident in the case of the mathe-

    matical sciences. Here, without mentioning various men who have achieved success, we must without hesitation and with the

    * Cicero, de Natura Deorum, I, 91. [Trans.]

    xx THE PUBLISHER TO THE READER

    unanimous approval of scholars assign the first place to Galileo Galilei, Member of the Academy of the Lincei. This he deserves not only because he has effectively demonstrated fallacies in many of our current conclusions, as is amply shown by his published works, but also because by means of the telescope (invented in this country but greatly perfected by him) he has discovered the four satellites of Jupiter, has shown us the true character of the Milky Way, and has made us acquainted with spots on the Sun, with the rough and cloudy portions of the lunar surface, with the threefold nature of Saturn, with the phases of Venus and with the physical character of comets. These matters were entirely unknown to the ancient astronomers and philosophers; so that we may truly say that he has restored to fhe world the science of astronomy and has presented it in a new light. Remembering that the wisdom and power and goodness of the

    Creator are nowhere exhibited so well as in the heavens and celestial bodies, we can easily recognize the great merit of him who has brought these bodies to our knowledge and has, in spite of their almost infinite distance, rendered them easily visible. For, according to the common saying, sight can teach more and with greater certainty in a single day than can precept even though repeated a thousand times; or, as another says, intuitive knowledge keeps pace with accurate definition.

    But the divine and natural gifts of this man are shown to best advantage in the present work where he is seen to have discovered, though not without many labors and long vigils, two entirely new sciences and to have demonstrated them in a rigid, that is, geometric, manner: and what is even more .re-

    markable in this work is the fact that one of the two sciences

    deals with a subject of never-ending interest, perhaps the most important in nature, one which has engaged the minds of all the great philosophers and one concerning which an extraordinary number of books have been written. I refer to motion [moto locale], a phenomenon exhibiting very many wonderful proper-

    ties, none of which has hitherto been discovered or demonstrated by any one. The other science which he has also developed from

    its

    THE PUBLISHER TO THE READER xxi

    its very foundations deals with the resistance which solid bodies offer to fracfture by external forces [per violenza], a subject of great utility, especially in the sciences and mechanical arts, and one also abounding in properties and theorems not hitherto observed.

    In this volume one finds the first treatment of these two

    sciences, full of propositions to which, as time goes on, able thinkers will add many more; also by means of a large number of clear demonstrations the author points the way to many other theorems as will be readily seen and understood by all in-

    telligent readers.

    TABLE OF CONTENTS

    1 Page

    First new science, treating of the resistance which solid bodies offer to fracture. First Day I

    II

    Concerning the cause of cohesion. Second Day 109

    III

    Second new science, treating of motion [movimenti locali[. Third Day 153

    Uniform motion 154 Naturally accelerated motion 160

    IV

    Violent motions. Projectiles. Fourth Day 244

    V

    Appendix; theorems and demonstrations concerning the centers of gravity of solids 295

    TWO NEW SCIENCES BY GALILEO

    FIRST DAY

    INTERLOCUTORS: SALVIATI, SA-

    GREDO AND SIMPLICIO

    ALV. The constant activity which you Vene- tians display in your famous arsenal suggests

    to the studious mind a large field for investi- gation, especially that part of the work

    which involves mechanics; for in this depart- ment all types of instruments and machines

    are constantly being cons tru (fled by many artisans, among whom there must be some

    who, partly by inherited experience and partly by their own ob- servations, have become highly expert and clever in explanation.

    SAGR. You are quite right. Indeed, I myself, being curious by nature, frequently visit this place for the mere pleasure of observing the work of those who, on account of their superiority

    over other artisans, we call “first rank men.” Conference with them has often helped me in the investigation of certain effects including not only those which are striking, but also those which are recondite and almost incredible. At times also I have been

    put to confusion and driven to despair of ever explaining some- thing for which I could not account, but which my senses told

    me to be true. And notwithstanding the fact that what the old man told us a little while ago is proverbial and commonly accepted, yet it seemed to me altogether false, like many another saying which is current among the ignorant; for I think they introduce these expressions in order to give the appearance of knowing something about matters which they do not understand.

    Salv.

    2 THE TWO NEW SCIENCES OF GALILEO

    [So] SALV. You refer, perhaps, to that last remark of his when we

    asked the reason why they employed stocks, scaffolding and bracing of larger dimensions for launching a big vessel than they do for a small one; and he answered that they did this in order to avoid the danger of the ship parting under its own heavy weight [vasta mole], a danger to which small boats are not subject?

    SAGR. Yes, that is what I mean; and I refer especially to his last assertion which I have always regarded as a false, though current, opinion; namely, that in speaking of these and other similar machines one cannot argue from the small to the large, because many devices which succeed on a small scale do not

    work on a large scale. ‘Now, since mechanics has its foundation in geometry, where mere size cuts no figure, I do not see that the properties of circles, triangles, cylinders, cones and other solid figures will change with their size. If, therefore, a large machine be constructed in such a way that its parts bear to one another the same ratio as in a smaller one, and if the smaller is sufficiently strong for the purpose for which it was designed, I do not see why the larger also should not be able to withstand any severe and destructive tests to which it may be subjected.

    SALV. The common opinion is here absolutely wrong. Indeed, it is so far wrong that precisely the opposite is true, namely, that many machines can be constructed even more perfectly on a large scale than on a small; thus, for instance, a clock which indi-

    cates and strikes the hour can be made more accurate on a large scale than on a small. There are some intelligent people who maintain this same opinion, but on more reasonable grounds, when they cut loose from geometry and argue that the better performance of the large machine is owing to the imperfections and variations of the material. Here I trust you will not charge

    me with arrogance if I say that imperfections in the material, even those which are great enough to invalidate the clearest mathematical proof, are not sufficient to explain the deviations observed between machines in the concrete and in the abstract.

    Yet I shall say it and will affirm that, even if the imperfections

    did

    FIRST DAY 3 did not exist and matter were absolutely perfect, unalterable and free from all accidental variations, still the mere fact that it is matter makes the larger machine, built of the same material and in the same proportion as the smaller, correspond with exactness to the smaller in every respect except that it will not be so strong or so resistant against violent treatment; the larger the machine, the greater its weakness. Since I assume * matter to be unchangeable and always the same, it is clear that we are no less able to treat this constant and invariable property in a rigid manner than if it belonged to simple and pure mathe-

    matics. Therefore, Sagredo, you would do well to change the opinion which you, and perhaps also many other students of mechanics, have entertained concerning the ability of machines and structures to resist external disturbances, thinking that when they are built of the same material and maintain the same

    ratio between parts, they are able equally, or rather propor- tionally, to resist or yield to such external disturbances and

    blows. For we can demonstrate by geometry that the large machine is not proportionately stronger than the small. Finally, we may say that, for every machine and structure, whether artificial or natural, there is set a necessary limit beyond which neither art nor nature can pass; it is here understood, of course, that the material is the same and the proportion preserved. .

    SAGR. My brain already reels. My mind, like a cloud momen- tarily illuminated by a lightning-flash, is for an instant filled

    with an unusual light, which now beckons to me and which.now suddenly mingles and obscures strange, crude ideas. From what you have said it appears to me impossible to build two similar structures of the same material, but of different sizes and have them proportionately strong; and if this were so, it would

    .[52]

    not be possible to find two single poles made of the same wood which shall be alike in strength and resistance but unlike in size.

    SALV. So it is, Sagredo. And to make sure that we understand each other, I say that if we take a wooden rod of a certain length and size, fitted, say, into a wall at right angles, i. e.,

    parallel

    4 THE TWO NEW SCIENCES OF GALILEO

    parallel to the horizon, it may be reduced to such a length that

    it will just support itself; so that if a hair’s breadth be added to its length it will break under its own weight and will be the only rod of the kind in the world.* Thus if, for instance, its length be a hundred times its breadth, you will not be able to find another rod whose length is also a hundred times its breadth and which, like the former, is just able to sustain its own weight and no more: all the larger ones will break while all the shorter ones will be strong enough to support something more than their own weight. And this which I have said about the ability to support itself must be understood to apply also to other tests; so that if a piece of scantling [corrente] will carry the weight of ten similar to itself, a beam [trove] having the same proportions will not be able to support ten similar beams.

    Please observe, gentlemen, how facts which at first seem improbable will, even on scant explanation, drop the cloak which has hidden them and stand forth in naked and simple beauty. Who does not know that a horse falling from a height of three or four cubits will break his bones, while a dog falling from the same height or a cat from a height of eight or ten cubits will suffer no injury? Equally harmless would be the fall of a grasshopper from a tower or the fall of an ant from the distance of the moon. Do not children fall with impunity from heights which would cost their elders a broken leg or perhaps a fractured skull? And just as smaller animals are proportionately stronger and more robust than the larger, so also smaller plants are able to stand up better than larger. I am certain you both know that an oak two hundred cubits [braccia] high would not be able to sustain its own branches if they were distributed as in a tree of ordinary size; and that nature cannot produce a horse as large as twenty ordinary horses or a giant ten times taller than an

    .[53]

    ordinary man unless by miracle or by greatly altering the proportions of his limbs and especially of his bones, which would have to be considerably enlarged over the ordinary. Likewise the current belief that, in the case of artificial machines the very

    * The author here apparently means that the solution is unique. [Trans.]

    FIRST DAY 5 large and the small are equally feasible and lasting is a manifest error. Thus, for example, a small obelisk or column or other solid figure can certainly be laid down or set up without danger of breaking, while the very large ones will go to pieces under die slightest provocation, and that purely on account of their own

    weight. And here I must relate a circumstance which is worthy”^ of your attention as indeed are all events which happen contrary to expectation, especially when a precautionary measure turns out to be a cause of disaster. A large marble column was laid out so that its two ends rested each upon a piece of beam; a little later it occurred to a mechanic that, in order to be doubly sure of its not breaking in the middle by its own weight, it would be wise to lay a third support midway; this seemed to all an excellent idea; but the sequel showed that it was quite the oppo-

    site, for not many months passed before the column was found cracked and broken exactly above the new middle support.

    SIMP. A very remarkable and thoroughly unexpected acci- dent, especially if caused by placing that new support in the

    middle.

    SALV. Surely this is the explanation, and the moment the cause is known our surprise vanishes; for when the two pieces of the column were placed on level ground it was observed that one of the end beams had, after a long while, become decayed and sunken, but that the middle one remained hard and strong, thus causing one half of the column to project in the air without any support. Under these circumstances the body therefore behaved differently from what it would have done if supported only upon the first beams; because no matter how much they might have sunken the column would have gone with them.

    This is an accident which could not possibly have happened to a ~ small column, even though made of the same stone and having a length corresponding to its thickness, i. e., preserving the ratio between thickness and length found in the large pillar.

    [54] SAGR. I am quite convinced of the facts of the case, but I do

    not understand why the strength and resistance are not multi- plied in the same proportion as the material; and I am the more

    puzzled

    6 THE TWO NEW SCIENCES OF GALILEO

    puzzled because, on the contrary, I have noticed in other cases that the strength and resistance against breaking increase in a larger ratio than the amount of material. Thus, for instance, if two nails be driven into a wall, the one which is twice as big as the other will support not only twice as much weight as the other, but three or four times as much.

    SALV. Indeed you will not be far wrong if you say eight times as much; nor does this phenomenon contradict the other even though in appearance they seem so different.

    SAGR. Will you not then, Salviati, remove these difficulties and clear away these obscurities if possible: for I imagine that this problem of resistance opens up a field of beautiful and useful

    ideas; and if you are pleased to make this the subject of to-day’s discourse you will place Simplicio and me under many obliga- tions.

    SALV. I am at your service if only I can call to mind what I

    learned from our Academician * who had thought much upon this subject and according to his custom had demonstrated everything by geometrical methods so that one might fairly call this a new science. For, although some of his conclusions had been reached by others, first of all by Aristotle, these are not the most beautiful and, what is more important, they had not been proven in a rigid manner from fundamental principles. Now, since I wish to convince you by demonstrative reasoning rather than to persuade you by mere probabilities, I shall sup-

    pose that you are familiar with present-day mechanics so far as it is needed in our discussion. First of all it is necessary to consider what happens when a piece of wood or any other solid which coheres firmly is broken; for this is the fundamental fact, involving the first and simple principle which we must take for granted as well known.

    To grasp this more clearly, imagine a cylinder or prism, AB, made of wood or other solid coherent material. Fasten the

    upper end, A, so that the cylinder hangs vertically. To the lower end, B, attach the weight C. It is clear that however great they may be, the tenacity and coherence [tenacita e

    * I. e. Galileo: The author frequently refers to himself under this name. [Trans.]

    FIRST DAY 7

    coerenza] between the parts of this solid, so long as they are not infinite, can be overcome by the pull of the weight C, a weight which can be increased indefinitely until finally the solid breaks like a rope. And as in the case of the rope whose strength we know to be derived from a multitude of hemp threads which compose it, so in the case of the wood, we observe its fibres and filaments run lengthwise and render it much stronger than a hemp rope of the same thickness. But in the case of a stone or metallic cylinder where thej coherence seems to be still greater the cement which holds the parts together must be some-

    thing other than filaments and fibres; and, yet even this can be broken by a strong pull.

    SIMP. If this matter be as you say I can well understand that the fibres of the wood, being as long as the piece of wood itself, render it strong and resistant against large forces tending to break it. But how can one make a rope one hundred cubits long out of hempen fibres which are not more than two or three cubits long, and still give it so much strength ? Besides, I should be glad to hear your opinion as to the manner in which the parts of metal, stone, and other ma-

    terials not showing a filamentous structure are Fig. i put together; for, if I mistake not, they exhibit even greater tenacity.

    SALV. To solve the problems which you raise it will be neces- sary to make a digression into subjects which have little bearing

    upon our present purpose. SAGR. But if, by digressions, we can reach new truth, what

    harm is there in making one now, so that we may not lose this knowledge, remembering that such an opportunity, once omitted, may not return; remembering also that we are not tied down to a fixed and brief method but that we meet solely for our own entertainment? Indeed, who knows but that we may thus

    [56] frequently

    8 THE TWO NEW SCIENCES OF GALILEO

    frequently discover something more interesting and beautiful than the solution originally sought? I beg of you, therefore, to grant the request of Simplicio, which is also mine; for I am no less curious and desirous than he to learn what is the binding material which holds together the parts of solids so that they can scarcely be separated. This information is also needed to understand the coherence of the parts of fibres themselves of which some solids are built up.

    SALV. I am at your service, since you desire it. The first question is, How are fibres, each not more than two or three cubits in length, so tightly bound together in the case of a rope one hundred cubits long that great force [violenza] is required to break it? Now tell me, Simplicio, can you not hold a hempen fibre so

    tightly between your fingers that I, pulling by the other end, would break it before drawing it away from you? Certainly you can. And now when the fibres of hemp are held not only at the ends, but are grasped by the surrounding medium through-

    out their entire length is it not manifestly more difficult to tear them loose from what holds them than to break them? But in

    the case of the rope the very act of twisting causes the threads to bind one another in such a way that when the rope is stretched with a great force the fibres break rather than separate from each other.

    At the point where a rope parts the fibres are, as everyone knows, very short, nothing like a cubit long, as they would be if the parting of the rope occurred, not by the breaking of the filaments, but by their slipping one over the other.

    SAGR. In confirmation of this it may be remarked that ropes sometimes break not by a lengthwise pull but by excessive twisting. This, it seems to me, is a conclusive argument because the threads bind one another so tightly that the compressing fibres do not permit those which are compressed to lengthen the spirals even that little bit by which it is necessary for them to lengthen in order to surround the rope which, on twisting, grows shorter and thicker.

    SALV. You are quite right. Now see how one fact suggests another

    FIRST DAY 9 another. The thread held between the fingers does not yield

    [S7l to one who wishes to draw it away even when pulled with con-

    siderable force, but resists because it is held back by a double compression, seeing that the upper finger presses against the lower as hard as the lower against the upper. Now, if we could retain only one of these pressures there is no doubt that only half the original resistance would remain; but since we are not able, by lifting, say, the upper finger, to remove one of these pressures without also removing the other, it becomes necessary to preserve one of them by means of a new device which causes the thread to press itself against the finger or against some other solid body upon which it rests; and thus it is brought about that the very force which pulls it in order to snatch it away compresses it more and more as the pull increases. This is accomplished by wrapping the thread around the solid in the manner of a spiral; and will be better understood by means of a

    figure. Let AB and CD be two cylinders be- tween which is stretched the thread EF: and

    for the sake of greater clearness we will im- agine it to be a small cord. If these two

    cylinders be pressed strongly together, the cord EF, when drawn by the end F, will un-

    doubtedly stand a considerable pull before it slips between the two compressing solids. But if we remove one of these cylinders the cord, though remaining in contact with the other, will not thereby be prevented from slipping freely. On the other hand, if one holds the cord loosely against the top of the cylinder A, winds it in the spiral form AFLOTR, and then pulls it by the end R, it is evident that the cord will begin to bind the cylinder; the greater the number of spirals the more tightly will the cord be pressed against the cylinder by any given pull. Thus as the number of turns increases, the line of

    contact

    Fig. 2

    io THE TWO NEW SCIENCES OF GALILEO

    contact becomes longer and in consequence more resistant; so that the cord slips and yields to the tractive force with increas-

    ing difficulty.

    [58] Is it not clear that this is precisely the kind of resistance which

    one meets in the case of a thick hemp rope where the fibres form thousands and thousands of similar spirals? And, indeed, the binding effect of these turns is so great that a few short rushes woven together into a few interlacing spirals form one of the strongest of ropes which I believe they call pack rope [jtuftz].

    SAGR. What you say has cleared up two points which I did not previously understand. One fact is how two, or at most three, turns of a rope around the axle of a windlass cannot only hold it fast, but can also prevent it from slipping when pulled by the immense force of the weight [forza del peso] which it sustains; and moreover how, by turning the windlass, this same axle, by mere friction of the rope around it, can wind up and

    lift huge stones while a mere boy is able to handle the slack of the rope. The other fact has to do with a simple but clever device, invented by a young kins-

    man of mine, for the purpose of descending from a window by means of a rope without lacerating the palms of his hands, as had happened to him shortly before and greatly to his discomfort. A small sketch will make this clear. He took a wooden cylinder, AB, about as thick as a walking stick and about one span long: on this he cut a spiral channel of about one turn and a half, and large enough to just receive the rope which he wished to use. Having introduced the rope at the end A and led it out again at the end B, he enclosed both the cylinder and the rope in a case of wood or tin, hinged along the side so that it

    Fig- 3 could be easily opened and closed. After he had fastened the rope to a firm support above, he could, on grasp-

    ing and squeezing the case with both hands, hang by his arms. The pressure on the rope, lying between the case and the cyl-

    inder, was such that he could, at will, either grasp the case

    more

    FIRST DAY ii

    more tightly and hold himself from slipping, or slacken his

    hold and descend as slowly as he wished. [5?]

    SALV. A truly ingenious device! I feel, however, that for a complete explanation other considerations might well enter; yet I must not now digress upon this particular topic since you are waiting to hear what I think about the breaking strength of other materials which, unlike ropes and most woods, do not show a filamentous structure. The coherence of these bodies

    is, in my estimation, produced by other causes which may be grouped under two heads. One is that much-talked-of repug-

    nance which nature exhibits towards a vacuum; but this horror of a vacuum not being sufficient, it is necessary to introduce

    another cause in the form of a gluey or viscous substance which j binds firmly together the component parts of the body.

    First I shall speak of the vacuum, demonstrating by definite experiment the quality and quantity of its force [wrfft]. If you take two highly polished and smooth plates of marble, metal, or glass and place them face to face, one will slide over the other with the greatest ease, showing conclusively that there is noth-

    ing of a viscous nature between them. But when you attempt to separate them and keep them at a constant distance apart, you find the plates exhibit such a repugnance to separation that the upper one will carry the lower one with it and keep it lifted indefinitely, even when the latter is big and heavy.

    This experiment shows the aversion of nature for empty space, even during the brief moment required for the outside air to rush in and fill up the region between the two plates. It is also observed that if two plates are not thoroughly polished, their contact is imperfect so that when you attempt to separate them slowly the only resistance offered is that of weight; if, however, the pull be sudden, then the lower plate rises, but quickly falls back, having followed the upper plate only for that very short interval of time required for the expansion of the small amount of air remaining between the plates, in conse-

    quence of their not fitting, and for the entrance of the surround- ing air. This resistance which is exhibited between the two

    plates

    12 THE TWO NEW SCIENCES OF GALILEO

    plates is doubtless likewise present between the parts of a solid, and enters, at least in part, as a concomitant cause of their coherence.

    SAGR. Allow me to interrupt you for a moment, please; for I want to speak of something which just occurs to me, namely, when I see how the lower plate follows the upper one and how rapidly it is lifted, I feel sure that, contrary to the opinion of many philosophers, including perhaps even Aristotle himself, motion in a vacuum is not instantaneous. If this were so the

    two plates mentioned above would separate without any re- sistance whatever, seeing that the same instant of time would

    suffice for their separation and for the surrounding medium to rush in and fill the vacuum between them. The fact that the lower plate follows the upper one allows us to infer, not only that motion in a vacuum is not instantaneous, but also that, between the two plates, a vacuum really exists, at least for a very short time, sufficient to allow the surrounding medium to rush in and fill the vacuum; for if there were no vacuum there would be no need of any motion in the medium. One must admit then that a vacuum is sometimes produced by violent motion [violenza] or contrary to the laws of nature, (although in my opinion nothing occurs contrary to nature except the impossible,

    . and that never occurs). But here another difficulty arises. While experiment con-

    vinces me of the correctness of this conclusion, my mind is not entirely satisfied as to the cause to which this effect is to be attributed. For the separation of the plates precedes the formation of the vacuum which is produced as a consequence of this separation; and since it appears to me that, in the order of nature, the cause must precede the effect, even though it ap-

    pears to follow in point of time, and since every positive effect must have a positive cause, I do not see how the adhesion of two plates and their resistance to separation — adlual facts — can be referred to a vacuum as cause when this vacuum is yet to follow. According to the infallible maxim of the Philosopher, the non-existent can produce no effect. Simp.

    FIRST DAY 13 SIMP. Seeing that you accept this axiom of Aristotle, I hardly

    think you will reject another excellent and reliable maxim of his, namely, Nature undertakes only that which happens without resistance; and in this saying, it appears to me, you will find the solution of your difficulty. Since nature abhors a vacuum, she prevents that from which a vacuum would follow as a necessary consequence. Thus it happens that nature prevents the separa-

    tion of the two plates. [61]

    SAGR. Now admitting that what Simplicio says is an adequate solution of my difficulty, it seems to me, if I may be allowed to resume my former argument, that this very resistance to a vacuum ought to be sufficient to hold together the parts either of stone or of metal or the parts of any other solid which is knit together more strongly and which is more resistant to separation. If for one effect there be only one cause, or if, more being as-

    signed, they can be reduced to one, then why is not this vacuum which really exists a sufficient cause for all kinds of resistance?

    SALV. I do not wish just now to enter this discussion as to whether the vacuum alone is sufficient to hold together the separate parts of a solid body; but I assure you that the vacuum which acts as a sufficient cause in the case of the two plates is not alone sufficient to bind together the parts of a solid cylinder of marble or metal which, when pulled violently, separates and divides. And now if I find a method of distinguishing this well known resistance, depending upon the vacuum, from every other kind which might increase the coherence, and if I show you that the aforesaid resistance alone is not nearly sufficient for such an effect, will you not grant that we are bound to introduce another cause? Help him, Simplicio, since he does not know what reply to make.

    SIMP. Surely, Sagredo’s hesitation must be owing to another reason, for there can be no doubt concerning a conclusion which is at once so clear and logical.

    SAGR. You have guessed rightly, Simplicio. I was wondering whether, if a million of gold each year from Spain were not sufficient to pay the army, it might not be necessary to

    make

    i4 THE TWO NEW SCIENCES OF GALILEO make provision other than small coin for the pay of the

    soldiers.* But go ahead, Salviati; assume that I admit your conclusion and show us your method of separating the action of the vacuum from other causes; and by measuring it show us how it is not sufficient to produce the effect in question.

    SALV. Your good angel assist you. I will tell you how to separate the force of the vacuum from the others, and after-

    wards how to measure it. For this purpose let us consider a continuous substance whose parts lack all resistance to separa-

    tion except that derived from a vacuum, such as is the case with water, a fact fully demonstrated by our Academician in one of his treatises. Whenever a cylinder of water is subjected to a pull and

    [62]

    offers a resistance to the separation of its parts this can be attrib-

    A R uted to no other cause than the resistance of the * vacuum. In order to try such an experiment

    I have invented a device which I can better explain by means of a sketch than by mere words. Let CABD represent the cross section of a cylinder either of metal or, preferably, of glass, hollow inside and accurately turned. Into this is introduced a perfectly fitting

    D cylinder of wood, represented in cross section by EGHF, and capable of up-and-down mo-

    tion. Through the middle of this cylinder is bored a hole to receive an iron wire, carrying a hook at the end K, while the upper end of the wire, I, is provided with a conical head. The wooden cylinder is countersunk

    Fig. 4 at the top so as to receive, with a perfect fit, the conical head I of the wire, IK, when pulled down by the end K.

    Now insert the wooden cylinder EH in the hollow cylinder AD, so as not to touch the upper end of the latter but to leave free a space of two or three finger-breadths; this space is to be filled

    * The bearing of this remark becomes clear on reading what Salviati says on p. 1 8 below. [Trans.]

    7—^-^, i

    < t

    G H

    J K 5V

    FIRST DAY 15 with water by holding the vessel with the mouth CD upwards, pushing down on the stopper EH, and at the same time keeping the conical head of the wire, I, away from the hollow portion of the wooden cylinder. The air is thus allowed to escape alongside the iron wire (which does not make a close fit) as soon as one presses down on the wooden stopper. The air having been allowed to escape and the iron wire having been drawn back so that it fits snugly against the conical depression in the wood, invert the vessel, bringing it mouth downwards, and hang on the hook K a vessel which can be filled with sand or any heavy material in quantity sufficient to finally separate the upper surface of the stopper, EF, from the lower surface of the water to which it was attached only by the resistance of the vacuum. Next weigh the stopper and wire together with the attached vessel and its contents; we shall then have the force of the vacuum [forza del vacuo]. If one attaches to a cylinder of marble [63]

    or glass a weight which, together with the weight of the marble or glass itself, is just equal to the sum of the weights before mentioned, and if breaking occurs we shall then be justified in saying that the vacuum alone holds the parts of the marble and glass together; but if this weight does not suffice and if breaking occurs only after adding, say, four times this weight, we shall then be compelled to say that the vacuum furnishes only one fifth of the total resistance [resistenza].

    SIMP. No one can doubt the cleverness of the device; yet it presents many difficulties which make me doubt its reliability. For who will assure us that the air does not creep in between the glass and stopper even if it is well packed with tow or other yielding material ? I question also whether oiling with wax or turpentine will suffice to make the cone, I, fit snugly on its seat. Besides, may not the parts of the water expand and dilate? Why may not the air or exhalations or some other more subtile substances penetrate the pores of the wood, or even of the glass itself?

    SALV. With great skill indeed has Simplicio laid before us the difficulties; and he has even partly suggested how to prevent the

    air

    16 THE TWO NEW SCIENCES OF GALILEO

    air from penetrating the wood or passing between the wood and the glass. But now let me point out that, as our experience in-

    creases, we shall learn whether or not these alleged difficulties really exist. For if, as is the case with air, water is by nature expansible, although only under severe treatment, we shall see the stopper descend; and if we put a small excavation in the upper part of the glass vessel, such as indicated by V, then the air or any other tenuous and gaseous substance, which might penetrate the pores of glass or wood, would pass through the water and collect in this receptacle V. But if these things do not happen we may rest assured that our experiment has been per-

    formed with proper caution; and we shall discover that water does not dilate and that glass does not allow any material, however tenuous, to penetrate it.

    SAGR. Thanks to this discussion, I have learned the cause of a certain effect which I have long wondered at and despaired of understanding. I once saw a cistern which had been provided with a pump under the mistaken impression that the water might thus be drawn with less effort or in greater quantity than by means of the ordinary bucket. The stock of the pump car-

    [64]

    ried its sucker and valve in the upper part so that the water was lifted by attraction and not by a push as is the case with pumps in which the sucker is placed lower down. This pump worked perfectly so long as the water in the cistern stood above a certain level; but below this level the pump failed to work. When I first noticed this phenomenon I thought the machine was out of order; but the workman whom I called in to repair it told me the defect was not in the pump but in the water which had fallen too low to be raised through such a height; and he added that it was not possible, either by a pump or by any other machine working on the principle of attraction, to lift water a

    hair’s breadth above eighteen cubits; whether the pump be large or small this is the extreme limit of the lift. Up to this time I had been so thoughtless that, although I knew a rope, or rod of wood, or of iron, if sufficiently long, would break by its own weight when held by the upper end, it never occurred to me

    that

    FIRST DAY 17

    that the same thing would happen, only much more easily, to a column of water. And really is not that thing which is at-

    tracted in the pump a column of water attached at the upper end and stretched more and more until finally a point is reached where it breaks, like a rope, on account of its excessive weight ?

    SALV. That is precisely the way it works; this fixed elevation of eighteen cubits is true for any quantity of water whatever, be the pump large or small or even as fine as a straw. We may therefore say that, on weighing the water contained in a tube eighteen cubits long, no matter what the diameter, we shall obtain the value of the resistance of the vacuum in a cylinder of any solid material having a bore of this same diameter. And having gone so far, let us see how easy it is to find to what length cylinders of metal, stone, wood, glass, etc., of any diam-

    eter can be elongated without breaking by their own weight. [65]

    Take for instance a copper wire of any length and thickness; fix the upper end and to the other end attach a greater and greater load until finally the wire breaks; let the maximum load be, say, fifty pounds. Then it is clear that if fifty pounds of copper, in addition to the weight of the wire itself which may be, say, Vs ounce, is drawn out into wire of this same size we shall have the greatest length of this kind of wire which can sus-

    tain its own weight. Suppose the wire which breaks to be one cubit in length and Vs ounce in weight; then since it supports 50 Ibs. in addition to its own weight, i. e., 4800 eighths-of-an- ounce, it follows that all copper wires, independent of size, can sustain themselves up to a length of 4801 cubits and no more. Since then a copper rod can sustain its own weight up to a length of 4801 cubits it follows that that part of the breaking strength [resistenza] which depends upon the vacuum, comparing it with the remaining facflors of resistance, is equal to the weight of a rod of water, eighteen cubits long and as thick as the copper rod. If, for example, copper is nine times as heavy as water, the breaking strength [resistenza allo strapparsi] of any copper rod, in so far as it depends upon the vacuum, is equal to the weight of two cubits of this same rod. By a similar method one can

    find

    i8 THE TWO NEW SCIENCES OF GALILEO

    find the maximum length of wire or rod of any material which will just sustain its own weight, and can at the same time dis-

    cover the part which the vacuum plays in its breaking strength. SAGR. It still remains for you to tell us upon what depends

    the resistance to breaking, other than that of the vacuum; what is the gluey or viscous substance which cements together the parts of the solid? For I cannot imagine a glue that will not burn up in a highly heated furnace in two or three months, or certainly within ten or a hundred. For if gold, silver and glass are kept for a long while in the molten state and are removed from the furnace, their parts, on cooling, immediately reunite and bind themselves together as before. Not only so, but whatever difficulty arises with respect to the cementation of the parts of the glass arises also with regard to the parts of the glue; in other words, what is that which holds these parts together so firmly?

    [66] SALV. A little while ago, I expressed the hope that your good

    angel might assist you. I now find myself in the same straits. Experiment leaves no doubt that the reason why two plates cannot be separated, except with violent effort, is that they are held together by the resistance of the vacuum; and the same can be said of two large pieces of a marble or bronze column. This being so, I do not see why this same cause may not explain the coherence of smaller parts and indeed of the very smallest particles of these materials. Now, since each effect must have one true and sufficient cause and since I find no other cement, am I not justified in trying to discover whether the vacuum is not a sufficient cause?

    SIMP. But seeing that you have already proved that the re- sistance which the large vacuum offers to the separation of

    two large parts of a solid is really very small in comparison with that cohesive force which binds together the most minute parts, why do you hesitate to regard this latter as something very different from the former?

    SALV. Sagredo has already [p. 13 above] answered this ques- tion when he remarked that each individual soldier was being

    FIRST DAY 19

    paid from coin collected by a general tax of pennies and farth- ings, while even a million of gold would not suffice to pay the

    entire army. And who knows but that there may be other extremely minute vacua which affect the smallest particles so that that which binds together the contiguous parts is through-

    out of the same mintage? Let me tell you something which has just occurred to me and which I do not offer as an absolute fact, but rather as a passing thought, still immature and calling for more careful consideration. You may take of it what you like; and judge the rest as you see fit. Sometimes when I have ob-

    served how fire winds its way in between the most minute particles of this or that metal and, even though these are solidly cemented together, tears them apart and separates them, and when I have observed that, on removing the fire, these particles reunite with the same tenacity as at first, without any loss of quantity in the case of gold and with little loss in the case of other metals, even though these parts have been separated for a long while, I have thought that the explanation might lie in the fact that the extremely fine particles of fire, penetrating the slender pores of the metal (too small to admit even the finest particles of air or of many other fluids), would fill the small intervening vacua and would set free these small particles from the attraction which these same vacua exert upon them and which prevents their separation. Thus the particles are able to [67]

    move freely so that the mass [massa] becomes fluid and remains so as long as the particles of fire remain inside; but if they depart and leave the former vacua then the original attraction [attraz- zione] returns and the parts are again cemented together.

    In reply to the question raised by Simplicio, one may say that although each particular vacuum is exceedingly minute and therefore easily overcome, yet their number is so extraordinarily great that their combined resistance is, so to speak, multipled almost without limit. The nature and the amount of force

    \Jorza] which results [risulta] from adding together an immense number of small forces [debolissimi momenti] is clearly illus-

    trated by the fact that a weight of millions of pounds, suspended

    by

    20 THE TWO NEW SCIENCES OF GALILEO

    by great cables, is overcome and lifted, when the south wind carries innumerable atoms of water, suspended in thin mist, which moving through the air penetrate between the fibres of the tense ropes in spite of the tremendous force of the hanging weight. When these particles enter the narrow pores they swell the ropes, thereby shorten them, and perforce lift the heavy mass [mole].

    SAGR. There can be no doubt that any resistance, so long as it is not infinite, may be overcome by a multitude of minute forces. Thus a vast number of ants might carry ashore a ship laden with grain. And since experience shows us daily that one ant can easily carry one grain, it is clear that the number of grains in the ship is not infinite, but falls below a certain limit. If you take another number four or six times as great, and if you set to work a corresponding number of ants they will carry the grain ashore and the boat also. It is true that this will call for a prodigious number of ants, but in my opinion this is pre-

    cisely the case with the vacua which bind together the least particles of a metal.

    SALV. But even if this demanded an infinite number would you still think it impossible?

    SAGR. Not if the mass [mole] of metal were infinite; other- wise. . . .

    [68] SALV. Otherwise what? Now since we have arrived at

    paradoxes let us see if we cannot prove that within a finite ex- tent it is possible to discover an infinite number of vacua. At the

    same time we shall at least reach a solution of the most remark- able of all that list of problems which Aristotle himself calls

    wonderful; I refer to his Questions in Mechanics. This solution may be no less clear and conclusive than that which he himself gives and quite different also from that so cleverly expounded by the most learned Monsignor di Guevara.*

    First it is necessary to consider a proposition, not treated by others, but upon which depends the solution of the problem and from which, if I mistake not, we shall derive other new and remarkable facts. For the sake of clearness let us draw an

    * Bishop of Teano; b. 1561, d. 1641. [Trans.]

    FIRST DAY 21

    accurate figure. About G as a center describe an equiangular and equilateral polygon of any number of sides, say the hexagon ABCDEF. Similar to this and concentric with it, describe another smaller one which we shall call HIKLMN. Prolong the

    Y Z

    F Fig. 5

    side AB, of the larger hexagon, indefinitely toward S; in like manner prolong the corresponding side HI of the smaller hex-

    agon, in the same direction, so that the line HT is parallel to AS; and through the center draw the line GV parallel to the other two. This done, imagine the larger polygon to roll upon [69]

    the line AS, carrying with it the smaller polygon. It is evident that, if the point B, the end of the side AB, remains fixed at the beginning of the rotation, the point A will rise and the point C will fall describing the arc CQ until the side BC coincides with the line BQ, equal to BC. But during this rotation the point I, on the smaller polygon, will rise above the line IT because IB is oblique to AS ; and it will not again return to the line IT until the point C shall have reached the position Q. The point I, having described the arc IO above the line HT, will reach the position

    Oat

    22 THE TWO NEW SCIENCES OF GALILEO

    O at the same time the side IK assumes the position OP; but in the meantime the center G has traversed a path above GV and does not return to it until it has completed the arc GC. This step having been taken, the larger polygon has been brought to rest with its side BC coinciding with the line BQ while the side IK of the smaller polygon has been made to coincide with the line OP, having passed over the portion IO without touching it; also the center G will have reached the position C after having traversed all its course above the parallel line GV. And finally the entire figure will assume a position similar to the first, so that if we continue the rotation and come to the next step, the side DC of the larger polygon will coincide with the portion QX and the side KL of the smaller polygon, having first skipped the arc PY, will fall on YZ, while the center still keeping above the line GV will return to it at R after having jumped the interval CR. At the end of one complete rotation the larger polygon will have traced upon the line AS, without break, six lines together equal to its perimeter; the lesser polygon will likewise have imprinted six lines equal to its perimeter, but separated by the interposition of five arcs, whose chords represent the parts of HT not touched by the polygon : the center G never reaches the line GV except at six points. From this it is clear that the space traversed by the smaller polygon is almost equal to that traversed by the larger, that is, the line HT approximates the line AS, differing from it only by the length of one chord of one of these arcs, provided we understand the line HT to include the five skipped arcs. Now this exposition which I have given in the case of these

    hexagons must be understood to be applicable to all other polygons, whatever the number of sides, provided only they are

    [70] similar, concentric, and rigidly connected, so that when the greater one rotates the lesser will also turn however small it may be. You must also understand that the lines described by these two are nearly equal provided we include in the space traversed by the smaller one the intervals which are not touched by any part of the perimeter of this smaller polygon.

    Let

    FIRST DAY 23 Let a large polygon of, say, one thousand sides make one

    complete rotation and thus lay off a line equal to its perimeter; at the same time the small one will pass over an approximately equal distance, made up of a thousand small portions each equal to one of its sides, but interrupted by a thousand spaces which, in contrast with the portions that coincide with the sides of the polygon, we may call empty. So far the matter is free from difficulty or doubt.

    But now suppose that about any center, say A, we describe two concentric and rigidly connected circles; and suppose that from the points C and B, on their radii, there are drawn the tangents CE and BF and that through the center A the line AD is drawn parallel to them, then if the large circle makes one complete rotation along the line BF, equal not only to its cir-

    cumference but also to the other two lines CE and AD, tell me what the smaller circle will do and also what the center will do. As to the center it will certainly traverse and touch the entire line AD while the circumference of the smaller circle will have measured off by its points of contact the entire line CE, just as was done by the above mentioned polygons. The only difference is that the line HT was not at every point in contact with the perimeter of the smaller polygon, but there were left untouched as many vacant spaces as there were spaces coinciding with the sides. But here in the case of the circles the circumference of the smaller one never leaves the line CE, so that no part of the latter is left untouched, nor is there ever a time when some point on the circle is not in contact with the straight line. How now can the smaller circle traverse a length greater than its circumference unless it go by jumps ?

    SAGR. It seems to me that one may say that just as the center of the circle, by itself, carried along the line AD is constantly in contact with it, although it is only a single point, so the points on the circumference of the smaller circle, carried along by the motion of the larger circle, would slide over some small parts of the line CE.

    SALV. There are two reasons why this cannot happen. First because

    24 THE TWO NEW SCIENCES OF GALILEO

    because there is no ground for thinking that one point of con- tact, such as that at C, rather than another, should slip over

    certain portions of the line CE. But if such slidings along CE did occur they would be infinite in number since the points of contact (being mere points) are infinite in number: an infinite number of finite slips will however make an infinitely long line, while as a matter of fact the line CE is finite. The other reason is that as the greater circle, in its rotation, changes its point of contact continuously the lesser circle must do the same because B is the only point from which a straight line can be drawn to A and pass through C. Accordingly the small circle must change its point of contact whenever the large one changes : no point of the small circle touches the straight line CE in more than one point. Not only so, but even in the rotation of the polygons there was no point on the perimeter of the smaller which coin-

    cided with more than one point on the line traversed by that perimeter; this is at once clear when you remember that the line IK is parallel to BC and that therefore IK will remain above IP until BC coincides with BQ, and that IK will not lie upon IP except at the very instant when BC occupies the position BQ; at this instant the entire line IK coincides with OP and immediately afterwards rises above it.

    SAGR. This is a very intricate matter. I see no solution. Pray explain it to us.

    SALV. Let us return to the consideration of the above men- tioned polygons whose behavior we already understand. Now

    in the case of polygons with 100000 sides, the line traversed by the perimeter of the greater, i. e., the line laid down by its 100000 sides one after another, is equal to the line traced out by the 100000 sides of the smaller, provided we include the 100000 vacant spaces interspersed. So in the case of the circles, poly-

    gons having an infinitude of sides, the line traversed by the continuously distributed [continuamente disposti] infinitude of sides is in the greater circle equal to the line laid down by the infinitude of sides in the smaller circle but with the exception that these latter alternate with empty spaces; and since the sides are not finite in number, but infinite, so also are the inter-

    v ning

    FIRST DAY 25

    vening empty spaces not finite but infinite. The line traversed by the larger circle consists then of an infinite number of points which completely fill it; while that which is traced by the smaller circle consists of an infinite number of points which leave empty spaces and only partly fill the line. And here I wish you to observe that after dividing and resolving a line into a finite

    number of parts, that is, into a number which can be counted, it [72] .

    is not possible to arrange them again into a greater length than that which they occupied when they formed a continuum [con- tinuate] and were connected without the interposition of as many empty spaces. But if we consider the line resolved into an infinite number of infinitely small and indivisible parts, we shall be able to conceive the line extended indefinitely by the interposition, not of a finite, but of an infinite number of in-

    finitely small indivisible empty spaces. Now this which has been said concerning simple lines must be

    understood to hold also in the case of surfaces and solid bodies, it being assumed that they are made up of an infinite, not a finite, number of atoms. Such a body once divided into a finite number of parts it is impossible to reassemble them so as to occupy more space than before unless we interpose a finite number of empty spaces, that is to say, spaces free from the substance of which the solid is made. But if we imagine the body, by some extreme and final analysis, resolved into its primary elements, infinite in number, then we shall be able to think of them as indefinitely extended in space, not by the interposition of a finite, but of an infinite number of empty spaces. Thus one can easily imagine a small ball of gold ex-

    panded into a very large space without the introduction of a finite number of empty spaces, always provided the gold is made up of an infinite number of indivisible parts.

    SIMP. It seems to me that you are travelling along toward those vacua advocated by a certain ancient philosopher.

    SALV. But you have failed to add, “who denied Divine Provi- dence,” an inapt remark made on a similar occasion by a cer-

    tain antagonist of our Academician. Simp.

    26 THE TWO NEW SCIENCES OF GALILEO SIMP. I noticed, and not without indignation, the rancor of

    this ill-natured opponent; further references to these affairs I omit, not only as a matter of good form, but also because I know how unpleasant they are to the good tempered and well ordered mind of one so religious and pious, so orthodox and God-fearing as you.

    But to return to our subject, your previous discourse leaves with me many difficulties which I am unable to solve. First among these is that, if the circumferences of the two circles are equal to the two straight lines, CE and BF, the latter con-

    sidered as a continuum, the former as interrupted with an in- finity of empty points, I do not see how it is possible to say that

    the line AD described by the center, and made up of an infinity of points, is equal to this center which is a single point. Besides, this building up of lines out of points, divisibles out of indivisi-

    bles, and finites out of infinites, offers me an obstacle difficult to avoid; and the necessity of introducing a vacuum, so conclu-

    sively refuted by Aristotle, presents the same difficulty. [73]

    SALV. These difficulties are real; and they are not the only ones. But let us remember that we are dealing with infinities and indivisibles, both of which transcend our finite under-

    standing, the former on account of their magnitude, the latter because of their smallness. In spite of this, men cannot refrain from discussing them, even though it must be done in a round-

    about way.

    Therefore I also should like to take the liberty to present some of my ideas which, though not necessarily convincing, would, on account of their novelty, at least, prove somewhat startling. But such a diversion might perhaps carry us too far away from the subject under discussion and might therefore appear to you inopportune and not very pleasing.

    SAGR. Pray let us enjoy the advantages and privileges which come from conversation between friends, especially upon sub-

    jects freely chosen and not forced upon us, a matter vastly different from dealing with dead books which give rise to many doubts but remove none. Share with us, therefore, the thoughts

    FIRST DAY 27

    which our discussion has suggested to you; for since we are free from urgent business there will be abundant time to pursue the topics already mentioned; and in particular the objections raised by Simplicio ought not in any wise to be neglected.

    SALV. Granted, since you so desire. The first question was, How can a single point be equal to a line? Since I cannot do more at present I shall attempt to remove, or at least diminish, one improbability by introducing a similar or a greater one,

    just as sometimes a wonder is diminished by a miracle.* And this I shall do by showing you two equal surfaces, to-

    gether with two equal solids located upon these same surfaces as bases, all four of which diminish continuously and uniformly in such a way that their remainders always preserve equality among themselves, and finally both the surfaces and the solids terminate their previous constant equality by degenerating, the one solid and the one surface into a very long line, the other solid and the other surface into a single point; that is, the latter to one point, the former to an infinite number of points.

    (74)

    SAGR. This proposition appears to me wonderful, indeed;

    but let us hear the explanation and demonstration. SALV. Since the proof is purely geometrical we shall need

    a figure. Let AFB be a semicircle with center at C; about it describe the rectangle ADEB and from the center draw the straight lines CD and CE to the points D and E. Imagine the radius CF to be drawn perpendicular to either of the lines AB or DE, and the entire figure to rotate about this radius as an axis. It is clear that the rectangle ADEB will thus describe a cylinder, the semicircle AFB a hemisphere, and the triangle CDE, a cone. Next let us remove the hemisphere but leave the cone and the rest of the cylinder, which, on account of its shape, we will call a

    “bowl.” First we shall prove that the bowl and the cone are equal; then we shall show that a plane drawn parallel to the circle which forms the base of the bowl and which has the line DE for

    diameter and F for a center — a plane whose trace is GN — cuts the bowl in the points G, I, O, N, and the cone in the points H, L, so that the part of the cone indicated by CHL is always equal to

    * C/. p. 30 below. [Trans.]

    28 THE TWO NEW SCIENCES OF GALILEO

    the part of the bowl whose profile is represented by the triangles GAI and BON. Besides this we shall prove that the base of the cone, i. e., the circle whose diameter is HL, is equal to the circular

    ^ £ B surface which forms the base of this portion of the bowl, or as one might say, equal to a ribbon

    N whose width is GI. (Note by

    the way the nature of mathe- matical definitions which con-

    sist merely in the imposition of

    names or, if you prefer, abbrevi- Fig. 6 ations of speech established and

    introduced in order to avoid the tedious drudgery which you and I now experience simply because we have not agreed

    to call this surface a “circular band” and that sharp solid

    portion of the bowl a “round razor.”) Now call them by [75]

    what name you please, it suffices to understand that the plane, drawn at any height whatever, so long as it is parallel to the base, i. e., to the circle whose diameter is DE, always cuts the two solids so that the portion CHL of the cone is equal to the upper portion of the bowl; likewise the two areas which are the bases of these solids, namely the band and the circle HL, are also equal. Here we have the miracle mentioned above; as the cut-

    ting plane approaches the line AB the portions of the solids cut off are always equal, so also the areas of their bases. And as the cutting plane comes near the top, the two solids (always equal) as well as their bases (areas which are also equal) finally vanish, one pair of them degenerating into the circumference of a circle, the other into a single point, namely, the upper edge of the bowl and the apex of the cone. Now, since as these solids diminish equality is maintained between them up to the very last, we are justified in saying that, at the extreme and final end of this diminution, they are still equal and that one is not infinitely greater than the other. It appears therefore that we may equate the circumference of a large circle to a single point. And this which is true of the solids is true also of the surfaces which

    form

    FIRST DAY 29

    form their bases; for these also preserve equality between them- selves throughout their diminution and in the end vanish, the

    one into the circumference of a circle, the other into a single point. Shall we not then call them equal seeing that they are the last traces and remnants of equal magnitudes? Note also that, even if these vessels were large enough to contain immense celestial hemispheres, both their upper edges and the apexes of the cones therein contained would always remain equal and would vanish, the former into circles having the dimensions of the largest celestial orbits, the latter into single points. Hence in conformity with the preceding we may say that all circum-

    ferences of circles, however different, are equal to each other, and are each equal to a single point.

    SAGR. This presentation strikes me as so clever and novel that, even if I were able, I would not be willing to oppose it; for to deface so beautiful a structure by a blunt pedantic attack would be nothing short of sinful. But for our complete satisfac-

    [76] tion pray give us this geometrical proof that there is always

    equality between these solids and between their bases; for it cannot, I think, fail to be very ingenious, seeing how subtle is the philosophical argument based upon this result.

    SALV. The demonstration is both short and easy. Referring to the preceding figure, since IPC is a right angle the square of the radius 1C is equal to the sum of the squares on the two sides IP, PC; but the radius 1C is equal to AC and also to GP, while CP is equal to PH. Hence the square of the line GP is equal to the sum of the squares of IP and PH, or multiplying through by 4, we have the square of the diameter GN equal to the sum of the squares on IO and HL. And, since the areas of circles are to each other as the squares of their diameters, it follows that the area of the circle whose diameter is GN is equal to the sum of the areas of circles having diameters IO and HL, so that if we remove the common area of the circle having IO for diameter the re-

    maining area of the circle GN will be equal to the area of the circle whose diameter is HL. So much for the first part. As for the other part, we leave its demonstration for the present, partly

    because

    3o THE TWO NEW SCIENCES OF GALILEO because those who wish to follow it will find it in the twelfth

    proposition of the second book of De centra gravitatis solidorum

    by the Archimedes of our age, Luca Valerio,* who made use of it for a different object, and partly because, for our purpose, it suffices to have seen that the above-mentioned surfaces are always equal and that, as they keep on diminishing uniformly, they degenerate, the one into a single point, the other into the circumference of a circle larger than any assignable; in this fa eft lies our miracle. t

    SAGR. The demonstration is ingenious and the inferences drawn from it are remarkable. And now let us hear something concerning the other difficulty raised by Simplicio, if you have anything special to say, which, however, seems to me hardly possible, since the matter has already been so thoroughly dis- cussed.

    SALV. But I do have something special to say, and will first of all repeat what I said a little while ago, namely, that in-

    finity and indivisibility are in their very nature incomprehensi- ble to us; imagine then what they are when combined. Yet if

    [77] we wish to build up a line out of indivisible points, we must take an infinite number of them, and are, therefore, bound to understand both the infinite and the indivisible at the same time. Many ideas have passed through my mind concerning this subject, some of which, possibly the more important, I may not be able to recall on the spur of the moment; but in the course of our discussion it may happen that I shall awaken in you, and especially in Simplicio, objections and difficulties which in turn will bring to memory that which, without such stimulus, would have lain dormant in my mind. Allow me therefore the customary liberty of introducing some of our human fancies, for indeed we may so call them in comparison with supernatural truth which furnishes the one true and safe recourse for deci-

    sion in our discussions and which is an infallible guide in the dark and dubious paths of thought.

    * Distinguished Italian mathematician; born at Ferrara about 1552; admitted to the Accademia dei Lincei 1612; died 1618. [Trans.} f C/. p. 27 above. [Trans.}

    FIRST DAY 31 One of the main objections urged against this building up

    of continuous quantities out of indivisible quantities [continue cT indwisibili\ is that the addition of one indivisible to an-

    other cannot produce a divisible, for if this were so it would render the indivisible divisible. Thus if two indivisibles, say two points, can be united to form a quantity, say a divisible line, then an even more divisible line might be formed by the union of three, five, seven, or any other odd number of points. Since however these lines can be cut into two equal parts, it becomes possible to cut the indivisible which lies exactly in the middle of the line. In answer to this and other objections of the

    same type we reply that a divisible magnitude cannot be con- structed out of two or ten or a hundred or a thousand indivisibles,

    but requires an infinite number of them. SIMP. Here a difficulty presents itself which appears to me

    insoluble. Since it is clear that we may have one line greater than another, each containing an infinite number of points, we are forced to admit that, within one and the same class,

    we may have something greater than infinity, because the in- finity of points in the long line is greater than the infinity of

    points in the short line. This assigning to an infinite quantity a value greater than infinity is quite beyond my comprehension.

    SALV. This is one of the difficulties which arise when we

    attempt, with our finite minds, to discuss the infinite, assigning to it those properties which we give to the finite and limited; but

    [78] this I think is wrong, for we cannot speak of infinite quantities

    as being the one greater or less than or equal to another. To prove this I have in mind an argument which, for the sake of clearness, I shall put in the form of questions to Simplicio who raised this difficulty.

    I take it for granted that you know which of the numbers are squares and which are not.

    SIMP. I am quite aware that a squared number is one which re- sults from the multiplication of another number by itself; thus

    4, 9, etc., are squared numbers which come from multiplying 2, 3, etc., by themselves.

    Salv.

    32 THE TWO NEW SCIENCES OF GALILEO SALV. Very well; and you also know that just as the products

    are called squares so the fadlors are called sides or roots; while on the other hand those numbers which do not consist of two equal fadlors are not squares. Therefore if I assert that all numbers, including both squares and non-squares, are more than the squares alone, I shall speak the truth, shall I not?

    SIMP. Most certainly. SALV. If I should ask further how many squares there are one

    might reply truly that there are as many as the corresponding number of roots, since every square has its own root and every root its own square, while no square has more than one root and no root more than one square.

    SIMP. Precisely so. SALV. But if I inquire how many roots there are, it cannot

    be denied that there are as many as there are numbers because every number is a root of some square. This being granted we must say that there are as many squares as there are num-

    bers because they are just as numerous as their roots, and all the numbers are roots. Yet at the outset we said there are many more numbers than squares, since the larger portion of them are not squares. Not only so, but the proportionate number of squares diminishes as we pass to larger numbers. Thus up to 100 we have 10 squares, that is, the squares constitute i/io part of all the numbers; up to 10000, we find only i/ioo

    [79] part to be squares; and up to a million only i/iooo part; on the other hand in an infinite number, if one could conceive of such a thing, he would be forced to admit that there are as many squares as there are numbers all taken together.

    SAGR. What then must one conclude under these circum- stances ?

    SALV. So far^as I see we can only infer that the totality of all numbers is infinite, that the number of squares is infinite, and that the number of their roots is infinite; neither is the number of squares less than the totality of all numbers, nor the latter greater than the former; and finally the attributes

    “equal,” “greater,” and “less,” are not applicable to infinite,

    but

    FIRST DAY 33

    but only to finite, quantities. When therefore Simplicio in- troduces several lines of different lengths and asks me how it

    is possible that the longer ones do not contain more points than the shorter, I answer him that one line does not contain more or less or just as many points as another, but that each line contains an infinite number. Or if I had replied to him that the points in one line were equal in number to the squares; in another, greater than the totality of numbers; and in the little one, as many as the number of cubes, might I not, indeed, have satisfied him by thus placing more points in one line than in another and yet maintaining an infinite number in each? So much for the first difficulty.

    SAGR. Pray stop a moment and let me add to what has al- ready been said an idea which just occurs to me. If the pre- ceding be true, it seems to me impossible to say either that one

    infinite number is greater than another or even that it is greater than a finite number, because if the infinite number were greater than, say, a million it would follow that on passing from the million to higher and higher numbers we would be approach-

    ing the infinite; but this is not so; on the contrary, the lar- ger the number to which we pass, the more we recede from

    [this property of] infinity, because the greater the numbers the fewer [relatively] are the squares contained in them; but the squares in infinity cannot be less than the totality of all the numbers, as we have just agreed; hence the approach to greater

    and greater numbers means a departure from infinity.* SALV. And thus from your ingenious argument we are led to

    [80]

    conclude that the attributes “larger,” “smaller,” and “equal” have no place either in comparing infinite quantities with each other or in comparing infinite with finite quantities.

    I pass now to another consideration. Since lines and all continuous quantities are divisible into parts which are them-

    selves divisible without end, I do not see how it is possible

    * A certain confusion of thought appears to be introduced here through a failure to distinguish between the number n and the class of the first n numbers; and likewise from a failure to distinguish infinity as a number from infinity as the class of all numbers. [Trans.]

    34 THE TWO NEW SCIENCES OF GALILEO to avoid the conclusion that these lines are built up of an in-

    finite number of indivisible quantities because a division and a subdivision which can be carried on indefinitely presupposes that the parts are infinite in number, otherwise the subdivision would reach an end; and if the parts are infinite in number, we must conclude that they are not finite in size, because an in-

    finite number of finite quantities would give an infinite magni- tude. And thus we have a continuous quantity built up of an

    infinite number of indivisibles. SIMP. But if we can carry on indefinitely the division into

    finite parts what necessity is there then for the introduction of non-finite parts?

    SALV. The very fact that one is able to continue, without end, the division into finite parts [in parti quante] makes it nec-

    essary to regard the quantity as composed of an infinite num- ber of immeasurably small elements [di infiniti non quanti}.

    Now in order to settle this matter I shall ask you to tell me whether, in your opinion, a continuum is made up of a finite or of an infinite number of finite parts [parti quante].

    SIMP. My answer is that their number is both infinite and

    finite; potentially infinite but actually finite [infinite, in po- tenza; e finite, in atto] ; that is to say, potentially infinite before division and actually finite after division; because parts cannot be said to exist in a body which is not yet divided or at least marked out; if this is not done we say that they exist potentially.

    SALV. So that a line which is, for instance, twenty spans long is not said to contain actually twenty lines each one span in length except after division into twenty equal parts; before division it is said to contain them only potentially. Suppose the facts are as you say; tell me then whether, when the division is once made, the size of the original quantity is thereby in-

    creased, diminished, or unaffected. SIMP. It neither increases nor diminishes.

    SALV. That is my. opinion also. Therefore the finite parts [parti quante] in a continuum, whether adlually or potentially present, do not make the quantity either larger or smaller; but it is perfectly clear that, if the number of finite parts actually

    contained

    FIRST DAY 35 contained in the whole is infinite in number, they will make the | magnitude infinite. Hence the number of finite parts, although existing only potentially, cannot be infinite unless the magnitude containing them be infinite; and conversely if the magnitude is ;

    PI] finite it cannot contain an infinite number of finite parts either

    actually or potentially.

    SAGR. How then is it possible to divide a continuum without limit into parts which are themselves always capable of subdivi-

    sion ?

    SALV. This distinction of yours between actual and potential appears to render easy by one method what would be impossible by another. But I shall endeavor to reconcile these mattersx

    in another way; and as to the query whether the finite parts * of a limited continuum [continuo terminate] are finite or in- finite in number I will, contrary to the opinion of Simplicio, ,

    answer that they are neither finite nor infinite. SIMP. This answer would never have occurred to me since I

    did not think that there existed any intermediate step between the finite and the infinite, so that the classification or distinc-

    tion which assumes that a thing must be either finite or infinite is faulty and defective.

    SALV. So it seems to me. And if we consider discrete quanti- ties I think there is, between finite and infinite quantities, a

    third intermediate term which corresponds to every assigned number; so that if asked, as in the present case, whether the finite parts of a continuum are finite or infinite in number the best reply is that they are neither finite nor infinite but corre-

    spond to every assigned number. In order that this may be possible, it is necessary that those parts should not be included within a limited number, for in that case they would not corre-

    spond to a number which is greater; nor can they be infinite in number since no assigned number is infinite; and thus at the pleasure of the questioner we may, to any given line, assign a hundred finite parts, a thousand, a hundred thousand, or indeed any number we may please so long as it be not infinite. I grant, therefore, to the philosophers, that the continuum contains as ‘

    many

    36 THE TWO NEW SCIENCES OF GALILEO

    many finite parts as they please and I concede also that it con- tains them, either actually or potentially, as they may like; but

    I must add that just as a line ten fathoms [canne] in length con- tains ten lines each of one fathom and forty lines each of one

    cubit [braccia] and eighty lines each of half a cubit, etc., so it contains an infinite number of points; call them actual or po-

    tential, as you like, for as to this detail, Simplicio, I defer to your opinion and to your judgment.

    [82] SIMP. I cannot help admiring your discussion; but I fear

    that this parallelism between the points and the finite parts contained in a line will not prove satisfactory, and that you will not find it so easy to divide a given line into an infinite num-

    ber of points as the philosophers do to cut it into ten fathoms or forty cubits; not only so, but such a division is quite impossible to realize in practice, so that this will be one of those poten-

    tialities which cannot be reduced to actuality. SALV. The fact that something can be done only with effort

    or diligence or with great expenditure of time does not render it impossible; for I think that you yourself could not easily divide a line into a thousand parts, and much less if the number of parts were 937 or any other large prime number. But if I were to accomplish this division which you deem impossible as readily as another person would divide the line into forty parts would you then be more willing, in our discussion, to concede the possibility of such a division ?

    SIMP. In general I enjoy greatly your method; and replying to your query, I answer that it would be more than sufficient if it prove not more difficult to resolve a line into points than to divide it into a thousand parts.

    SALV. I will now say something which may perhaps astonish you; it refers to the possibility of dividing a line into its in-

    finitely small elements by following the same order which one employs in dividing the same line into forty, sixty, or a hundred parts, that is, by dividing it into two, four, etc. He who thinks that, by following this method, he can reach an infinite number of points is greatly mistaken; for if this process were followed to eternity

    FIRST DAY 37

    eternity there would still remain finite parts which were un-| divided.

    Indeed by such a method one is very far from reaching the goal of indivisibility; on the contrary he recedes from it and while he thinks that, by continuing this division and by multi-

    plying the multitude of parts, he will approach infinity, he is, in my opinion, getting farther and farther away from it. My reason is this. In the preceding discussion we concluded that, in an infinite number, it is necessary that the squares and cubes should be as numerous as the totality of the natural numbers [tutti i numeri], because both of these are as numerous as their roots which constitute the totality of the natural numbers. Next we saw that the larger the numbers taken the more sparsely distributed were the squares, and still more sparsely the cubes; therefore it is clear that the larger the numbers to which we pass the farther we recede from the infinite number; hence it follows

    . [83] that, since this process carries us farther and farther from the a

    end sought, if on turning back we shall find that any number , can be said to be infinite, it must be unity. Here indeed are I satisfied all those conditions which are requisite for an infinite number; I mean that unity contains in itself as many squares as I there are cubes and natural numbers [tutti i numeri].

    SIMP. I do not quite grasp the meaning of this. SALV. There is no difficulty in the matter because unity is at

    once a square, a cube, a square of a square and all the other powers [dignita] ; nor is there any essential peculiarity in squares or cubes which does not belong to unity; as, for example, the property of two square numbers that they have between them a mean proportional; take any square number you please as the first term and unity for the other, then you will always find a number which is a mean proportional. Consider the two square numbers, 9 and 4; then 3 is the mean proportional between 9 and I ; while 2 is a mean proportional between 4 and I ; between 9 and 4 we have 6 as a mean proportional. A property of cubes is that they must have between them two mean proportional numbers; take 8 and 27; between them lie 12 and 18; while

    between

    38 THE TWO NEW SCIENCES OF GALILEO between I and 8 we have 2 and 4 intervening; and between I and 27 there lie 3 and 9. Therefore we conclude that unity is the only infinite number. These are some of the marvels which our imagination cannot grasp and which should warn us against the serious error of those who attempt to discuss the infinite by assigning to it the same properties which we employ for the finite, the natures of the two having nothing in common. With regard to this subject I must tell you of a remarkable

    property which just now occurs to me and which will explain the vast alteration and change of character which a finite quan-

    tity would undergo in passing to infinity. Let us draw the straight line AB of arbitrary length and let the point C divide it into two unequal parts; then I say that, if pairs of lines be drawn, one from each of the terminal points A and B, and if the ratio between the lengths of these lines is the same as that between AC and CB, their points of intersection will all lie upon

    the circumference of one and the same circle. Thus, for ex- [84]

    ample, AL and BL drawn from A and B, meeting at the point L, bearing to one another the same ratio as AC to BC, and the

    pair AK and BK meeting at K also

    bearing to one an- other the same ratio, and likewise the pairs

    |EAI,BI,AH,BH,AG, BG, AF, BF, AE, BE, have their points of intersection L, K,

    I,H,G, F,E, allly- Fig. 7 ing upon the circum-

    ference of one and the same circle. Accordingly if we imagine the point C to move continuously in such a manner that the lines drawn from it to the fixed terminal points, A and B, always main-

    tain the same ratio between their lengths as exists between the original parts, AC and CB, then the point C will, as I shall pres-

    ently prove, describe a circle. And the circle thus described will increase

    FIRST DAY 39

    increase in size without limit as the point C approaches the mid- dle point which we may call O; but it will diminish in size as C

    approaches the end B. So that the infinite number of points lo- cated in the line OB will, if the motion be as explained above, de- scribe circles of every size, some smaller than the pupil of the eye

    of a flea, others larger than the celestial equator. Now if we move any of the points lying between the two ends O and B they will all describe circles, those nearest O, immense circles; but if we move the point O itself, and continue to move it according to the aforesaid law, namely, that the lines drawn from O to the terminal points, A and B, maintain the same ratio as the original lines AO and OB, what kind of a line will be produced ? A circle will be drawn larger than the largest of the others, a circle which is therefore infinite. But from the point O a straight line will also be drawn perpendicular to BA and extending to infinity with-

    out ever turning, as did the others, to join its last end with its first; for the point C, with its limited motion, having described

    the upper semi-circle, CHE, proceeds to describe the lower semicircle EMC, thus returning to the starting point. But the point O having started to describe its circle, as did all the other points in the line AB, (for the points in the other portion OA describe their circles also, the largest being those nearest the point O) is unable to return to its starting point because the circle it describes, being the largest of all, is infinite; in fact, it describes an infinite straight line as circumference of its infinite circle. Think now what a difference there is between a finite and

    an infinite circle since the latter changes character in such a manner that it loses not only its existence but also its possibility of existence; indeed, we already clearly understand that there can be no such thing as an infinite circle; similarly there can be no infinite sphere, no infinite body, and no infinite surface of

    any shape. Now what shall we say concerning this metamorpho- sis in the transition from finite to infinite? And why should we

    feel greater repugnance, seeing that, in our search after the infinite among numbers we found it in unity? Having broken up a solid into many parts, having reduced it to the finest of

    powder

    4o THE TWO NEW SCIENCES OF GALILEO powder and having resolved it into its infinitely small indivisible atoms why may we not say that this solid has been reduced to a single continuum [un solo continuo] perhaps a fluid like water or mercury or even a liquified metal? And do we not see stones melt into glass and the glass itself under strong heat become more fluid than water?

    SAGR. Are we then to believe that substances become fluid in virtue of being resolved into their infinitely small indivisible components ?

    SALV. I am not able to find any better means of accounting for certain phenomena of which the following is one. When I take a hard substance such as stone or metal and when I reduce it by means of a hammer or fine file to the most minute and impalpable powder, it is clear that its finest particles, although when taken one by one are, on account of their smallness, im-

    perceptible to our sight and touch, are nevertheless finite in size, possess shape, and capability of being counted. It is also true that when once heaped up they remain in a heap ; and if an excavation be made within limits the cavity will remain and the surrounding particles will not rush in to fill it; if shaken the particles come to rest immediately after the external disturbing agent is removed; the same effects are observed in all piles of

    [86] larger and larger particles, of any shape, even if spherical, as is the case with piles of millet, wheat, lead shot, and every other material. But if we attempt to discover such properties in water we do not find them; for when once heaped up it imme-

    diately flattens out unless held up by some vessel or other exter- nal retaining body; when hollowed out it quickly rushes in to fill

    the cavity; and when disturbed it fluctuates for a long time and sends out its waves through great distances.

    Seeing that water has less firmness [consistenza] than the finest of powder, in fact has no consistence whatever, we may, it seems to me, very reasonably conclude that the smallest particles into which it can be resolved are quite different from finite and divisible particles; indeed the only difference I am able to discover is that the former are indivisible. The exquisite

    transparency

    FIRST DAY 41

    transparency of water also favors this view; for the most trans- parent crystal when broken and ground and reduced to powder

    loses its transparency; the finer the grinding the greater the loss; but in the case of water where the attrition is of the highest degree we have extreme transparency. Gold and silver when pulverized with acids [acque forti\ more finely than is possible

    with any file still remain powders,* and do not become fluids until the finest particles [gf indivisibili] of fire or of the rays of the sun dissolve them, as I think, into their ultimate, indivisible, and infinitely small components.

    SAGR. This phenomenon of light which you mention is one which I have many times remarked with astonishment. I have, for instance, seen lead melted instantly by means of a concave mirror only three hands \palmi\ in diameter. Hence I think that if the mirror were very large, well-polished and of a para-

    bolic figure, it would just as readily and quickly melt any other metal, seeing that the small mirror, which was not well polished and had only a spherical shape, was able so energetically to melt lead and burn every combustible substance. Such effects as these render credible to me the marvels accomplished by the mirrors of Archimedes.

    SALV. Speaking of the effects produced by the mirrors of Archimedes, it was his own books (which I had already read and studied with infinite astonishment) that rendered credible to me all the miracles described by various writers. And if any doubt

    had remained the book which Father Buenaventura Cavalierif [87]

    has recently published on the subject of the burning glass [specchio ustorio] and which I have read with admiration would have removed the last difficulty.

    SAGR. I also have seen this treatise and have read it with

    * It is not clear what Galileo here means by saying that gold and silver when treated with acids still remain powders. [Trans.]

    f One of the most active investigators among Galileo’s contemporaries; born at Milan 1598; died at Bologna 1647; a Jesuit father, first to intro-

    duce the use of logarithms into Italy and first to derive the expression for

    the focal length of a lens having unequal radii of curvature. His ” method of indivisibles” is to be reckoned as a precursor of the infinitesimal calculus. [Trans.]

    42 THE TWO NEW SCIENCES OF GALILEO

    pleasure and astonishment; and knowing the author I was con- firmed in the opinion which I had already formed of him that

    he was destined to become one of the leading mathematicians of our age. But now, with regard to the surprising effect of solar rays in melting metals, must we believe that such a furious adlion is devoid of motion or that it is accompanied by the most rapid of motions ?

    SALV. We observe that other combustions and resolutions are

    accompanied by motion, and that, the most rapid; note the ac- tion of lightning and of powder as used in mines and petards;

    note also how the charcoal flame, mixed as it is with heavy and impure vapors, increases its power to liquify metals whenever quickened by a pair of bellows. Hence I do not understand how the adlion of light, although very pure, can be devoid of motion and that of the swiftest type.

    SAGR. But of what kind and how great must we consider this speed of light to be ? Is it instantaneous or momentary or does it like other motions require time? Can we not decide this by experiment?

    SIMP. Everyday experience shows that the propagation of light is instantaneous; for when we see a piece of artillery fired, at great distance, the flash reaches our eyes without lapse of time; but the sound reaches the ear only after a noticeable interval.

    SAGR. Well, Simplicio, the only thing I am able to infer from this familiar bit of experience is that sound, in reaching our ear, travels more slowly than light; it does not inform me whether the coming of the light is instantaneous or whether, although extremely rapid, it still occupies time. An observation of this kind tells us nothing more than one in which it is claimed that

    “As soon as the sun reaches the horizon its light reaches our eyes”; but who will assure me that these rays had not reached this limit earlier than they reached our vision ?

    SALV. The small conclusiveness of these and other similar observations once led me to devise a method by which one might accurately ascertain whether illumination, i. e., the propagation of light, is really instantaneous. The fact that the speed of

    sound

    FIRST DAY 43 [88]

    sound is as high as it is, assures us that the motion of light cannot fail to be extraordinarily swift. The experiment which I devised was as follows :

    Let each of two persons take a light contained in a lantern, or other receptacle, such that by the interposition of the hand, the one can shut oft or admit the light to the vision of the other. Next let them stand opposite each other at a distance of a few cubits and practice until they acquire such skill in uncovering and occulting their lights that the instant one sees the light of his companion he will uncover his own. After a few trials the response will be so prompt that without sensible error [svario] the uncovering of one light is immediately followed by the un-

    covering of the other, so that as soon as one exposes his light he will instantly see that of the other. Having acquired skill at this short distance let the two experimenters, equipped as before, take up positions separated by a distance of two or three miles and let them perform the same experiment at night, noting care-

    fully whether the exposures and occultations occur in the same manner as at short distances; if they do, we may safely conclude that the propagation of light is instantaneous; but if time is required at a distance of three miles which, considering the going of one light and the coming of the other, really amounts to six, then the delay ought to be easily observable. If the experiment is to be made at still greater distances, say eight or ten miles, telescopes may be employed, each observer ad-

    justing one for himself at the place where he is to make the experiment at night; then although the lights are not large and are therefore invisible to the naked eye at so great a distance, they can readily be covered and uncovered since by aid of the telescopes, once adjusted and fixed, they will become easily visible.

    SAGR. This experiment strikes me as a clever and reliable in- vention. But tell us what you conclude from the results.

    SALV. In facl: I have tried the experiment only at a short distance, less than a mile, from which I have not been able to

    ascertain with certainty whether the appearance of the op-

    44 THE TWO NEW SCIENCES OF GALILEO

    posite light was instantaneous or not; but if not instantaneous it is extraordinarily rapid — I should call it momentary; and for the present I should compare it to motion which we see in the lightning flash between clouds eight or ten miles distant from us. We see the beginning of this light — I might say its head and [89]

    source — located at a particular place among the clouds; but it immediately spreads to the surrounding ones, which seems to be an argument that at least some time is required for propagation; for if the illumination were instantaneous and not gradual, we

    should not be able to distinguish its origin — its center, so to speak — from its outlying portions. What a sea we are grad-

    ually slipping into without knowing it! With vacua and in- finities and indivisibles and instantaneous motions, shall we

    ever be able, even by means of a thousand discussions, to reach dry land?

    SAGR. Really these matters lie far beyond our grasp. Just think; when we seek the infinite among numbers we find it in unity; that which is ever divisible is derived from indivisibles; the vacuum is found inseparably connected with the plenum; indeed the views commonly held concerning the nature of these matters are so reversed that even the circumference of a circle turns out to be an infinite straight line, a fact which, if my memory serves me correctly, you, Salviati, were intending to demonstrate geometrically. Please therefore proceed without further digression.

    SALV. I am at your service; but for the sake of greater clear- ness let me first demonstrate the following problem : Given a straight line divided into unequal parts which bear to each other any ratio whatever, to describe a circle such that two straight lines drawn from the ends of the given line to any point on the circumference will bear to each other the same ratio as the two parts of the given line, thus making those lines which are drawn from the same terminal points homologous.

    Let AB represent the given straight line divided into any two unequal parts by the point C; the problem is to describe a circle

    such

    FIRST DAY 45

    such that two straight lines drawn from the terminal points, A and B, to any point on the circumference will bear to each other the same ratio as the part AC bears to BC, so that lines drawn from the same terminal points are homologous. About C as center describe a circle having the shorter part CB of the given line, as radius. Through A draw a straight line AD which

    [90] shall be tangent to the circle at D and indefinitely prolonged

    toward E. Draw the radius CD which will be perpendicular to AE. At B erect a perpendicular to AB; this perpendicular will intersect AE at

    some point since the angle at A is acute; call this point of in-

    tersection E, and from it draw a per-

    pendicular t o AE which will intersect

    AB prolonged in F. Now I say the two straight lines FE and FC are equal. For if we join E and C,

    we shall have two FiS- 8 triangles, DEC and BEC, in which the two sides of the one, DE and EC, are equal to the two sides of the other, BE and EC, both DE and EB being tangents to the circle DB while the bases DC and CB are likewise equal; hence the two angles, DEC and BEC, will be equal. Now since the angle BCE differs from a right angle by the angle CEB, and the angle CEF also differs from a right angle by the angle CED, and since these differences are equal, it follows that the angle FCE

    is equal to CEF ;’ consequently the sides FE and FC are equal. If we describe a circle with F as center and FE as radius it will

    pass through the point C; let CEG be such a circle. This is the circle sought, for if we draw lines from the terminal points A and B to any point on its circumference they will bear to each other

    the

    46 THE TWO NEW SCIENCES OF GALILEO the same ratio as the two portions AC and BC which meet at the point C. This is manifest in the case of the two lines AE and BE, meeting at the point E, because the angle E of the triangle AEB is bisected by the line CE, and therefore AC : CB = AE : BE. The same may be proved of the two lines AG and BG terminat-

    ing in the point G. For since the triangles AFE and EFB are similar, we have AF:FE=EF:FB, or AF:FC=CF:FB, and dividendo AC : CF = CB : BF, or AC : FG = CB : BF ; also com- ponendo we have both AB : BG = CB : BF and AG : GB = CF : FB =AE:EB=AC:BC. Q. E. D.

    Take now any other point in the circumference, say H, where the two lines AH and BH intersect; in like manner we shall have AC: CB=AH: HB. Prolong HB until it meets the circumference at I and join IF; and since we have already found that AB:BG=CB:BF it follows that the rectangle AB.BF is equal to the rectangle CB.BG or IB.BH. Hence AB : BH=IB:BF. But the angles at B are equal and therefore AH:HB=IF:FB=EF:FB=AE:EB.

    Besides, I may add, that it is impossible for lines which main- tain this same ratio and which are drawn from the terminal

    points, A and B, to meet at any point either inside or outside the circle, CEG. For suppose this were possible; let AL and BL be two such lines intersecting at the point L outside the circle: prolong LB till it meets the circumference at M and join MF. If AL:BL=AC:BC=MF:FB, then we shall have two tri-

    angles ALB and MFB which have the sides about the two angles proportional, the angles at the vertex, B, equal, and the two remaining angles, FMB and LAB, less than right angles (because the right angle at M has for its base the entire diameter CG and not merely a part BF: and the other angle at the point A is acute because the line AL, the homologue of AC, is greater than BL, the homologue of BC). From this it follows that the triangles ABL and MBF are similar and therefore AB:BL= MB:BF, making the rectangle AB.BF =MB.BL; but it has been demonstrated that the rectangle AB.BF is equal to CB.BG; whence it would follow that the rectangle MB.BL is equal to the

    rectangle

    FIRST DAY 47 rectangle CB.BG which is impossible; therefore the intersection cannot fall outside the circle. And in like manner we can show that it cannot fall inside; hence all these intersections fall on the circumference.

    But now it is time for us to go back and grant the request of Simplicio by showing him that it is not only not impossible to resolve a line into an infinite number of points but that this is quite as easy as to divide it into its finite parts. This I will do under the following condition which I am sure, Simplicio, you will not deny me, namely, that you will not require me to sep-

    arate the points, one from the other, and show them to you,

    [92] one by one, on this paper; for I should be content that you, without separating the four or six parts of a line from one an-

    other, should show me the marked divisions or at most that you should fold them at angles forming a square or a hexagon: for, then, I am certain you would consider the division distinctly and actually accomplished.

    SIMP. I certainly should. SALV. If now the change which takes place when you bend a

    line at angles so as to form now a square, now an octagon, now a polygon of forty, a hundred or a thousand angles, is sufficient to bring into actuality the four, eight, forty, hundred, and thousand parts which, according to you, existed at first only potentially in the straight line, may I not say, with equal right, that, when I have bent the straight line into a polygon having an infinite number of sides, i. e., into a circle, I have reduced to actuality that infinite number of parts which you claimed, while it was straight, were contained in it only potentially? Nor can one deny that the division into an infinite number of points is just as truly accomplished as the one into four parts when the square is formed or into a thousand parts when the millagon is formed; for in such a division the same conditions are satisfied as in the case of a polygon of a thousand or a hundred thousand sides. Such a polygon laid upon a straight line touches it with one of its sides, i. e., with one of its hundred thousand parts; while the circle which is a polygon of an infinite number of sides

    touches

    48 THE TWO NEW SCIENCES OF GALILEO touches the same straight line with one of its sides which is a

    single point different from all its neighbors and therefore sep- arate and distinct in no less degree than is one side of a polygon

    from the other sides. And just as a polygon, when rolled along a plane, marks out upon this plane, by the successive contacts of its sides, a straight line equal to its perimeter, so the circle rolled upon such a plane also traces by its infinite succession of contacts a straight line equal in length to its own circumference. I am willing, Simplicio, at the outset, to grant to the Peripatetics the truth of their opinion that a continuous quantity [it con-

    tinue] is divisible only into parts which are still further divisible so that however far the division and subdivision be continued no end will be reached; but I am not so certain that they will concede to me that none of these divisions of theirs can be a final one, as is surely the fact, because there always remains

    “another”; the final and ultimate division is rather one which resolves a continuous quantity into an infinite number of in-

    divisible quantities, a result which I grant can never be reached by successive division into an ever-increasing number of parts. But if they employ the method which I propose for separating

    [93] and resolving the whole of infinity \tutta la infinita], at a single stroke (an artifice which surely ought not to be denied me), I think that they would be contented to admit that a continuous quantity is built up out of absolutely indivisible atoms, es-

    pecially since this method, perhaps better than any other, enables us to avoid many intricate labyrinths, such as cohesion in solids, already mentioned, and the question of expansion and contraction, without forcing upon us the objectionable admission of empty spaces [in solids] which carries with it the penetrability of bodies. Both of these objections, it appears to me, are avoided if we accept the above-mentioned view of indivisible con- stituents.

    SIMP. I hardly know what the Peripatetics would say since the views advanced by you would strike them as mostly new, and as such we must consider them. It is however not unlikely that they would find answers and solutions for these problems which

    I?

    FIRST DAY 49 I, for want of time and critical ability, am at present unable to solve. Leaving this to one side for the moment, I should like to hear how the introduction of these indivisible quantities helps us to understand contraction and expansion avoiding at the same time the vacuum and the penetrability of bodies.

    SAGR. I also shall listen with keen interest to this same matter

    which is far from clear in my mind; provided I am allowed to hear what, a moment ago, Simplicio suggested we omit, namely, the reasons which Aristotle offers against the existence of the vacuum and the arguments which you must advance in rebuttal.

    SALV. I will do both. And first, just as, for the production of expansion, we employ the line described by the small circle

    during one rotation of the large one — a line greater than the circumference of the small circle — so, in order to explain con-

    traction, we point out that, during each rotation of the smaller circle, the larger one describes a straight line which is shorter than its circumference.

    For the better understanding of this we proceed to the con- sideration of what happens in the case of polygons. Employing

    [94] a figure similar to the earlier one, construct the two hexagons, ABC and HIK, about the common center L, and let them roll along the parallel lines HOM and ABc. Now holding the vertex I fixed, allow the smaller polygon to rotate until the side IK lies upon the parallel, during which motion the point K will describe the arc KM, and the side KI will coincide with IM. Let us see what, in the meantime, the side CB of the larger polygon has been doing. Since the rotation is about the point I, the terminal point B, of the line IB, moving backwards, will describe the arc Eb underneath the parallel cA so that when the side KI coincides with the line MI, the side BC will coincide with be, having advanced only through the distance Br, but having retreated through a portion of the line BA which subtends the arc B&. If we allow the rotation of the smaller polygon to go on it will traverse and describe along its parallel a line equal to its perimeter; while the larger one will traverse and describe a line less than its perimeter by as many times the length bB as there

    are

    5o THE TWO NEW SCIENCES OF GALILEO

    are sides less one; this line is approximately equal to that de- scribed by the smaller polygon exceeding it only by the distance

    bE. Here now we see, without any difficulty, why the larger polygon, when carried by the smaller, does not measure off with its sides a line longer than that traversed by the smaller

    one; this is because a por- tion of each side is super-

    posed upon its immedi- ately preceding neighbor.

    Let us next consider

    two circles, having a com- mon center at A, and ly-

    ing upon their respective

    parallels, the smaller be- ing tangent to its parallel

    at the point B ; the larger, at the point C. Here when the small circle commen-

    ces to roll the point B [95]

    does not remain at rest for a while so as to allow

    Fig. 9 BC to move backward and carry with it the point C, as happened in the case of the polygons, where the point I remained fixed until the side KI coincided with MI and the line IB carried the terminal point B backward as far as b, so that the side BC fell upon be, thus super-

    posing upon the line BA, the portion B&, and advancing by an amount Be, equal to MI, that is, to one side of the smaller polygon. On account of these superpositions, which are the excesses of the sides of the larger over the smaller polygon, each net advance is equal to one side of the smaller polygon and, dur-

    ing one complete rotation, these amount to a straight line equal in length to the perimeter of the smaller polygon.

    But

    FIRST DAY 51 But now reasoning in the same way concerning the circles,

    we must observe that whereas the number of sides in any poly- gon is comprised within a certain limit, the number of sides in a

    circle is infinite; the former are finite and divisible; the latter infinite and indivisible. In the case of the polygon, the vertices remain at rest during an interval of time which bears to the period of one complete rotation the same ratio which one side bears to the perimeter; likewise, in the case of the circles, the delay of each of the infinite number of vertices is merely in-

    stantaneous, because an instant is such a fraction of a finite interval as a point is of a line which contains an infinite number of points. The retrogression of the sides of the larger polygon is not equal to the length of one of its sides but merely to the excess of such a side over one side of the smaller polygon, the net advance being equal to this smaller side; but in the circle, the point or side C, during the instantaneous rest of B, recedes by an amount equal to its excess over the side B, making a net progress equal to B itself. In short the infinite number of indivisible sides of the greater circle with their infinite number of indivisible retrogressions, made during the infinite number of instantaneous delays of the infinite number of vertices of the smaller circle, together with the infinite number of progressions, equal to the infinite number of sides in the smaller circle — all these, I say, add up to a line equal to that described by the smaller circle, a line which contains an infinite number of infinitely small superpositions, thus bringing about a thickening or contraction without any overlapping or interpenetration of finite parts. This result could not be obtained in the case of a line divided

    .

    into finite parts such as is the perimeter of any polygon, which when laid out in a straight line cannot be shortened except by the overlapping and interpenetration of its sides. This contrac-

    tion of an infinite number of infinitely small parts without the interpenetration or overlapping of finite parts and the previously mentioned [p. 70, Nat. Ed.] expansion of an infinite number of indivisible parts by the interposition of indivisible vacua is, in my opinion, the most that can be said concerning the contraction

    and

    52 THE TWO NEW SCIENCES OF GALILEO and rarefaction of bodies, unless we give up the impenetrability of matter and introduce empty spaces of finite size. If you find anything here that you consider worth while, pray use it; if not regard it, together with my remarks, as idle talk; but this remember, we are dealing with the infinite and the indivisible.

    SAGR. I frankly confess that your idea is subtle and that it impresses me as new and strange; but whether, as a matter of fact, nature actually behaves according to such a law I am unable to determine; however, until I find a more satisfactory explanation I shall hold fast to this one. Perhaps Simplicio can tell us something which I have not yet heard, namely, how to explain the explanation which the philosophers have given of this abstruse matter; for, indeed, all that I have hitherto read concerning contraction is so dense and that concerning ex-

    pansion so thin that my poor brain can neither penetrate the former nor grasp the latter.

    SIMP. I am all at sea and find difficulties in following either path, especially this new one; because according to this theory an ounce of gold might be rarefied and expanded until its size would exceed that of the earth, while the earth, in turn, might be condensed and reduced until it would become smaller than a walnut, something which I do not believe; nor do I believe that you believe it. The arguments and demonstrations which you have advanced are mathematical, abstract, and far removed from concrete matter; and I do not believe that when applied to the physical and natural world these laws will hold.

    SALV. I am not able to render the invisible visible, nor do I think that you will ask this. But now that you mention gold, do not our senses tell us that that metal can be immensely ex-

    panded ? I do not know whether you have observed the method [97]

    employed by those who are skilled in drawing gold wire, of which really only the surface is gold, the inside material being silver. The way they draw it is as follows: they take a cylinder or, if you please, a rod of silver, about half a cubit long and three or

    four times as wide as one’s thumb; this rod they cover with gold-leaf which is so thin that it almost floats in air, putting on

    not

    FIRST DAY 53

    not more than eight or ten thicknesses. Once gilded they begin to pull it, with great force, through the holes of a draw-plate; again and again it is made to pass through smaller and smaller holes, until, after very many passages, it is reduced to the

    fineness of a lady’s hair, or perhaps even finer; yet the surface remains gilded. Imagine now how the substance of this gold has been expanded and to what fineness it has been reduced.

    SIMP. I do not see that this process would produce, as a consequence, that marvellous thinning of the substance of the gold which you suggest: first, because the original gilding con-

    sisting of ten layers of gold-leaf has a sensible thickness; secondly, because in drawing out the silver it grows in length but at the same time diminishes proportionally in thickness; and, since one dimension thus compensates the other, the area will not be so increased as to make it necessary during the process of gilding to reduce the thinness of the gold beyond that of the original leaves.

    SALV. You are greatly mistaken, Simplicio, because the sur- face increases directly as the square root of the length, a fact

    which I can demonstrate geometrically. SAGR. Please give us the demonstration not only for my own

    sake but also for Simplicio provided you think we can under- stand it. ,

    SALV. I’ll see if I can recall it on the spur of the moment. At the outset, it is clear that the original thick rod of silver and the wire drawn out to an enormous length are two cylinders of the same volume, since they are the same body of silver. So

    [98] that, if I determine the ratio between the surfaces of cylinders of

    the same volume, the problem will be solved. I say then,

    The areas of cylinders of equal volumes, neglecting the bases, bear to each other a ratio which is the square root of the ratio of their lengths.

    Take two cylinders of equal volume having the altitudes AB and CD, between which the line E is a mean proportional. Then I claim that, omitting the bases of each cylinder, the surface of the cylinder AB is to that of the cylinder CD as the length AB

    is

    54 THE TWO NEW SCIENCES OF GALILEO is to the line E, that is, as the square root of AB is to the square root of CD. Now cut off the cylinder AB at F so that the alti-

    tude AF is equal to CD. Then since the bases of cylinders of equal volume bear to one another the inverse ratio of their heights, it follows that the area of the circular base of the cylinder CD will be to the area of the circular base of AB as the altitude BA is to DC: moreover, since circles are to one another as the squares of their diameters, the said squares will be to each other as BA is to CD. But BA is to CD as the square of » BA is to the square of E : and, therefore, these

    ‘”””””^ four squares will form a proportion; and like- wise their sides; so the line AB is to E as the

    diameter of circle C is to the diameter of the circle A. But the diameters are proportional to the circumferences and the circumferences are proportional to the areas of cylinders of equal height; hence the line AB is to E as the surface of the cylinder CD is to the surface of the cylinder AF. Now since the height AF is to AB as the surface of AF is to the surface of AB; and since the height AB is to the line E as the

    m surface CD is to AF, it follows, ex cequali in Fig. 10 proportione perturbata* that the height AF is

    to E as the surface CD is to the surface AB, and convertendo,

    the surface of the cylinder AB is to the surface of the cyl- inder CD as the line E is to AF, i. e., to CD, or as AB is to

    E which is the square root of the ratio of AB to CD. Q. E. D. If now we apply these results to the case in hand, and assume

    that the silver cylinder at the time of gilding had a length of only half a cubit and a thickness three or four times that of

    [99] one’s thumb, we shall find that, when the wire has been reduced to the fineness of a hair and has been drawn out to a length of twenty thousand cubits (and perhaps more), the area of its surface will have been increased not less than two hundred times. Consequently the ten leaves of gold which were laid on

    * See Euclid, Book V, Def. 20., Todhunter’s Ed., p. 137 (London, 1877.)

    [Trans]

    FIRST DAY 55

    have been extended over a surface two hundred times greater, assuring us that the thickness of the gold which now covers the surface of so many cubits of wire cannot be greater than one twentieth that of an ordinary leaf of beaten gold. Consider now what degree of fineness it must have and whether one could

    conceive it to happen in any other way than by enormous ex- pansion of parts; consider also whether this experiment does not

    suggest that physical bodies [materie fisiche] are composed of infinitely small indivisible particles, a view which is supported by other more striking and conclusive examples.

    SAGR. This demonstration is so beautiful that, even if it does

    not have the cogency originally intended, — although to my mind, it is very forceful — the short time devoted to it has nevertheless been most happily spent.

    SALV. Since you are so fond of these geometrical demonstra- tions, which carry with them distinct gain, I will give you a

    companion theorem which answers an extremely interesting query. We have seen above what relations hold between equal cylinders of different height or length; let us now see what holds when the cylinders are equal in area but unequal in height, understanding area to include the curved surface, but not the upper and lower bases. The theorem is :

    The volumes of right cylinders having equal curved sur- faces are inversely proportional to their altitudes.

    Let the surfaces of the two cylinders, AE and CF, be equal but let the height of the latter, CD, be greater than that of the former, AB : then I say that the volume of the cylinder AE is to that of the cylinder CF as the height CD is to AB. Now since the surface of CF is equal to the surface of AE, it fol-

    lows that the volume of CF is less than that of AE; for, if they

    were equal, the surface of CF would, by the preceding proposi- tion, exceed that of AE, and the excess would be so much the

    greater if the volume of the cylinder CF were greater than that

    [100] of AE. Let us now take a cylinder ID having a volume equal to that of AE; then, according to the preceding theorem, the sur-

    face of the cylinder ID is to the surface of AE as the altitude

    IF

    56 THE TWO NEW SCIENCES OF GALILEO

    IF is to the mean proportional between IF and AB. But since one datum of the problem is that the surface of AE is equal to that of CF, and since the surface ID is to the surface CF as the altitude IF is to the altitude CD, it follows that CD is a

    ^ — ̂ mean proportional between IF and AB. Not only so, but since the volume of the cylinder ID is equal to that of AE, each will bear the same ratio to the volume of

    the cylinder CF; but the volume ID is to the volume CF as the altitude IF is to the altitude CD; hence the volume of AE is to the volume of CF as the length IF is to the length CD, that is, as the length CD is to the length AB. Q. E. D.

    This explains a phenomenon upon which the common people always look with wonder, namely, if we have a piece of stuff which has one side longer than the other, we can make from it a cornsack, using the customary wooden base, which will hold more when the short side of the cloth is used for the height of the sack and the long side is wrapped around the wooden base,

    than with the alternative arrangement. So that, for instance, from a piece of cloth which is six cubits on one side and twelve on the other, a sack can be made which will hold more when the side of twelve cubits is wrapped around the wooden base, leav-

    ing the sack six cubits high than when the six cubit side is put around the base making the sack twelve cubits high. From what has been proven above we learn not only the general fact that one sack holds more than the other, but we also get specific and particular information as to how much more, namely, just in proportion as the altitude of the sack diminishes the contents increase and vice versa. Thus if we use the figures given which make the cloth twice as long as wide and if we use the long side for the seam, the volume of the sack will be just one-half as great as with the opposite arrangement. Likewise

    IJ

    FIRST DAY 57 [101]

    if we have a piece of matting which measures 7 x 25 cubits and make from it a basket, the contents of the basket will, when the

    seam is lengthwise, be seven as compared with twenty-five when the seam runs endwise.

    SAGR. It is with great pleasure that we continue thus to ac- quire new and useful information. But as regards the subject

    just discussed, I really believe that, among those who are not already familiar with geometry, you would scarcely find four per-

    sons in a hundred who would not, at first sight, make the mistake of believing that bodies having equal surfaces would be equal in other respects. Speaking of areas, the same error is made when one attempts, as often happens, to determine the sizes of various cities by measuring their boundary lines, forgetting that the circuit of one may be equal to the circuit of another while the area of the one is much greater than that of the other. And this is true not only in the case of irregular, but also of regular surfaces, where the polygon having the greater number of sides always contains a larger area than the one with the less number of sides, so that finally the circle which is a polygon of an in-

    finite number of sides contains the largest area of all polygons of equal perimeter. I remember with particular pleasure having seen this demonstration when I was studying the sphere of

    Sacrobosco * with the aid of a learned commentary. SALV. Very true! I too came across the same passage which

    suggested to me a method of showing how, by a single short demonstration, one can prove that the circle has the largest content of all regular isoperimetric figures; and that, of other

    [102] figures, the one which has the larger number of sides contains a greater area than that which has the smaller number.

    SAGR. Being exceedingly fond of choice and uncommon propo- sitions, I beseech you to let us have your demonstration.

    SALV. I can do this in a few words by proving the following theorem:

    The area of a circle is a mean proportional between any

    * See interesting biographical note on Sacrobosco [John Holywood] in Ency. Brit., I ith Ed. [Trans.]

    58 THE TWO NEW SCIENCES OF GALILEO

    two regular and similar polygons of which one circum- scribes it and the other is isoperimetric with it. In addition,

    the area of the circle is less than that of any circumscribed polygon and greater than that of any isoperimetric polygon. And further, of these circumscribed polygons, the one which has the greater number of sides is smaller than the one which has a less number; but, on the other hand, that isoperi-

    metric polygon which has the greater number of sides is the larger.

    Let A and B be two similar polygons of which A circumscribes the given circle and B is isoperimetric with it. The area of the circle will then be a mean proportional between the areas of the polygons. For if we indicate the radius of the circle by AC and if we remember that the area of the circle is equal to that of a right-angled triangle in which one of the sides about the right angle is equal to the radius, AC, and the other to the circum-

    ference; and if likewise we remember that the area of the poly- gon A is equal to the area of a right-angled triangle one of [103]

    whose sides about the right angle has the same length as AC and the other is equal to the perimeter of the polygon itself; it is then

    Fig. 12

    manifest^that the circumscribed polygon bears to the circle the same ratio which its perimeter bears to the circumference of the circle, or to the perimeter of the polygon B which is, by hypoth-

    esis, equal to the circumference of the circle. But since the polygons A and B are similar their areas are to each other as the squares of their perimeters; hence the area of the circle A is a

    mean

    FIRST DAY 59 mean proportional between the areas of the two polygons A and B. And since the area of the polygon A is greater than that of the circle A, it is clear that the area of the circle A is greater than that of the isoperimetric polygon B, and is therefore the greatest of all regular polygons having the same perimeter as the circle.

    We now demonstrate the remaining portion of the theorem, which is to prove that, in the case of polygons circumscribing a given circle, the one having the smaller number of sides has a larger area than one having a greater number of sides; but that on the other hand, in the case of isoperimetric polygons, the one having the more sides has a larger area than the one with less sides. To the circle which has O for center and OA

    for radius draw the tangent AD; and on this tangent lay off, say, AD which shall represent one-half of the side of a circum-

    scribed pentagon and AC which shall represent one-half of the side of a heptagon; draw the straight lines OGC and OFD; then with O as a center and OC as radius draw the arc ECI. Now

    since the triangle DOC is greater than the secftor EOC and since the secftor COI is greater than the triangle COA, it follows that the triangle DOC bears to the triangle COA a greater ratio than the secftor EOC bears to the secftor COI, that is, than the secftor

    FOG bears to the secftor GOA. Hence, componendo et per- mutando, the triangle DOA bears to the secftor FOA a greater ratio than that which the triangle COA bears to the secftor GOA, and also 10 such triangles DOA bear to 10 such secftors FOA a greater ratio than 14 such triangles COA bear to 14 such secftors GOA, that is to say, the circumscribed pentagon bears to the circle a greater ratio than does the heptagon. Hence the pentagon exceeds the heptagon in area.

    But now let us assume that both the heptagon and the penta- gon have the same perimeter as that of a given circle. Then I

    say the heptagon will contain a larger area than the pentagon. For since the area of the circle is a mean proportional between areas of the circumscribed and of the isoperimetric pentagons, [104]

    and since likewise it is a mean proportional between the cir- cumscribed

    6o THE TWO NEW SCIENCES OF GALILEO

    cumscribed and isoperimetric heptagons, and since also we have proved that the circumscribed pentagon is larger than the circumscribed heptagon, it follows that this circumscribed pen-

    tagon bears to the circle a larger ratio than does the heptagon, that is, the circle will bear to its isoperimetric pentagon a greater ratio than to its isoperimetric heptagon. Hence the pentagon is smaller than its isoperimetric heptagon. Q. E. D.

    SAGR. A very clever and elegant demonstration! But how did we come to plunge into geometry while discussing the objections urged by Simplicio, objections of great moment, especially that one referring to density which strikes me as particularly difficult?

    SALV. If contraction and expansion [condensazione e rare- fazzione] consist in contrary motions, one ought to find for each great expansion a correspondingly large contraction. But our surprise is increased when, every day, we see enormous expan-

    sions taking pkce almost instantaneously. Think what a tremendous expansion occurs when a small quantity of gun-

    powder flares up into a vast volume of fire! Think too of the almost limitless expansion of the light which it produces! Imagine the contraction which would take place if this fire and this light were to reunite, which, indeed, is not impossible since only a little while ago they were located together in this small

    space. You will find, upon observation, a thousand such expan- sions for they are more obvious than contractions since dense

    matter is more palpable and accessible to our senses. We can take wood and see it go up in fire and light, but we do not see [105]

    them recombine to form wood; we see fruits and flowers and a thousand other solid bodies dissolve largely into odors, but we do not observe these fragrant atoms coming together to form fragrant solids. But where the senses fail us reason must step in; for it will enable us to understand the motion involved in the condensation of extremely rarefied and tenuous substances just as clearly as that involved in the expansion and dissolution of solids. Moreover we are trying to find out how it is possible to produce expansion and contraction in bodies which are capable of such changes without introducing vacua and without giving

    up

    FIRST DAY 61

    up the impenetrability of matter; but this does not exclude the possibility of there being materials which possess no such prop-

    erties and do not, therefore, carry with them consequences which you call inconvenient and impossible. And finally, Simplicio, I have, for the sake of you philosophers, taken pains to find an explanation of how expansion and contraction can take place without our admitting the penetrability of matter and introducing vacua, properties which you deny and dislike; if you were to admit them, I should not oppose you so vigorously. Now either admit these difficulties or accept my views or sug-

    gest something better. SAGR. I quite agree with the peripatetic philosophers in

    denying the penetrability of matter. As to the vacua I should

    like to hear a thorough discussion of Aristotle’s demonstration in which he opposes them, and what you, Salviati, have to say in reply. I beg of you, Simplicio, that you give us the precise proof of the Philosopher and that you, Salviati, give us the reply.

    SIMP. So far as I remember, Aristotle inveighs against the ancient view that a vacuum is a necessary prerequisite for motion and that the latter could not occur without the former.

    In opposition to this view Aristotle shows that it is precisely the phenomenon of motion, as we shall see, which renders untenable the idea of a vacuum. His method is to divide the

    argument into two parts. He first supposes bodies of different weights to move in the same medium; then supposes, one and the same body to move in different media. In the first case, he

    [106] supposes bodies of different weight to move in one and the same medium with different speeds which stand to one another in the same ratio as the weights ; so that, for example, a body which is ten times as heavy as another will move ten times as rapidly as the other. In the second case he assumes that the speeds of one and the same body moving in different media are in inverse ratio to the densities of these media; thus, for instance, if the density of water were ten times that of air, the speed in air would be ten times greater than in water. From this second supposi-

    tion,

    62 THE TWO NEW SCIENCES OF GALILEO

    tion, he shows that, since the tenuity of a vacuum differs in- finitely from that of any medium filled with matter however

    rare, any body which moves in a plenum through a certain space in a certain time ought to move through a vacuum in-

    stantaneously; but instantaneous motion is an impossibility; it is therefore impossible that a vacuum should be produced by motion.

    SALV. The argument is, as you see, ad hominem, that is, it is directed against those who thought the vacuum a prerequisite for motion. Now if I admit the argument to be conclusive and concede also that motion cannot take place in a vacuum, the assumption of a vacuum considered absolutely and not with reference to motion, is not thereby invalidated. But to tell you what the ancients might possibly have replied and in order to

    better understand just how conclusive Aristotle’s demonstra- tion is, we may, in my opinion, deny both of his assumptions.

    And as to the first, I greatly doubt that Aristotle ever tested by experiment whether it be true that two stones, one weighing ten times as much as the other, if allowed to fall, at the same in-

    stant, from a height of, say, 100 cubits, would so differ in speed that when the heavier had reached the ground, the other would not have fallen more than 10 cubits.

    SIMP. His language would seem to indicate that he had tried the experiment, because he says: We see the heavier; now the word see shows that he had made the experiment.

    SAGR. But I, Simplicio, who have made the test can assure

    you that a cannon ball weighing one or two hundred pounds, or even more, will not reach the ground by as much as a span ahead of a musket ball weighing only half a pound, provided both are dropped from a height of 200 cubits.

    SALV. But, even without further experiment, it is possible to prove clearly, by means of a short and conclusive argument, that a heavier body does not move more rapidly than a lighter one provided both bodies are of the same material and in short such as those mentioned by Aristotle. But tell me, Simplicio, whether you admit that each falling body acquires a definite

    speed

    FIRST DAY 63 speed fixed by nature, a velocity which cannot be increased or diminished except by the use of force [violenza] or resistance.

    SIMP. There can be no doubt but that one and the same body moving in a single medium has a fixed velocity which is deter-

    mined by nature and which cannot be increased except by the addition of momentum [impeto] or diminished except by some resistance which retards it.

    SALV. If then we take two bodies whose natural speeds are different, it is clear that on uniting the two, the more rapid one will be partly retarded by the slower, and the slower will be somewhat hastened by the swifter. Do you not agree with me in this opinion ?

    SIMP. You are unquestionably right. SALV. But if this is true, and if a large stone moves with a

    speed of, say, eight while a smaller moves with a speed of four, then when they are united, the system will move with a speed less than eight; but the two stones when tied together make a stone larger than that which before moved with a speed of eight. Hence the heavier body moves with less speed than the lighter; an effect which is contrary to your supposition. Thus you see

    [108] how, from your assumption that the heavier body moves more rapidly than the lighter one, I infer that the heavier body moves more slowly.

    SIMP. I am all at sea because it appears to me that the smaller stone when added to the larger increases its weight and by adding weight I do not see how it can fail to increase its speed or, at least, not to diminish it.

    SALV. Here again you are in error, Simplicio, because it is not true that the smaller stone adds weight to the larger.

    SIMP. This is, indeed, quite beyond my comprehension. SALV. It will not be beyond you when I have once shown you

    the mistake under which you are laboring. Note that it is necessary to distinguish between heavy bodies in motion and the same bodies at rest. A large stone placed in a balance not only acquires additional weight by having another stone placed upon it, but even by the addition of a handful of hemp its weight is

    augmented

    64 THE TWO NEW SCIENCES OF GALILEO augmented six to ten ounces according to the quantity of hemp. But if you tie the hemp to the stone and allow them to fall freely from some height, do you believe that the hemp will press down upon the stone and thus accelerate its motion or do you think the motion will be retarded by a partial upward pressure? One always feels the pressure upon his shoulders when he prevents the motion of a load resting upon him ; but if one descends just as rapidly as the load would fall how can it gravitate or press upon him ? Do you not see that this would be the same as trying to strike a man with a lance when he is run-

    ning away from you with a speed which is equal to, or even greater, than that with which you are following him ? You must therefore conclude that, during free and natural fall, the small stone does not press upon the larger and consequently does not increase its weight as it does when at rest.

    SIMP. But what if we should place the larger stone upon

    the smaller? [109]

    SALV. Its weight would be increased if the larger stone moved more rapidly; but we have already concluded that when the small stone moves more slowly it retards to some extent the speed of the larger, so that the combination of the two, which is a heavier body than the larger of the two stones, would move less rapidly, a conclusion which is contrary to your hypothesis. We infer therefore that large and small bodies move with the same speed provided they are of the same specific gravity.

    SIMP. Your discussion is really admirable; yet I do not find it easy to believe that a bird-shot falls as swiftly as a cannon ball.

    SALV. Why not say a grain of sand as rapidly as a grindstone? But, Simplicio, I trust you will not follow the example of many others who divert the discussion from its main intent and fasten

    upon some statement of mine which lacks a hair’s-breadth of the truth and, under this hair, hide the fault of another which is as

    big as a ship’s cable. Aristotle says that “an iron ball of one hundred pounds falling from a height of one hundred cubits reaches the ground before a one-pound ball has fallen a single

    cubit.” I say that they arrive at the same time. You find, on making

    FIRST DAY 65

    making the experiment, that the larger outstrips the smaller by two finger-breadths, that is, when the larger has reached the ground, the other is short of it by two finger-breadths; now you would not hide behind these two fingers the ninety-nine cubits of Aristotle, nor would you mention my small error and at the same time pass over in silence his very large one. Aristotle declares that bodies of different weights, in the same medium, travel (in so far as their motion depends upon gravity) with speeds which are proportional to their weights ; this he illustrates by use of bodies in which it is possible to perceive the pure and un-

    adulterated effect of gravity, eliminating other considerations, for example, figure as being of small importance [minimi mo- menti\, influences which are greatly dependent upon the medium which modifies the single effect of gravity alone. Thus we ob-

    serve that gold, the densest of all substances, when beaten out into a very thin leaf, goes floating through the air; the same thing happens with stone when ground into a very fine powder. But if you wish to maintain the general proposition you will have to show that the same ratio of speeds is preserved in the

    [no] case of all heavy bodies, and that a stone of twenty pounds moves ten times as rapidly as one of two; but I claim that this is false and that, if they fall from a height of fifty or a hundred cubits, they will reach the earth at the same moment.

    SIMP. Perhaps the result would be different if the fall took place not from a few cubits but from some thousands of cubits.

    SALV. If this were what Aristotle meant you would burden him with another error which would amount to a falsehood; because, since there is no such sheer height available on earth, it is clear that Aristotle could not have made the experiment; yet he wishes to give us the impression of his having performed it when he speaks of such an effect as one which we see.

    SIMP. In fact, Aristotle does not employ this principle, but uses the other one which is not, I believe, subject to these same difficulties.

    SALV. But the one is as false as the other; and I am surprised that you yourself do not see the fallacy and that you do not

    perceive

    66 THE TWO NEW SCIENCES OF GALILEO

    perceive that if it were true that, in media of different densities and different resistances, such as water and air, one and the same body moved in air more rapidly than in water, in propor-

    tion as the density of water is greater than that of air, then it would follow that any body which falls through air ought also to fall through water. But this conclusion is false inasmuch as many bodies which descend in air not only do not descend in water, but acftually rise.

    SIMP. I do not understand the necessity of your inference; and in addition I will say that Aristotle discusses only those bodies which fall in both media, not those which fall in air but rise in water.

    SALV. The arguments which you advance for the Philos- opher are such as he himself would have certainly avoided so as

    not to aggravate his first mistake. But tell me now whether the density [corpulenza] of the water, or whatever it may be that

    retards the motion, bears a definite ratio to the density of air which is less retardative; and if so fix a value for it at your

    pleasure. SIMP. Such a ratio does exist; let us assume it to be ten; then,

    for a body which falls in both these media, the speed in water will be ten times slower than in air.

    SALV. I shall now take one of those bodies which fall in air but not in water, say a wooden ball, and I shall ask you to assign to it any speed you please for its descent through air.

    SIMP. Let us suppose it moves with a speed of twenty. SALV. Very well. Then it is clear that this speed bears to

    some smaller speed the same ratio as the density of water bears to that of air; and the value of this smaller speed is two. So that really if we follow exadlly the assumption of Aristotle we ought to infer that the wooden ball which falls in air, a sub-

    stance ten times less-resisting than water, with a speed of twenty would fall in water with a speed of two, instead of coming to the surface from the bottom as it does; unless perhaps you wish to reply, which I do not believe you will, that the rising of the wood through the water is the same as its falling with a speed of two.

    But

    FIRST DAY 67

    But since the wooden ball does not go to the bottom, I think you will agree with me that we can find a ball of another material, not wood, which does fall in water with a speed of two.

    SIMP. Undoubtedly we can; but it must be of a substance considerably heavier than wood.

    SALV. That is it exadtly. But if this second ball falls in water with a speed of two, what will be its speed of descent in air? If you hold to the rule of Aristotle you must reply that it will move at the rate of twenty; but twenty is the speed which you yourself have already assigned to the wooden ball; hence this and the other heavier ball will each move through air with the same speed. But now how does the Philosopher harmonize this result with his other, namely, that bodies of different weight

    move through the same medium with different speeds — speeds which are proportional to their weights? But without going into the matter more deeply, how have these common and

    [112]

    obvious properties escaped your notice ? Have you not observed that two bodies which fall in water, one with a speed a hundred times as great as that of the other, will fall in air with speeds so nearly equal that one will not surpass the other by as much as one hundredth part? Thus, for example, an egg made of marble will descend in water one hundred times more rapidly than a

    hen’s egg, while in air falling from a height of twenty cubits the one will fall short of the other by less than four finger-breadths. In short, a heavy body which sinks through ten cubits of water in three hours will traverse ten cubits of air in one or two pulse- beats; and if the heavy body be a ball of lead it will easily traverse the ten cubits of water in less than double the time

    required for ten cubits of air. And here, I am sure, Simplicio, you find no ground for difference or objection. We conclude, therefore, that the argument does not bear against the existence of a vacuum; but if it did, it would only do away with vacua of considerable size which neither I nor, in my opinion, the ancients ever believed to exist in nature, although they might possibly be produced by force [violenza] as may be gathered from various ex-

    periments whose description would here occupy too much time.

    Sagr.

    68 THE TWO NEW SCIENCES OF GALILEO

    SAGR. Seeing that Simplicio is silent, I will take the opportu- nity of saying something. Since you have clearly demonstrated

    that bodies of different weights do not move in one and the same medium with velocities proportional to their weights, but that they all move with the same speed, understanding of course that they are of the same substance or at least of the same specific gravity; certainly not of different specific gravities, for I hardly think you would have us believe a ball of cork moves

    with the same speed as one of lead; and again since you have clearly demonstrated that one and the same body moving through differently resisting media does not acquire speeds which are inversely proportional to the resistances, I am curious to learn what are the ratios actually observed in these cases.

    SALV. These are interesting questions and I have thought much concerning them. I will give you the method of approach and the result which I finally reached. Having once established the falsity of the proposition that one and the same body moving through differently resisting media acquires speeds which are inversely proportional to the resistances of these media, and having also disproved the statement that in the same medium bodies of different weight acquire velocities proportional to their weights (understanding that this applies also to bodies which differ merely in specific gravity), I then began to combine these two facts and to consider what would happen if bodies of differ-

    ent weight were placed in media of different resistances; and I found that the differences in speed were greater in those media which were more resistant, that is, less yielding. This difference was such that two bodies which differed scarcely at all in their speed through air would, in water, fall the one with a speed ten times as great as that of the other. Further, there are bodies which will fall rapidly in air, whereas if placed in water not only will not sink but will remain at rest or will even rise to the top: for it is possible to find some kinds of wood, such as knots and roots, which remain at rest in water but fall rapidly in air.

    SAGR. I have often tried with the utmost patience to add grains of sand to a ball of wax until it should acquire the same

    specific

    FIRST DAY 69 specific gravity as water and would therefore remain at rest in this medium. But with all my care I was never able to accom-

    plish this. Indeed, I do not know whether there is any solid substance whose specific gravity is, by nature, so nearly equal to that of water that if placed anywhere in water it will remain at rest.

    SALV. In this, as in a thousand other operations, men are surpassed by animals. In this problem of yours one may learn much from the fish which are very skillful in maintaining their equilibrium not only in one kind of water, but also in waters which are notably different either by their own nature or by

    some accidental muddiness or through salinity, each of which produces a marked change. So perfectly indeed can fish keep their equilibrium that they are able to remain motionless in any position. This they accomplish, I believe, by means of an apparatus especially provided by nature, namely, a bladder located in the body and communicating with the mouth by means of a narrow tube through which they are able, at will, to expel a portion of the air contained in the bladder: by rising to the surface they can take in more air; thus they make themselves heavier or lighter than water at will and maintain equilibrium.

    SAGR. By means of another device I was able to deceive some friends to whom I had boasted that I could make up a ball of wax that would be in equilibrium in water. In the bottom of a vessel I placed some salt water and upon this some fresh water; then I showed them that the ball stopped in the middle of the water, and that, when pushed to the bottom or lifted to the top, would not remain in either of these places but would return to the middle.

    SALV. This experiment is not without usefulness. For when physicians are testing the various qualities of waters, especially their specific gravities, they employ a ball of this kind so ad-

    justed that, in certain water, it will neither rise nor fall. Then in testing another water, differing ever so slightly in specific gravity \pcso\ the ball will sink if this water be lighter and rise if it be heavier. And so exact is this experiment that the addi-

    tion

    yo THE NEW TWO SCIENCES OF GALILEO tion of two grains of salt to six pounds of water is sufficient to make the ball rise to the surface from the bottom to which it had fallen. To illustrate the precision of this experiment and also to clearly demonstrate the non-resistance of water to division, I wish to add that this notable difference in specific gravity can be produced not only by solution of some heavier substance, but also by merely heating or cooling; and so sensitive is water to this process that by simply adding four drops of another water which is slightly warmer or cooler than the six pounds one can cause the ball to sink or rise; it will sink when the warm water is poured in and will rise upon the addition of cold water. Now you [“Si

    can see how mistaken are those philosophers who ascribe to water viscosity or some other coherence of parts which offers resistance to separation of parts and to penetration.

    SAGR. With regard to this question I have found many con- vincing arguments in a treatise by our Academician; but there is

    one great difficulty of which I have not been able to rid myself, namely, if there be no tenacity or coherence between the particles of water how is it possible for those large drops of water to stand out in relief upon cabbage leaves without scattering or spreading out?

    SALV. Although those who are in possession of the truth are able to solve all objections raised, I would not arrogate to myself such power; nevertheless my inability should not be allowed to becloud the truth. To begin with let me confess that I do not understand how these large globules of water stand out and hold themselves up, although I know for a certainty, that it is not owing to any internal tenacity acting between the particles of water; whence it must follow that the cause of this effect is external. Beside the experiments already shown to prove that the cause is not internal, I can offer another which is very con-

    vincing. If the particles of water which sustain themselves in a heap, while surrounded by air, did so in virtue of an internal cause then they would sustain themselves much more easily when surrounded by a medium in which they exhibit less tendency to fall than they do in air; such a medium would be any fluid

    heavier

    FIRST DAY 71 heavier than air, as, for instance, wine: and therefore if some wine be poured about such a drop of water, the wine might rise until the drop was entirely covered, without the particles of water, held together by this internal coherence, ever parting company. But this is not the fadl; for as soon as the wine touches the water, the latter without waiting to be covered scatters and spreads out underneath the wine if it be red. The cause of this effecl: is therefore external and is possibly to be found in the surrounding air. Indeed there appears to be a considerable antagonism between air and water as I have ob-

    served in the following experiment. Having taken a glass globe which had a mouth of about the same diameter as a straw, I filled it with water and turned it mouth downwards; neverthe-

    [116] less, the water, although quite heavy and prone to descend, and the air, which is very light and disposed to rise through the water, refused, the one to descend and the other to ascend through the opening, but both remained stubborn and defiant. On the other hand, as soon as I apply to this opening a glass of red wine, which is almost inappreciably lighter than water, red streaks are immediately observed to ascend slowly through the water while the water with equal slowness descends through the wine without mixing, until finally the globe is completely filled with wine and the water has all gone down into the vessel below. What then can we say except that there exists, between water and air, a certain incompatibility which I do not under-

    stand, but perhaps. . . . SIMP. I feel almost like laughing at the great antipathy which

    Salviati exhibits against the use of the word antipathy; and yet it is excellently adapted to explain the difficulty.

    SALV. Alright, if it please Simplicio, let this word antipathy be the solution of our difficulty. Returning from this digression, let us again take up our problem. We have already seen that the difference of speed between bodies of different specific gravities is most marked in those media which are the most resistant: thus, in a medium of quicksilver, gold not merely sinks to the bottom more rapidly than lead but it is the only

    substance

    72 THE TWO NEW SCIENCES OF GALILEO substance that will descend at all; all other metals and stones rise to the surface and float. On the other hand the variation of speed in air between balls of gold, lead, copper, porphyry, and other heavy materials is so slight that in a fall of 100 cubits a ball of gold would surely not outstrip one of copper by as much as four fingers. Having observed this I came to the conclusion that in a medium totally devoid of resistance all bodies would fall with the same speed.

    SIMP. This is a remarkable statement, Salviati. But I shall never believe that even in a vacuum, if motion in such a place were possible, a lock of wool and a bit of lead can fall with the same velocity.

    SALV. A little more slowly, Simplicio. Your difficulty is not so recondite nor am I so imprudent as to warrant you in believing that I have not already considered this matter and found the proper solution. Hence for my justification and

    for your enlightenment hear what I have to say. Our problem is to find out what happens to bodies of different weight moving in a medium devoid of resistance, so that the only difference in speed is that which arises from inequality of weight. Since no medium except one entirely free from air and other bodies, be it ever so tenuous and yielding, can furnish our senses with the evidence we are looking for, and since such a medium is not available, we shall observe what happens in the rarest and least resistant media as compared with what happens in denser and more resistant media. Because if we find as a fact that the variation of speed among bodies of different specific gravities is less and less according as the medium becomes more and more yielding, and if finally in a medium of extreme tenuity, though not a perfect vacuum, we find that, in spite of great diversity of specific gravity [p£H>], the difference in speed is very small and almost inappreciable, then we are justified in believing it highly probable that in a vacuum all bodies would fall with the same speed. Let us, in view of this, consider what takes place in air, where for the sake of a definite figure and light material imagine an inflated bladder. The air in this bladder when surrounded by

    air

    FIRST DAY 73

    air will weigh little or nothing, since it can be only slightly com- pressed; its weight then is small being merely that of the skin

    which does not amount to the thousandth part of a mass of lead having the same size as the inflated bladder. Now, Sim- plicio, if we allow these two bodies to fall from a height of four or six cubits, by what distance do you imagine the lead will anticipate the bladder? You may be sure that the lead will not travel three times, or even twice, as swiftly as the bladder, although vou would have made it move a thousand times as rapidly.

    SIMP. It may be as you say during the first four or six cubits of the fall; but after the motion has continued a long while, I believe that the lead will have left the bladder behind not only six out of twelve parts of the distance but even eight or ten.

    SALV. I quite agree with you and doubt not that, in very long distances, the lead might cover one hundred miles while the

    [118] bladder was traversing one; but, my dear Simplicio, this phenom-

    enon which you adduce against my proposition is precisely the one which confirms it. Let me once more explain that the

    variation of speed observed in bodies of different specific gravi- ties is not caused by the difference of specific gravity but de-

    pends upon external circumstances and, in particular, upon the resistance of the medium, so that if this is removed all bodies would fall with the same velocity; and this result I deduce mainly from the fact which you have just admitted and which is very true, namely, that, in the case of bodies which differ widely in weight, their velocities differ more and more as the spaces traversed increase, something which would not occur if the effect depended upon differences of specific gravity. For since these specific gravities remain constant, the ratio between the distances traversed ought to remain constant whereas the

    fact is that this ratio keeps on increasing as the motion con- tinues. Thus a very heavy body in a fall of one cubit will not

    anticipate a very light one by so much as the tenth part of this space; but in a fall of twelve cubits the heavy body would out-

    strip

    74 THE TWO NEW SCIENCES OF GALILEO

    strip the other by one-third, and in a fall of one hundred cubits

    by 90/100, etc. SIMP. Very well: but, following your own line of argument, if differences of weight in bodies of different specific gravities cannot produce a change in the ratio of their speeds, on the ground that their specific gravities do not change, how is it possible for the medium, which also we suppose to remain con-

    stant, to bring about any change in the ratio of these velocities ? SALV. This objection with which you oppose my statement

    is clever; and I must meet it. I begin by saying that a heavy body has an inherent tendency to move with a constantly and uniformly accelerated motion toward the common center of gravity, that is, toward the center of our earth, so that during equal intervals of time it receives equal increments of momentum and velocity. This, you must understand, holds whenever all external and accidental hindrances have been removed; but of these there is one which we can never remove, namely, the medium which must be penetrated and thrust aside by the falling body. This quiet, yielding, fluid medium opposes motion

    [119].

    through it with a resistance which is proportional to the rapidity with which the medium must give way to the passage of the body; which body, as I have said,. is by nature continuously accelerated so that it meets with more and more resistance in the medium and hence a diminution in its rate of gain of speed until finally the speed reaches such a point and the resistance of the medium becomes so great that, balancing each other, they prevent any further acceleration and reduce the motion of the body to one which is uniform and which will thereafter maintain a constant value. There is, therefore, an increase in the resist-

    ance of the medium, not on account of any change in its essential properties, but on account of the change in rapidity with which it must yield and give way laterally to the passage of the falling body which is being constantly accelerated. Now seeing how great is the resistance which the air offers to

    the slight momentum [momenta] of the bladder and how small that which it offers to the large weight [peso] of the lead, I

    am

    FIRST DAY 75

    am convinced that, if the medium were entirely removed, the advantage received by the bladder would be so great and that coming to the lead so small that their speeds would be equalized. Assuming this principle, that all falling bodies acquire equal speeds in a medium which, on account of a vacuum or something else, offers no resistance to the speed of the motion, we shall be able accordingly to determine the ratios of the speeds of both similar and dissimilar bodies moving either through one and the same medium or through different space-filling, and therefore resistant, media. This result we may obtain by observing how much the weight of the medium detracts from the weight of the moving body, which weight is the means employed by the falling body to open a path for itself and to push aside the parts of the medium, something which does not happen in a vacuum where, therefore, no difference [of speed] is to be expected from a difference of specific gravity. And since it is known that the effect of the medium is to diminish the weight of the body by the weight of the medium displaced, we may accomplish our purpose by diminishing in just this proportion the speeds of the falling bodies, which in a non-resisting medium we have assumed to be equal.

    Thus, for example, imagine lead to be ten thousand times as heavy as air while ebony is only one thousand times as heavy.

    [120] Here we have two substances whose speeds of fall in a medium devoid of resistance are equal: but, when air is the medium, it will subtract from the speed of the lead one part in ten thousand, and from the speed of the ebony one part in one thousand, i. e. ten parts in ten thousand. While therefore lead and ebony would fall from any given height in the same interval of time, provided the retarding effect of the air were removed, the lead will, in air, lose in speed one part in ten thousand; and the ebony, ten parts in ten thousand. In other words, if the elevation from which the bodies start be divided into ten thousand parts, the lead will reach the ground leaving the ebony behind by as much as ten, or at least nine, of these parts. Is it not clear then that a leaden ball allowed to fall from a tower two hundred cubits h gh

    76 THE TWO NEW SCIENCES OF GALILEO

    high will outstrip an ebony ball by less than four inches ? Now ebony weighs a thousand times as much as air but this inflated bladder only four times as much; therefore air diminishes the inherent and natural speed of ebony by one part in a thousand ; while that of the bladder which, if free from hindrance, would be the same, experiences a diminution in air amounting to one part in four. So that when the ebony ball, falling from the tower, has reached the earth, the bladder will have traversed only three-quarters of this distance. Lead is twelve times as heavy as water; but ivory is only twice as heavy. The speeds of these two substances which, when entirely unhindered, are equal will be diminished in water, that of lead by one part in twelve, that of ivory by half. Accordingly when the lead has fallen through eleven cubits of water the ivory will have fallen through only six. Employing this principle we shall, I believe, find a much closer agreement of experiment with our computation than with that of Aristotle.

    In a similar manner we may find the ratio of the speeds of one and the same body in different fluid media, not by comparing the different resistances of the media, but by considering the excess of the specific gravity of the body above those of the media. Thus, for example, tin is one thousand times heavier than air and ten times heavier than water; hence, if we divide its un-

    hindered speed into 1000 parts, air will rob it of one of these parts so that it will fall with a speed of 999, while in water its speed will be 900, seeing that water diminishes its weight by one part in ten while air by only one part in a thousand.

    Again take a solid a little heavier than water, such as oak, a ball of which will weigh let us say 1000 drachms; suppose an

    [121] equal volume of water to weigh 950, and an equal volume of air, 2; then it is clear that if the unhindered speed of the ball is 1000, its speed in air will be 998, but in water only 50, seeing that the water removes 950 of the 1000 parts which the body weighs, leaving only 50.

    Such a solid would therefore move almost twenty times as fast in air as in water, since its specific gravity exceeds that of

    water

    FIRST DAY 77 water by one part in twenty. And here we must consider the fact that only those substances which have a specific gravity

    greater than water can fall through it — substances which must, therefore, be hundreds of times heavier than air; hence when we try to obtain the ratio of the speed in air to that in water, we may, without appreciable error, assume that air does not, to any considerable extent, diminish the free weight [assoluta gravitd], and consequently the unhindered speed [assoluta velocitd] of such substances. Having thus easily found the excess of the weight of these substances over that of water, we can say that their speed in air is to their speed in water as their free weight [totale gravitd] is to the excess of this weight over that of water. For example, a ball of ivory weighs 20 ounces; an equal volume of water weighs 17 ounces; hence the speed of ivory in air bears to its speed in water the approximate ratio of 20:3.

    SAGR. I have made a great step forward in this truly interest- ing subject upon which I have long labored in vain. In order

    to put these theories into practice we need only discover a method of determining the specific gravity of air with reference to water and hence with reference to other heavy substances.

    SIMP. But if we find that air has levity instead of gravity what then shall we say of the foregoing discussion which, in other respects, is very clever?

    SALV. I should say that it was empty, vain, and trifling. But can you doubt that air has weight when you have the clear testimony of Aristotle affirming that all the elements have weight including air, and excepting only fire? As evidence of this he cites the fact that a leather bottle weighs more when inflated than when collapsed.

    [122] SIMP. I am inclined to believe that the increase of weight

    observed in the inflated leather bottle or bladder arises, not from the gravity of the air, but from the many thick vapors mingled with it in these lower regions. To this I would attribute the increase of weight in the leather bottle.

    SALV. I would not have you say this, and much less attribute it to Aristotle; because, if speaking of the elements, he wished to

    persuade

    78 THE TWO NEW SCIENCES OF GALILEO persuade me by experiment that air has weight and were to say to me: “Take a leather bottle, fill it with heavy vapors and ob-

    serve how its weight increases,” I would reply that the bottle would weigh still more if filled with bran; and would then add that this merely proves that bran and thick vapors are heavy, but in regard to air I should still remain in the same doubt as before. However, the experiment of Aristotle is good and the proposition is true. But I cannot say as much of a certain other consideration, taken at face value; this consideration was of-

    fered by a philosopher whose name slips me; but I know I have read his argument which is that air exhibits greater gravity than levity, because it carries heavy bodies downward more easily than it does light ones upward.

    SAGR. Fine indeed! So according to this theory air is much heavier than water, since all heavy bodies are carried downward more easily through air than through water, and all light bodies buoyed up more easily through water than through air; further there is an infinite number of heavy bodies which fall through air but ascend in water and there is an infinite number of sub-

    stances which rise in water and fall in air. But, Simplicio, the question as to whether the weight of the leather bottle is owing to thick vapors or to pure air does not affect our problem which is to discover how bodies move through this vapor-laden atmos-

    phere of ours. Returning now to the question which interests me more, I should like, for the sake of more complete and thorough knowledge of this matter, not only to be strengthened in my belief that air has weight but also to learn, if possible, how great its specific gravity is. Therefore, Salviati, if you can satisfy my curiosity on this point pray do so.

    SALV. The experiment with the inflated leather bottle of Aristotle proves conclusively that air possesses positive gravity and not, as some have believed, levity, a property possessed possibly by no substance whatever; for if air did possess this quality of absolute and positive levity, it should on compression [123]

    exhibit greater levity and, hence, a greater tendency to rise; but experiment shows precisely the opposite.

    As

    FIRST DAY 79 As to the other question, namely, how to determine the

    specific gravity of air, I have employed the following method. I took a rather large glass bottle with a narrow neck and at-

    tached to it a leather cover, binding it tightly about the neck of the bottle: in the top of this cover I inserted and firmly fastened the valve of a leather bottle, through which I forced into the glass bottle, by means of a syringe, a large quantity of air. And since air is easily condensed one can pump into the bottle two or three times its own volume of air. After this I took an accurate balance and weighed this bottle of compressed air with the utmost precision, adjusting the weight with fine sand. I next opened the valve and allowed the compressed air to escape ; then replaced the flask upon the balance and found it per-

    ceptibly lighter: from the sand which had been used as a counter- weight I now removed and laid aside as much as was necessary

    to again secure balance. Under these conditions there can be no doubt but that the weight of the sand thus laid aside represents the weight of the air which had been forced into the flask and had afterwards escaped. But after all this experiment tells me merely that the weight of the compressed air is the same as that of the sand removed from the balance; when however it comes to knowing certainly and definitely the weight of air as compared with that of water or any other heavy substance this I cannot hope to do without first measuring the volume [qicantiia] of compressed air; for this measurement I have devised the two following methods.

    According to the first method one takes a bottle with a narrow neck similar to the previous one; over the mouth of this bottle is slipped a leather tube which is bound tightly about the neck of the flask; the other end of this tube embraces the valve attached to the first flask and is tightly bound about it. This second flask is provided with a hole in the bottom through which an iron rod can be placed so as to open, at will, the valve above mentioned and thus permit the surplus air of the first to escape after it has once been weighed: but his second bottle must be filled with water. Having prepared everything in the manner [124]

    above

    8o THE TWO NEW SCIENCES OF GALILEO

    above described, open the valve with the rod; the air will rush into the flask containing the water and will drive it through the hole at the bottom, it being clear that the volume [quantita] of water thus displaced is equal to the volume [mole e quantita] of air escaped from the other vessel. Having set aside this dis-

    placed water, weigh the vessel from which the air has escaped (which is supposed to have been weighed previously while containing the compressed air), and remove the surplus of sand as described above; it is then manifest that the weight of this sand is precisely the weight of a volume [mole] of air equal to the volume of water displaced and set aside; this water we can weigh and find how many times its weight contains the weight of the removed sand, thus determining definitely how many times heavier water is than air; and we shall find, contrary to the opinion of Aristotle, that this is not 10 times, but, as our experi-

    ment shows, more nearly 400 times. The second method is more expeditious and can be carried

    out with a single vessel fitted up as the first was. Here no air is added to that which the vessel naturally contains but water is forced into it without allowing any air to escape; the water thus introduced necessarily compresses the air. Having forced into the vessel as much water as possible, filling it, say, three-fourths full, which does not require any extraordinary effort, place it upon the balance and weigh it accurately; next hold the vessel mouth up, open the valve, and allow the air to escape; the volume of the air thus escaping is precisely equal to the volume of water contained in the flask. Again weigh the vessel which will have diminished in weight on account of the escaped air; this loss in weight represents the weight of a volume of air equal to the volume of water contained in the vessel.

    SIMP. No one can deny the cleverness and ingenuity of your devices; but while they appear to give complete intellectual satisfaction they confuse me in another direction. For since it is undoubtedly true that the elements when in their proper places have neither weight nor levity, I cannot understand how it is possible for that portion of air, which appeared to weigh, say, 4 drachms of sand, should really have such a weight in air as the

    sand

    FIRST DAY 81

    sand which counterbalances it. It seems to me, therefore, that the experiment should be carried out, not in air, but in a medium

    .[125] in which the air could exhibit its property of weight if such it

    really has.

    SALV. The objection of Simplicio is certainly to the point and must therefore either be unanswerable or demand an equally clear solution. It is perfectly evident that that air which, under compression, weighed as much as the sand, loses this weight when once allowed to escape into its own element, while, indeed, the sand retains its weight. Hence for this experiment it be-

    comes necessary to select a place where air as well as sand can gravitate; because, as has been often remarked, the medium diminishes the weight of any substance immersed in it by an amount equal to the weight of the displaced medium; so that air in air loses all its weight. If therefore this experiment is to be made with accuracy it should be performed in a vacuum where every heavy body exhibits its momentum without the slightest diminution. If then, Simplicio, we were to weigh a portion of air in a vacuum would you then be satisfied and assured of the fact?

    SIMP. Yes truly: but this is to wish or ask the impossible. SALV. Your obligation will then be very great if, for your

    sake, I accomplish the impossible. But I do not want to sell you something which I have already given you; for in the previous experiment we weighed the air in vacuum and not in air or other medium. The fact that any fluid medium diminishes the

    weight of a mass immersed in it, is due, Simplicio, to the resist- ance which this medium offers to its being opened up, driven

    aside, and finally lifted up. The evidence for this is seen in the readiness with which the fluid rushes to fill up any space for-

    merly occupied by the mass; if the medium were not affected by such an immersion then it would not react against the immersed body. Tell me now, when you have a flask, in air, filled with its natural amount of air and then proceed to pump into the vessel more air, does this extra charge in any way separate or divide or change the circumambient air? Does the vessel perhaps expand

    82 THE TWO NEW SCIENCES OF GALILEO so that the surrounding medium is displaced in order to give

    more room? Certainly not. Therefore one is able to say that [126]

    this extra charge of air is not immersed in the surrounding medium for it occupies no space in it, but is, as it were, in a vacuum. Indeed, it is really in a vacuum; for it diffuses into the vacuities which are not completely filled by the original and uncondensed air. In facft I do not see any difference between the enclosed and the surrounding media: for the surrounding medium does not press upon the enclosed medium and, vice versa, the enclosed medium exerts no pressure against the surrounding one; this same relationship exists in the case of any matter in a vacuum, as well as in the case of the extra charge of air com-

    pressed into the flask. The weight of this condensed air is therefore the same as that which it would have if set free in a vacuum. It is true of course that the weight of the sand used as a counterpoise would be a little greater in vacuo than in free air. We must, then, say that the air is slightly lighter than the sand required to counterbalance it, that is to say, by an amount equal to the weight in vacuo of a volume of air equal to the volume of the sand.

    At this point in an annotated copy of the original edition the following note by Galileo is found.

    [SAGR. A very clever discussion, solving a wonderful problem, because it demonstrates briefly and concisely the manner in which one may find the weight of a body in vacuo by simply weighing it in air. The explanation is as follows: when a heavy body is immersed in air it loses in weight an amount equal to the weight of a volume [mole} of air equivalent to the volume [mole] of the body itself. Hence if one adds to a body, without expanding it, a quantity of air equal to that which it displaces and weighs it, he will obtain its absolute weight in vacuo, since, without increasing it in size, he has increased its weight by just the amount which it lost through immersion in air. When therefore we force a quantity of water into a vessel which al-

    ready contains its normal amount of air, without allowing any of this air to escape it is clear that this normal quantity of air will be compressed and condensed into a smaller space in order to make room for the water which is forced in: it is also clear that the volume of air thus com-

    pressed is equal to the volume of water added. If now the vessel be

    weighed

    FIRST DAY 83 weighed in air in this condition, it is manifest that the weight of the water will be increased by that of an equal volume of air; the total weight of water and air thus obtained is equal to the weight of the water alone in vacuo.

    Now record the weight of the entire vessel and then allow the com- pressed air to escape; weigh the remainder; the difference of these two

    weights will be the weight of the compressed air which, in volume, is equal to that of the water. Next find the weight of the water alone and add to it that of the compressed air; we shall then have the water alone in vacuo. To find the weight of the water we shall have to remove it from the vessel and weigh the vessel alone; subtrad this weight from that of the vessel and water together. It is clear that the remainder will

    be the weight of the water alone in air.] [127]

    SIMP. The previous experiments, in my opinion, left some- thing to be desired : but now I am fully satisfied.

    SALV. The facts set forth by me up to this point and, in particular, the one which shows that difference of weight, even when very great, is without effect in changing the speed of falling bodies, so that as far as weight is concerned they all fall with equal speed: this idea is, I say, so new, and at first glance so remote from fact, that if we do not have the means of making it just as clear as sunlight, it had better not be mentioned; but having once allowed it to pass my lips I must neglect no experi-

    ment or argument to establish it. SAGR. Not only this but also many other of your views are

    so far removed from the commonly accepted opinions and doctrines that if you were to publish them you would stir up a large number of antagonists; for human nature is such that men do not look with favor upon discoveries — either of truth or fallacy — in their own field, when made by others than them-

    selves. They call him an innovator of doctrine, an unpleasant title, by which they hope to cut those knots which they cannot untie, and by subterranean mines they seek to destroy struc-

    tures which patient artisans have built with customary tools. [128]

    But as for ourselves who have no such thoughts, the experi- ments and arguments which you have thus far adduced are

    fully satisfactory; however if you have any experiments’ which are

    84 THE TWO NEW SCIENCES OF GALILEO are more direct or any arguments which are more convincing we will hear them with pleasure.

    SALV. The experiment made to ascertain whether two bodies, differing greatly in weight will fall from a given height with the same speed offers some difficulty; because, if the height is con-

    siderable, the retarding effect of the medium, which must be penetrated and thrust aside by the falling body, will be greater in the case of the small momentum of the very light body than in the case of the great force [violenza] of the heavy body; so that, in a long distance, the light body will be left behind; if the height be small, one may well doubt whether there is any difference; and if there be a difference it will be inappreciable.

    It occurred to me therefore to repeat many times the fall through a small height in such a way that I might accumulate all those small intervals of time that elapse between the arrival of the heavy and light bodies respectively at their common terminus, so that this sum makes an interval of time which is not only observable, but easily observable. In order to employ the slowest speeds possible and thus reduce the change which the resisting medium produces upon the simple effect of gravity it occurred to me to allow the bodies to fall along a plane slightly inclined to the horizontal. For in such a plane, just as well as in a vertical plane, one may discover how bodies of different weight behave: and besides this, I also wished to rid myself of the resistance which might arise from contact of the moving body with the aforesaid inclined plane. Accordingly I took two balls, one of lead and one of cork, the former more than a hun-

    dred times heavier than the latter, and suspended them by means of two equal fine threads, each four or five cubits long. Pulling each ball aside from the perpendicular, I let them go at the same instant, and they, falling along the circumferences of circles having these equal strings for semi-diameters, passed beyond the perpendicular and returned along the same path. This free vibration [per lor medesime le andate e le tornate] repeated a hundred times showed clearly that the heavy body maintains so [129]

    nearly the period of the light body that neither in a hundred swings

    FIRST DAY 85 swings nor even in a thousand will the former anticipate the latter by as much as a single moment [minima momenta], so perfectly do they keep step. We can also observe the effect of the medium which, by the resistance which it offers to motion, diminishes the vibration of the cork more than that of the lead, but without altering the frequency of either; even when the arc traversed by the cork did not exceed five or six degrees while that of the lead was fifty or sixty, the swings were performed in equal times.

    SIMP. If this be so, why is not the speed of the lead greater than that of the cork, seeing that the former traverses sixty de-

    grees in the same interval in which the latter covers scarcely six? SALV. But what would you say, Simplicio, if both covered

    their paths in the same time when the cork, drawn aside through thirty degrees, traverses an arc of sixty, while the lead pulled aside only two degrees traverses an arc of four? Would not then the cork be proportionately swifter? And yet such is the experimental fact. But observe this: having pulled aside the pendulum of lead, say through an arc of fifty degrees, and set it free, it swings beyond the perpendicular almost fifty degrees, thus describing an arc of nearly one hundred degrees; on the return swing it describes a little smaller arc; and after a large number of such vibrations it finally comes to rest. Each vibra-

    tion, whether of ninety, fifty, twenty, ten, or four degrees occupies the same time: accordingly the speed of the moving body keeps on diminishing since in equal intervals of time, it traverses arcs which grow smaller and smaller.

    Precisely the same things happen with the pendulum of cork, suspended by a string of equal length, except that a smaller number of vibrations is required to bring it to rest, since on account of its lightness it is less able to overcome the resistance of the air; nevertheless the vibrations, whether large or small, are all performed in time-intervals which are not only equal among themselves, but also equal to the period of the lead pendulum. Hence it is true that, if while the lead is traversing an arc of fifty degrees the cork covers one of only ten, the cork moves more slowly than the lead; but on the other hand it is also true

    that

    86 THE TWO NEW SCIENCES OF GALILEO

    [130] that the cork may cover an arc of fifty while the lead passes over one of only ten or six; thus, at different times, we have now the cork, now the lead, moving more rapidly. But if these same bodies traverse equal arcs in equal times we may rest assured that their speeds are equal.

    SIMP. I hesitate to admit the conclusiveness of this argument because of the confusion which arises from your making both bodies move now rapidly, now slowly and now very slowly, which leaves me in doubt as to whether their velocities are always equal.

    SAGR. Allow me, if you please, Salviati, to say just a few words. Now tell me, Simplicio, whether you admit that one can say with certainty that the speeds of the cork and the lead are equal whenever both, starting from rest at the same moment and descending the same slopes, always traverse equal spaces in equal times?

    SIMP. This can neither be doubted nor gainsaid. SAGR. Now it happens, in the case of the pendulums, that each

    of them traverses now an arc of sixty degrees, now one of fifty, or thirty or ten or eight or four or two, etc. ; and when they both

    swing through an arc of sixty degrees they do so in equal inter- vals of time; the same thing happens when the arc is fifty degrees

    or thirty or ten or any other number; and therefore we conclude that the speed of the lead in an arc of sixty degrees is equal to the speed of the cork when the latter also swings through an arc of sixty degrees; in the case of a fifty-degree arc these speeds are also equal to each other; so also in the case of other arcs. But this is not saying that the speed which occurs in an arc of sixty is the same as that which occurs in an arc of fifty; nor is the speed in an arc of fifty equal to that in one of thirty, etc. ; but the smaller the arcs, the smaller the speeds; the facft observed is that one and the same moving body requires the same time for traversing a large arc of sixty degrees as for a small arc of fifty or even a very small arc of ten; all these arcs, indeed, are covered in the same interval of time. It is true therefore that the lead

    [131]

    and

    FIRST DAY 87 and the cork each diminish their speed [moto] in proportion as their arcs diminish; but this does not contradict the fact that they maintain equal speeds in equal arcs. My reason for saying these things has been rather because I

    wanted to learn whether I had correctly understood Salviati, than because I thought Simplicio had any need of a clearer ex-

    planation than that given by Salviati which like everything else of his is extremely lucid, so lucid, indeed, that when he solves questions which are difficult not merely in appearance, but in reality and in fact, he does so with reasons, observations and experiments which are common and familiar to everyone.

    In this manner he has, as I have learned from various sources, given occasion to a highly esteemed professor for undervaluing his discoveries on the ground that they are commonplace, and established upon a mean and vulgar basis; as if it were not a most admirable and praiseworthy feature of demonstrative science that it springs from and grows out of principles well- known, understood and conceded by all.

    But let us continue with this light diet; and if Simplicio is satisfied to understand and admit that the gravity inherent [interna graviia] in various falling bodies has nothing to do with the difference of speed observed among them, and that all bodies, in so far as their speeds depend upon it, would move with the same velocity, pray tell us, Salviati, how you explain the appreciable and evident inequality of motion; please reply also to the objection urged by Simplicio — an objection in which I .concur — namely, that a cannon ball falls more rapidly than a bird-shot. From my point of view, one might expect the differ-

    ence of speed to be small in the case of bodies of the same sub- stance moving through any single medium, whereas the larger

    ones will descend, during a single pulse-beat, a distance which the smaller ones will not traverse in an hour, or in four, or even in twenty hours; as for instance in the case of stones and fine sand and especially that very fine sand which produces muddy water and which in many hours will not fall through as much as two cubits, a distance which stones not much larger will traverse in a single pulse-beat.

    Salv.

    88 THE TWO NEW SCIENCES OF GALILEO

    SALV. The acflion of the medium in producing a greater retardation upon those bodies which have a less specific gravity has already been explained by showing that they experience a diminution of weight. But to explain how one and the same

    medium produces such different retardations in bodies which are made of the same material and have the same shape, but differ only in size, requires a discussion more clever than that by which one explains how a more expanded shape or an op-

    posing motion of the medium retards the speed of the moving body. The solution of the present problem lies, I think, in the roughness and porosity which are generally and almost neces-

    sarily found in the surfaces of solid bodies. When the body is in motion these rough places strike the air or other ambient me-

    dium. The evidence for this is found in the humming which accompanies the rapid motion of a body through air, even when that body is as round as possible. One hears not only humming, but also hissing and whistling, whenever there is any appreciable cavity or elevation upon the body. We observe also that a round solid body rotating in a lathe produces a current of air. But what more do we need ? When a top spins on the ground at its greatest speed do we not hear a distinct buzzing of high pitch? This sibilant note diminishes in pitch as the speed of rotation slackens, which is evidence that these small rugosities on the surface meet resistance in the air. There can be no doubt, therefore, that in the motion of falling bodies these rugosities strike the surrounding fluid and retard the speed; and this they do so much the more in proportion as the surface is larger, which is the case of small bodies as compared with greater.

    SIMP. Stop a moment please, I am getting confused. For al- though I understand and admit that f riction of the medium upon

    the surface of the body retards its motion and that, if other

    things are the same, the larger surface suffers greater retarda- tion, I do not see on what ground you say that the surface of the

    smaller body is larger. Besides if, as you say, the larger surface suffers greater retardation the larger solid should move more slowly, which is not the fact. But this objection can be easily

    met

    FIRST DAY 89

    met by saying that, although the larger body has a larger sur- face, it has also a greater weight, in comparison with which the

    resistance of the larger surface is no more than the resistance of the small surface in comparison with its smaller weight; so that the speed of the larger solid does not become less. I therefore see no reason for expecting any difference of speed so long as the

    driving weight \gravita movente] diminishes in the same propor- [133]

    tion as the retarding power [facolta ritardante] of the surface. SALV. I shall answer all your objections at once. You will

    admit, of course, Simplicio, that if one takes two equal bodies, of the same material and same figure, bodies which would therefore fall with equal speeds, and if he diminishes the weight of one of them in the same proportion as its surface (maintaining the similarity of shape) he would not thereby diminish the speed of this body.

    SIMP. This inference seems to be in harmony with your theory which states that the weight of a body has no effect in either accelerating or retarding its motion.

    SALV. I quite agree with you in this opinion from which it appears to follow that, if the weight of a body is diminished in greater proportion than its surface, the motion is retarded to a certain extent; and this retardation is greater and greater in proportion as the diminution of weight exceeds that of the sur- face.

    SIMP. This I admit without hesitation.

    SALV. Now you must know, Simplicio, that it is not possible to diminish the surface of a solid body in the same ratio as the weight, and at the same time maintain similarity of figure. For since it is clear that in the case of a diminishing solid the weight grows less in proportion to the volume, and since the volume always diminishes more rapidly than the surface, when the same shape is maintained, the weight must therefore dimin-

    ish more rapidly than the surface. But geometry teaches us that, in the case of similar solids, the ratio of two volumes is greater than the ratio of their surfaces; which, for the sake of better understanding, I shall illustrate by a particular case. Take,

    90 THE TWO NEW SCIENCES OF GALILEO Take, for example, a cube two inches on a side so that each

    face has an area of four square inches and the total area, i. e., the sum of the six faces, amounts to twenty-four square inches; now imagine this cube to be sawed through three times so as to divide it into eight smaller cubes, each one inch on the side, each face one inch square, and the total surface of each cube six square inches instead of twenty-four as in the case of the larger [134]

    cube. It is evident therefore that the surface of the little cube is

    only one-fourth that of the larger, namely, the ratio of six to twenty-four; but the volume of the solid cube itself is only one- eighth; the volume, and hence also the weight, diminishes there-

    fore much more rapidly than the surface. If we again divide the little cube into eight others we shall have, for the total surface of one of these, one and one-half square inches, which is one- sixteenth of the surface of the original cube; but its volume is only one-sixty-fourth part. Thus, by two divisions, you see that the volume is diminished four times as much as the surface. And, if the subdivision be continued until the original solid be reduced to a fine powder, we shall find that the weight of one of these smallest particles has diminished hundreds and hundreds of times as much as its surface. And this which I have illustrated in the case of cubes holds also in the case of all similar solids, where the volumes stand in sesquialteral ratio to their surfaces. Observe then how much greater the resistance, arising from con-

    tact of the surface of the moving body with the medium, in the case of small bodies than in the case of large; and when one considers that the rugosities on the very small surfaces of fine dust particles are perhaps no smaller than those on the surfaces of larger solids which have been carefully polished, he will see how important it is that the medium should be very fluid and offer no resistance to being thrust aside, easily yielding to a small force. You see, therefore, Simplicio, that I was not mistaken when, not long ago, I said that the surface of a small solid is comparatively greater than that of a large one.

    SIMP. I am quite convinced; and, believe me, if I were again beginning my studies, I should follow the advice of Plato and

    start

    FIRST DAY 91 start with mathematics, a science which proceeds very cautiously md admits nothing as established until it has been rigidly dem- onstrated.

    SAGR. This discussion has afforded me great pleasure; but Before proceeding further I should like to hear the explanation of i phrase of yours which is new to me, namely, that similar solids ire to each other in the sesquialteral ratio of their surfaces; for although I have seen and understood the proposition in which it s demonstrated that the surfaces of similar solids are in the

    [135]

    luplicate ratio of their sides and also the proposition which Droves that the volumes are in the triplicate ratio of their sides, ^et I have not so much as heard mentioned the ratio of the /olume of a solid to its surface.

    SALV. You yourself have suggested the answer to your ques- tion and have removed every doubt. For if one quantity is

    the cube of something of which another quantity is the square does it not follow that the cube is the sesquialteral of the square? Surely. Now if the surface varies as the square of its linear dimensions while the volume varies as the cube of these dimen-

    sions may we not say that the volume stands in sesquialteral ratio to the surface?

    SAGR. Quite so. And now although there are still some de- tails, in connection with the subject under discussion, con-

    :erning which I might ask questions yet, if we keep making one digression after another, it will be long before we reach the main topic which has to do with the variety of properties found in the resistance which solid bodies offer to fracture; and, therefore, if you please, let us return to the subject which we originally proposed to discuss.

    SALV. Very well; but the questions which we have already considered are so numerous and so varied, and have taken up so much time that there is not much of this day left to spend upon our main topic which abounds in geometrical demonstra-

    tions calling for careful consideration. May I, therefore, suggest that we postpone the meeting until to-morrow, not only for the reason just mentioned but also in order that I may bring with

    me

    92 THE TWO NEW SCIENCES OF GALILEO me some papers in which I have set down in an orderly way the theorems and propositions dealing with the various phases of this subject, matters which, from memory alone, I could not present in the proper order.

    SAGR. I fully concur in your opinion and all the more will- ingly because this will leave time to-day to take up some of

    my difficulties with the subject which we have just been dis- cussing. One question is whether we are to consider the re-

    sistance of the medium as sufficient to destroy the acceleration of a body of very heavy material, very large volume, and

    . spherical figure. I say spherical in order to select a volume which is contained within a minimum surface and therefore less sub-

    ject to retardation. Another question deals with the vibrations of pendulums

    which may be regarded from several viewpoints; the first is whether all vibrations, large, medium, and small, are performed in exactly and precisely equal times : another is to find the ratio of the times of vibration of pendulums supported by threads of unequal length.

    SALV. These are interesting questions : but I fear that here, as in the case of all other facts, if we take up for discussion any one of them, it will carry in its wake so many other facts and curious consequences that time will not remain to-day for the discussion of all.

    SAGR. If these are as full of interest as the foregoing, I would gladly spend as many days as there remain hours between now and nightfall; and I dare say that Simplicio would not be wearied by these discussions.

    SIMP. Certainly not; especially when the questions pertain to natural science and have not been treated by other philos- ophers.

    SALV. Now taking up the first question, I can assert without hesitation that there is no sphere so large, or composed of material so dense but that the resistance of the medium, al-

    though very slight, would check its acceleration and would, in time reduce its motion to uniformity; a statement which is strongly

    FIRST DAY 93 strongly supported by experiment. For if a falling body, as time goes on, were to acquire a speed as great as you please, no such speed, impressed by external forces [motore esterno], can be so great but that the body will first acquire it and then, owing to the resisting medium, lose it. Thus, for instance, if a cannon ball, having fallen a distance of four cubits through the air and having acquired a speed of, say, ten units [gradi\ were to strike the surface of the water, and if the resistance of the water were not able to check the momentum [impeto] of the shot, it would either increase in speed or maintain a uniform motion until the bottom were reached: but such is not the observed fact; on the contrary, the water when only a few cubits deep hinders and diminishes the motion in such a way that the shot delivers to the bed of the river or lake a very slight impulse. Clearly [137]

    then if a short fall through the water is sufficient to deprive a cannon ball of its speed, this speed cannot be regained by a fall of even a thousand cubits. How could a body acquire, in a fall of a thousand cubits, that which it loses in a fall of four? But what more is needed? Do we not observe that the enormous momen-

    tum, delivered to a shot by a cannon, is so deadened by passing through a few cubits of water that the ball, so far from injuring the ship, barely strikes it? Even the air, although a very yield-

    ing medium, can also diminish the speed of a falling body, as may be easily understood from similar experiments. For if a gun be fired downwards from the top of a very high tower the shot will make a smaller impression upon the ground than if the gun had been fired from an elevation of only four or six cubits; this is clear evidence that the momentum of the ball, fired from the top of the ̂ ower, diminishes continually from the instant it leaves the barrel until it reaches the ground. Therefore a fall from ever so great an altitude will not suffice to give to a body that momentum which it has once lost through the resistance of the air, no matter how it was originally acquired. In like manner, the destructive effect produced upon a wall by a shot fired from a gun at a distance of twenty cubits cannot be duplicated by the fall of the same shot from any altitude how-

    ever

    94 THE TWO NEW SCIENCES OF GALILEO

    ever great. My opinion is, therefore, that under the circum- stances which occur in nature, the acceleration of any body fall-

    ing from rest reaches an end and that the resistance of the medium finally reduces its speed to a constant value which is thereafter maintained.

    SAGR. These experiments are in my opinion much to the purpose; the only question is whether an opponent might not make bold to deny the fact in the case of bodies \moli] which are very large and heavy or to assert that a cannon ball, falling from the distance of the moon or from the upper regions of the atmos-

    phere, would deliver a heavier blow than if just leaving the muzzle of the gun.

    SALV. No doubt many objections may be raised not all of which can be refuted by experiment: however in this particular

    [138]

    case the following consideration must be taken into account, namely, that it is very likely that a heavy body falling from a height will, on reaching the ground, have acquired just as much momentum as was necessary to carry it to that height; as may be clearly seen in the case of a rather heavy pendulum which, when pulled aside fifty or sixty degrees from the vertical, will acquire precisely that speed and force which are sufficient to carry it to an equal elevation save only that small portion which it loses through friction on the air. In order to place a cannon ball at such a height as might suffice to give it just that momen-

    tum which the powder imparted to it on leaving the gun we need only fire it vertically upwards from the same gun; and we can then observe whether on falling back it delivers a blow equal to that of the gun fired at close range; in my opinion it would be much weaker. The resistance of the air would, therefore, I think, prevent the muzzle velocity from being equalled by a natural fall from rest at any height whatsoever. We come now to the other questions, relating to pendulums,

    a subject which may appear to many exceedingly arid, es- pecially to those philosophers who are continually occupied

    with the more profound questions of nature. Nevertheless, the problem is one which I do not scorn. I am encouraged by the

    example

    FIRST DAY 95 example of Aristotle whom I admire especially because he did not fail to discuss every subject which he thought in any degree worthy of consideration.

    Impelled by your queries I may give you some of my ideas concerning certain problems in music, a splendid subject, upon which so many eminent men have written: among these is Aristotle himself who has discussed numerous interesting acous-

    tical questions. Accordingly, if on the basis of some easy and tangible experiments, I shall explain some striking phenomena in the domain of sound, I trust my explanations will meet your approval.

    SAGR. I shall receive them not only gratefully but eagerly. For, although I take pleasure in every kind of musical instru- [139]

    ment and have paid considerable attention to harmony, I have never been able to fully understand why some combinations of tones are more pleasing than others, or why certain combina-

    tions not only fail to please but are even highly offensive. Then there is the old problem of two stretched strings in unison; when one of them is sounded, the other begins to vibrate and to emit its note; nor do I understand the different ratios of harmony [forme delle consonanze] and some other details.

    SALV. Let us see whether we cannot derive from the pendulum a satisfactory solution of all these difficulties. And first, as to the question whether one and the same pendulum really per-

    forms its vibrations, large, medium, and small, all in exactly the same time, I shall rely upon what I have already heard from our Academician. He has clearly shown that the time of descent is the same along all chords, whatever the arcs which

    subtend them, as well along an arc of 1 80° (i. e., the whole diameter) as along one of 100°, 60°, 10°, 2°, ̂°> or 4′. It is understood, of course, that these arcs all terminate at the lowest point of the circle, where it touches the horizontal plane.

    If now we consider descent along arcs instead of their chords

    then, provided these do not exceed 90°, experiment shows that they are all traversed in equal times; but these times are greater for the chord than for the arc, an effect which is all the more

    remarkable

    96 THE TWO NEW SCIENCES OF GALILEO remarkable because at first glance one would think just the opposite to be true. For since the terminal points of the two motions are the same and since the straight line included be-

    tween these two points is the shortest distance between them, it would seem reasonable that motion along this line should be executed in the shortest time; but this is not the case, for the

    shortest time — and therefore the most rapid motion — is that employed along the arc of which this straight line is the chord.

    As to the times of vibration of bodies suspended by threads of different lengths, they bear to each other the same proportion as the square roots of the lengths of the thread; or one might say the lengths are to each other as the squares of the times; so that if one wishes to make the vibration-time of one pendulum twice that of another, he must make its suspension four times as long. In like manner, if one pendulum has a suspension nine times as

    [140] long as another, this second pendulum will execute three vibra-

    tions during each one of the first; from which it follows that the lengths of the suspending cords bear to each other the [inverse] ratio of the squares of the number of vibrations performed in the same time.

    SAGR. Then, if I understand you corredlly, I can easily meas- ure the length of a string whose upper end is attached at any

    height whatever even if this end were invisible and I could see only the lower extremity. For if I attach to the lower end of this string a rather heavy weight and give it a to-and-fro motion, and if I ask a friend to count a number of its vibrations, while I, during the same time-interval, count the number of vibrations of a pendulum which is exactly one cubit in length, then knowing the number of vibrations which each pendulum makes in the given interval of time one can determine the length of the string. Suppose, for example, that my friend counts 20 vibra-

    tions of the long cord during the same time in which I count 240 of my string which is one cubit in length; taking the squares of the two numbers, 20 and 240, namely 400 and 57600, then, I say, the long string contains 57600 units of such length that my pendulum will contain 400 of them; and since the length of

    my

    FIRST DAY 97

    my string is one cubit, I shall divide 57600 by 400 and thus ob- tain 144. Accordingly I shall call the length of the string 144

    cubits.

    SALV. Nor will you miss it by as much as a hand’s breadth, especially if you observe a large number of vibrations.

    SAGR. You give me frequent occasion to admire the wealth and profusion of nature when, from such common and even trivial phenomena, you derive facts which are not only striking and new but which are often far removed from what we would

    have imagined. Thousands of times I have observed vibrations especially in churches where lamps, suspended by long cords, had been inadvertently set into motion; but the most which I could infer from these observations was that the view of those

    who think that such vibrations are maintained by the medium is highly improbable: for, in that case, the air must needs have considerable judgment and little else to do but kill time by push-

    ing to and fro a pendent weight with perfect regularity. But I never dreamed of learning that one and the same body, when

    [Hi] suspended from a string a hundred cubits long and pulled aside

    through an arc of 90° or even i° or ̂ 2°, would employ the same time in passing through the least as through the largest of these arcs; and, indeed, it still strikes me as somewhat unlikely. Mow I am waiting to hear how these same simple phenomena can

    \irnish solutions for those acoustical problems — solutions which will be at least partly satisfactory.

    SALV. First of all one must observe that each pendulum has ts own time of vibration so definite and determinate that

    t is not possible to make it move with any other period [altro periodo] than that which nature has given it. For let any one :ake in his hand the cord to which the weight is attached and :ry, as much as he pleases, to increase or diminish the frequency frequenza] of its vibrations; it will be time wasted. On the other hand, one can confer motion upon even a heavy pendulum which is at rest by simply blowing against it; by repeating these Dlasts with a frequency which is the same as that of the pendu-

    lum one can impart considerable motion. Suppose that by the first

    98 THE TWO NEW SCIENCES OF GALILEO first puff we have displaced the pendulum from the vertical b; say, half an inch; then if, after the pendulum has returned and about to begin the second vibration, we add a second puff, w shall impart additional motion; and so on with other bias provided they are applied at the right instant, and not when tl: pendulum is coming toward us since in this case the blast woul impede rather than aid the motion. Continuing thus wit many impulses [impulsi\ we impart to the pendulum sue momentum [impeto] that a greater impulse [forza] than that of single blast will be needed to stop it.

    SAGR. Even as a boy, I observed that one man alone h giving these impulses at the right instant was able to ring bell so large that when four, or even six, men seized the rof and tried to stop it they were lifted from the ground, all < them together being unable to counterbalance the momentu] which a single man, by properly-timed pulls, had given it.

    SALV. Your illustration makes my meaning clear and is quil as well fitted, as what I have just said, to explain the wonderfi phenomenon of the strings of the cittern [cetera] or of the spin<

    [142] [cimbalo], namely, the fact that a vibrating string will s< another string in motion and cause it to sound not only when tl: latter is in unison but even when it differs from the former by a octave or a fifth. A string which has been struck begins 1 vibrate and continues the motion as long as one hears tit sound [risonanza]; these vibrations cause the immediately su: rounding air to vibrate and quiver; then these ripples in the a expand far into space and strike not only all the strings of tli same instrument but even those of neighboring instrument; Since that string which is tuned to unison with the one plucke is capable of vibrating with the same frequency, it acquire: at the first impulse, a slight oscillation; after receiving tw< three, twenty, or more impulses, delivered at proper interval; it finally accumulates a vibratory motion equal to that of th plucked string, as is clearly shown by equality of amplitude i their vibrations. This undulation expands through the air an sets into vibration not only strings, but also any other bod

    whic

    FIRST DAY 99 which happens to have the same period as that of the plucked string. Accordingly if we attach to the side of an instrument small pieces of bristle or other flexible bodies, we shall observe that, when a spinet is sounded, only those pieces respond that have the same period as the string which has been struck; the remaining pieces do not vibrate in response to this string, nor do the former pieces respond to any other tone.

    If one bows the base string on a viola rather smartly and brings near it a goblet of fine, thin glass having the same tone [tuono] as that of the string, this goblet will vibrate and audibly resound. That the undulations of the medium are widely dispersed about the sounding body is evinced by the fact that a glass of water may be made to emit a tone merely by the friction of the finger-tip upon the rim of the glass; for in this water is produced a series of regular waves. The same phenomenon is observed to better advantage by fixing the base of the goblet upon the bottom of a rather large vessel of water filled nearly to the edge of the goblet; for if, as before, we sound the glass by friction of the finger, we shall see ripples spreading with the utmost regularity and with high speed to large distances about the glass. I have often remarked, in thus sounding a rather

    [143]

    large glass nearly full of water, that at first the waves are spaced with great uniformity, and when, as sometimes happens, the tone of the glass jumps an octave higher I have noted that at this moment each of the aforesaid waves divides into two; a phenomenon which shows clearly that the ratio involved in the octave [forma delV ottava] is two.

    SAGR. More than once have I observed this same thing, much to my delight and also to my profit. For a long time I have Deen perplexed about these different harmonies since the ex-

    planations hitherto given by those learned in music impress me as not sufficiently conclusive. They tell us that the diapa-

    son, i. e. the octave, involves the ratio of two, that the diapente which we call the fifth involves a ratio of 3 :2, etc. ; because if the open string of a monochord be sounded and afterwards a bridge be placed in the middle and the half length be sounded

    one

    ioo THE TWO NEW SCIENCES OF GALILEO

    one hears theocftave; and if the bridge be placed at 1/3 the lengt of the string, then on plucking first the open string and afte wards 2/3 of its length the fifth is given ; for this reason they sa that the octave depends upon the ratio of two to one [coi

    tenuta tra’l due e Vuno\ and the fifth upon the ratio of three t two. This explanation does not impress me as sufficient t establish 2 and 3/2 as the natural ratios of theocftave and thefiftl and my reason for thinking so is as follows. There are thrc different ways in which the tone of a string may be sharpenec namely, by shortening it, by stretching it and by making thinner. If the tension and size of the string remain constar one obtains the odlave by shortening it to one-half, i. e., by sounc ing first the open string and then one-half of it; but if length an size remain constant and one attempts to produce the octave b stretching he will find that it does not suffice to double th stretching weight; it must be quadrupled; so that, if the fundc mental note is produced by a weight of one pound, four will b required to bring out the octave. And finally if the length and tension remain constant, whii

    one changes the size * of the string he will find that in order t produce the odlave the size must be reduced to X tnat wnic gave the fundamental. And what I have said concerning th octave, namely, that its ratio as derived from the tension an size of the string is the square of that derived from the lengtl applies equally well to all other musical intervals [interval

    musici\. Thus if one wishes to produce a fifth by changing th length he finds that the ratio of the lengths must be sesquialtera in other words he sounds first the open string, then two-third of it; but if he wishes to produce this same result by stretching c thinning the string then it becomes necessary to square th ratio 3/2 that is by taking 9/4 [dupla sesquiquarta]; according!} if the fundamental requires a weight of 4 pounds, the highe note will be produced not by 6, but by 9 pounds; the same i true in regard to size, the string which gives the fundamental i larger than that which yields the fifth in the ratio of 9 to 4

    In view of these facts, I see no reason why those wise philos

    * For the exact meaning of “size” see p. 103 below. [Trans.]

    FIRST DAY 101

    ophers should adopt 2 rather than 4 as the ratio of the octave, or why in the case of the fifth they should employ the sesquialt-

    eral ratio, 3/2, rather than that of 9/4. Since it is impossible to count the vibrations of a sounding string on account of its high frequency, I should still have been in doubt as to whether a string, emitting the upper octave, made twice as many vibra-

    tions in the same time as one giving the fundamental, had it not been for the following fact, namely, that at the instant when the tone jumps to the octave, the waves which constantly ac-

    company the vibrating glass divide up into smaller ones which are precisely half as long as the former.

    SALV. This is a beautiful experiment enabling us to distin- guish individually the waves which are produced by the vibra-

    tions of a sonorous body, which spread through the air, bringing to the tympanum of the ear a stimulus which the mind translates into sound. But since these waves in the water last only so long as the friction of the finger continues and are, even then, not constant but are always forming and disappearing, would it not be a fine thing if one had the ability to produce waves which would persist for a long while, even months and years, so as to easily measure and count them?

    SAGR. Such an invention would, I assure you, command my admiration.

    SALV. The device is one which I hit upon by accident; my part consists merely in the observation of it and in the appreciation of its value as a confirmation of something to which I had given profound consideration; and yet the device is, in itself, rather common. As I was scraping a brass plate with a sharp iron [145]

    chisel in order to remove some spots from it and was running the chisel rather rapidly over it, I once or twice, during many strokes, heard the plate emit a rather strong and clear whistling sound; on looking at the plate more carefully, I noticed a long row of fine streaks parallel and equidistant from one another. Scraping with the chisel over and over again, I noticed that it was only when the plate emitted this hissing noise that any marks were left upon it; when the scraping was not accompanied

    by

    102 THE TWO NEW SCIENCES OF GALILEO

    by this sibilant note there was not the least trace of such marks. Repeating the trick several times and making the stroke, now with greater now with less speed, the whistling followed with a pitch which was correspondingly higher and lower. I noted also that the marks made when the tones were higher were closer together; but when the tones were deeper, they were farther apart. I also observed that when, during a single stroke, the speed increased toward the end the sound became sharper and the streaks grew closer together, but always in such a way as to remain sharply defined and equidistant. Besides whenever the stroke was accompanied by hissing I felt the chisel tremble in my grasp and a sort of shiver run through my hand. In short we see and hear in the case of the chisel precisely that which is seen and heard in the case of a whisper followed by a loud voice; for, when the breath is emitted without the production of a tone, one does not feel either in the throat or mouth any motion to speak of in comparison with that which is felt in the larynx and upper part of the throat when the voice is used, especially when the tones employed are low and strong.

    At times I have also observed among the strings of the spinet two which were in unison with two of the tones produced by the aforesaid scraping; and among those which differed most in pitch I found two which were separated by an interval of a perfect fifth. Upon measuring the distance between the mark-

    ings produced by the two scrapings it was found that the space which contained 45 of one contained 30 of the other, which is precisely the ratio assigned to the fifth.

    But now before proceeding any farther I want to call your attention to the fact that, of the three methods for sharpening a tone, the one which you refer to as the fineness of the string should be attributed to its weight. So long as the material of

    [146] the string is unchanged, the size and weight vary in the same ratio. Thus in the case of gut-strings, we obtain the octave by making one string 4 times as large as the other; so also in the case of brass one wire must have 4 times the size of the other; but if now we wish to obtain the octave of a gut-string, by use of

    brass

    FIRST DAY 103 brass wire, we must make it, not four times as large, but four times as heavy as the gut-string: as regards size therefore the metal string is not four times as big but four times as heavy. The wire may therefore be even thinner than the gut notwith-

    standing the fact that the latter gives the higher note. Hence if two spinets are strung, one with gold wire the other with brass, and if the corresponding strings each have the same length, diameter, and tension it follows that the instrument strung with gold will have a pitch about one-fifth lower than the other be-

    cause gold has a density almost twice that of brass. And here it is to be noted that it is the weight rather than the size of a moving body which offers resistance to change of motion [velocita del moto] contrary to what one might at first glance think. For it seems reasonable to believe that a body which is large and light should suffer greater retardation of motion in thrusting aside the medium than would one which is thin and heavy; yet here exactly the opposite is true.

    Returning now to the original subject of discussion, I assert that the ratio of a musical interval is not immediately deter-

    mined either by the length, size, or tension of the strings but rather by the ratio of their frequencies, that is, by the number of pulses of air waves which strike the tympanum of the ear, causing it also to vibrate with the same frequency. This fact established, we may possibly explain why certain pairs of notes, differing in pitch produce a pleasing sensation, others a less pleasant effect, and still others a disagreeable sensation. Such an explanation would be tantamount to an explanation of the more or less perfect consonances and of dissonances. The un-

    pleasant sensation produced by the latter arises, I think, from the discordant vibrations of two different tones which strike the

    ear out of time [sproporzionatamente]. Especially harsh is the dissonance between notes whose frequencies are incommen-

    surable; such a case occurs when one has two strings in unison and sounds one of them open, together with a part of the other [147]

    which bears the same ratio to its wnole length as the side of a square bears to the diagonal; this yields a dissonance similar

    to

    104 THE TWO NEW SCIENCES OF GALILEO

    to the augmented fourth or diminished fifth [tritono o semi- diapente].

    Agreeable consonances are pairs of tones which strike the ear with a certain regularity; this regularity consists in the fact that the pulses delivered by the two tones, in the same interval of time, shall be commensurable in number, so as not to keep the ear drum in perpetual torment, bending in two different directions in order to yield to the ever-discordant impulses. The first and most pleasing consonance is, therefore, the

    odtave since, for every pulse given to the tympanum by the lower string, the sharp string delivers two; accordingly at every other vibration of the upper string both pulses are delivered simultaneously so that one-half the entire number of pulses are delivered in unison. But when two strings are in unison their vibrations always coincide and the effect is that of a single string; hence we do not refer to it as consonance. The fifth is also a pleasing interval since for every two vibrations of the lower string the upper one gives three, so that considering the entire number of pulses from the upper string one-third of them will strike in unison, i. e., between each pair of concordant vibra-

    tions there intervene two single vibrations; and when the in- terval is a fourth, three single vibrations intervene. In case the

    interval is a second where the ratio is 9/8 it is only every ninth vibration of the upper string which reaches the ear simulta-

    neously with one of the lower; all the others are discordant and produce a harsh effect upon the recipient ear which interprets them as dissonances.

    SIMP. Won’t you be good enough to explain this argument a little more clearly? SALV. Let AB denote the length of a wave [lo spazio e la

    dilatazione d’una vibrazione] emitted by the lower string and CD that of a higher string which is emitting the odlave of AB; divide AB in the middle at E. If the two strings begin their motions at A and C, it is clear that when the sharp vibration has reached the end D, the other vibration will have travelled only as far as E, which, not being a terminal point, will emit no pulse; but there is a blow delivered at D. Accordingly when the one

    wave

    FIRST DAY 105

    wave comes back from D to C, the other passes on from E to B ; hence the two pulses from B and C strike the drum of the ear simultaneously. Seeing that these vibrations are repeated

    again and again in the same manner, we con- jv, E B elude that each alternate pulse from CD falls ‘ ‘ « in unison with one from AB. But each of the , – L

    [I48] C D

    pulsations at the terminal points, A and B, is A £ 0 B constantly accompanied by one which leaves al- * ‘ ‘ ‘ ways from C or always from D. This is clear t i i

    because if we suppose the waves to reach A and c m D C at the same instant, then, while one wave Fig. 13 travels from A to B, the other will proceed from C to D and back to C, so that waves strike at C and B simultaneously; during the passage of the wave from B back to A the disturbance at C goes to D and again returns to C, so that once more the pulses at A and C are simultaneous.

    Next let the vibrations AB and CD be separated by an in- terval of a fifth, that is, by a ratio of 3/2; choose the points E and

    O such that they will divide the wave length of the lower string into three equal parts and imagine the vibrations to start at the same instant from each of the terminals A and C. It is evident that when the pulse has been delivered at the terminal D, the wave in AB has travelled only as far as O; the drum of the ear receives, therefore, only the pulse from D. Then during the return of the one vibration from D to C, the other will pass from O to B and then back to O, producing an isolated pulse at B— a pulse which is out of time but one which must be taken into consideration.

    Now since we have assumed that the first pulsations started from the terminals A and C at the same instant, it follows that the second pulsation, isolated at D, occurred after an interval of time equal to that required for passage from C to D or, what is the same thing, from A to O; but the next pulsation, the one at B, is separated from the preceding by only half this interval, namely, the time required for passage from O to B. Next while the one vibration travels from O to A, the other travels from C to

    D,

    106 THE TWO NEW SCIENCES OF GALILEO

    D, the result of which is that two pulsations occur simulta- neously at A and D. Cycles of this kind follow one after an-

    other, i. e., one solitary pulse of the lower string interposed be- tween two solitary pulses of the upper string. Let us no\\

    imagine time to be divided into very small equal intervals then if we assume that, during the first two of these intervals, the disturbances which occurred simultaneously at A and C have travelled as far as O and D and have produced a pulse at D; anc if we assume that during the third and fourth intervals one disturbance returns from D to C, producing a pulse at C, while the other, passing on from O to B and back to O, produces i pulse at B ; and if finally, during the fifth and sixth intervals, the disturbances travel from O and C to A and D, producing c pulse at each of the latter two, then the sequence in which the pulses strike the ear will be such that, if we begin to count time from any instant where two pulses are simultaneous, the ea] drum will, after the lapse of two of the said intervals, receive c solitary pulse; at the end of the third interval, another solitary

    pulse; so also at the end of the fourth interval; and two in- tervals later, i. e., at the end of the sixth interval, will be hearc

    two pulses in unison. Here ends the cycle — the anomaly, so tc speak — which repeats itself over and over again.

    SAGR. I can no longer remain silent; for I must express to yoi; the great pleasure I have in hearing such a complete explanatior of phenomena with regard to which I have so long been ir darkness. Now I understand why unison does not differ from z single tone; I understand why the ocftave is the principal har-

    mony, but so like unison as often to be mistaken for it and alsc why it occurs with the other harmonies. It resembles unisor

    because the pulsations of strings in unison always occur simulta- neously, and those of the lower string of the octave are always

    accompanied by those of the upper string; and among the lattei is interposed a solitary pulse at equal intervals and in such a manner as to produce no disturbance; the result is that such a harmony is rather too much softened and lacks fire. But the

    fifth is characterized by its displaced beats and by the interposi-

    tion

    FIRST DAY 107

    tion of two solitary beats of the upper string and one solitary beat of the lower string between each pair of simultaneous pulses; these three solitary pulses are separated by intervals of time equal to half the interval which separates each pair of simultaneous beats from the solitary beats of the upper string. Thus the effect of the fifth is to produce a tickling of the ear drum such that its softness is modified with sprightliness, giving at the same moment the impression of a gentle kiss and of a bite.

    SALV. Seeing that you have derived so much pleasure from these novelties, I must show you a method by which the eye may enjoy the same game as the ear. Suspend three balls of lead, or other heavy material, by means of strings of different length such that while the longest makes two vibrations the shortest will make four and the medium three; this will take place when the longest string measures 16, either in hand breadths or in any other unit, the medium 9 and the shortest 4, all measured in the same unit.

    Now pull all these pendulums aside from the perpendicular and release them at the same instant; you will see a curious interplay of the threads passing each other in various manners but such that at the completion of every fourth vibration of the longest pendulum, all three will arrive simultaneously at the same terminus, whence they start over again to repeat the same cycle. This combination of vibrations, when produced on strings is precisely that which yields the interval of the octave and the intermediate fifth. If we employ the same disposition

    [150] of apparatus but change the lengths of the threads, always however in such a way that their vibrations correspond to those of agreeable musical intervals, we shall see a different crossing of these threads but always such that, after a definite interval of time and after a definite number of vibrations, all the threads, whether three or four, will reach the same terminus at the same instant, and then begin a repetition of the cycle.

    If however the vibrations of two or more strings are incom- mensurable so that they never complete a definite number of

    vibrations at the same instant, or if commensurable they return

    only

    io8 THE TWO NEW SCIENCES OF GALILEO

    only after a long interval of time and after a large number oi vibrations, then the eye is confused by the disorderly succession of crossed threads. In like manner the ear is pained by an irregular sequence of air waves which strike the tympanum with-

    out any fixed order. But, gentlemen, whither have we drifted during these many

    hours lured on by various problems and unexpected digressions ? The day is already ended and we have scarcely touched the subject proposed for discussion. Indeed we have deviated so far that I remember only with difficulty our early introduction and the little progress made in the way of hypotheses and principles for use in later demonstrations.

    SAGR. Let us then adjourn for to-day in order that our minds may find refreshment in sleep and that we may return to-

    morrow, if so please you, and resume the discussion of the main

    question. SALV. I shall not fail to be here to-morrow at the same hour,

    hoping not only to render you service but also to enjoy your company.

    END OF THE FIRST DAY.

    SECOND DAY

    GR. While Simplicio and I were awaiting

    your arrival we were trying to recall that last consideration which you advanced as a prin-

    ciple and basis for the results you intended to obtain; this consideration dealt with the resistance which all solids offer to fracflure

    and depended upon a certain cement which held the parts glued together so that they

    would yield and separate only under considerable pull [potente attrazzione]. Later we tried to find the explanation of this coherence, seeking it mainly in the vacuum; this was the occa-

    sion of our many digressions which occupied the entire day and led us far afield from the original question which, as I have already stated, was the consideration of the resistance [resistenza] that solids offer to fracfture.

    SALV. I remember it all very well. Resuming the thread of our discourse, whatever the nature of this resistance which solids offer to large tractive forces [violenta attrazzione] there can at least be no doubt of its existence; and though this resistance is very great in the case of a direct pull, it is found, as a rule, to be less in the case of bending forces [net violentargli per traverso]. Thus, for example, a rod of steel or of glass will sustain a longi-

    tudinal pull of a thousand pounds while a weight of fifty pounds would be quite sufficient to break it if the rod were fastened at right angles into a vertical wall. It is this second type of re-

    sistance which we must consider, seeking to discover in what [152]

    proportion

    i io THE TWO NEW SCIENCES OF GALILEO

    proportion it is found in prisms and cylinders of the same material, whether alike or unlike in shape, length, and thick-

    ness. In this discussion I shall take for granted the well-known mechanical principle which has been shown to govern the behavior of a bar, which we call a lever, namely, that the force bears to the resistance the inverse ratio of the distances which separate the fulcrum from the force and resistance respectively.

    SIMP. This was demonstrated first of all by Aristotle, in his Mechanics.

    SALV. Yes, I am willing to concede him priority in point of time; but as regards rigor of demonstration the first place must be given to Archimedes, since upon a single proposition proved

    in his book on Equilibrium * depends not only the law of the lever but also those of most other mechanical devices.

    SAGR. Since now this principle is fundamental to all the demonstrations which you propose to set forth would it not be advisable to give us a complete and thorough proof of this proposition unless possibly it would take too much time?

    SALV. Yes, that would be quite proper, but it is better I think to approach our subject in a manner somewhat different from that employed by Archimedes, namely, by first assuming merely that equal weights placed in a balance of equal arms will

    produce equilibrium — a principle also assumed by Archimedes — and then proving that it is no less true that unequal weights produce equilibrium when the arms of the steelyard have lengths inversely proportional to the weights suspended from them; in other words, it amounts to the same thing whether one places equal weights at equal distances or unequal weights at distances which bear to each other the inverse ratio of the weights.

    In order to make this matter clear imagine a prism or solid cylinder, AB, suspended at each end to the rod [linea] HI, and supported by two threads HA and IB; it is evident that if I attach a thread, C, at the middle point of the balance beam HI, the entire prism AB will, according to the principle assumed, hang in equilibrium since one-half its weight lies on one side, and the other half on the other side, of the point of suspension C. Now

    * Works of Archimedes. Trans, by T. L. Heath, pp. 189-220. [Trans]

    SECOND DAY in

    suppose the prism to be divided into unequal parts by a plane [153]

    through the line D, and let the part DA be the larger and DB the smaller: this division having been made, imagine a thread ED, attached at the point E and supporting the parts AD and DB, in order that these parts may remain in the same position relative to line HI: and since the relative position of the prism and the beam HI remains unchanged, there can be no doubt but that the prism will maintain its former state of equilibrium.

    H

    Fig. 14

    But circumstances would remain the same if that part of the prism which is now held up, at the ends, by the threads AH and D.E were supported at the middle by a single thread GL; and likewise the other part DB would not change position if held by a thread FM placed at its middle point. Suppose now the threads HA, ED, and IB to be removed, leaving only the two GL and FM, then the same equilibrium will be maintained so long as the suspension is at C. Now let us consider that we have here two heavy bodies AD and DB hung at the ends G and F, of a balance beam GF in equilibrium about the point C, so that the line CG is the distance from C to the point of suspension of the heavy body AD, while CF is the distance at which the other heavy body, DB, is supported. It remains now only to show that these distances bear to each other the inverse ratio of the weights themselves, that is, the distance GC is to the

    distance CF as the prism DB is to the prism DA — a proposition which we shall prove as follows: Since the line GE is the half of EH, and since EF is the half of El, the whole length GF will be

    half

    ii2 THE TWO NEW SCIENCES OF GALILEO

    half of the entire line HI, and therefore equal to CI : if now we subtract the common part CF the remainder GC will be equal to the remainder FI, that is, to FE, and if to each of these we add CE we shall have GE equal to CF: hence GE:EF=FC:CG. But GE and EF bear the same ratio to each other as do their doubles HE and El, that is, the same ratio as the prism AD to DB. Therefore, by equating ratios we have, convertendo, the distance GC is to the distance CF as the weight BD is to the weight DA, which is what I desired to prove. [154]

    If what precedes is clear, you will not hesitate, I think, to admit that the two prisms AD and DB are in equilibrium about the point C since one-half of the whole body AB lies on the right of the suspension C and the other half on the left; in other words, this arrangement is equivalent to two equal weights dis-

    posed at equal distances. I do not see how any one can doubt, if the two prisms AD and DB were transformed into cubes, spheres, or into any other figure whatever and if G and F were retained as points of suspension, that they would remain in equilibrium about the point C, for it is only too evident that change of figure does not produce change of weight so long as the mass [quantita di materia] does not vary. From this we may derive the general conclusion that any two heavy bodies are in equilibrium at distances which are inversely proportional to their weights.

    This principle established, I desire, before passing to any other subject, to call your attention to the fact that these forces, resistances, moments, figures, etc., may be considered either in the abstract, dissociated from matter, or in the concrete, asso-

    ciated with matter. Hence the properties which belong to figures that are merely geometrical and non-material must be modified when we fill these figures with matter and therefore give them weight. Take, for example, the lever BA which, resting upon the support E, is used to lift a heavy stone D. The principle just demonstrated makes it clear that a force ap-

    plied at the extremity B will just suffice to equilibrate the resistance offered by the heavy body D provided this force [momenta] bears to the force [momenta] at D the same ratio as the

    distance

    SECOND DAY 113

    distance AC bears to the distance CB ; and this is true so long as we consider only the moments of the single force at B and of the resistance at D, treating the lever as an immaterial body devoid of weight. But if we take into account the weight of the lever itself — an instrument which may be made either of wood or of iron — it is manifest that, when this weight has been added to the

    force at B, the ratio will be changed and must therefore be expressed in different terms. Hence before going further let

    B

    Fig. 15

    us agree to distinguish between these two points of view; when we consider an instrument in the abstract, i. e., apart from the

    weight of its own material, we shall speak of “taking it in an absolute sense ” [prendere assolutamente] ; but if we fill one of these simple and absolute figures with matter and thus give it weight,

    we shall refer to such a material figure as a “moment” or “compound force” [momenta oforza composta].

    SAGR. I must break my resolution about not leading you off into a digression; for I cannot concentrate my attention upon what is to follow until a certain doubt is removed from my mind, namely, you seem to compare the force at B with the

    total weight of the stone D, a part of which — possibly the greater part — rests upon the horizontal plane: so that . . .

    SALV. I understand perfectly: you need go no further. How- ever please observe that I have not mentioned the total weight

    of the stone; I spoke only of its force [momenta] at the point A, the extremity of the lever BA, which force is always less than the total weight of the stone, and varies with its shape and elevation.

    SAGR. Good : but there occurs to me another question about which

    ii4 THE TWO NEW SCIENCES OF GALILEO which I am curious. For a complete understanding of this matter, I should like you to show me, if possible, how one can determine what part of the total weight is supported by the underlying plane and what part by the end A of the lever.

    SALV. The explanation will not delay us long and I shall therefore have pleasure in granting your request. In the accom-

    panying figure, let us understand that the weight having its center of gravity at A rests with the end B upon the horizontal plane and with the other end upon the lever CG. Let N be the fulcrum of a lever to which the force [potenza] is applied at G. Let fall the perpendiculars, AO and CF, from the center A and the end C. Then I say, the magnitude [momenta] of the entire weight bears to the magnitude of the force [momenta delta potenza] at G a ratio compounded of the ratio between the two

    Fig. 16 distances GN and NC and the ratio between FB and BO. Lay off a distance X such that its ratio to NC is the same as that of BO to FB; then, since the total weight A is counterbalanced by the two forces at B and at C, it follows that the force at B is to that at C as the distance FO is to the distance OB. Hence,

    [156] componendo, the sum of the forces at B and C, that is, the total weight A [momento di tutto V peso A\ is to the force at C as the line FB is to the line BO, that is, as NC is to X: but the force [momento delta potenza} applied at C is to the force applied at G as the distance GN is to the distance NC; hence it follows,

    ex aquali in propprtione perturbata* that the entire weight A is to the force applied at G as the distance GN is to X. But the ratio of GN to X is compounded of the ratio of GN to NC and of NC to X, that is, of FB to BO; hence the weight A bears to the

    * For definition of perturbata see Todhunter’s Euclid. Book V, Def . 20.

    [Trans.]

    SECOND DAY 115 equilibrating force at G a ratio compounded of that of GN to NC and of FB to BO: which was to be proved.

    Let us now return to our original subject; then, if what has hitherto been said is clear, it will be easily understood that,

    PROPOSITION I

    A prism or solid cylinder of glass, steel, wood or other break- able material which is capable of sustaining a very heavy weight

    when applied longitudinally is, as previously remarked, easily broken by the transverse application of a weight which may be much smaller in proportion as the length of the cylinder exceeds its thickness.

    Let us imagine a solid prism ABCD fastened into a wall at the end AB, and supporting a weight E at the other end; under-

    stand also that the wall is vertical and that the prism or cylinder is fastened at right angles to the wall. It is clear that, if the cylinder breaks, fracture will occur at the point B where the edge of the mortise acts as a fulcrum for the lever BC, to which the force is applied; the thickness of the solid BA is the other arm of the lever along which is located the resistance. This resistance opposes the separation of the part BD, lying outside the wall, from that portion lying inside. From the preceding, it follows that the magnitude [momenta] of the force applied at C bears to the magnitude [momento] of the resistance, found in the thickness of the prism, i. e., in the attachment of the base BA to its con-

    tiguous parts, the same ratio which the length CB bears to half

    the length BA; if now we define absolute resistance to fracture [157]

    as that offered to a longitudinal pull (in which case the stretch-

    ing force acts in the same direction as that through which the body is moved), then it follows that the absolute resistance of the prism BD is to the breaking load placed at the end of the lever BC in the same ratio as the length BC is to the half of AB in the case of a prism, or the semidiameter in the case of a

    linden This is our first proposition.* Observe that in what

    * The one fundamental error which is implicitly introduced into this >roposition and which is carried through the entire discussion of the

    ii6 THE TWO NEW SCIENCES OF GALILEO

    has here been said the weight of the solid BD itself has been left out of consideration, or rather, the prism has been assumed to be devoid of weight. But if the weight of the prism is to be taken account of in conjunction with the weight E, we must add

    to the weight E one half

    that BD: of

    so the

    that

    prism

    if, for example, the latter weighs two

    pounds and the weight E is ten pounds w e must treat the weight E as if it were eleven

    pounds. SIMP. Why not

    twelve? SALV. The weight

    E, my dear Simp- licio, hanging at the extreme end C acts

    upon the lever BC with its full mo-

    ment of ten pounds : so also would the

    Fig. 17 solid BD if sus- pended at the same point exert its full moment of two pounds;

    but, as you know, this solid is uniformly distributed through-

    Second Day consists in a failure to see that, in such a beam, there must be equilibrium between the forces of tension and compression over any cross-section. The correct point of view seems first to have been found by E. Mariotte in 1680 and by A. Parent in 1713. Fortunately this error does not vitiate the conclusions of the subsequent propositions which deal only with proportions — not actual strength — of beams. Following K. Pearson (Todhunter’s History of Elasticity) one might say that Galileo’s mistake lay in supposing the fibres of the strained beam to be inextensible. Or, confessing the anachronism, one might say that the error consisted in taking the lowest fibre of the beam as the neutral axis.

    [Trans]

    SECOND DAY 117 out its entire length, BC, so that the parts which lie near the end B are less effective than those more remote.

    Accordingly if we strike a balance between the two, the weight of the entire prism may be considered as concentrated at its center of gravity which lies midway of the lever BC. But a weight hung at the extremity C exerts a moment twice as great as it would if suspended from the middle: therefore

    [158]

    if we consider the moments of both as located at the end C we

    must add to the weight E one-half that of the prism. SIMP. I understand perfectly; and moreover, if I mistake not,

    the force of the two weights BD and E, thus disposed, would exert the same moment as would the entire weight BD together with twice the weight E suspended at the middle of the lever BC.

    SALV. Precisely so, and a fact worth remembering. Now we can readily understand

    PROPOSITION II

    How and in what proportion a rod, or rather a prism, whose width is greater than its thickness offers more resistance to fracture when the

    loice is applied in the direction of its breadth than in the direction of its thickness.

    For the sake of clearness, take a ruler ad whose width is ac and

    whose thickness, Fig. 18 cb, is much less than its width. The question now is why will the ruler, if stood on edge, as in the first figure, withstand a great weight T, while, when laid flat, as in the second figure, it will not support the weight X which is less than T. The answer is evident when we remember that in the one case

    the

    ii8 THE TWO NEW SCIENCES OF GALILEO

    the fulcrum is at the line be, and in the other case at ca, while the distance at which the force is applied is the same in both cases, namely, the length bd: but in the first case the distance of the resistance from the fulcrum — half the line ca — is greater than in the other case where it is only half of be. Therefore the weight T is greater than X in the same ratio as half the width ca is greater than half the thickness be, since the former adls as a lever arm for ca, and the latter for cb, against the same resistance, namely, the strength of all the fibres in the cross-sedtion ab. We conclude, therefore, that any given ruler, or prism, whose width exceeds its thickness, will offer greater resistance to fradlure when standing on edge than when lying flat, and this in the ratio of the width to the thickness.

    PROPOSITION III

    Considering now the case of a prism or cylinder growing longer in a horizontal direction, we must find out in what ratio the moment of its own weight increases in comparison with its

    resistance to fradhzre. This moment I find increases in propor- [159]

    tion to the square of the length. In order to prove this let AD be a prism or cylinder lying horizontal with its end A firmly fixed in a wall. Let the length of the prism be increased by the addi-

    tion of the portion BE. It is clear that merely changing the length of the lever from AB to AC will, if we disregard its weight, increase the moment of the force [at the end] tending to produce fracfture at A in the ratio of CA to BA. But, besides this, the weight of the solid portion BE, added to the weight of the solid AB increases the moment of the total weight in the ratio of the weight of the prism AE to that of the prism AB, which is the same as the ratio of the length AC to AB.

    It follows, therefore, that, when the length and weight are simultaneously increased in any given proportion, the moment, which is the product of these two, is increased in a ratio which is the square of the preceding proportion. The conclusion is then that the bending moments due to the weight of prisms and cylinders which have the same thickness but different lengths,

    bear

    SECOND DAY 119 bear to each other a ratio which is the square of the ratio of their lengths, or, what is the same thing, the ratio of the squares of their lengths. We shall next show in what ratio the resistance to fradhire

    Fig. 19

    [bending strength], in prisms and cylinders, increases with in- [160]

    crease of thickness while the length remains unchanged. Here I say that

    PROPOSITION IV

    In prisms and cylinders of equal length, but of unequal thicknesses, the resistance to fradture increases in the same ratio as the cube of the diameter of the thickness, i. e., of the base.

    JLet A and B be two cylinders of equal lengths DG, FH; let their |bases be circular but unequal, having the diameters CD and EF.

    “hen I say that the resistance to fradlure offered by the cylinder B

    120 THE TWO NEW SCIENCES OF GALILEO

    B is to that offered by A as the cube of the diameter FE is to the cube of the diameter DC. For, if we consider the resistance to fracture by longitudinal pull as dependent upon the bases, i. e., upon the circles EF and DC, no one can doubt that the strength [resistenza] of the cylinder B is greater than that of A in the same proportion in which the area of the circle EF exceeds that of CD; because it is precisely in this ratio that the number of fibres binding the parts of the solid together in the one cylinder exceeds that in the other cylinder.

    But in the case of a force acting transversely it must be re- membered that we are employing two levers in which the forces

    are applied at distances DG, FH, and the fulcrums are located at the points D and F; but the resistances are applied at distances which are equal to the radii of the circles DC and EF, since the fibres distributed over

    F these entire cross-sections Fig. 20 act as if concentrated at the

    centers. Remembering this and remembering also that the arms, DG and FH, through which the forces G and H act are equal, we can understand that the resistance, located at the center of the base EF, acting against the force at H, is more effective [maggiore] than the resistance at the center of the base CD opposing the force G, in the ratio of the radius FE to the radius DC. Accordingly the resistance to fracture of-

    fered by the cylinder B is greater than that of the cylinder A in a ratio which is compounded of that of the area of the circles EF and DC and that of their radii, i. e., of their diameters; but the areas of circles are as the squares of their diameters. There-

    fore the ratio of the resistances, being the product of the two preceding ratios, is the same as that of the cubes of the diameters. This is what I set out to prove. Also since the volume of a cube

    [161]

    varies as the third power of its edge we may say that the re-

    sistance

    SECOND DAY 121

    sistance [strength] of a cylinder whose length remains constant varies as the third power of its diameter.

    From the preceding we are able to conclude that

    COROLLARY

    The resistance [strength] of a prism or cylinder of constant length varies in the sesquialteral ratio of its volume.

    This is evident because the volume of a prism or cylinder of constant altitude varies direclly as the area of its base, i. e., as the square of a side or diameter of this base; but, as just demon-

    strated, the resistance [strength] varies as the cube of this same side or diameter. Hence the resistance varies in the sesquialteral

    ratio of the volume — consequently also of the weight — of the solid itself.

    SIMP. Before proceeding further I should like to have one of my difficulties removed. Up to this point you have not taken into consideration a certain other kind of resistance which, it appears to me, diminishes as the solid grows longer, and this is quite as true in the case of bending as in pulling; it is precisely thus that in the case of a rope we observe that a very long one is less able to support a large weight than a short one. Whence, I believe, a short rod of wood or iron will support a greater weight than if it were long, provided the force be always applied longi-

    tudinally and not transversely, and provided also that we take into account the weight of the rope itself which increases with its ength. SALV. I fear, Simplicio, if I correcflly catch your meaning,

    hat in this particular you are making the same mistake as many others; that is if you mean to say that a long rope, one of perhaps .o cubits, cannot hold up so great a weight as a shorter length, ay one or two cubits, of the same rope. SIMP. That is what I meant, and as far as I see the proposition

    s highly probable. SALV. On the contrary, I consider it not merely improbable

    )ut false; and I think I can easily convince you of your error, t AB represent the rope, fastened at the upper end A: at the

    ower end attach a weight C whose force is just sufficient to break

    122 THE TWO NEW SCIENCES OF GALILEO

    break the rope. Now, Simplicio, point out the exact place where

    you think the break ought to occur.

    SIMP. Let us say D. SALV. And why at D? SIMP. Because at this point the rope is not strong enough to

    support, say, 100 pounds, made up of the portion of the rope DB and the stone C.

    SALV. Accordingly whenever the rope is stretched [violentata] with the weight of 100 pounds at D it will break there.

    SIMP. I think so. SALV. But tell me, if instead of attaching the weight at the

    end of the rope, B, one fastens it at a point nearer D, say, at E: or if, instead of fixing the upper end of the rope at A, one fastens it at some point F, just above D, will not the rope, at the point D, be sub j eel to the same pull of 100 pounds?

    SIMP. It would, provided you include with the stone C the portion of rope EB.

    SALV. Let us therefore suppose that the rope is stretched at the point D with a weight of 100 pounds, then according to your own admission it will break; but FE is only a small portion of AB; how can you therefore maintain that the long rope is weaker than the short one? Give up then this erroneous view which you share with many very intelligent people, and let us proceed. Now having demonstrated that, in the case of

    [uniformly loaded] prisms and cylinders of constant thickness, the moment of force tending to produce fracture [momenta sopra le proprie resistenze] varies

    as the square of the length; and having likewise shown that, when the length is constant and the thickness varies, the resist-

    ance to fracfture varies as the cube of the side, or diameter, of the base, let us pass to the investigation of the case of solids which simultaneously vary in both length and thickness. Here I observe that,

    Fig. 21

    SECOND DAY 123

    PROPOSITION V

    Prisms and cylinders which differ in both length and thickness offer resistances to fraclure [i. e., can support at their ends loads] which are direcflly proportional to the cubes of the diameters of their bases and inversely propor-

    tional to their lengths.

    Let ABC and DEF be two such cylinders; then the resistance [bending strength] of the cylinder AC bears to the resistance of the cylinder DF a ratio which is the producft of the cube of the diameter AB divided by the cube of the diameter DE, and of the length EF divided by the A length BC. Make EG equal to BC: let H be a third proportional to the lines AB and DE; let I be a fourth proportional, [AB/DE = H/I]: and let I:S=EF:BC Now since the resistance

    of the cylinder AC is to that of the cylinder DG as the cube of AB is to the cube of DE, that is, as the length AB is to the length I; and since the resistance Fig. 22 of the cylinder DG is to that of the cylinder DF as the length FE is to EG, that is, as I is to S, it follows that the length AB is to S as the resistance of the cylinder AC is to that of the cylinder DF. But the line AB bears to S a ratio which is the produdl of AB/I and I/S. Hence the resistance [bending strength] of the cylinder AC bears to the resistance of the cyl-

    inder DF a ratio which is the producl of AB/I (that is, AB3/ DE3) and of I/S (that is, EF/BC): which is what I meant to prove.

    This proposition having been demonstrated, let us next consider

    i24 THE TWO NEW SCIENCES OF GALILEO consider the case of prisms and cylinders which are similar. Concerning these we shall show that,

    PROPOSITION VI

    In the case of similar cylinders and prisms, the moments [stretching forces] which result from multiplying together their weight and length [i. e., from the moments produced by their own weight and length], which latter acts as a lever-arm, bear to each other a ratio which is the sesqui-

    alteral of the ratio between the resistances of their bases. In order to prove this let us indicate the two similar cylinders

    by AB and CD : then the magnitude of the force [momenta] in the cylinder AB, opposing the resistance of its base B, bears to the magnitude [momenta] of the force at CD, opposing the resistance of its base D, a ratio which is the sesquialteral of the ratio

    between the resistance of the base B and the resistance of the base D. And since the

    B solids AB and CD, are effective in opposing the resistances of their

    bases B and D, in pro- D portion to their weights

    and to the mechanical &* 23 advantages [forze] of

    their lever arms respectively, and since the advantage [forza] of the lever arm AB is equal to the advantage [forza] of the lever arm CD (this is true because in virtue of the similarity of the cylinders the length AB is to the radius of the base B as the length CD is to the radius of the base D), it follows that the total force [momenta] of the cylinder AB is to the total force [momenta] of the cylinder CD as the weight alone of the cylinder AB is to the weight alone of the cylinder CD, that is, as the volume of

    the cylinder AB [I’istesso cilindro AB] is to the volume CD [airistesso CD]: but these are as the cubes of the diameters of their bases B and D; and the resistances of the bases, being

    to

    SECOND DAY 125

    to each other as their areas, are to each other consequently as the squares of their diameters. Therefore the forces [momenti] of the cylinders are to each other in the sesquialteral ratio of the resistance of their bases.*

    SIMP. This proposition strikes me as both new and surprising: at first glance it is very different from anything which I my-

    self should have guessed: for since these figures are similar in all other respects, I should have certainly thought that the forces [momenti] and the resistances of these cylinders would have borne to each other the same ratio.

    SAGR. This is the proof of the proposition to which I referred, at the very beginning of our discussion, as one imperfectly un-

    derstood by me. SALV. For a while, Simplicio, I used to think, as you do, that

    the resistances of similar solids were similar; but a certain casual observation showed me that similar solids do not exhibit a

    strength which is proportional to their size, the larger ones being less fitted to undergo rough usage just as tall men are more apt than small children to be injured by a fall. And, as we re-

    marked at the outset, a large beam or column falling from a [165]

    given height will go to pieces when under the same circumstances a small scantling or small marble cylinder will not break. It was this observation which led me to the investigation of the fact which I am about to demonstrate to you: it is a very remarkable thing that, among the infinite variety of solids which are similar one to another, there are no two of which the forces [momenti], and the resistances of these solids are related in the same ratio.

    SIMP. You remind me now of a passage in Aristotle’s Questions

    * The preceding paragraph beginning with Prop. VI is of more than usual interest as illustrating the confusion of terminology current in the time of Galileo. The translation given is literal except in the case of those words for which the Italian is supplied. The facts which Galileo has in mind are so evident that it is difficult to see how one can here

    interpret “moment” to mean the force “opposing the resistance of its base” unless “the force of the lever arm AB” be taken to mean “the mechanical advantage of the lever made up of AB and the radius of the

    base B”; and similarly for “the force of the lever arm CD.” [Trans]

    126 THE TWO NEW SCIENCES OF GALILEO

    in Mechanics in which he tries to explain why it is that a wooden beam becomes weaker and can be more easily bent as it grows

    longer, notwithstanding the fact that the shorter beam is thin- ner and the longer one thicker: and, if I remember correctly,

    he explains it in terms of the simple lever. SALV. Very true: but, since this solution seemed to leave

    room for doubt, Bishop di Guevara,* whose truly learned com- mentaries have greatly enriched and illuminated this work,

    indulges in additional clever speculations with the hope of thus overcoming all difficulties ; nevertheless even he is confused as regards this particular point, namely, whether, when the length and thickness of these solid figures increase in the same ratio, their strength and resistance to fracture, as well as to bending, remain constant. After much thought upon this subject, I have reached the following result. First I shall show that,

    PROPOSITION VII

    Among heavy prisms and cylinders of similar figure, there is one and only one which under the stress of its own weight lies just on the limit between breaking and not breaking: so that every larger one is unable to carry the load of its own weight and breaks; while every smaller one is able to withstand some additional force tending to break it.

    Let AB be a heavy prism, the longest possible that will just sustain its own weight, so that if it be lengthened the least bit it will break. Then, I say, this prism is unique among all similar

    prisms — infinite in number — in occupying that boundary line between breaking and not breaking; so that every larger one

    [166] will break under its own weight, and every smaller one will not break, but will be able to withstand some force in addition to its own weight.

    Let the prism CE be similar to, but larger than, AB: then, I say, it will not remain intact but will break under its own weight. Lay off the portion CD, equal in length to AB. And, since, the resistance [bending strength] of CD is to that of AB as

    * Bishop of Teano; b. 1561 ; d. 1641. [Trans.]

    SECOND DAY 127 the cube of the thickness of CD is to the cube of the thickness of

    AB, that is, as the prism CE is to the similar prism AB, it follows that the weight of CE is the utmost load which a prism of the length CD can sustain; but the length of CE is greater; there-

    fore the prism CE will break. . Now take another prism FG which is smaller than AB.

    Let FH equal AB, then it can be shown in a similar manner

    that the resistance [bending strength] of FG is to that of Fig. 24 AB as the prism FG is to the prism AB provided the dis-

    tance AB that is FH, is equal to the distance FG; but AB is greater than FG, and therefore the moment of the prism FG applied at G is not sufficient to break the prism FG.

    SAGR. The demonstration is short and clear; while the proposi- tion which, at first glance, appeared improbable is now seen

    to be both true and inevitable. In order therefore to bring this prism into that limiting condition which separates breaking from not breaking, it would be necessary to change the ratio between thickness and length either by increasing the thickness or by diminishing the length. An investigation of this limiting state will, I believe, demand equal ingenuity.

    SALV. Nay, even more; for the question is more difficult; this I know because I spent no small amount of time in its discovery which I now wish to share with you.

    PROPOSITION VIII

    Given a cylinder or prism of the greatest length consist- ent with its not breaking under its own weight; and having

    given a greater length, to find the diameter of another cylinder or prism of this greater length which shall be the only and largest one capable of withstanding its own weight.

    Let BC be the largest cylinder capable of sustaining its own weight; and let DE be a length greater than AC: the problem is

    to find the diameter of the cylinder which, having the length [167] DE,

    128 THE TWO NEW SCIENCES OF GALILEO

    DE, shall be the largest one just able to withstand its own weight. Let I be a third proportional to the lengths DE and AC; let the diameter FD be to the diameter BA as DE is to I; draw the cylinder FE; then, among all cylinders having the same proportions, this is the largest and only one just capable of sustaining its own weight.

    Let M be a third proportional to DE and I: also let O be a fourth proportional to DE, I, and M; lay off FG equal to AC. Now since the diameter FD is to the diameter AB as the length DE is to I, and since O is a fourth proportional to DE, I and M,

    it follows that FD3:BA3=DE:O. But the resistance [bending

    A^.^^ AC strength] of the cylinder DG is B to the resistance of the cylinder

    ^ 7″ ‘ ** cylinder BC as the length DE is p, i to O. And since the moment Fig. 25 of the cylinder BC is held in

    equilibrium by [e equate alia} its resistance, we shall accomplish our end (which is to prove that the moment of the cylinder FE is equal to the resistance located at FD), if we show that the moment of the cylinder FE is to the moment of the cylinder BC as the resistance DF is to the resistance BA, that is, as the cube of FD is to the cube of BA, or as the length DE is to O. The moment of the cylinder FE is to the moment of the cylinder DG as the square of DE is to the square of AC, that is, as the length DE is to I; but the moment of the cylinder DG is to the moment of the cylinder BC, as the square of DF is to the square of BA, that is, as the square of DE is to the square of I, or as the square of I is to the square of M, or, as I is to O. Therefore by equating ratios, it results that the moment of the cylinder FE is to the moment of the cylinder BC as the length DE is to O, that is, as the cube of DF is to the cube of BA, or as the resistance of the base DF is to the resistance of the base BA; which was to be

    proven. SAGR. This demonstration, Salviati, is rather long and diffi-

    cult

    SECOND DAY 129

    cult to keep in mind from a single hearing. Will you not, therefore, be good enough to repeat it?

    SALV. As you like; but I would suggest instead a more direct and a shorter proof: this will, however, necessitate a different figure.

    [168] SAGR. The favor will be that much greater: nevertheless I

    hope you will oblige me by putting into written form the argu- ment just given so that I may study it at my leisure.

    SALV. I shall gladly do so. Let A denote a cylinder of diam- eter DC and the largest capable of sustaining its own weight:

    the problem is to determine a larger cylinder which shall be at once the maximum and the unique one capable of sustaining its own weight.

    Let E be such a cylinder, similar to A, having the assigned length, and having a diameter KL. Let MN be a third propor-

    tional to the two lengths DC and KL: let MN also be the diameter of an-

    other cylinder, X, having the same length as E: then, I say, X is the cyl- inder sought. Now since the resist-

    ance of the base DC is to the resist- ance of the base KL as the square of

    DC is to the square of KL, that is, as w the square of KL is to the square of MN, or, as the cylinder E is to the cylinder X, that is, as the moment E is to the moment X; and since also the resistance [bending strength] of the base KL is to the resistance of the base MN as the cube of KL is to the cube of MN, that is, as the cube of DC is to the cube of KL, or, as the cylinder A is to the cylinder E, that is, as the moment of A is to the moment

    of E; hence it follows, ex czquali in proportione perturbata* that the moment of A is to the moment of X as the resistance of the base DC is to the resistance of the base MN; therefore moment and resistance are related to each other in prism X precisely as they are in prism A.

    * For definition of perturbata see Todhunter’s Euclid, Book V, Def . 20. [Trans.]

    130 THE TWO NEW SCIENCES OF GALILEO Let us now generalize the problem; then it will read as

    follows : Given a cylinder AC in which moment and resistance [bending strength] are related in any manner whatsoever; let DE be the length of another cylinder; then determine what its thickness must be in order that the relation be-

    tween its moment and resistance shall be identical with that of the cylinder AC.

    Using Fig. 25 in the same manner as above, we may say that, since the moment of the cylinder FE is to the moment of the portion DG as the square of ED is to the square of FG, that is, as the length DE is to I; and since the moment of the cylinder FG is to the moment of the cylinder AC as the square of FD is to the square of AB, or, as the square of ED is to the square of I, or, as the square of I is to the square of M, that is, as the length I is to O; it follows, ex cequali, that the moment of the

    cylinder FE is to the moment of the cylinder AC as the length DE is to O, that is, as the cube of DE is to the cube of I, or, as the cube of FD is to the cube of AB, that is, as the resistance of the base FD is to the resistance of the base AB; which was to be proven.

    From what has already been demonstrated, you can plainly see the impossibility of increasing the size of structures to vast dimensions either in art or in nature; likewise the impossibility of building ships, palaces, or temples of enormous size in such a way that their oars, yards, beams, iron-bolts, and, in short, all their other parts will hold together; nor can nature produce trees of extraordinary size because the branches would break down under their own weight; so also it would be impossible to build up the bony structures of men, horses, or other animals so as to hold together and perform their normal functions if these animals were to be increased enormously in height; for this increase in height can be accomplished only by employing a material which is harder and stronger than usual, or by enlarging the size of the bones, thus changing their shape until the form and appearance of the animals suggest a monstrosity. This is

    perhaps

    SECOND DAY 131

    perhaps what our wise Poet had in mind, when he says, in describing a huge giant:

    “Impossible it is to reckon his height “So beyond measure is his size.” *

    To illustrate briefly, I have sketched a bone whose natural length has been increased three times and whose thickness has been multiplied until, for a correspondingly large animal, it would perform the same function which the small bone per-

    forms for its small animal. From the figures here shown you can see how out of proportion the enlarged bone appears. Clearly then if one wishes to maintain in a great giant the same proportion of limb as that found in an ordinary man he must either find a

    harder and stronger ma- terial for making the

    bones, or he must admit a diminution of strength in comparison with men of medium stature; for if his height be increased inordinately he will fall and be crushed under his own weight. Whereas, if the size of a body be diminished, the strength of that body is not diminished in the same proportion; indeed the smaller the body the greater its relative strength. Thus a small dog could probably carry on his back two or three dogs of his own size; but I believe that a horse could not carry even one of his own size.

    SIMP. This may be so; but I am led to doubt it on account of the enormous size reached by certain fish, such as the whale which, I understand, is ten times as large as an elephant; yet they all support themselves.

    SALV. Your question, Simplicio, suggests another principle,

    * Non si pub compartir quanta sia lungo, Si smisuratamentf e tutto grosso.

    Ariosto’s Orlando Furioso, XVII, 30 [Trans.]

    132 THE TWO NEW SCIENCES OF GALILEO one which had hitherto escaped my attention and which enables giants and other animals of vast size to support themselves and to move about as well as smaller animals do. This result may be secured either by increasing the strength of the bones and other parts intended to carry not only their weight but also the superincumbent load; or, keeping the proportions of the bony structure constant, the skeleton will hold together in the same manner or even more easily, provided one diminishes, in the proper proportion, the weight of the bony material, of the flesh, and of anything else which the skeleton has to carry. It is this second principle which is employed by nature in the structure of fish, making their bones and muscles not merely light but entirely devoid of weight.

    SIMP. The trend of your argument, Salviati, is evident. Since fish live in water which on account of its density [corpulenza] or, as others would say, heaviness [gravita] diminishes the weight [peso] of bodies immersed in it, you mean to say that, for this reason, the bodies of fish will be devoid of weight and will be supported without injury to their bones. But this is not all; for although the remainder of the body of the fish may be without weight, there can be no question but that their bones

    have weight. Take the case of a whale’s rib, having the dimen- sions of a beam; who can deny its great weight or its tendency to

    go to the bottom when placed in water? One would, therefore,

    hardly expect these great masses to sustain themselves. SALV. A very shrewd objection ! And now, in reply, tell me

    whether you have ever seen fish stand motionless at will under water, neither descending to the bottom nor rising to the top, without the exertion of force by swimming?

    SIMP. This is a well-known phenomenon. SALV. The fact then that fish are able to remain motionless

    under water is a conclusive reason for thinking that the material of their bodies has the same specific gravity as that of water; accordingly, if in their make-up there are certain parts which are heavier than water there must be others which are lighter, for otherwise they would not produce equilibrium.

    Hence

    SECOND DAY 133

    Hence, if the bones are heavier, it is necessary that the mus- cles or other constituents of the body should be lighter in order

    that their buoyancy may counterbalance the weight of the bones. In aquatic animals therefore circumstances are just reversed from what they are with land animals inasmuch as, in the latter, the bones sustain not only their own weight but also that of the flesh, while in the former it is the flesh which sup-

    ports not only its own weight but also that of the bones. We must therefore cease to wonder why these enormously large animals inhabit the water rather than the land, that is to say, the air.

    SIMP. I am convinced and I only wish to add that what we call land animals ought really to be called air animals, seeing that they live in the air, are surrounded by air, and breathe air.

    SAGR. I have enjoyed Simplicio’s discussion including both the question raised and its answer. Moreover I can easily understand that one of these giant fish, if pulled ashore, would not perhaps sustain itself for any great length of time, but would be crushed under its own mass as soon as the connections

    between the bones gave way. SALV. I am inclined to your opinion; and, indeed, I almost

    think that the same thing would happen in the case of a very big ship which floats on the sea without going to pieces under

    its load of merchandise and armament, but which on dry land and in air would probably fall apart. But let us proceed and show how:

    Given a prism or cylinder, also its own weight and the maximum load which it can carry, it is then possible to find a maximum length beyond which the cylinder cannot be prolonged without breaking under its own weight.

    Let AC indicate both the prism and its own weight; also let D represent the maximum load which the prism can carry at the end C without fradhire; it is required to find the max-

    imum to which the length of the said prism can be increased without breaking. Draw AH of such a length that the weight of the prism AC is to the sum of AC and twice the weight D

    as

    134 THE TWO NEW SCIENCES OF GALILEO as the length CA is to AH; and let AG be a mean proportional between CA and AH; then, I say, AG is the length sought. Since the moment of the weight [momento gravante} D attached at the point C is equal to the moment of a weight twice as large as D placed at the middle point AC, through which the weight of

    ‘”11\ – H t^ie Pr*sm AC acts, it fol- jG ‘ lows that the moment of the resistance of the prism

    AC located at A is equiva- lent to twice the weight D

    plus the weight of AC, both acting through the middle point of AC. And since we

    have agreed that the moment of the weights thus located, namely, twice D plus AC, bears to the moment of AC the same ratio which the length HA bears to CA and since AG is a mean proportional between these two lengths, it follows that the mo-

    ment of twice D plus AC is to the moment of AC as the square of GA is to the square of CA. But the moment arising from the weight [momento premente] of the prism GA is to the moment of AC as the square of GA is to the square of CA; thence AG is the maximum length sought, that is, the length up to which the prism AC may be prolonged and still support itself, but beyond which it will break.

    Hitherto we have considered the moments and resistances of prisms and solid cylinders fixed at one end with a weight applied at the other end; three cases were discussed, namely that in which the applied force was the only one acting, tha in which the weight of the prism itself is also taken into con sideration, and that in which the weight of the prism alone taken into consideration. Let us now consider these sam< [173]

    prisms and cylinders when supported at both ends or at a singl< point placed somewhere between the ends. In the first place. I remark that a cylinder carrying only its own weight and having the maximum length, beyond which it will break, will, whei supported either in the middle or at both ends, have twice th lengtl

    SECOND DAY 135 length of one which is mortised into a wall and supported only at one end. This is very evident because, if we denote the cylinder by ABC and if we assume that one-half of it, AB, is the greatest possible length capable of supporting its own weight with one end fixed at B, then, for the same jeason, if the cylinder is carried on the point G, the first half will be counter-

    balanced by the other half BC. So also in the case of the cylinder DEF, if its length be such that it will support only one-half this

    Fig. 29

    length when the end D is held fixed, or the other half when the end F is fixed, then it is evident that when supports, such as H and I, are placed under the ends D and F respectively the mo-

    ment of any additional force or weight placed at E will produce fracture at this point. A more intricate and difficult problem is the following:

    neglect the weight of a solid such as the preceding and find whether the same force or weight which produces fracture when applied at the middle of a cylinder, supported at both ends, will also break the cylinder when applied at some other point nearer one end than the other.

    Thus, for example, if one wished to break a stick by holding it with one hand at each end and applying his knee at the

    l middle, would the same force be required to break it in the same I manner if the knee were applied, not at the middle, but at some | point nearer to one end ?

    SAGR. This problem, I believe, has been touched upon by Aristotle in his Questions in Mechanics.

    Salv.

    136 THE TWO NEW SCIENCES OF GALILEO [174]

    SALV. His inquiry however is not quite the same; for he seeks merely to discover why it is that a stick may be more easily broken by taking hold, one hand at each end of the stick, that is, far removed from the knee, than if the hands were closer together. He gives a general explanation, referring it to the lengthened lever arms which are secured by placing the hands at the ends of the stick. Our inquiry calls for something more: what we want to know is whether, when the hands are retained at the ends of the stick, the same force is required to break it wherever the knee be placed.

    SAGR. At first glance this would appear to be so, because the two lever arms exert, in a certain way, the same moment, seeing that as one grows shorter the other grows correspondingly longer.

    SALV. Now you see how readily one falls into error and what caution and circumspection are required to avoid it. What you have just said appears at first glance highly probable, but on closer examination it proves to be quite far from true; as will be seen from the fact that whether the knee — the fulcrum of the two levers — be placed in the middle or not makes such a differ-

    ence that, if fradlure is to be produced at any other point than the middle, the breaking force at the middle, even wiien multi-

    plied four, ten, a hundred, or a thousand times would not suffice. To begin with we shall offer some general considerations and then pass to the determination of the ratio in which the breaking force must change in order to produce fradture at one point rather than another.

    Let AB denote a wooden cylinder which is to be broken in the middle, over the supporting point C, and let DE represent an identical cylinder which is to be broken just over the sup-

    porting point F which is not in the middle. First of all it is clear that, since the distances AC and CB are equal, the forces applied at the extremities B and A must also be equal. Secondly since the distance DF is less than the distance AC the moment of any force acting at D is less than the moment of the same force at A, that is, applied at the distance CA; and the moments are less in the ratio of the length DF to AC; consequently it is ecessary

    SECOND DAY

    B

    necessary to increase the force [momenta] at D in order to over- come, or even to balance, the resistance at F; but in comparison

    with the length AC the distance DF can be diminished in- definitely: in order therefore to counterbalance the resistance at

    F it will be necessary to increase indefinitely the force [forza] applied at D. On the other hand, in proportion as we in-

    [175] crease the distance FE over

    that of CB, we must diminish the force at E in order to counterbalance the resistance at F; but the distance FE, measured in terms of CB,

    cannot be increased indefi- Fig. 30

    nitely by sliding the fulcrum F toward the end D; indeed, it can- not even be made double the length CB. Therefore the force re-

    quired at E to balance the resistance at F will always be more than half that required at B. It is clear then that, as the fulcrum F approaches the end D, we must of necessity indefinitely in-

    crease the sum of the forces applied at E and D in order to balance, or overcome, the resistance at F.

    SAGR. What shall we say, Simplicio? Must we not confess that geometry is the most powerful of all instruments for sharpening the wit and training the mind to think corredtly? Was not Plato perfectly right when he wished that his pupils should be first of all well grounded in mathematics? As for myself, I quite understood the property of the lever and how, by increasing or diminishing its length, one can increase or diminish the moment of force and of resistance; and yet, in the solution of the present problem I was not slightly, but greatly, deceived.

    SIMP. Indeed I begin to understand that while logic is an ex- cellent guide in discourse, it does not, as regards stimulation to

    discovery, compare with the power of sharp distinction which belongs to geometry.

    SAGR. Logic, it appears to me, teaches us how to test the conclusiveness

    138 THE TWO NEW SCIENCES OF GALILEO

    conclusiveness of any argument or demonstration already dis- covered and completed; but I do not believe that it teaches

    us to discover correct arguments and demonstrations. But it would be better if Salviati were to show us in just what pro-

    portion the forces must be increased in order to produce fracture as the fulcrum is moved from one point to another along one and the same wooden rod.

    [176]

    SALV. The ratio which you desire is determined as follows:

    If upon a cylinder one marks two points at which frac- ture is to be produced, then the resistances at these two

    points will bear to each other the inverse ratio of the rectangles formed by the distances from the respective points to the ends of the cylinder.

    Let A and B denote the least forces which will bring about fracture of the cylinder at C; likewise E and F the smallest forces which will break it at D. Then, I say, that the sum of the forces A and B is to the sum of the forces E and F as the area of the rectangle AD.DB is to the area of the rectangle AC.CB. Because the sum of the forces A and B bears to the sum of the forces E and F a ratio which is the product of the three following ratios, namely, (A+B)/B, B/F, and F/(F+E); but the length BA is to the length CA as the sum of the forces A and B is to the

    force B; and, as the length DB is to the length CB, so is the force B to the force F; also as the length AD is to AB, so is the force F to the sum of the forces F andE.

    Hence it follows that the sum of the forces A and B bears to the sum of the forces E and F a ratio which is the product of the three following ratios, namely, BA/CA, BD/BC, and AD/AB. But DA/CA is the product of DA/BA and BA/CA. Therefore the sum of the forces A and B bears to the sum of the forces E and F a ratio which is the product of DArCA and DB:CB. But the rectangle AD.DB bears to the rectangle AC.CB a ratio which is the product of DA/CA and DB/CB. Accordingly

    SECOND DAY 139 Accordingly the sum of the forces A and B is to the sum of the forces E and F as the rectangle AD.DB is to the rectangle AC.CB, that is, the resistance to fracture at C is to the resistance to fracture at D as the rectangle AD.DB is to the rectangle AC.CB. Q. E. D. [177]

    Another rather interesting problem may be solved as a con- sequence of this theorem, namely,

    Given the maximum weight which a cylinder or prism can support at its middle-point where the resistance is a mini-

    mum, and given also a larger weight, find that point in the cylinder for which this larger weight is the maximum load that can be supported.

    Let that one of the given weights which is larger than the maximum weight supported at the middle of the cylinder AB bear to this maximum weight the same ratio which the length E bears to the length F. The problem is to find that point in the cylinder at which this larger weight becomes the max-

    imum that can be supported. Let G be a mean proportional between the lengths E and F. Draw AD and S so that they bear to each other the same ratio as E to G; accordingly S will be less than AD.

    Let AD be the diameter of a semicircle AHD, in which take AH equal to S; join the points H and D and lay off DR equal to HD. Then, I say, R is the point sought, namely, the point at which the given weight, greater than the maximum supported at the middle of the cylinder D, would become the maximum load.

    On AB as diameter draw the semicircle ANB: erect the per- pendicular RN and join the points N and D. Now since the

    sum of the squares on NR and RD is equal to the square of ND, that is, to the square of AD, or to the sum of the squares of AH and HD; and, since the square of HD is equal to the square of DR, it follows that the square of NR, that is, the rectangle AR.RB, is equal to the square of AH, also therefore to the square of S; but the square of S is to the square of AD as the length F is to the length E, that is, as the maximum weight

    supported

    140 THE TWO NEW SCIENCES OF GALILEO

    supported at D is to the larger of the two given weights. Hence the latter will be the maximum load which can be carried at the

    point R; which is the solution sought. SAGR. Now I understand thoroughly; and I am thinking that,

    since the prism AB grows constantly stronger and more resistant to the pressure of its load at points which are more and more removed from the middle, we could in the case of large heavy beams cut away a considerable portion near the ends which would notably lessen the weight,

    1 and which, in the beam work of large rooms, would prove to be of great utility and convenience.

    32 [178] It would be a fine thing if one could discover the proper shape

    to give a solid in order to make it equally resistant at every point, in which case a load placed at the middle would not produce fracture more easily than if placed at any other

    point.* SALV. I was just on the point of mentioning an interesting

    and remarkable fact connected with this very question. My meaning will be clearer if I draw a figure. Let DB represent

    a prism; then, as we have already shown, its resistance to frac- ture [bending strength] at the end AD, owing to a load placed at

    the end B, will be less than the resistance at CI in the ratio of the length CB to AB. Now imagine this same prism to be cut through diagonally along the line FB so that the opposite faces will be triangular; the side facing us will be FAB. Such a solid

    * The reader will notice that two different problems are here involved. That which is suggested in the last remark of Sagredo is the fol- lowing:

    To find a beam whose maximum stress has the same value when a constant load moves from one end of the beam to the other.

    The second problem — the one which Salviati proceeds to solve — is the following:

    To find a beam in all cross-sections of which the maximum stress is the same for a constant load in a fixed position. [Trans.]

    SECOND DAY 141

    will have properties different from those of the prism; for, if the load remain at B, the resistance against fracture [bending strength] at C will be less than that at A in the ratio of the length CB to the length AB. This is easily proved: for if CNO represents a cross-section parallel to AFD, then the length FA bears to the length CN, in the triangle FAB, the same ratio which the length AB bears to D I the length CB. Therefore, we imagine A and C to be the points at which the fulcrum is placed, the lever arms in the two cases BA, AF and BC, CN A fc will be proportional [simili]^ Fig. 33 Hence the moment of any force applied at B and acting through the arm BA, against a resistance placed at a distance AF will be equal to that of the same force at B acting through the arm BC against the same resistance located at a distance CN. But now, if the force still be applied at B, the resistance to be over-

    come when the fulcrum is at C, acting through the arm CN, is less than the resistance with the fulcrum at A in the same pro-

    portion as the rectangular cross-section CO is less than the rectangular cross-section AD, that is, as the length CN is less than AF, or CB than BA.

    Consequently the resistance to fracture at C, offered by the portion OBC, is less than the resistance to fracture at A, offered by the entire block DAB, in the same proportion as the length CB is smaller than the length AB.

    By this diagonal saw-cut we have now removed from the beam, or prism DB, a portion, i. e., a half, and have left the wedge, or triangular prism, FBA. We thus have two solids

    .[179] possessing opposite properties; one body grows stronger as it is shortened while the other grows weaker. This being so it would seem not merely reasonable, but inevitable, that there exists a line of section such that, when the superfluous material has been removed, there will remain a solid of such figure that it will offer the same resistance [strength] at all points. Simp.

    H2 THE TWO NEW SCIENCES OF GALILEO SIMP. Evidently one must, in passing from greater to less,

    encounter equality. SAGR. But now the question is what path the saw should

    follow in making the cut. SIMP. It seems to me that this ought not to be a difficult task:

    for if by sawing the prism along the diagonal line and removing half of the material, the remainder acquires a property just the opposite to that of the entire prism, so that at every point where the latter gains strength the former becomes weaker, then it seems to me that by taking a middle path, i. e., by removing half the former half, or one-quarter of the whole, the strength of the remaining figure will be constant at all those points where, in the two previous figures, the gain in one was equal to the loss in the other.

    SALV. You have missed the mark, Simplicio. For, as I shall presently show you, the amount which you can remove from the prism without weakening it is not a quarter but a third. It now remains, as suggested by Sagredo, to discover the path along which the saw must travel: this, as I shall prove, must be a parabola. But it is first necessary to demonstrate the following lemma :

    If the fulcrums are so placed under two levers or balances that the arms through which the forces act are to each other in the same ratio as the squares of the arms through which the resistances act, and if these resistances are to each other in the same ratio as the arms through which they act, then the forces will be equal.

    Let AB and CD represent two levers whose lengths are A * 1 , , ̂ divided by their fulcrums in

    Er M such a way as to make the dis-

    f , • .1 tance EB bear to the distance C F D FD a ratio which is equal to the

    Fig- 34 square of the ratio between the distances EA and FC. Let the resistances located at A and C

    [180] be to each other as EA is to FC. Then, I say, the forces which must be applied at B and D in order to hold in equilibrium the

    resistances

    SECOND DAY 143

    resistances at A and C are equal. Let EG be a mean propor- tional between EB and FD. Then we shall have BE:EG =

    EG:FD=AE:CF. But this last ratio is precisely that which we have assumed to exist between the resistances at A and C.

    And since EG:FD=AE:CF, it follows, permutando, that EG: AE = FD :CF. Seeing that the distances DC and GA are divided in the same ratio by the points F and E, it follows that the same force which, when applied at D, will equilibrate the resistance at C, would if applied at G equilibrate at A a resistance equal to that found at C.

    But one datum of the problem is that the resistance at A is to the resistance at C as the distance AE is to the distance CF, or as BE is to EG. Therefore the force applied at G, or rather at D, will, when applied at B, just balance the resistance located at A.

    Q. E. D. This being clear draw the parabola FNB in the face FB of the prism DB. Let the prism be sawed along this parabola whose vertex is at B. The portion of the solid which remains will be included between the base AD, the rectangular plane AG, the straight line BG and the surface DGBF, whose curvature is identical with that of the parabola FNB. This solid will have, I say, the same strength at every point. Let the solid be cut by a plane CO parallel to the plane AD. Imagine p the points A and C to be the fulcrums of two levers

    of which one will have the _

    arms BA and AF ; the other A”” c BCandCN. Then since in Fig. 35

    the parabola FBA, we have BA:BC=AF2: CN2, it is clear that the arm BA of one lever is to the arm BC of the other lever as the square of the arm AF is to the square of the other arm CN. Since the resistance to be balanced by the lever BA is to the resistance to be balanced by the lever BC in the same ratio as the rectangle DA is to the rectangle OC, that is as the length AF is to the length CN, which two lengths are the other arms of the levers, it follows, by the lemma just demonstrated, that

    the

    144 THE TWO NEW SCIENCES OF GALILEO the same force which, when applied at BG will equilibrate the resistance at DA, will also balance the resistance at CO. The

    [181] same is true for any other section. Therefore this parabolic solid is equally strong throughout.

    It can now be shown that, if the prism be sawed along the line of the parabola FNB, one-third part of it will be removed; because the rectangle FB and the surface FNBA bounded by the parabola are the bases of two solids included between two parallel planes, i. e., between the rectangles FB and DG; con-

    sequently the volumes of these two solids bear to each other the same ratio as their bases. But the area of the rectangle is one and a half times as large as the area FNBA under the parabola; hence by cutting the prism along the parabola we re-

    move one-third of the volume. It is thus seen how one can diminish the weight of a beam by as much as thirty-three per cent without diminishing its strength; a fact of no small utility in the construction of large vessels, and especially in supporting the decks, since in such structures lightness is of prime im- portance.

    SAGR. The advantages derived from this fact are so numerous that it would be both wearisome and impossible to mention them all; but leaving this matter to one side, I should like to learn just how it happens that diminution of weight is possible in the ratio above stated. I can readily understand that, when

    a section is made along the diagonal, one-half the weight is removed; but, as for the parabolic section removing one-third of the prism, this I can only accept on the word of Salviati who is always reliable; however I prefer first-hand knowledge to the word of another.

    SALV. You would like then a demonstration of the fact that the excess of the volume of a prism over the volume of what we have called the parabolic solid is one-third of the entire prism. This I have already given you on a previous occasion; however I shall now try to recall the demonstration in which

    I remember having used a certain lemma from Archimedes ‘ book On Spirals* namely, Given any number of lines, differing in

    * For demonstration of the theorem here cited, see ” Works of Arch-

    SECOND DAY

    length one from another by a common difference which is equal to the shortest of these lines; and given also an equal number of lines each of which has the same length as the longest of the first-mentioned series; then the sum of the squares of the lines of this second group will be less than three times the sum of the squares of the lines in the first group. But the sum of the squares of the second group will be greater than three times the sum of the squares of all excepting the longest of the first group.

    [182] Assuming this, inscribe in the rectangle ACBP the parabola

    AB. We have now to prove that the mixed triangle BAP whose sides are BP and PA, and whose base is the parabola BA, is a third part of the entire rectangle CP. If this is not true it will be either greater or less than a third. Suppose it to be less by an area which is represented by X. By drawing lines parallel to the sides BP and CA, we can divide the rectangle CP into equal parts; and if the process be continued we shall finally reach a division into parts so small that each of them will be smaller than the area X; let the rec- & V tangle OB represent one of these parts and, through the points where the other parallels cut the parabola, draw lines parallel to AP. Let us now ̂ describe about

    our “mixed triangle” a figure made up of rectangles such as BO, IN, HM, FL, EK, and GA; this figure will also be less than

    ~–^^

    T I

    ^-^^>

    | n

    ^ \

    R

    F x O

    E N

    C\

    C D*,

    Fig. 36

    a third part of the rectangle CP because the excess of this figure

    above the area of the “mixed triangle” is much smaller than the rectangle BO which we have already made smaller than X.

    SAGR. More slowly, please; for I do not see how the excess of

    this figure described about the “mixed triangle” is much smaller than the rectangle BO.

    SALV. Does not the rectangle BO have an area which is equal to the sum of the areas of all the little rectangles through which

    imedes” translated by T. L. Heath (Camb. Univ. Press 1897) p. 107 and p. 162. [Trans.]

    146 THE TWO NEW SCIENCES OF GALILEO

    the parabola passes? I mean the rectangles BI, IH, HF, FE,

    EG, and GA of which only a part lies outside the “mixed tri- angle.” Have we not taken the rectangle BO smaller than the

    area X? Therefore if, as our opponent might say, the triangle plus X is equal to a third part of this rectangle CP, the cir-

    cumscribed figure, which adds to the triangle an area less than X, will still remain smaller than a third part of the rectangle, CP. But this cannot be, because this circumscribed figure is larger than a third of the area. Hence it is not true that our

    “mixed triangle” is less than a third of the rectangle.

    [183] .

    SAGR. You have cleared up my difficulty; but it still remains to be shown that the circumscribed figure is larger than a third part of the rectangle CP, a task which will not, I believe, prove so easy.

    SALV. There is nothing very difficult about it. Since in the

    parabola DE2:ZG2=DA:AZ= rectangle KE: rectangle AG, seeing that the altitudes of these two rectangles, AK and KL, are

    equal, it follows that ED 2:ZG2= LA2 :AK2== rectangle KE: rectangle KZ. In precisely the same manner it may be shown that the other rectangles LF, MH, NI, OB, stand to one another in the same ratio as the squares of the lines MA, NA, OA, PA.

    Let us now consider the circumscribed figure, composed of areas which bear to each other the same ratio as the squares of a series of lines whose common difference in length is equal to the shortest one in the series; note also that the rectangle CP is made up of an equal number of areas each equal to the largest and each equal to the rectangle OB. Consequently, according to the lemma of Archimedes, the circumscribed figure is larger than a third part of the rectangle CP; but it was also smaller, which is

    impossible. Hence the “mixed triangle” is not less than a third part of the rectangle CP.

    Likewise, I say, it cannot be greater. For, let us suppose that it is greater than a third part of the rectangle CP and let the area X represent the excess of the triangle over the third part of the rectangle CP; subdivide the rectangle into equal rectangles and continue the process until one of these subdivisions is smaller

    than

    SECOND DAY 147

    than the area X. Let BO represent such a rectangle smaller than

    X. Using the above figure, we have in the “mixed triangle” an inscribed figure, made up of the rectangles VO, TN, SM, RL, and QK, which will not be less than a third part of the large rectangle CP.

    For the “mixed triangle” exceeds the inscribed figure by a quantity less than that by which it exceeds the third part of the rectangle CP; to see that this is true we have only to re-

    member that the excess of the triangle over the third part of the rectangle CP is equal to the area X, which is less than the rectangle BO, which in turn is much less than the excess of the triangle over the inscribed figure. For the rectangle BO is [184]

    made up of the small rectangles AG, GE, EF, FH, HI, and IB; and the excess of the triangle over the inscribed figure is less than half the sum of these little rectangles. Thus since the triangle exceeds the third part of the rectangle CP by an amount X, which is more than that by which it exceeds the inscribed figure, the latter will also exceed the third part of the rectangle, CP. But, by the lemma which we have assumed, it is smaller. For the rectangle CP, being the sum of the largest rectangles, bears to the component rectangles of the inscribed figure the same ratio which the sum of all the squares of the lines equal to the longest bears to the squares of the lines which have a common difference, after the square of the longest has been subtracted.

    Therefore, as in the case of squares, the sum total of the largest rectangles, i. e., the rectangle CP, is greater than three times the sum total of those having a common difference minus the largest; but these last make up the inscribed figure. Hence

    the “mixed triangle” is neither greater nor less than the third part of rectangle CP; it is therefore equal to it.

    SAGR. A fine, clever demonstration; and all the more so be- cause it gives us the quadrature of the parabola, proving it to be

    four-thirds of the inscribed * triangle, a fact which Archimedes demonstrates by means of two different, but admirable, series of

    * Distinguish carefully between this triangle and the “mixed tri- angle” above mentioned. [Trans.]

    148 THE TWO NEW SCIENCES OF GALILEO many propositions. This same theorem has also been recently

    established by Luca Valerio,* the Archimedes of our age; his demonstration is to be found in his book dealing with the centers of gravity of solids.

    SALV. A book which, indeed, is not to be placed second to any produced by the most eminent geometers either of the present or of the past; a book which, as soon as it fell into the hands of our Academician, led him to abandon his own researches along these lines; for he saw how happily everything had been treated and demonstrated by Valerio.

    .

    SAGR. When I was informed of this event by the Academician

    himself, I begged of him to show the demonstrations which he

    had discovered before seeing Valerio’s book; but in this I did not succeed. SALV. I have a copy of them and will show them to you; for

    you will enjoy the diversity of method employed by these two authors in reaching and proving the same conclusions; you will also find that some of these conclusions are explained in different ways, although both are in fact equally correct.

    SAGR. I shall be much pleased to see them and will consider it a great favor if you will bring them to our regular meeting. But in the meantime, considering the strength of a solid formed from a prism by means of a parabolic section, would it not, in view of the fact that this result promises to be both interesting and useful in many mechanical operations, be a fine thing if you were to give some quick and easy rule by which a mechanician might draw a parabola upon a plane surface?

    SALV. There are many ways of tracing these curves; I will mention merely the two which are the quickest of all. One of these is really remarkable; because by it I can trace thirty or forty parabolic curves with no less neatness and precision, and in a shorter time than another man can, by the aid of a compass, neatly draw four or six circles of different sizes upon paper. I take a perfectly round brass ball about the size of a walnut and project it along the surface of a metallic mirror held

    * An eminent Italian mathematician, contemporary with Galileo.

    [Trans.]

    SECOND DAY 149

    in a nearly upright position, so that the ball in its motion will

    press slightly upon the mirror and trace out a fine sharp para- bolic line; this parabola will grow longer and narrower as the

    angle of elevation increases. The above experiment furnishes clear and tangible evidence that the path of a projectile is a parabola; a fact first observed by our friend and demonstrated by him in his book on motion which we shall take up at our next meeting. In the execution of this method, it is advisable to slightly heat and moisten the ball by rolling in the hand in order that its trace upon the mirror may be more distinct. [I86J

    The other method of drawing the desired curve upon the face of the prism is the following: Drive two nails into a wall at a convenient height and at the same level; make the distance between these nails twice the width of the rectangle upon which it is desired to trace the semiparabola. Over these two nails hang a light chain of such a length that the depth of its sag is equal to the length of the prism. This chain will assume the

    form of a parabola,* so that if this form be marked by points on the wall we shall have described a complete parabola which can be divided into two equal parts by drawing a vertical line through a point midway between the two nails. The transfer of this curve to the two opposing faces of the prism is a matter of no difficulty; any ordinary mechanic will know how to do it.

    By use of the geometrical lines drawn upon our friend’s compass,t one may easily lay off those points which will locate this same curve upon the same face of the prism.

    Hitherto we have demonstrated numerous conclusions per- taining to the resistance which solids offer to fracture. As

    a starting point for this science, we assumed that the resistance offered by the solid to a straight-away pull was known; from this base one might proceed to the discovery of many other results and their demonstrations; of these results the number to

    * It is now well known that this curve is not a parabola but a catenary the equation of which was first given, 49 years after Galileo’s death, by James Bernoulli. [Trans.]

    t The geometrical and military compass of Galileo, described in Nat. Ed. Vol. 2. [Trans.]

    ISO THE TWO NEW SCIENCES OF GALILEO be found in nature is infinite. But, in order to bring our daily conference to an end, I wish to discuss the strength of hollow

    solids, which are employed in art — and still oftener in nature — in a thousand operations for the purpose of greatly increasing strength without adding to weight; examples of these are seen in the bones of birds and in many kinds of reeds which are light and highly resistant both to bending and breaking. For if a stem of straw which carries a head of wheat heavier than the

    entire stalk were made up of the same amount of material in

    solid form it would offer less resistance to bending and breaking. This is an experience which has been verified and confirmed in practice where it is found that a hollow lance or a tube of wood or metal is much stronger than would be a solid one of the same length and weight, one which would necessarily be thinner; men have discovered, therefore, that in order to make lances strong as well as light they must make them hollow. We shall now show that:

    In the case of two cylinders, one hollow the other solid but having equal volumes and equal lengths, their resist-

    ances [bending strengths] are to each other in the ratio of their diameters.

    Let AE denote a hollow cylinder and IN a solid one of the same weight and length;

    then, I say, that the resist- ance against fradlure ex-

    hibited by the tube AE bears to that of the solid cylinder IN the same ratio as the di-

    ameter AB to the diameter tf IL. This is very evident ; for

    Fig. 37 since the tube and the solid

    cylinder IN have the same volume and length, the area of the cir- cular base IL will be equal to that of the annulus AB which is the

    base of the tube AE. (By annulus is here meant the area which lies between two concentric circles of different radii.) Hence

    their resistances to a straight-away pull are equal; but in produc-

    ing

    SECOND DAY 151

    ing fracfture by a transverse pull we employ, in the case of the cylinder IN, the length LN as one lever arm, the point L as a fulcrum, and the diameter LI, or its half, as the opposing lever arm : while in the case of the tube, the length BE which plays the part of the first lever arm is equal to LN, the opposing lever arm beyond the fulcrum, B, is the diameter AB, or its half. Manifestly then the resistance [bending strength] of the tube exceeds that of the solid cylinder in the proportion in which the diameter AB exceeds the diameter IL’ which is the desired result.

    [188] Thus the strength of a hollow tube exceeds that of a solid cylinder in the ratio of their diameters whenever the two are made of the same material and have the same weight and length.

    It may be well next to investigate the general case of tubes and solid cylinders of constant length, but with the weight and the hollow portion variable. First we shall show that :

    Given a hollow tube, a solid cylinder may be determined which will be equal [eguale] to it.

    The method is very simple. Let AB denote the external and CD the internal diameter of the tube. In the larger circle lay off the line AE equal in length to the di- ameter CD; join the points E and B. Now since the angle at E inscribed in a semicircle, AEB, is a right-angle, the area of the circle whose diameter is AB is equal to the sum of the areas of the two circles

    whose respective diameters are AE and EB. But AE is the diameter of the hollow

    portion of the tube. Therefore the area of the circle whose diameter is EB is the

    same as the area of the annulus ACBD. FlS- 3 8 Hence a solid cylinder of circular base having a diameter EB will have the same volume as the walls of the tube of equal length.

    By use of this theorem, it is easy: To find the ratio between the resistance [bending strength] of any tube and that of any cylinder of equal length.

    Let

    152 THE TWO NEW SCIENCES OF GALILEO Let ABE denote a tube and RSM a cylinder of equal length : it is required to find the ratio between their resistances. Using the preceding proposition, determine a cylinder ILN which shall

    have the same volume and

    length as the tube. Draw a line V of such a length that it will be related to IL and

    RS (diameters of the bases of the cylinders IN and RM), as follows: V:RS=RS:IL. Then, I say, the resistance of the tube AE is to that of the cylinder RM as the length

    )-j£ of the line AB is to the length

    V. For, since the tube AE is Fl&- 39 equal both in volume and

    length, to the cylinder IN, the resistance of the tube will bear to the resistance of the cylinder the same ratio as the line AB to IL; but the resistance of the cylinder IN is to that of the cylinder RM as the cube of IL is to the cube of RS, that is, as the length IL is to length V: therefore, ex czquali, the resistance [bending strength] of the tube AE bears to the resistance of the cylinder RM the same ratio as the length AB to V. Q. E. D.

    END OF SECOND DAY.

    THIRD DAY [190]

    CHANGE OF POSITION. [De Motu Locali]

    Y purpose is to set forth a very new science dealing with a very ancient subject. There is, in nature, perhaps nothing older than motion, concerning which the books written by philosophers are neither few nor small; nevertheless I have discovered by experi-

    ment some properties of it which are worth knowing and which have not hitherto been

    cither observed or demonstrated. Some superficial observations have been made, as, for instance, that the free motion [naturalem

    molum] of a heavy falling body is continuously accelerated;* but to just what extent this acceleration occurs has not yet been announced; for so far as I know, no one has yet pointed out that the distances traversed, during equal intervals of time, by a body falling from rest, stand to one another in the same ratio as the odd numbers beginning with unity.f

    It has been observed that missiles and projectiles describe a curved path of some sort; however no one has pointed out the fact that this path is a parabola. But this and other facts, not few in number or less worth knowing, I have succeeded in proving; and what I consider more important, there have been opened up to this vast and most excellent science, of which my

    * “Natural motion” of the author has here been translated into “free motion” — since this is the term used to-day to distinguish the “natural” from the “violent” motions of the Renaissance. [Trans.]

    t A theorem demonstrated on p. 175 below. [Trans.]

    154 THE TWO NEW SCIENCES OF GALILEO work is merely the beginning, ways and means by which other minds more acute than mine will explore its remote corners.

    This discussion is divided into three parts; the first part deals with motion which is steady or uniform; the second treats of motion as we find it accelerated in nature; the third deals with the so-called violent motions and with projectiles.

    [I9

    UNIFORM MOTION

    In dealing with steady or uniform motion, we need a single definition which I give as follows :

    DEFINITION

    By steady or uniform motion, I mean one in which the dis- tances traversed by the moving particle during any equal

    intervals of time, are themselves equal.

    CAUTION

    We must add to the old definition (which defined steady mo- tion simply as one in which equal distances are traversed in

    equal times) the word “any,” meaning by this, all equal inter- vals of time; for it may happen that the moving body will

    traverse equal distances during some equal intervals of time and yet the distances traversed during some small portion of these time-intervals may not be equal, even though the time- intervals be equal.

    From the above definition, four axioms follow, namely:

    AXIOM I

    In the case of one and the same uniform motion, the distance traversed during a longer interval of time is greater than the distance traversed during a shorter interval of time.

    AXIOM II

    In the case of one and the same uniform motion, the time required to traverse a greater distance is longer than the time required for a less distance.

    THIRD DAY 155 AXIOM III

    In one and the same interval of time, the distance traversed at a greater speed is larger than the distance traversed at a less speed.

    [192] AXIOM IV

    The speed required to traverse a longer distance is greater than that required to traverse a shorter distance during the same time-interval.

    THEOREM I, PROPOSITION I

    If a moving particle, carried uniformly at a constant speed, traverses two distances the time-intervals required are to each other in the ratio of these distances.

    Let a particle move uniformly with constant speed through two distances AB, BC, and let the time required to traverse AB be represented by DE; the time required to traverse BC, by EF;

    i. …… Ipf Ly j . . .* i ….. IAIB f • .. — , — **—

    Fig. 40

    then I say that the distance AB is to the distance BC as the time DE is to the time EF.

    Let the distances and times be extended on both sides towards

    G, H and I, K; let AG be divided into any number whatever of spaces each equal to AB, and in like manner lay off in DI exactly the same number of time-intervals each equal to DE. Again lay off in CH any number whatever of distances each equal to BC; and in FK exactly the same number of time- intervals each equal to EF; then will the distance BG and the time El be equal and arbitrary multiples of the distance BA and the time ED; and likewise the distance HB and the time KE are equal and arbitrary multiples of the distance CB and the time FE. And since DE is the time required to traverse AB, the whole

    time

    156 THE TWO NEW SCIENCES OF GALILEO time El will be required for the whole distance BG, and when the motion is uniform there will be in El as many time-intervals each equal to DE as there are distances in BG each equal to BA; and likewise it follows that KE represents the time required to traverse HB.

    Since, however, the motion is uniform, it follows that if the distance GB is equal to the distance BH, then must also the time IE be equal to the time EK; and if GB is greater than BH,

    then also IE will be greater than EK; and if less, less.* There

    [193] .

    are then four quantities, the first AB, the second BC, the third DE, and the fourth EF; the time IE and the distance GB are arbitrary multiples of the first and the third, namely of the distance AB and the time DE.

    But it has been proved that both of these latter quantities are either equal to, greater than, or less than the time EK and the space BH, which are arbitrary multiples of the second and the fourth. Therefore the first is to the second, namely the distance AB is to the distance BC, as the third is to the fourth, namely the time DE is to the time EF. Q. E. D.

    THEOREM II, PROPOSITION II

    If a moving particle traverses two distances in equal in- tervals of time, these distances will bear to each other the

    same ratio as the speeds. And conversely if the distances are as the speeds then the times are equal.

    Referring to Fig. 40, let AB and BC represent the two distances traversed in equal time-intervals, the distance AB for instance with the velocity DE, and the distance BC with the velocity EF. Then, I say, the distance AB is to the distance BC as the velocity DE is to the velocity EF. For if equal multiples of both distances and speeds be taken, as above, namely, GB and IE of AB and DE respectively, and in like manner HB and KE of BC and EF, then one may infer, in the same manner as above, that the multiples GB and IE are either less than, equal

    * The method here employed by Galileo is that of Euclid as set forth in the famous 5th Definition of the Fifth Book of his Elements, for which see art. Geometry Ency. Brit, nth Ed. p. 683. [Trans.}

    THIRD DAY 157

    to, or greater than equal multiples of BH and EK. Hence the theorem is established.

    THEOREM III, PROPOSITION III

    In the case of unequal speeds, the time-intervals required to traverse a given space are to each other inversely as the speeds.

    Let the larger of the two unequal speeds be indicated by A; the smaller, by B; and let the motion corresponding to both traverse the given space CD. Then I say the time required to traverse the distance CD at speed A i <•• . i A is to the time required to trav-

    erse the same distance at speed ̂ f .f B, as the speed B is to the speed C £ 1> A. For let CD be to CE as A

    is to B ; then, from the preced- ̂ *””” ing, it follows that the time re- Fig- 41 quired to complete the distance CD at speed A is the same as [194]

    the time necessary to complete CE at speed B; but the time needed to traverse the distance CE at speed B is to the time re-

    quired to traverse the distance CD at the same speed as CE is to CD; therefore the time in which CD is covered at speed A is to the time in which CD is covered at speed B as CE is to CD, that is, as speed B is to speed A. Q. E. D.

    THEOREM IV, PROPOSITION IV

    If two particles are carried with uniform motion, but each with a different speed, the distances covered by them dur-

    ing unequal intervals of time bear to each other the com- pound ratio of the speeds and time intervals.

    Let the two particles which are carried with uniform motion be E and F and let the ratio of the speed of the body E be to that of the body F as A is to B ; but let the ratio of the time consumed by the motion of E be to the time consumed by the motion of F as C is to D. Then, I say, that the distance covered by E, with speed A in time C, bears to the space traversed by F with speed

    B

    158 THE TWO NEW SCIENCES OF GALILEO

    B in time D a ratio which is the product of the ratio of the speed A to the speed B by the ratio of the time C to the time D. For

    if G is the distance traversed by E at speed A during the time-

    A» ‘ ‘ ‘ ‘ • •»- —t interval Q and if G is to I as the speed A is to the speed B;

    1 and if also the time-interval C is to the time-interval D as I is to L, then it follows

    Fig. 42 that I is the distance trav-

    ersed by F in the same time that G is traversed by E since G is to I in the same ratio as the speed A to the speed B. And

    since I is to L in the same ratio as the time-intervals C and D, if I is the distance traversed by F during the interval C, then L will be the distance traversed by F during the interval D at the

    speed B. But the ratio of G to L is the product of the ratios G to I

    and I to L, that is, of the ratios of the speed A to the speed B

    and of the time-interval C to the time-interval D. Q. E. D. [195]

    THEOREM V, PROPOSITION V

    If two particles are moved at a uniform rate, but with un- equal speeds, through unequal distances, then the ratio of

    the time-intervals occupied will be the product of the ratio of the distances by the inverse ratio of the speeds.

    Let the two moving particles be denoted by A and B, and let the speed of A be to the speed of B in the ratio of V to T; in like manner let

    the distances trav- JU ersed be in the ratio Fig- 43

    of S to R; then I say that the ratio of the time-interval during which the motion of A occurs to the time-interval occupied by the motion of B is the product of the ratio of the speed T to the speed V by the ratio of the distance S to the distance R.

    Let C be the time-interval occupied by the motion of A, and

    let

    THIRD DAY 159

    let the time-interval C bear to a time-interval E the same ratio as the speed T to the speed V. And since C is the time-interval during which A, with speed

    V, traverses the distance S and since T, the speed of B, is to the

    speed V, as the time-interval C is to the time-interval E, then E will be the time required by the particle B to traverse the distance S. If now we let the time-interval E be to the time- interval G as the distance S is to the distance R, then it follows that G is the time required by B to traverse the space R. Since the ratio of C to G is the product of the ratios C to E and E to G (while also the ratio of C to E is the inverse ratio of the speeds of A and B respectively, i. e., the ratio of T to V) ; and since the ratio of E to G is the same as that of the distances S and R

    respectively, the proposition is proved.

    [196]

    THEOREM VI, PROPOSITION VI

    If two particles are carried at a uniform rate, the ratio of their speeds will be the product of the ratio of the distances

    traversed by the inverse ratio of the time-intervals occupied. Let A and B be the two particles which move at a uniform

    rate; and let the respective distances traversed by them have the ratio of V V” C — to T, but let the J£ *

    time-intervals be ̂ * “E as S to R. Then T

    I say the speed B _______ G — of A will bear R

    to the speed of FlS- 44 B a ratio which is the product of the ratio of the distance V to the distance T and the time-interval R to the time-interval S.

    Let C be the speed at which A traverses the distance V during the time-interval S; and let the speed C bear the same ratio to another speed E as V bears to T; then E will be the speed at which B traverses the distance T during the time-interval S. If now the speed E is to another speed G as the time-interval R is to the time-interval S, then G will be the speed at which the

    particle

    160 THE TWO NEW SCIENCES OF GALILEO

    particle B traverses the distance T during the time-interval R. Thus we have the speed C at which the particle A covers the distance V during the time S and also the speed G at which the particle B traverses the distance T during the time R. The ratio of C to G is the product of the ratio C to E and E to G; the ratio of C to E is by definition the same as the ratio of the distance V to distance T; and the ratio of E to G is the same as the ratio of R to S. Hence follows the proposition.

    SALV. The preceding is what our Author has written concern- ing uniform motion. We pass now to a new and more discrim-

    inating consideration of naturally accelerated motion, such as that generally experienced by heavy falling bodies; following is the title and introduction.

    [197]

    NATURALLY ACCELERATED MOTION

    The properties belonging to uniform motion have been dis- cussed in the preceding section; but accelerated motion remains

    to be considered.

    And first of all it seems desirable to find and explain a defini- tion best fitting natural phenomena. For anyone may invent an

    arbitrary type of motion and discuss its properties; thus, for instance, some have imagined helices and conchoids as described by certain motions which are not met with in nature, and have very commendably established the properties which these curves possess in virtue of their definitions; but we have decided to con-

    sider the phenomena of bodies falling with an acceleration such as actually occurs in nature and to make this definition of accelerated motion exhibit the essential features of observed accelerated motions. And this, at last, after repeated efforts we trust we have succeeded in doing. In this belief we are confirmed mainly by the consideration that experimental results are seen to agree with and exactly correspond with those properties which have been, one after another, demonstrated by us. Finally, in the investigation of naturally accelerated motion we were led, by hand as it were, in following the habit and custom of

    nature

    THIRD DAY 161

    nature herself, in all her various other processes, to employ only those means which are most common, simple and easy.

    For I think no one believes that swimming or flying can be accomplished in a manner simpler or easier than that instinc-

    tively employed by fishes and birds. When, therefore, I observe a stone initially at rest falling

    from an elevated position and continually acquiring new in- crements of speed, why should I not believe that such increases

    take place in a manner which is exceedingly simple and rather obvious to everybody? If now we examine the matter carefully we find no addition or increment more simple than that which repeats itself always in the same manner. This we readily understand when we consider the intimate relationship between time and motion; for just as uniformity of motion is defined by and conceived through equal times and equal spaces (thus we call a motion uniform when equal distances are traversed during

    equal time-intervals), so also we may, in a similar manner, through equal time-intervals, conceive additions of speed as taking place without complication; thus we may picfture to our

    mind a motion as uniformly and continuously accelerated when, during any equal intervals of time whatever, equal increments of speed are given to it. Thus if any equal intervals of time whatever have elapsed, counting from the time at which the moving body left its position of rest and began to descend, the amount of speed acquired during the first two time-intervals will be double that acquired during the first time-interval alone; so the amount added during three of these time-intervals will be treble; and that in four, quadruple that of the first time- interval. To put the matter more clearly, if a body were to continue its motion with the same speed which it had acquired during the first time-interval and were to retain this same uni-

    form speed, then its motion would be twice as slow as that which it would have if its velocity had been acquired during two time- intervals.

    And thus, it seems, we shall not be far wrong if we put the increment of speed as proportional to the increment of time;

    hence

    162 THE TWO NEW SCIENCES OF GALILEO

    hence the definition of motion which we are about to discuss may be stated as follows : A motion is said to be uniformly accelerated, when starting from rest, it acquires, during equal time-intervals, equal increments of speed.

    SAGR. Although I can offer no rational objection to this or indeed to any other definition, devised by any author whomso-

    ever, since all definitions are arbitrary, I may nevertheless without offense be allowed to doubt whether such a definition as the above, established in an abstract manner, corresponds to and describes that kind of accelerated motion which we meet in nature in the case of freely falling bodies. And since the Author apparently maintains that the motion described in his defini-

    tion is that of freely falling bodies, I would like to clear my mind of certain difficulties in order that I may later apply myself more earnestly to the propositions and their demon- strations.

    SALV. It is well that you and Simplicio raise these difficulties. They are, I imagine, the same which occurred to me when I first saw this treatise, and which were removed either by discus-

    sion with the Author himself, or by turning the matter over in my own mind.

    SAGR. When I think of a heavy body falling from rest, that is, starting with zero speed and gaining speed in proportion to the

    [199] .

    time from the beginning of the motion; such a motion as would, for instance, in eight beats of the pulse acquire eight degrees of speed; having at the end of the fourth beat acquired four de-

    grees; at the end of the second, two; at the end of the first, one: and since time is divisible without limit, it follows from all these considerations that if the earlier speed of a body is less than its present speed in a constant ratio, then there is no degree of speed however small (or, one may say, no degree of slowness however great) with which we may not find this body travelling after starting from infinite slowness, i. e., from rest. So that if that speed which it had at the end of the fourth beat was such that, if kept uniform, the body would traverse two miles in an hour, and if keeping the speed which it had at the end of the

    second

    THIRD DAY 163 second beat, it would traverse one mile an hour, we must infer

    that, as the instant of starting is more and more nearly ap- proached, the body moves so slowly that, if it kept on moving at

    this rate, it would not traverse a mile in an hour, or in a day, or in a year or in a thousand years; indeed, it would not traverse a span in an even greater time; a phenomenon which baffles the imagination, while our senses show us that a heavy falling body suddenly acquires great speed.

    SALV. This is one of the difficulties which I also at the begin- ning, experienced, but which I shortly afterwards removed; and

    the removal was effected by the very experiment which creates the difficulty for you. You say the experiment appears to show that immediately after a heavy body starts from rest it acquires a very considerable speed: and I say that the same experiment makes clear the fact that the initial motions of a falling body, no matter how heavy, are very slow and gentle. Place a heavy body upon a yielding material, and leave it there without any pressure except that owing to its own weight; it is clear that if one lifts this body a cubit or two and allows it to fall upon the same material, it will, with this impulse, exert a new and greater pressure than that caused by its mere weight; and this effect is brought about by the [weight of the] falling body together with the velocity acquired during the fall, an effect which will be greater and greater according to the height of the fall, that is according as the velocity of the falling body becomes greater. From the quality and intensity of the blow we are thus enabled to accurately estimate the speed of a falling body. But tell me, gentlemen, is it not true that if a block be allowed to fall upon a stake from a height of four cubits and drives it into the earth,

    [200] say, four finger-breadths, that coming from a height of two cubits it will drive the stake a much less distance, and from the height of one cubit a still less distance; and finally if the block be lifted only one finger-breadth how much more will it accomplish than if merely laid on top of the stake without percussion? Certainly very little. If it be lifted only the thickness of a leaf, the effect will be altogether imperceptible. And since the

    effect

    164 THE TWO NEW SCIENCES OF GALILEO

    effect of the blow depends upon the velocity of this striking body, can any one doubt the motion is very slow and the speed more than small whenever the effect [of the blow] is impercepti-

    ble ? See now the power of truth; the same experiment which at first glance seemed to show one thing, when more carefully examined, assures us of the contrary.

    But without depending upon the above experiment, which is doubtless very conclusive, it seems to me that it ought not to be difficult to establish such a fact by reasoning alone. Imagine a heavy stone held in the air at rest; the support is removed and the stone set free; then since it is heavier than the air it begins to fall, and not with uniform motion but slowly at the beginning and with a continuously accelerated motion. Now since velocity can be increased and diminished without limit, what reason is there to believe that such a moving body starting with infinite slowness, that is, from rest, immediately acquires a speed of ten degrees rather than one of four, or of two, or of one, or of a half, or of a hundredth; or, indeed, of any of the infinite number of small values [of speed] ? Pray listen. I hardly think you will refuse to grant that the gain of speed of the stone falling from rest follows the same sequence as the diminution and loss of this same speed when, by some impelling force, the stone is thrown to its former elevation : but even if you do not grant this, I do not see how you can doubt that the ascending stone, diminishing in speed, must before coming to rest pass through every possible degree of slowness.

    SIMP. But if the number of degrees of greater and greater slowness is limitless, they will never be all exhausted, therefore such an ascending heavy body will never reach rest, but will continue to move without limit always at a slower rate; but this is not the observed fact.

    SALV. This would happen, Simplicio, if the moving body were to maintain its speed for any length of time at each degree of velocity; but it merely passes each point without delaying more than an instant: and since each time-interval however

    [201] small may be divided into an infinite number of instants, these

    will

    THIRD DAY 165

    will always be sufficient [in number] to correspond to the infinite degrees of diminished velocity.

    That such a heavy rising body does not remain for any length of time at any given degree of velocity is evident from the fol-

    lowing: because if, some time-interval having been assigned, the body moves with the same speed in the last as in the first in-

    stant of that time-interval, it could from this second degree of elevation be in like manner raised through an equal height, just as it was transferred from the first elevation to the second, and by the same reasoning would pass from the second to the third and would finally continue in uniform motion forever.

    SAGR. From these considerations it appears to me that we may obtain a proper solution of the problem discussed by philosophers, namely, what causes the acceleration in the natural motion of heavy bodies ? Since, as it seems to me, the force [virtu] impressed by the agent projecting the body upwards diminishes continuously, this force, so long as it was greater than the contrary force of gravitation, impelled the body upwards; when the two are in equilibrium the body ceases to rise and passes through the state of rest in which the impressed impetus [impeto] is not destroyed, but only its excess over the weight of the body has been consumed — the excess which caused the body to rise. Then as the diminution of the outside impetus [impeto] continues, and gravitation gains the upper hand, the fall begins, but slowly at first on account of the opposing impetus [virtu impressa], a large portion of which still remains in the body; but as this continues to diminish it also continues to be more and

    more overcome by gravity, hence the continuous acceleration of motion.

    SIMP. The idea is clever, yet more subtle than sound; for even if the argument were conclusive, it would explain only the case in which a natural motion is preceded by a violent motion, in which there still remains active a portion of the external force [virtu esterna] ; but where there is no such remaining portion and the body starts from an antecedent state of rest, the cogency of the whole argument fails.

    SAGR. I believe that you are mistaken and that this distinc- tion

    166 THE TWO NEW SCIENCES OF GALILEO

    tion between cases which you make is superfluous or rather non- existent. But, tell me, cannot a projectile receive from the

    projector either a large or a small force [virtu] such as will throw it to a height of a hundred cubits, and even twenty or four or one?

    [202] SIMP. Undoubtedly, yes. SAGR. So therefore this impressed force [virtu impressa] may

    exceed the resistance of gravity so slightly as to raise it only a finger-breadth; and finally the force [virtu] of the projector may be just large enough to exadlly balance the resistance of gravity so that the body is not lifted at all but merely sus-

    tained. When one holds a stone in his hand does he do anything but give it a force impelling [virtu impellente] it upwards equal to the power [facolta] of gravity drawing it downwards ? And do you not continuously impress this force [virtu] upon the stone as long as you hold it in the hand? Does it perhaps diminish with the time during which one holds the stone?

    And what does it matter whether this support which prevents

    the stone from falling is furnished by one’s hand or by a table or by a rope from which it hangs? Certainly nothing at all. You must conclude, therefore, Simplicio, that it makes no difference whatever whether the fall of the stone is preceded by a period of rest which is long, short, or instantaneous provided only the fall does not take place so long as the stone is adled upon by a force [virtu] opposed to its weight and sufficient to hold it at rest.

    SALV. The present does not seem to be the proper time to investigate the cause of the acceleration of natural motion con-

    cerning which various opinions have been expressed by various philosophers, some explaining it by attraction to the center, others to repulsion between the very small parts of the body, while still others attribute it to a certain stress in the surrounding medium which closes in behind the falling body and drives it from one of its positions to another. Now, all these fantasies, and others too, ought to be examined; but it is not really worth while. At present it is the purpose of our Author merely to

    investigate

    THIRD DAY 167

    investigate and to demonstrate some of the properties of ac- celerated motion (whatever the cause of this acceleration may

    be) — meaning thereby a motion, such that the momentum of its velocity [i momenti delta sua velocita] goes on increasing after departure from rest, in simple proportionality to the time, which is the same as saying that in equal time-intervals the body receives equal increments of velocity; and if we find the proper-

    ties [of accelerated motion] which will be demonstrated later are realized in freely falling and accelerated bodies, we may conclude that the assumed definition includes such a motion of falling bodies and that their speed [accelerazione] goes on increasing as

    the time and the duration of the motion. [203]

    SAGR. So far as I see at present, the definition might have been put a little more clearly perhaps without changing the fundamental idea, namely, uniformly accelerated motion is such that its speed increases in proportion to the space traversed; so that, for example, the speed acquired by a body in falling four cubits would be double that acquired in falling two cubits and this latter speed would be double that acquired in the first cubit. Because there is no doubt but that a heavy body falling from the height of six cubits has, and strikes with, a momentum [impeto] double that it had at the end of three cubits, triple that which it had at the end of one.

    SALV. It is very comforting to me to have had such a com- panion in error; and moreover let me tell you that your proposi-

    tion seems so highly probable that our Author himself admitted, when I advanced this opinion to him, that he had for some time shared the same fallacy. But what most surprised me was to

    see two propositions so inherently probable that they com- manded the assent of everyone to whom they were presented,

    proven in a few simple words to be not only false, but im- possible.

    SIMP. I am one of those who accept the proposition, and believe that a falling body acquires force [vires] in its descent, its velocity increasing in proportion to the space, and that the momentum [momenta] of the falling body is doubled when it falls

    from

    168 THE TWO NEW SCIENCES OF GALILEO

    from a doubled height; these propositions, it appears to me, ought to be conceded without hesitation or controversy.

    SALV. And yet they are as false and impossible as that motion should be completed instantaneously; and here is a very clear demonstration of it. If the velocities are in proportion to the spaces traversed, or to be traversed, then these spaces are traversed in equal intervals of time; if, therefore, the velocity with which the falling body traverses a space of eight feet were double that with which it covered the first four feet (just as the one distance is double the other) then the time-intervals re-

    quired for these passages would be equal. But for one and the same body to fall eight feet and four feet in the same time is possible only in the case of instantaneous [discontinuous] motion; [204]

    but observation shows us that the motion of a falling body oc- cupies time, and less of it in covering a distance of four feet than

    of eight feet; therefore it is not true that its velocity increases in proportion to the space.

    The falsity of the other proposition may be shown with equal clearness. For if we consider a single striking body the difference of momentum in its blows can depend only upon difference of velocity; for if the striking body falling from a double height were to deliver a blow of double momentum, it would be neces-

    sary for this body to strike with a doubled velocity; but with this doubled speed it would traverse a doubled space in the same time-interval; observation however shows that the time required for fall from the greater height is longer.

    SAGR. You present these recondite matters with too much evidence and ease; this great facility makes them less appre-

    ciated than they would be had they been presented in a more abstruse manner. For, in my opinion, people esteem more lightly that knowledge which they acquire with so little labor than that acquired through long and obscure discussion.

    SALV. If those who demonstrate with brevity and clearness the fallacy of many popular beliefs were treated with contempt instead of gratitude the injury would be quite bearable; but on the other hand it is very unpleasant and annoying to see men,

    who

    THIRD DAY 169

    who claim to be peers of anyone in a certain field of study, take for granted certain conclusions which later are quickly and easily shown by another to be false. I do not describe such a feeling as one of envy, which usually degenerates into hatred and anger against those who discover such fallacies; I would call it a strong desire to maintain old errors, rather than accept newly discovered truths. This desire at times induces them to unite against these truths, although at heart believing in them, merely for the purpose of lowering the esteem in which certain others are held by the unthinking crowd. Indeed, I have heard from our Academician many such fallacies held as true but easily refutable; some of these I have in mind.

    SAGR. You must not withhold them from us, but, at the proper time, tell us about them even though an extra session be necessary. But now, continuing the thread of our talk, it would [205]

    seem that up to the present we have established the definition of uniformly accelerated motion which is expressed as follows:

    A motion is said to be equally or uniformly accelerated when, starting from rest, its momentum (celeritatis momenta) receives equal increments in equal times.

    SALV. This definition established, the Author makes a single assumption, namely,

    The speeds acquired by one and the same body moving down planes of different inclinations are equal when the heights of these planes are equal.

    By the height of an inclined plane we mean the perpendicular let fall from the upper end of the plane upon the horizontal line drawn through the lower end of the same plane. Thus, to illustrate, let the line AB be horizontal, and let the planes CA and CD be inclined to it; then the Author calls the perpendicular

    CB the “height” of the planes CA and CD; he supposes that he speeds acquired by one and the same body, descending ilong the planes CA and CD to the terminal points A and D are equal since the heights of these planes are the same, CB; and Iso it must be understood that this speed is that which would

    be acquired by the same body falling from C to B. Sagr.

    1 70 THE TWO NEW SCIENCES OF GALILEO SAGR. Your assumption appears to me so reasonable that it

    ought to be conceded without question, provided of course there are no chance or outside resistances, and that the planes are

    chard and smooth, and that the figure of the moving body is per- fedtly round, so that neither plane nor moving body is rough. All re? sistance and opposition having been removed, my reason tells

    me at once that a heavy and per- 45 fedtly round ball descending along

    the lines CA, CD, CB would reach the terminal points A, D, B, with equal momenta [impeti eguali\.

    SALV. Your words are very plausible; but I hope by experi- ment to increase the probability to an extent which shall be little

    short of a rigid demonstration. [206]

    Imagine this page to represent a vertical wall, with a nail driven into it; and from the nail let there be suspended a lead bullet of one or two ounces by means of a fine vertical thread, AB, say from four to six feet long, on this wall draw a horizontal line DC, at right angles to the vertical thread AB, which hangs

    about two finger-breadths in front of the wall. Now bring the thread AB with the attached ball into the position AC and set it free; first it will be observed to descend along the arc CBD, to pass the point B, and to travel along the arc BD, till it almost reaches the horizontal CD, a slight shortage being caused by the resistance of the air and the string; from this we may rightly infer that the ball in its descent through the arc CB acquired a momentum [impeto] on reaching B, which was just sufficient to carry it through a similar arc BD to the same height. Having repeated this experiment many times, let us now drive a nail into the wall close to the perpendicular AB, say at E or F, so that it projects out some five or six finger-breadths in order that the thread, again carrying the bullet through the arc CB, may strike upon the nail E when the bullet reaches B, and thus compel it to traverse the arc BG, described about E as center. From this

    we

    THIRD DAY 171

    we can see what can be done by the same momentum [impeto] which previously starting at the same point B carried the same body through the arc BD to the horizontal CD. Now, gentle-

    men, you will observe with pleasure that the ball swings to the point G in the horizontal, and you would see the same thing happen if the obstacle were placed at some lower point, say at F, about which the ball would describe the arc BI, the rise of the

    Fig. 46

    ball always terminating exactly on the line CD. But when the nail is placed so low that the remainder of the thread below it will not reach to the height CD (which would happen if the nail were placed nearer B than to the intersection of AB with the [207]

    horizontal CD) then the thread leaps over the nail and twists itself about it.

    This experiment leaves no room for doubt as to the truth of our supposition; for since the two arcs CB and DB are equal and similarly placed, the momentum [momenta] acquired by the fall through the arc CB is the same as that gained by fall through the arc DB; but the momentum [momenta] acquired at B, owing to fall through CB, is able to lift the same body [mobile] through the arc BD; therefore, the momentum acquired in the fall BD is equal to that which lifts the same body through the same arc from B to D; so, in general, every momentum acquired by fall through

    172 THE TWO NEW SCIENCES OF GALILEO through an arc is equal to that which can lift the same body through the same arc. But all these momenta [momenti\ which cause a rise through the arcs BD, BG, and BI are equal, since they are produced by the same momentum, gained by fall through CB, as experiment shows. Therefore all the momenta gained by fall through the arcs DB, GB, IB are equal.

    SAGR. The argument seems to me so conclusive and the ex- periment so well adapted to establish the hypothesis that we

    may, indeed, consider it as demonstrated. SALV. I do not wish, Sagredo, that we trouble ourselves too

    much about this matter, since we are going to apply this principle mainly in motions which occur on plane surfaces, and not upon curved, along which acceleration varies in a manner greatly different from that which we have assumed for planes.

    So that, although the above experiment shows us that the descent of the moving body through the arc CB confers upon it momentum [momenta] just sufficient to carry it to the same height through any of the arcs BD, BG, BI, we are not able, by similar means, to show that the event would be identical in the case of a perfectly round ball descending along planes whose inclinations are respectively the same as the chords of these arcs. It seems likely, on the other hand, that, since these planes form angles at the point B, they will present an obstacle to the ball which has descended along the chord CB, and starts to rise along the chord BD, BG, BI.

    In striking these planes some of its momentum [impeto] will be lost and it will not be able to rise to the height of the line CD; but this obstacle, which interferes with the experiment, once removed, it is clear that the momentum [impeto] (which gains

    [208] in strength with descent) will be able to carry the body to the

    same height. Let us then, for the present, take this as a pos- tulate, the absolute truth of which will be established when we

    find that the inferences from it correspond to and agree per- fectly with experiment. The author having assumed this single

    principle passes next to the propositions which he clearly dem- onstrates; the first of these is as follows :

    THIRD DAY

    THEOREM I, PROPOSITION I

    The time in which any space is traversed by a body start- ing from rest and uniformly accelerated is equal to the time

    in which that same space would be traversed by the same body moving at a uniform speed whose value is the mean of the highest speed and the speed just before acceleration began.

    Let us represent by the line AB the time in which the space CD is traversed by a body which starts from rest at C and is uniformly accelerated; let the final and highest value of the speed gained during the interval AB be represented by the line EB drawn at right angles to AB; draw the line AE, then all lines drawn from equidistant points on AB and parallel to BE will represent the increasing values of the speed, C beginning with the instant A. Let the point F bisect the line EB; draw FG parallel to BA, and GA parallel to FB, thus forming a parallel-

    ogram AGFB which will be equal in area to the triangle AEB, since the side GF bisects the side AE at the point I; for if the parallel lines in the triangle AEB are extended to GI, then the sum of all the parallels contained in the quadrilateral

    is equal to the sum of those contained in the tri- angle AEB; for those in the triangle IEF are

    equal to those contained in the triangle GIA, while those included in the trapezium AIFB are common. Since each and every instant of time in the time-interval AB has its corresponding point on the line AB, from which points par-

    allels drawn in and limited by the triangle AEB ** represent the increasing values of the growing Fig. 47 velocity, and since parallels contained within the rectangle rep-

    resent the values of a speed which is not increasing, but constant, it appears, in like manner, that the momenta [momenta] assumed by the moving body may also be represented, in the case of the accelerated motion, by the increasing parallels of the triangle

    AEB,

    174 THE TWO NEW SCIENCES OF GALILEO [209]

    AEB, and, in the case of the uniform motion, by the parallels of the redlangle GB. For, what the momenta may lack in the first part of the accelerated motion (the deficiency of the momenta being represented by the parallels of the triangle AGI) is made up by the momenta represented by the parallels of the triangle IEF. Hence it is clear that equal spaces will be traversed in equal

    times by two bodies, one of which, starting from rest, moves with A H a uniform acceleration, while the momentum of

    the other, moving with uniform speed, is one-half its maximum momentum under accelerated mo-

    tion. Q. E. D.

    THEOREM II, PROPOSITION II

    The spaces described by a body falling from rest with a uniformly accelerated motion are to each

    other as the squares of the time-intervals em- ployed in traversing these distances.

    Let the time beginning with any instant A be rep- resented by the straight line AB in which are taken

    any two time-intervals AD and AE. Let HI repre- sent the distance through which the body, starting

    from rest at H, falls with uniform acceleration. If

    HL represents the space traversed during the time- interval AD, and HM that covered during the in-

    terval AE, then the space MH stands to the space LH in a ratio which is the square of the ratio of the time AE to the time AD; or we may say simply that the distances HM and HL are related as the squares

    Fig. 48′ of AE and AD. Draw the line AC making any angle whatever with the line

    AB; and from the points D and E, draw the parallel lines DO and EP; of these two lines, DO represents the greatest velocity attained during the interval AD, while EP represents the max-

    imum velocity acquired during the interval AE. But it has

    just been proved that so far as distances traversed are con- cerned

    THIRD DAY 175 cerned it is precisely the same whether a body falls from rest with a uniform acceleration or whether it falls during an equal time-interval with a constant speed which is one-half the max-

    imum speed attained during the accelerated motion. It follows therefore that the distances HM and HL are the same as would

    be traversed, during the time-intervals AE and AD, by uniform velocities equal to one-half those represented by DO and EP respectively. If, therefore, one can show that the distances HM and HL are in the same ratio as the squares of the time- intervals AE and AD, our proposition will be proven.

    [210] But in the fourth proposition of the first book [p. 157 above]

    it has been shown that the spaces traversed by two particles in uniform motion bear to one another a ratio which is equal to the product of the ratio of the velocities by the ratio of the times. But in this case the ratio of the velocities is the same as the ratio

    of the time-intervals (for the ratio of AE to AD is the same as that of y2 EP to K DO or of EP to DO) . Hence the ratio of the spaces traversed is the same as the squared ratio of the time- intervals. Q. E. D.

    Evidently then the ratio of the distances is the square of the ratio of the final velocities, that is, of the lines EP and DO, since these are to each other as AE to AD.

    COROLLARY I

    Hence it is clear that if we take any equal intervals of time whatever, counting from the beginning of the motion, such as AD, DE, EF, FG, in which the spaces HL, LM, MN, NI are traversed, these spaces will bear to one another the same ratio as the series of odd numbers, I, 3, 5, 7; for this is the ratio of the differences of the squares of the lines [which represent time], differences which exceed one another by equal amounts, this excess being equal to the smallest line [viz. the one representing a single time-interval] : or we may say [that this is the ratio] of the differences of the squares of the natural numbers beginning with unity. While,

    I

    ~7

    /

    176 THE TWO NEW SCIENCES OF GALILEO While, therefore, during equal intervals of time the velocities

    increase as the natural numbers, the increments in the distances

    traversed during these equal time-intervals are to one another as the odd numbers beginning with unity.

    SAGR. Please suspend the discussion for a moment since there just occurs to me an idea which I want to illustrate by means of a diagram in order that it may be clearer both to you and tome.

    Let the line AI represent the lapse of time measured from the initial instant A; through A draw the straight line AF making

    Aany angle whatever; join the terminal points I and F; divide the time AI in half at C; draw CB parallel to IF. Let us consider CB as the maximum value of the velocity which increases from zero

    C at the beginning, in simple proportion- ality to the intercepts on the triangle

    ABC of lines drawn parallel to BC; or what is the same thing, let us suppose the velocity to increase in proportion to the

    r time; then I admit without question, in view of the preceding argument, that the space described by a body falling in the aforesaid manner will be equal to the space traversed by the same body during the same length of time travelling with a uniform speed equal to EC, the half of BC. Further let us imagine that the

    Fig- 49 [211] body has fallen with accelerated motion so that, at the in-

    stant C, it has the velocity BC. It is clear that if the body continued to descend with the same speed BC, without ac-

    celeration, it would in the next time-interval CI traverse double the distance covered during the interval AC, with the uniform speed EC which is half of BC; but since the falling body acquires equal increments of speed during equal increments of time, it follows that the velocity BC, during the next time-

    interval

    THIRD DAY 17?

    interval CI will be increased by an amount represented by the parallels of the triangle BFG which is equal to the triangle ABC. If, then, one adds to the velocity GI half of the velocity FG, the

    highest speed acquired by the accelerated motion and deter- mined by the parallels of the triangle BFG, he will have the

    uniform velocity with which the same space would have been described in the time CI; and since this speed IN is three times

    as great as EC it follows that the space described during the in- terval CI is three times as great as that described during the inter-

    val AC. Let us imagine the motion extended over another equal time-interval IO, and the triangle extended to APO; it is then evident that if the motion continues during the interval IO, at the constant rate IF acquired by acceleration during the time AI, the space traversed during the interval IO will be four times that traversed during the first interval AC, because the speed IF is four times the speed EC. But if we enlarge our triangle so as to include FPQ which is equal to ABC, still assuming the accelera-

    tion to be constant, we shall add to the uniform speed an incre- ment RQ, equal to EC; then the value of the equivalent uniform

    speed during the time-interval IO will be five times that during the first time-interval AC; therefore the space traversed will be quintuple that during the first interval AC. It is thus evident by simple computation that a moving body starting from rest and acquiring velocity at a rate proportional to the time, will, during equal intervals of time, traverse distances which are related to each other as the odd numbers beginning with unity,

    J> 3> 5; * or considering the total space traversed, that covered [2I2[

    in double time will be quadruple that covered during unit time; in triple time, the space is nine times as great as in unit time.

    * As illustrating the greater elegance and brevity of modern analytical methods, one may obtain the result of Prop. II directly from the fun-

    damental equation

    S = l/2 g (t\ ” *2l) = g/2 (*2 + tl) (*2 – *l)

    where g is the acceleration of gravity and j, the space traversed between the instants t\ and t-2. If now ̂ – 1\ = I , say one second, then s = g/2 (% + *i) where *2+*i, must always be an odd number, seeing that it is the sum of two consecutive terms in the series of natural numbers. [Trans.]

    178 THE TWO NEW SCIENCES OF GALILEO And in general the spaces traversed are in the duplicate ratio of the times, i. e., in the ratio of the squares of the times.

    SIMP. In truth, I find more pleasure in this simple and clear

    argument of Sagredo than in the Author’s demonstration which to me appears rather obscure; so that I am convinced that matters are as described, once having accepted the definition of uniformly accelerated motion. But as to whether this accelera-

    tion is that which one meets in nature in the case of falling bodies, I am still doubtful; and it seems to me, not only for my own sake but also for all those who think as I do, that this would be the proper moment to introduce one of those experi-

    ments— and there are many of them, I understand — which illustrate in several ways the conclusions reached.

    SALV. The request which you, as a man of science, make, is a

    very reasonable one; for this is the custom — and properly so — in those sciences where mathematical demonstrations are applied to natural phenomena, as is seen in the case of perspective, astronomy, mechanics, music, and others where the principles, once established by well-chosen experiments, become the founda-

    tions of the entire superstructure. I hope therefore it will not appear to be a waste of time if we discuss at considerable length this first and most fundamental question upon which hinge numerous consequences of which we have in this book only a small number, placed there by the Author, who has done so much to open a pathway hitherto closed to minds of speculative turn. So far as experiments go they have not been neglected by the Author; and often, in his company, I have attempted in the following manner to assure myself that the acceleration actually experienced by falling bodies is that above described. A piece of wooden moulding or scantling, about 12 cubits

    long, half a cubit wide, and three finger-breadths thick, was taken; on its edge was cut a channel a little more than one finger in breadth; having made this groove very straight, smooth, and polished, and having lined it with parchment, also as smooth and polished as possible, we rolled along it a hard, smooth, and very round bronze ball. Having placed this

    board

    THIRD DAY 179

    board in a sloping position, by lifting one end some one or two cubits above the other, we rolled the ball, as I was just saying, along the channel, noting, in a manner presently to be described, the time required to make the descent. We repeated this ex-

    periment more than once in order to measure the time with an accuracy such that the deviation between two observations never exceeded one-tenth of a pulse-beat. Having performed this operation and having assured ourselves of its reliability, we now rolled the ball only one-quarter the length of the channel; and having measured the time of its descent, we found it pre-

    cisely one-half of the former. Next we tried other distances, com- paring the time for, the whole length with that for the half, or with

    that for two-thirds, or three-fourths, or indeed for any fraction; in such experiments, repeated a full hundred times, we always found that the spaces traversed were to each other as the squares of the times, and this was true for all inclinations of the plane, i. e., of the channel, along which we rolled the ball. We also observed that the times of descent, for various inclinations of the plane, bore to one another precisely that ratio which, as we shall see later, the Author had predicted and demonstrated for them.

    For the measurement of time, we employed a large vessel of water placed in an elevated position; to the bottom of this vessel was soldered a pipe of small diameter giving a thin jet of water, which we collected in a small glass during the time of each descent, whether for the whole length of the channel or for a part of its length; the water thus collected was weighed, after each descent, on a very accurate balance; the differences and ratios of these weights gave us the differences and ratios of the times, and this with such accuracy that although the operation was re-

    peated many, many times, there was no appreciable discrepancy in the results.

    SIMP. I would like to have been present at these experiments; but feeling confidence in the care with which you performed them, and in the fidelity with which you relate them, I am satisfied and accept them as true and valid

    SALV. Then we can proceed without discussion.

    i8o THE TWO NEW SCIENCES OF GALILEO

    COROLLARY II

    Secondly, it follows that, starting from any initial point, if we take any two distances, traversed in any time-intervals whatso-

    ever, these time-intervals bear to one another the same ratio as one of the distances to the mean proportional of the two distances.

    For if we take two distances ST and SY measured from

    the initial point S, the mean proportional of which is SX, the time of fall through ST is to the time of fall through

    X SY as ST is to SX; or one may say the time of fall through SY is to the time of fall through ST as SY is to SX. Now since it has been shown that the spaces traversed are in

    LT the same ratio as the squares of the times ; and since, more- Fig. 5° over, the ratio of the space SY to the space ST is the square of the ratio SY to SX, it follows that the ratio of the times of fall through SY and ST is the ratio of the respective distances SY and SX.

    SCHOLIUM

    The above corollary has been proven for the case of vertical fall; but it holds also for planes inclined at any angle; for it is to be assumed that along these planes the velocity increases in the same ratio, that is, in proportion to the time, or, if you prefer, as the series of natural numbers.*

    SALV. Here, Sagredo, I should like, if it be not too tedious to Simplicio, to interrupt for a moment the present discussion in order to make some additions on the basis of what has already been proved and of what mechanical principles we have already learned from our Academician. This addition I make for the

    better establishment on logical and experimental grounds, of the principle which we have above considered; and what is more important, for the purpose of deriving it geometrically, after first demonstrating a single lemma which is fundamental in the science of motion [impeti].

    * The dialogue which intervenes between this Scholium and the follow- ing theorem was elaborated by Viviani, at the suggestion of Galileo.

    See National Edition, viii, 23. [Trans.}

    THIRD DAY 181

    SAGR. If the advance which you propose to make is such as will confirm and fully establish these sciences of motion, I will gladly devote to it any length of time. Indeed, I shall not only [215]

    be glad to have you proceed, but I beg of you at once to satisfy the curiosity which you have awakened in me concerning your proposition; and I think that Simplicio is of the same mind.

    SIMP. Quite right. SALV. Since then I have your permission, let us first of all con-

    sider this notable fact, that the momenta or speeds [i momenti o le velocita] of one and the same moving body vary with the inclina-

    tion of the plane. The speed reaches a maximum along a vertical direction, and

    for other directions diminishes as the plane diverges from the

    vertical. Therefore the impetus, ability, energy, [I’impeto, il talento, I’energia] or, one might say, the momentum [il momenta] of descent of the moving body is diminished by the plane upon which it is supported and along which it rolls.

    For the sake of greater clearness erect the line AB perpendicular to the horizontal AC; next draw AD, AE, AF, etc., at different inclinations to the horizontal. Then I say that all the momentum of the falling body is along the vertical and is a maximum when it falls in that direction; the momentum is less along DA and still less along EA, and even less yet along the more inclined plane FA. Finally on the horizontal plane the mo-

    mentum vanishes altogether; the body finds itself in a condition of indifference

    as to motion or rest; has no inherent tend- ency to move in any direction, and offers no resistance to being set in motion. For just as a heavy body or system of bodies cannot of itself move upwards, or recede from the common center [comun centra] HO} toward which all heavy things tend, so it j is impossible for any body of its own ac- O cord to assume any motion other than Fig. 51 one which carries it nearer to the aforesaid common center. Hence, along the horizontal, by which we understand a surface, every point of which is equidistant from this same common center, the body will have no momentum whatever.

    This

    182 THE TWO NEW SCIENCES OF GALILEO

    [216] This change of momentum being clear, it is here necessary for

    me to explain something which our Academician wrote when in Padua, embodying it in a treatise on mechanics prepared solely for the use of his students, and proving it at length and conclu-

    sively when considering the origin and nature of that marvellous machine, the screw. What he proved is the manner in which the momentum [impeto] varies with the inclination of the plane, as for instance that of the plane FA, one end of which is elevated through a vertical distance FC. This direction FC is that along which the momentum of a heavy body becomes a maximum; let us discover what ratio this momentum bears to that of the same body moving along the inclined plane FA. This ratio, I say, is the inverse of that of the aforesaid lengths. Such is the lemma preceding the theorem which I hope to demonstrate a little later.

    It is clear that the impelling force [impeto] acting on a body in descent is equal to the resistance or least force [resistenza o forza minima] sufficient to hold it at rest. In order to measure this force and resistance [forza e resistenza] I propose to use the weight of another body. Let us place upon the plane FA a body G con-

    nected to the weight H by means of a cord passing over the point F; then the body H will ascend or descend, along the perpendicular, the same distance which the body G ascends or descends along the inclined plane FA; but this distance will not be equal to the rise or fall of G along the vertical in which direction alone G, as other bodies, exerts its force [resistenza]. This is clear. For if we consider the motion of the body G, from A to F, in the triangle AFC to be made up of a horizontal component AC and a vertical component CF, and remember that this body experiences no resistance to motion along the horizontal (because by such a [217]

    motion the body neither gains nor loses distance from the common center of heavy things) it follows that resistance is met only in consequence of the body rising through the vertical distance CF. Since then the body G in moving from A to F offers resistance only in so far as it rises through the vertical distance CF, while the other body H must fall vertically through the entire distance FA, and since this ratio is maintained whether the motion be large or small, the two bodies being inextensibly connected, we are able to assert positively that, in case of equilibrium (bodies at rest) the

    momenta,

    THIRD DAY 183

    momenta, the velocities, or their tendency to motion [propensioni al moto], i. e., the spaces which would be traversed by them in equal times, must be in the inverse ratio to their weights. This is what has been demonstrated in every case of mechanical motion.* So that, in order to hold the weight G at rest, one must give H a weight smaller in the same ratio as the distance CF is smaller than FA. If we do this, FA:FC = weight G: weight H; then equilibrium will occur, that is, the weights H and G will have the same impell-

    ing forces [momenti eguali], and the two bodies will come to rest. And since we are agreed that the impetus, energy, momentum

    or tendency to motion of a moving body is as great as the force or least resistance [forza o resistenza minima] sufficient to stop it, and since we have found that the weight H is capable of preventing motion in the weight G, it follows that the less weight H whose en-

    tire force [momenta totale] is along the perpendicular, FC, will be an exact measure of the component of force [momenta parziale] which the larger weight G exerts along the plane FA. But the measure of the total force [total momento] on the body G is its own weight, since to prevent its fall it is only necessary to balance it with an equal weight, provided this second weight be free to move verti-

    cally; therefore the component of the force [momento parziale] on G along the inclined plane FA will bear to the maximum and total force on this same body G along the perpendicular FC the same ratio as the weight H to the weight G. This ratio is, by con-

    struction, the same which the height, FC, of the inclined plane bears to the length FA. We have here the lemma which I pro-

    posed to demonstrate and which, as you will see, has been as- sumed by our Author in the second part of the sixth proposition

    of the present treatise. SAGR. From what you have shown thus far, it appears to me

    that one might infer, arguing ex aequali con la proportione per- turbata, that the tendencies [momenti] of one and the same body to move along planes differently inclined, but having the same verti-

    cal height, as FA and FI, are to each other inversely as the lengths of the planes.

    [218] SALV. Perfectly right. This point established, I pass to the

    demonstration of the following theorem:

    * A near approach to the principle of virtual work enunciated by John Bernoulli in 1717. [Trans.]

    184 THE TWO NEW SCIENCES OF GALILEO If a body falls freely along smooth planes inclined at any angle whatsoever, but of the same height, the speeds with which it reaches the bottom are the same.

    First we must recall the fact that on a plane of any inclination whatever a body starting from rest gains speed or momentum

    [la quantita dell’impeto] in direct proportion to the time, in agree- ment with the definition of naturally accelerated motion given by

    the Author. Hence, as he has shown in the preceding proposition, the distances traversed are proportional to the squares of the times and therefore to the squares of the speeds. The speed relations are here the same as in the motion first studied [i. e., vertical motion], since in each case the gain of speed is proportional to the time.

    Let AB be an inclined plane whose height above the level BC is

    AC. As we have seen above the force impelling [I’impeto] a body to fall along the vertical AC is to the force which drives the same body along the in-

    clined plane AB as AB is to AC. On the incline AB, lay off AD a third propor-

    tional to AB and AC; then the force pro- ducing motion along AC is to that along

    AB (i. e., along AD) as the length AC is to the length AD. And therefore the body will traverse the space AD, along the in-

    cline AB, in the same time which it would occupy in falling the ver- tical distance AC, (since the forces [momenti] are in the same ratio

    as these distances) ; also the speed at C is to the speed at D as the distance AC is to the distance AD. But, according to the defini-

    tion of accelerated motion, the speed at B is to the speed of the same body at D as the time required to traverse AB is to the time required for AD; and, according to the last corollary of the second proposition, the time of passing through the distance AB bears to the time of passing through AD the same ratio as the distance AC (a mean proportional between AB and AD) to AD. Accord-

    ingly the two speeds at B and C each bear to the speed at D the same ratio, namely, that of the distances AC and AD; hence they are equal. This is the theorem which I set out to prove.

    From the above we are better able to demonstrate the following third proposition of the Author in which he employs the following principle, namely, the time required to traverse an inclined plane

    is

    THIRD DAY 185 is to that required to fall through the vertical height of the plane in the same ratio as the length of the plane to its height. [219]

    For, according to the second corollary of the second proposition, if BA represents the time required to pass over the distance BA, the time required to pass the distance AD will be a mean propor-

    tional between these two distances and will be represented by the line AC; but if AC represents the time needed to traverse AD it will also represent the time required to fall through the distance AC, since the distances AC and AD are traversed in equal times; consequently if AB represents the time required for AB then AC will represent the time required for AC. Hence the times required to traverse AB and AC are to each other as the distances AB and AC.

    In like manner it can be shown that the time required to fall through AC is to the time required for any other incline AE as the length AC is to the length AE; therefore, ex aequali, the time of fall along the incline AB is to that along AE as the distance AB is to the distance AE, etc.*

    One might by application of this same theorem, as Sagredo will readily see, immediately demonstrate the sixth proposition of the Author; but let us here end this digression which Sagredo has perhaps found rather tedious, though I consider it quite important for the theory of motion.

    SAGR. On the contrary it has given me great satisfaction, and indeed I find it necessary for a complete grasp of this principle.

    SALV. I will now resume the reading of the text. [215]

    THEOREM III, PROPOSITION III If one and the same body, starting from rest, falls along an inclined plane and also along a vertical, each having the same height, the times of descent will be to each other as the lengths of the inclined plane and the vertical.

    Let AC be the inclined plane and AB the perpendicular, each having the same vertical height above the horizontal, namely, BA; then I say, the time of descent of one and the same body

    * Putting this argument in a modern and evident notation, one has

    AC = 1/2 rt and AD = 1/2 si ̂ If now AC2 = AB- *&> h follows at once that td = k. [Trans.] Q. D. E.

    186 THE TWO NEW SCIENCES OF GALILEO [216]

    along the plane AC bears a ratio to the time of fall along the perpendicular AB, which is the same as the ratio of the length AC to the length AB. Let DG, El and LF be any lines parallel

    y^to the horizontal CB; then it follows from what has preceded that a body starting from A will acquire the same speed at the point G as at D, since in each case the vertical fall is the same; in like manner the speeds at I and E will be the same; so also those at L and F. And in general the speeds at the two extremi-

    ties of any parallel drawn from any point on AB to the corresponding point on AC will be

    C M. B equal. Fig- 53 Thus the two distances AC and AB are

    traversed at the same speed. But it has already been proved [217]

    that if two distances are traversed by a body moving with equal speeds, then the ratio of the times of descent will be the ratio of the distances themselves; therefore, the time of descent along AC is to that along AB as the length of the plane AC is to the vertical distance AB. Q. E. D.

    [218] SAGR. It seems to me that the above could have been proved

    clearly and briefly on the basis of a proposition already demon- strated, namely, that the distance traversed in the case of

    accelerated motion along AC or AB is the same as that covered [219]

    by a uniform speed whose value is one-half the maximum speed, CB; the two distances AC and AB having been traversed at the same uniform speed it is evident, from Proposition I, that the times of descent will be to each other as the distances.

    COROLLARY

    Hence we may infer that the times of descent along planes having different inclinations, but the same vertical height stand

    to

    THIRD DAY 187

    to one another in the same ratio as the lengths of the planes. For consider any plane AM extending from A to the horizontal CB; then it may be demonstrated in the same manner that the time of descent along AM is to the time along AB as the dis-

    tance AM is to AB ; but since the time along AB is to that along AC as the length AB is to the length AC, it follows, ex (zqualiy that as AM is to AC so is the time along AM to the time along AC.

    THEOREM IV, PROPOSITION IV

    The times of descent along planes of the same length but of different inclinations are to each other in the inverse ratio of the square roots of their heights

    From a single point B draw the planes BA and BC, having the same length but different inclinations; let AE and CD be horizontal lines drawn to meet the perpendicular BD; and

    [220] let BE represent the height of the plane AB, and BD the height of BC; also let BI be a mean proportional to BD and BE; then the ratio of BD to BI is equal to the B square root of the ratio of BD to BE. Now, I say, the ratio of the times of de-

    scent along BA and BC is the ratio of BD to BI; so that the time of descent along BA is related to the height of the other plane BC, namely BD as the time along BC is related to the height BI. Now it

    must be proved that the time of descent A ^ / along BA is to that along BC as the length / BD is to the length BI. C L Draw IS parallel to DC; and since it Fig. 54

    has been shown that the time of fall along BA is to that along the vertical BE as BA is to BE; and also that the time along BE is to that along BD as BE is to BI; and likewise that the time along BD is to that along BC as BD is to BC, or as BI to BS; it follows, ex czquali, that the time along BA is to that along BC as BA to BS, or BC to BS. However, BC is to BS as BD is to BI ; hence follows our proposition.

    1 88 THE TWO NEW SCIENCES OF GALILEO

    THEOREM V, PROPOSITION V

    The times of descent along planes of different length, slope and height bear to one another a ratio which is equal to the product of the ratio of the lengths by the square root of the inverse ratio of their heights.

    Draw the planes AB and AC, having different inclinations, lengths, and heights. My theorem then is that the ratio of the

    time of descent along AC to that along AB is equal to the product of the ratio of AC to AB by the square root of the inverse ratio of their heights.

    For let AD be a perpendicular to which are drawn the horizontal lines BG and CD; also let AL be a mean proportional to the heights AG and AD; from the point L draw a hori- zontal line meeting AC in F; accordingly AF will be a mean proportional between AC and AE. Now since the time of descent along AC is to that along AE as the length AF is to

    Fig. 55 AE; and since the time along AE is to that along AB as AE is to AB, it is clear that the time along AC is to that along AB as AF is to AB.

    [221] Thus it remains to be shown that the ratio of AF to AB is equal

    to the product of the ratio of AC to AB by the ratio of AG to AL, which is the inverse ratio of the square roots of the heights DA and GA. Now it is evident that, if we consider the line AC in connection with AF and AB, the ratio of AF to AC is the same as that of AL to AD, or AG to AL which is the square root of the ratio of the heights AG and AD; but the ratio of AC to AB is the ratio of the lengths themselves. Hence follows the theorem.

    THEOREM VI, PROPOSITION VI

    If from the highest or lowest point in a vertical circle there be drawn any inclined planes meeting the circumference the

    times

    B

    THIRD DAY 189

    times of descent along these chords are each equal to the other.

    On the horizontal line GH construct a vertical circle. From its

    lowest point — the point of tangency with the horizontal — draw the diameter FA and from the highest point, A, draw inclined planes to B and C, any points whatever on the circumference; then the times of descent along these are equal. Draw BD and CE perpendicular to the diame-

    ter; make AI a mean propor- tional between the heights of the

    planes, AE and AD; and since the rectangles FAAE and FA. AD are respectively equal to the squares of AC and AB, while the

    rectangle FA.AE is to the rect- angle FAAD as AE is to AD,

    it follows that the square of AC Fi&- 56 is to the square of AB as the length AE is to the length AD. But since the length AE is to AD as the square of AI is to the square of AD, it follows that the squares on the lines AC and AB are to each other as the squares on the lines AI and AD, and hence also the length AC is to the length AB as AI is to AD. But it has previously been demonstrated that the ratio of the time of descent along AC to that along AB is equal to the product of the two ratios AC to AB and AD to AI; but this last ratio is the same as that of AB to AC. Therefore the ratio of the time of descent along AC to that along AB is the product of the two ratios, AC to AB and AB to AC. The ratio of these times is therefore unity. Hence follows our proposition.

    By use of the principles of mechanics [ex mechanicis] one may obtain the same result, namely, that a falling body will require equal times to traverse the distances CA and DA, indicated in the following figure. Lay off BA equal to DA, and let fall the

    ^ [222] perpendiculars BE and DF; it follows from the principles of

    mechanics

    190 THE TWO NEW SCIENCES OF GALILEO mechanics that the component of the momentum [momentum ponderis] acting along the inclined plane ABC is to the total momentum [i. e., the momentum of the body falling freely] as

    BE is to BA; in like manner the momentum along the

    plane AD is to its total mo- mentum [i. e., the momentum

    of the body falling freely] as

    DFistoDA,ortoBA. There- fore the momentum of this

    same weight along the plane DA is to that along the plane ABC as the length DF is to the length BE; for this reason, this same weight will in equal times according to the second proposition of the first book,

    traverse spaces along the planes CA and DA which are to each other as the lengths BE and DF. But it can be shown that CA is to DA as BE is to DF. Hence the falling body will traverse the two paths CA and DA in equal times.

    Moreover the fadl that CA is to DA as BE is to DF may be demonstrated as follows: Join C and D; through D, draw the line DGL parallel to AF and cutting the line AC in I; through B draw the line BH, also parallel to AF. Then the angle ADI will be equal to the angle DCA, since they subtend equal arcs LA and DA, and since the angle DAC is common, the sides of the triangles, CAD and DAI, about the common angle will be proportional to each other; accordingly as CA is to DA so is DA to LA, that is as BA is to IA, or as HA is to GA, that is as BE is to DF. E. D.

    The same proposition may be more easily demonstrated as follows : On the horizontal line AB draw a circle whose diameter DC is vertical. From the upper end of this diameter draw any inclined plane, DF, extending to meet the circumference; then, I say, a body will occupy the same time in falling along the plane DF as along the diameter DC. For draw FG parallel

    to

    THIRD DAY 191 to AB and perpendicular to DC; join FC; and since the time of fall along DC is to that along DG as the mean proportional [223]

    between CD and GD is to GD itself; and since also DF is a

    mean proportional between DC and DG, the angle DFC in- scribed in a semicircle being a right-

    angle, and FG being perpendicular to DC, it follows that the time of fall along DC is to that along DG as the length FD is to GD. But it has already been demonstrated that thep time of descent along DF is to that along DG as the length DF is to DG; hence the times of descent along DF and DC each bear to the time of fall

    along DG the same ratio; conse- quently they are equal.

    In like manner it may be shown that if one draws the chord CE from

    the lower end of the diameter, also the line EH parallel to the horizon, and joins the points E and D, the time of descent along EQwill be the same as that along the diameter, DC.

    COROLLARY I

    From this it follows that the times of descent along all chords drawn through either C or D are equal one to another.

    COROLLARY II

    It also follows that, if from any one point there be drawn a vertical line and an inclined one along which the time of descent is the same, the inclined line will be a chord of a semicircle of which the vertical line is the diameter.

    COROLLARY III

    Moreover the times of descent along inclined planes will be equal when the vertical heights of equal lengths of these planes

    are

    192 THE TWO NEW SCIENCES OF GALILEO are to each other as the lengths of the planes themselves; thus it is clear that the times of descent along CA and DA, in the figure just before the last, are equal, provided the vertical height of AB (AB being equal to AD), namely, BE, is to the vertical height DF as CA is to DA.

    SAGR. Please allow me to interrupt the ledlure for a moment in order that I may clear up an idea which just occurs to me; one which, if it involve no fallacy, suggests at least a freakish and [224]

    interesting circumstance, such as often occurs in nature and in the realm of necessary consequences.

    If, from any point fixed in a horizontal plane, straight lines be drawn extending indefinitely in all directions, and if we imagine a point to move along each of these lines with constant speed, all starting from the fixed point at the same instant and moving with equal speeds, then it is clear that all of these mov-

    ing points will lie upon the circumference of a circle which grows larger and larger, always having the aforesaid fixed point as its center; this circle spreads out in precisely the same manner as the little waves do in the case of a pebble allowed to drop into quiet water, where the impact of the stone starts the motion in all directions, while the point of impact remains the center of these ever-expanding circular waves. But imagine a vertical plane from the highest point of which are drawn lines inclined at every angle and extending indefinitely; imagine also that heavy particles descend along these lines each with a naturally accelerated motion and each with a speed appropriate to the inclination of its line. If these moving particles are always visible, what will be the locus of their positions at any instant? Now the answer to this question surprises me, for I am led by the preceding theorems to believe that these particles will always lie upon the circumference of a single circle, ever increasing in size as the particles recede farther and farther from the point at which their motion began. To be more definite, let A be the fixed point from which are drawn the lines AF and AH inclined at any angle whatsoever. On the perpendicular AB take any two points C and D about which, as centers, circles are described

    passing

    THIRD DAY

    passing through the point A, and cutting the inclined lines at the points F, H, B, E, G, I. From the preceding theorems it is clear that, if particles start, at the same instant, from A and descend along these lines, when one is at E another will be at G and another at I ; at a later instant A they will be found simultaneously at F, H and B; these, and indeed an

    infinite number of other particles [225]

    travelling along an infinite number of different slopes will at successive instants always lie upon a single ever-expanding circle. The two kinds of motion occurring in nature give rise therefore to two infinite series

    of circles, at once resembling and Fig. 59 differing from each other; the one takes its rise in the center of an infinite number of concentric circles; the other has its origin in the contact, at their highest points, of an infinite number of eccentric circles; the former are produced by motions which are equal and uniform; the latter by motions which are neither uniform nor equal among themselves, but which vary from one to another according to the slope.

    Further, if from the two points chosen as origins of motion, we draw lines not only along horizontal and vertical planes but in all directions then just as in the former cases, beginning at a single point ever-expanding circles are produced, so in the latter case an infinite number of spheres are produced about a single point, or rather a single sphere which expands in size without limit; and this in two ways, one with the origin at the center, the other on the surface of the spheres.

    SALV. The idea is really beautiful and worthy of the clever mind of Sagredo.

    SIMP. As for me, I understand in a general way how the two kinds of natural motions give rise to the circles and spheres; and yet as to the production of circles by accelerated motion and its proof, I am not entirely clear; but the fact that one can take

    the

    194 THE TWO NEW SCIENCES OF GALILEO

    the origin of motion either at the inmost center or at the very top of the sphere leads one to think that there may be some great mystery hidden in these true and wonderful results, a mystery related to the creation of the universe (which is said to be spherical in shape), and related also to the seat of the first cause \prima causa].

    SALV. I have no hesitation in agreeing with you. But pro- found considerations of this kind belong to a higher science than

    ours [a piu alte dottrine che le nostre]. We must be satisfied to belong to that class of less worthy workmen who procure from the quarry the marble out of which, later, the gifted sculptor produces those masterpieces which lay hidden in this rough and shapeless exterior. Now, if you please, let us proceed.

    [226] THEOREM VII, PROPOSITION VII

    If the heights of two inclined planes are to each other in the same ratio as the squares of their lengths, bodies starting from rest will traverse these planes in equal times.

    Take two planes of different lengths and different inclinations, AE and AB, whose heights are AF and AD: let AF be to AD as

    A the square of AE is to the square of AB; then, I say, that a body, starting from rest at A, will traverse the planes AE and AB in equal times. From the vertical line, draw the horizontal par- allel lines EF and DB, the latter cut-

    ‘ ting AE at G. Since FA : DA = EA2 : BA2, and since FA:DA=EA:GA, it follows that EA:GA = EA2:BA2. Hence BA is a mean proportional be-

    . 60 tween £A anj GA. Now since the time of descent along AB bears to the time along AG the same ratio which AB bears to AG and since also the time of descent

    along AG is to the time along AE as AG is to a mean propor- tional between AG and AE, that is, to AB, it follows, ex aquali,

    that

    THIRD DAY 195

    that the time along AB is to the time along AE as AB is to itself. Therefore the times are equal. Q. E. D.

    THEOREM VIII, PROPOSITION VIII

    The times of descent along all inclined planes which intersedl one and the same vertical circle, either at its highest or lowest point, are equal to the time of fall along the vertical diameter; for those planes which fall short of this diameter the times are shorter; for planes which cut this diameter, the times are longer.

    Let AB be the vertical diameter of a circle which touches the horizontal plane. It has already been proven that the times of de-

    scent along planes drawn from either end, A or B, to the cir-

    cumference are equal. In order to show that the time of descent

    [227]

    along the plane DF which falls short of the diameter is shorter

    we may draw the plane DB which

    is both longer and less steeply in- clined than DF; whence it follows

    that the time along DF is less than that along DB and consequently along AB. In like manner, it is shown that the time of de-

    scent along CO which cuts the diameter is greater: for it is both longer and less steeply inclined than CB. Hence follows the theorem.

    THEOREM IX, PROPOSITION IX

    If from any point on a horizontal line two planes, inclined at any angle, are drawn, and if they are cut by a line which makes with them angles alternately equal to the angles be-

    tween these planes and the horizontal, then the times re- quired to traverse those portions of the plane cut off by

    the aforesaid line are equal. Through

    Fig. 62

    196 THE TWO NEW SCIENCES OF GALILEO Through the point C on the horizontal line X, draw two

    planes CD and CE inclined at any angle whatever: at any point in the line CD lay off the angle CDF equal to the angle XCE; let the line DF cut CE at F so that the angles CDF and CFD are alternately equal to XCE and LCD; then, I say, the

    times of descent over CD and CF are equal. Now since the angle CDF is equal to the angle XCE by construction, it is evident that the angle CFD must be equal to the angle DCL. For if the common angle DCF be subtracted from

    the three angles of the tri- angle CDF, together equal

    to two right angles, (to which are also equal all the angles which can be described about the point C on the lower side of the line LX) there remain in the triangle two angles, CDF and CFD, equal to the two angles XCE and LCD; but, by hypothesis, the angles CDF and XCE are equal; hence the remaining angle CFD is equal to the remainder DCL. Take CE equal to CD ; from the points D and E draw DA and EB perpendicular to the horizontal line XL; and from the point C draw CG perpendicular to DF. Now since the angle CDG is equal to the angle ECB and since DGC and CBE are right angles, it follows that the triangles CDG and CBE are equiangular ; consequently DC :CG = CE :EB . But DC is equal to CE, and therefore CG is equal to EB. Since also the angles at C and at A, in the triangle DAC, are equal to the angles at F and G in the triangle CGF, we have CD:DA = FC:CG and, permutando, DC:CF=DA:CG=DA:BE. Thus the ratio of the heights of the equal planes CD and CE is

    the same as the ratio of the lengths DC and CF. Therefore, by [228]

    Corollary I of Prop. VI, the times of descent along these planes will be equal. Q. E. D.

    An alternative proof is the following: Draw FS perpendicular to

    THIRD DAY 197

    to the horizontal line AS. Then, since the triangle CSF is similar to the triangle DGC, we have SF:FC=GC:CD; and since the triangle CFG is similar to the triangle DCA, we have FC:CG=CD:DA.L A c Hence, ex czquali, SF: CG=CG:DA. There-

    fore CG is a mean pro- portional between SF

    and DA, while DA :SF=

    DA2:CG2. Again since the triangle ACD is sim-

    ilar to the triangle CGF, we have DADC=GC: C F and, permutando, DA:CG = DC:CF:also Fig. 63

    DA2:CG2=DC2:CF2._J3utjt has been shown that DA2:CG2= DA:SF. Therefore DC2 :CF2=DA:FS. Hence from the above Prop. VII, since the heights DA and FS of the planes CD and CF are to each other as the squares of the lengths of the planes, it follows that the times of descent along these planes will be equal.

    THEOREM X, PROPOSITION X

    The times of descent along inclined planes of the same height, but of different slope, are to each other as the lengths of these planes; and this is true whether the motion starts from rest or whether it is preceded by a fall from a constant height.

    Let the paths of descent be along ABC and ABD to the horizon- tal plane DC so that the falls along BD and BC are preceded by

    the fall along AB; then, I say, that the time of descent along BD is to the time of descent along BC as the length BD is to BC. Draw the horizontal line AF and extend DB until it cuts this

    [229]

    line at F; let FE be a mean proportional between DF and FB; draw EO parallel to DC; then AO will be a mean proportional between CA and AB. If now we represent the time of fall along

    AB

    198 THE TWO NEW SCIENCES OF GALILEO AB by the length AB, then the time of descent along FB will be represented by the distance FB; so also the time of fall through the entire distance AC will be represented by the mean pro-

    portional AO: and for the entire distance FD by FE. Hence the time of fall along the remainder, BC, will be represented by

    jfL “F BO, and that along the remainder, BD, by BE; but since BE:BO=BD:BC, it follows, if we allow the bodies to fall first along AB and FB, or, what is the same thing, along the common stretch AB, that the times of descent along BD and BC will be to each other as the

    lengths BD and BC. But we have previously proven that

    the time of descent, from rest at B, along Fig- 64 BD is to the time along BC in the ratio

    which the length BD bears to BC. Hence the times of descent along different planes of constant height are to each other as the lengths of these planes, whether the motion starts from rest or is preceded by a fall from a constant height. Q. E. D.

    THEOREM XI, PROPOSITION XI

    If a plane be divided into any two parts and if motion along it starts from rest, then the time of descent along the first part is to the time of descent along the remainder as the length of this first part is to the excess of a

    mean proportional between this first part and the en- tire length over this first part.

    Let the fall take place, from rest at A, through the entire distance AB which is divided at any point C; also let AF be a mean proportional between the entire length BA and the first part AC; then CF will denote the excess of the mean proportional FA over the first part AC. Now, I say, the time of descent along AC will be to the time of subsequent fall through CB as the length AC is to CF. Fig. 65 This is evident, because the time along AC is to the time along the entire distance AB as AC is to the mean proportional AF.

    Therefore,

    A

    THIRD DAY 199

    Therefore, dividendo, the time along AC will be to the time

    along the remainder CB as AC is to CF. If we agree to repre- sent the time along AC by the length AC then the time along

    CB will be represented by CF. Q. E. D.

    [230]

    In case the motion is not along the straight line ACB but along the broken line ACD to the horizon-

    tal line BD, and if from F we draw the horizontal line FE, it may in like manner be proved that the time along AC is to the time along the in-

    clined line CD as AC is to CE. For

    the time along AC is to the time along CB as AC is to CF; but it

    has already been shown that the – time along CB, after the fall through the distance AC, is to the time along CD, after descent through the same distance AC, as CB is to CD, or, as CF is to CE; there-

    fore, ex <zquali, the time along AC will be to the time along CD as the length AC is to the length CE.

    THEOREM XII, PROPOSITION XII

    If a vertical plane and any inclined plane are limited by two horizontals, and if we take mean proportionals between the lengths of these planes and those portions of them which lie between their point of intersection and the upper horizontal, then the time of fall along the perpendicular bears to the time required to traverse the upper part of the perpendicular plus the time required to traverse the lower part of the intersecting plane the same ratio which the entire length of the vertical bears to a length which is the sum of the mean proportional on the vertical plus the excess of the entire length of the inclined plane over its mean proportional.

    Let AF and CD be two horizontal planes limiting the vertical plane AC and the inclined plane DF; let the two last-mentioned planes intersect at B. Let AR be a mean proportional between

    the

    200 THE TWO NEW SCIENCES OF GALILEO

    the entire vertical AC and its upper part AB; and let FS be a mean proportional between FD and its upper part FB. Then, I say, the time of fall along the entire vertical path AC bears to the time of fall along its upper portion AB plus the time of fall

    along the lower part of the inclined plane, namely, BD, the same ratio which the length AC bears to the mean proportional on the vertical, namely, AR, plus the length SD which is the excess of the entire plane DF over its mean proportional FS.

    Join the points R and S giving a horizontal line RS. Now since the

    time of fall through the entire dis- 6? tance AC is to the time along the

    portion AB as CA is to the mean proportional AR it follows that, if we agree to represent the time of fall through AC by the distance AC, the time of fall through the distance AB will be represented by AR; and the time of descent through the re-

    mainder, BC, will be represented by RC. But, if the time along AC is taken to be equal to the length AC, then the time along FD will be equal to the distance FD ; and we may likewise infer that the time of descent along BD, when preceded by a fall along FB or AB, is numerically equal to the distance DS. Therefore

    [231] the time required to fall along the path AC is equal to AR plus RC; while the time of descent along the broken line ABD will be equal to AR plus SD. Q. E. D.

    The same thing is true if, in place of a vertical plane, one takes any other plane, as for instance NO; the method of proof is also the same.

    PROBLEM I, PROPOSITION XIII

    Given a perpendicular line of limited length, it is required to find a plane having a vertical height equal to the given perpendicular and so inclined that a body, having fallen from rest along the perpendicular, will make its descent

    along

    THIRD DAY 201

    along the inclined plane in the same time which it occu- pied in falling through the given perpendicular.

    Let AB denote the given perpendicular: prolong this line to C making BC equal to AB, and draw the horizontal lines CE and AG. It is required to draw a plane from B to the horizontal line CE such that after a body starting from rest at A has fallen through the distance AB, it will complete its path along

    this plane in an equal time. – Lay off CD equal to BC, and draw the line BD. Construe!: the line BE equal to the sum of BD and DC; then, I say, BE is the required plane. Prolong EB till it intersects the horizontal AG at G. Let GF be a mean pro-

    portional between GE and GB;

    then EF:FB=EG:GF, and EF2: FB2=EG2:GF2=EG:GB. But EG is twice GB; hence the square of EF is twice the square of FB ; so also is the square of DB twice the square of BC. Consequently

    et permutando, EBJDB + BC= D BF:BC. But EB=DB + BC; Fig- 68

    hence BF =BC=BA. If we agree that the length AB shall rep- resent the time of fall along the line AB, then GB will represent

    the time of descent along GB, and GF the time along the entire distance GE; therefore BF will represent the time of descent along the difference of these paths, namely, BE, after fall from G or from A. Q. E. F.

    [232] PROBLEM II, PROPOSITION XIV

    Given an inclined plane and a perpendicular passing

    through it, to find a length on the upper part of the per- pendicular through which a body will fall from rest in the

    same time which is required to traverse the inclined plane after fall through the vertical distance just determined.

    Let AC be the inclined plane and DB the perpendicular. It is required to find on the vertical AD a length which will be

    traversed

    202 THE TWO NEW SCIENCES OF GALILEO

    traversed by a body, falling from rest, in the same time which is needed by the same body to traverse the plane AC after the aforesaid fall. Draw the horizontal CB; lay off AE such that BA + 2AC:AC=AC:AE, and lay off AR such that BA:AC= EAiAR. From R draw RX perpendicular to DB; then, I say, X is the point sought. For since BA + 2AC:AC=AC:AE, it follows, dividendo, that BA -f AC:AC=CE:AE. And since BA:AC=EA:AR, we have, componendo, BA+AC:AC=ER: RA. But BA + AC:AC=CE:AE, hence CE:EA=ER:RA= sum of the antecedents: sum of the consequents =CRJRE.

    Thus RE is seen to be a mean propor- tional between CR and RA. Moreover

    since it has been assumed that BA:

    .JfcAC=EA:AR, and since by similar tri- angles we have BA:AC=XA:AR, it

    follows that EA iAR =XA :AR. Hence EA and XA are equal. But if we agree that the time of fall through RA shall be represented by the length RA, then the time of fall along RC will be repre-

    sented by the length RE which is a

    ^ mean proportional between RA and RC ; Fig- 69 likewise AE will represent the time of

    descent along AC after descent along RA or along AX. But the time of fall through XA is represented by the length XA, while RA represents the time through RA. But it has been shown that XA and AE are equal. Q. E. F.

    PROBLEM III, PROPOSITION XV

    Given a vertical line and a plane inclined to it, it is re- quired to find a length on the vertical line below its point

    of intersection which will be traversed in the same time as

    the inclined plane, each of these motions having been pre- ceded by a fall through the given vertical line.

    Let AB represent the vertical line and BC the inclined plane; it is required to find a length on the perpendicular below its point of intersection, which after a fall from A will be traversed in the

    same

    THIRD DAY

    same time which is needed for BC after an identical fall from A.

    Draw the horizontal AD, intersecting the prolongation of CB at D; let DE be a mean proportional between CD and DB; lay [233]

    off BF equal to BE; also let AG be a third proportional to BA and AF. Then, I say, BG is the distance which a body, after falling through AB, will traverse in the A D same time which is needed for the plane BC after the same preliminary fall. For if we assume that the time of fall

    along AB is represented by AB, then the time for DB will be represented by DB. And since DE is a mean propor-

    tional between BD and DC, this same DE will represent the time of descent along the entire distance DC while BE will represent the time required for the difference of these paths, namely, BC, provided in each case the fall is from rest at D or at A. In like manner we

    may infer that BF represents the time FlS- ?° of descent through the distance BG after the same preliminary fall; but BF is equal to BE. Hence the problem is solved.

    THEOREM XIII, PROPOSITION XVI If a limited inclined plane and a limited vertical line are drawn from the same point, and if the time required for a body, starting from rest, to traverse each of these is the same, then a body falling from any higher altitude will trav-

    erse the inclined plane in less time than is required for the vertical line.

    Let EB be the vertical line and CE the inclined plane, both starting from the common point E, and both traversed in equal times by a body starting from rest at E; extend the vertical line upwards to any point A, from which falling bodies are allowed to start. Then, I say that, after the fall through AE, the inclined plane EC will be traversed in less time than the per-

    pendicular

    204 THE TWO NEW SCIENCES OF GALILEO

    pendicular EB. Join CB, draw the horizontal AD, and prolong CE backwards until it meets the latter in D; let DF be a mean proportional between CD and DE while AG is made a mean

    proportional between BA and AE. Draw FG and DG; then [234]

    since the times of descent along EC and EB, starting from rest at E, are equal, it follows, according to Corollary II of Proposi-

    tion VI that the angle at C is a right angle; but the angle at A is also a right angle and the angles at the vertex E are equal; hence the triangles AED and CEB are equiangular and the sides about the equal angles are proportional; hence BE: EC = DE : EA. Consequently the rectangle BE.EA is equal to the rectangle CE.ED; and

    since the rectangle CD.DE ex- ceeds the rectangle CE.ED by

    the square of ED, and since the rectangle BA.AE exceeds the rectangle BE.EA by the square of EA, it follows that the excess of the rectangle CD.DE over the rectangle BAAE, or what is the same thing, the excess of the square of FD over the

    Flg- 7I square of AG, will be equal to the excess of the square of DE over the square of AE, which ex-

    cess is_equal to the square of AD. Therefore FD2 =GA2 + AD2 =GD2. Hence DF is equal to DG, and the angle DGF is equal to the angle DFG while the angle EGF is less than the angle EFG, and the opposite side EF is less than the opposite side EG. If now we agree to represent the time of fall through AE by the length AE, then the time along DE will be represented by DE. And since AG is a mean proportional between BA and

    AE,

    THIRD DAY 205 AE, it follows that AG will represent the time of fall through the total distance AB, and the difference EG will represent the time of fall, from rest at A, through the difference of path EB.

    In like manner EF represents the time of descent along EC, starting from rest at D or falling from rest at A. But it has been shown that EF is less than EG; hence follows the theorem.

    COROLLARY

    From this and the preceding proposition, it is clear that the vertical distance covered by a freely falling body, after a pre-

    liminary fall, and during the time-interval required to traverse an inclined plane, is greater than the length of the inclined plane, but less than the distance traversed on the inclined plane during an equal time, without any preliminary fall. For since we have just shown that bodies falling from an elevated point A will traverse the plane EC in Fig. 71 in a shorter time than the vertical EB, it is evident that the distance along EB which will be traversed during a time equal to that of descent along EC will be less than the whole of EB. But now in order to show that this vertical distance is greater than the length of the inclined plane EC, we reproduce Fig. 70 of the preceding theorem in which the vertical length BG is trav-

    ersed in the same time as BC after a

    preliminary fall through AB. That BG is greater than BC is shown as follows: since BE and FB are equal

    [235] while BA is less than BD, it follows that FB will bear to BA a greater ratio than EB bears to BD; and, compon- endo, FA will bear to BA a greater

    ratio than ED to DB; but FA ‘AB = Or GF:FB (since AF is a mean propor- Fig. 72 tional between BA and AG) and in like manner ED:BD = CE: EB. Hence GB bears to BF a greater ratio than CB bears to BE; therefore GB is greater than BC.

    B

    206 THE TWO NEW SCIENCES OF GALILEO

    PROBLEM IV, PROPOSITION XVII

    Given a vertical line and an inclined plane, it is required to lay off a distance along the given plane which will be trav-

    ersed by a body, after fall along the perpendicular, in the same time-interval which is needed for this body to fall from rest through the given perpendicular.

    Let AB be the vertical line and BE the inclined plane. The problem is to determine on BE a distance such that a body,

    A. p after falling through AB, will traverse it in a time equal to that required to traverse the perpendicular AB itself, starting from rest.

    Draw the horizontal AD and extend the plane until it meets this line in D. Lay off FB equal to BA; and choose the point E such that BD:FD=DF:DE. Then, I say, the time of descent along BE, after fall through AB, is equal to the time of fall,

    Fig. 73 from rest at A, through AB. For, if we assume that the length AB represents the time of fall through AB, then the time of fall through DB will be represented by the time DB; and since BD:FD =DF:DE, it follows that DF will represent the time of descent along the entire plane DE while BF represents the time through the portion BE starting from rest at D; but the time of descent along BE after the prelimi-

    nary descent along DB is the same as that after a preliminary fall through AB. Hence the time of descent along BE after AB will be BF which of course is equal to the time of fall through AB from rest at A/ Q. E. F.

    [236]

    PROBLEM V, PROPOSITION XVIII

    Given the distance through which a body will fall vertically from rest during a given time-interval, and given also a smaller time-interval, it is required to locate another [equal]

    vertical

    THIRD DAY

    vertical distance which the body will traverse during this

    given smaller time-interval. Let the vertical line be drawn through A, and on this line lay off the distance AB which is traversed by a body falling from rest at A, during a time which may also be represented by AB. Draw the horizontal line CBE, and on it lay off BC to represent the given interval of time which is shorter than AB. It is required to locate, in the perpendicular above mentioned, a distance which is equal to AB and which will be de-

    scribed in a time equal to BC. Join the points A and C; then, since BC<BA, it follows that the angle BAC<angle BCA. Construct the angle CAE equal to BCA and let E be the point where AE intersects the horizontal line; draw ED at right angles to AE, cutting the vertical at D ; lay off DF equal to BA. Then, I say, that FD is that portion of the vertical

    Fig. 74

    which a body starting from rest at A will traverse during the as- signed time-interval BC. For, if in the right-angled triangle

    AED a perpendicular be drawn from the right-angle at E to the opposite side AD, then AE will be a mean proportional between DA and AB while BE will be a mean proportional between BD and BA, or between FA and AB (seeing that FA is equal to DB) ; and since it has been agreed to represent the time of fall through AB by the distance AB, it follows that AE, or EC, will represent the time of fall through the entire distance AD, while EB will represent the time through AF. Consequently the re-

    mainder BC will represent the time of fall through the remain- ing distance FD. Q. E. F.

    [237]

    PROBLEM VI, PROPOSITION XIX

    Given the distance through which a body falls in a vertical line from rest and given also the time of fall, it is required to find the time in which the same body will, later, traverse

    an

    208 THE TWO NEW SCIENCES OF GALILEO

    an equal distance chosen anywhere in the same vertical line.

    On the vertical line AB, lay off AC equal to the distance fallen from rest at A, also locate at random an equal distance DB.

    Let the time of fall through AC be represented by the length AC. It is required to find the time necessary to traverse DB after fall from rest at A. About the entire length AB describe the semicircle AEB; from C draw CE perpendicular to AB; join the points A and E; the line AE will be longer than EC; lay off EF equal to EC. Then, I say, the difference FA will represent the time required for fall through DB. For since AE is a mean proportional be-

    tween BA and AC and since AC rep- resents the time of fall through AC,

    it follows that AE will represent the time through the entire distance AB. And since CE is a mean pro-

    portional between DA and AC (see- ing that DA=BC) it follows that

    CE, that is, EF, will represent the Hence the difference AF will represent

    Q. E. D.

    Fig. 75

    time of fall through AD. the time of fall through the difference DB.

    COROLLARY

    Hence it is inferred that if the time of fall from rest through any given distance is represented by that distance itself, then the time of fall, after the given distance has been increased by a certain amount, will be represented by the excess of the mean proportional between the increased distance and the original distance over the mean proportional between the original dis-

    tance and the increment. Thus, for instance, if we agree that

    AB

    THIRD DAY 209

    AB represents the time of fall, from rest at A, through the dis- tance AB, and that AS is the increment, the time required to

    traverse AB, after fall through SA, will be the excess of the mean proportional between SB and BA over the mean proportional between BA and AS.

    [238] PROBLEM VII, PROPOSITION XX

    Given any distance whatever and a portion of it laid off from the point at which motion begins, it is re-

    quired to find another portion which lies at the other end of the distance and which is traversed in the same

    time as the first given portion. Fig. 76 Let the given distance be CB and let CD be that part of it

    ! which is laid off from the beginning of motion. It is required to find another part, at the end B, which is traversed in the same time as the assigned portion CD. Let BA be a mean proportional between BC and CD; also let CE be a third proportional to BC and CA. Then, I say, EB will be the distance which,

    B after fall from C, will be traversed in the same time as CD itself. For if we agree that CB shall repre-

    sent the time through the entire distance CB, then BA A (which, of course, is a mean proportional between BC

    and CD) will represent the time along CD; and since CA is a mean proportional between BC and CE, it fol-

    lows that CA will be the time through CE; but the total length CB represents the time through the total

    B distance CB. Therefore the difference BA will be the

    Fig- 77 time along the difference of distances, EB, after falling from C; but this same BA was the time of fall through CD. Consequently the distances CD and EB are traversed, from rest at A, in equal times. Q. E. F.

    THEOREM XIV, PROPOSITION XXI

    If, on the path of a body falling vertically from rest, one lays off a portion which is traversed in any time you please

    and

    210 THE TWO NEW SCIENCES OF GALILEO

    and whose upper terminus coincides with the point where the motion begins, and if this fall is followed by a motion defledled along any inclined plane, then the space traversed along the inclined plane, during a time-interval equal to that occupied in the previous vertical fall, will be greater than twice, and less than three times, the length of the vertical fall.

    Let AB be a vertical line drawn downwards from the horizon- tal line AE, and let it represent the path of a body falling from

    rest at A; choose any portion AC of this path. Through C draw any inclined plane, CG, along which the motion is con-

    tinued after fall through AC. Then, I say, that the distance [239]

    traversed along this plane CG, during the time-interval equal to that of the fall through AC, is more than twice, but less

    A F than three times, this same distance AC. Let us lay off

    CF equal to AC, and ex- tend the plane GC until it

    meets the horizontal in E; choose G such that CE: EF=EF:EG. If now we assume that the time of fall along AC is represented by the length AC, then CE will represent the time of descent along CE, while CF, or CA, will represent the time of descent along lg’

    CG. It now remains to be shown that the distance CG is more than twice, and less than three times, the distance CA itself. Since CE:EF=EF:EG, it follows that CE:EF=CF:FG; but EC<EF; therefore CF will be less than FG and GC will be more than twice FC, or AC. Again since FE<2EC (for EC is greater than CA, or CF), we have GF less than twice FC, and also GC less than three times CF, or CA. Q. E. D.

    This proposition may be stated in a more general form; since

    what

    THIRD DAY 211

    what has been proven for the case of a vertical and inclined plane holds equally well in the case of motion along a plane of any inclination followed by motion along any plane of less steepness, as can be seen from the adjoining figure. The method of proof is the same.

    [240] PROBLEM VIII, PROPOSITION XXII

    Given two unequal time-intervals, also the distance through which a body will fall along a vertical line, from rest, during the shorter of these intervals, it is required to pass through the highest point of this vertical line a plane so inclined that the time of descent along it will be equal to the longer of the given intervals.

    Let A represent the longer and B the shorter of the two un- equal time-intervals, also let CD represent the length of the

    Fig. 79

    vertical fall, from rest, during the time B. It is required to pass

    through the point C a plane of such a slope that it will be trav- ersed in the time A.

    Draw from the point C to the horizontal a line CX of such a length that B:A=CD:CX. It is clear that CX is the plane along which a body will descend in the given time A. For it has been shown that the time of descent along an inclined plane bears to the time of fall through its vertical height the same ratio which the length of the plane bears to its vertical height. Therefore the time along CX is to the time along CD as the

    length CX is to the length CD, that is, as the time-interval A is to

    212 THE TWO NEW SCIENCES OF GALILEO

    to the time-interval B : but B is the time required to traverse the vertical distance, CD, starting from rest; therefore A is the time required for descent along the plane CX.

    PROBLEM IX, PROPOSITION XXIII

    Given the time employed by a body in falling through a cer- tain distance along a vertical line, it is required to pass

    through the lower terminus of this vertical fall, a plane so inclined that this body will, after its vertical fall, traverse on this plane, during a time-interval equal to that of the vertical fall, a distance equal to any assigned distance, pro-

    vided this assigned distance is more than twice and less than three times, the vertical fall.

    Let AS be any vertical line, and let AC denote both the length of the vertical fall, from rest at A, and also the time

    T K N ft required for this fall. Let IR be a distance more than twice and less than three times, AC. It is required to pass a plane through the point C so inclined that a body, after fall through AC, will, during the time AC, traverse a distance equal to

    S IR. Lay off RN and NM Fig. 80 each equal to AC. Through

    the point C, draw a plane CE meeting the horizontal, AE, at such a point that IMMN=AC:CE. Extend the plane to O, and lay off CF, FG and GO equal to RN, NM, and MI respectively. Then, I say, the time along the inclined plane CO, after fall through AC, is equal to the time of fall, from rest at A, through AC. For since OG:GF=FC:CE, it follows, componendo, that OF:FG=OF:FC=FE:EC, and since an antecedent is to its consequent as the sum of the antecedents is to the sum of the consequents, we have OE:EF=EF:EC. Thus EF is a mean proportional between OE and EC. Having agreed to

    represent

    THIRD DAY 213

    represent the time of fall through AC by the length AC it follows that EC will represent the time along EC, and EF the time along the entire distance EO, while the difference CF will represent the time along the difference CO; but CF=CA; therefore the problem is solved. For the time CA is the time of fall, from rest at A, through CA while CF (which is equal to CA) is the time required to traverse CO after descent along EC or after fall through AC. Q. E. F.

    It is to be remarked also that the same solution holds if the

    antecedent motion takes place, not along a vertical, but along an inclined plane. This case is illustrated in the following figure where the antecedent motion is along the inclined plane AS

    [242] underneath the horizontal AE. The proof is identical with the preceding.

    SCHOLIUM

    On careful attention, it will be clear that, the nearer the given line IR approaches to three times the length AC, the nearer the

    I M N K

    E

    inclined plane, CO, along which the second motion takes place, approaches the perpendicular along which the space traversed, during the time AC, will be three times the distance AC. For if IR be taken nearly equal to three times AC, then IM will be almost equal to MN; and since, by construction,

    IM:

    214 THE TWO NEW SCIENCES OF GALILEO

    IMMN =AC:CE, it follows that CE is but little greater than CA : consequently the point E will lie near the point A, and the lines CO and CS, forming a very acute angle, will almost coin-

    cide. But, on the other hand, if the given line, IR, be only the least bit longer than twice AC, the line IM will be very short; from which it follows that AC will be very small in comparison with CE which is now so long that it almost coincides with the horizontal line drawn through C. Hence we can infer that, if, after descent along the inclined plane AC of the adjoining figure, the motion is continued along a horizontal line, such as CT, the distance traversed by a body, during a time equal to the time of fall through AC, will be exactly twice the distance AC. The argument here employed is the same as the preceding. For it is clear, since OE:EF=EF:EC, that FC measures the time of descent along CO. But, if the horizontal line TC which is twice as long as CA, be divided into two equal parts at V then this line must be extended indefinitely in the direction of X before it will intersect the line AE produced; and accordingly the ratio of the infinite length TX to the infinite length VX is the same as the ratio of the infinite distance VX to the infinite distance CX.

    The same result may be obtained by another method of ap- proach, namely, by returning to the same line of argument which

    was employed in the proof of the first proposition. Let us [243]

    consider the triangle ABC, which, by lines drawn parallel to its base, represents for us a velocity increasing in proportion to the time; if these lines are infinite in number, just as the points in the line AC are infinite or as the number of instants in any interval of time is infinite, they will form the area of the triangle. Let us now suppose that the maximum velocity attained — that represented by the line BC — to be continued, without accelera-

    tion and at constant value through another interval of time equal to the first. From these velocities will be built up, in a similar manner, the area of the parallelogram ADBC, which is twice that of the triangle ABC; accordingly the distance traversed with these velocities during any given interval of time will be

    twice

    THIRD DAY 215 twice that traversed with the velocities represented by the triangle during an equal interval of time. But along a horizontal plane the motion is uniform since here it experiences neither acceleration nor retardation; therefore we con-j> elude that the distance CD traversed during a time- interval equal to AC is twice the distance AC; for the latter is covered by a motion, starting from rest and increasing in speed in proportion — 4- to the parallel lines in the triangle, while the — -4- former is traversed by a motion represented by *f- the parallel lines of the parallelogram which, being also infinite in number, yield an area twice that of the triangle. Fig. 82

    Furthermore we may remark that any velocity once imparted to a moving body will be rigidly maintained as long as the external causes of acceleration or retardation are removed, a condition which is found only on horizontal planes; for in the case of planes which slope downwards there is already present a cause of acceleration, while on planes sloping upward there is retardation; from this it follows that motion along a horizontal plane is perpetual; for, if the velocity be uniform, it cannot be diminished or slackened, much less destroyed. Further, al-

    though any velocity which a body may have acquired through natural fall is permanently maintained so far as its own nature [suapte natura] is concerned, yet it must be remembered that if, after descent along a plane inclined downwards, the body is deflected to a plane inclined upward, there is already existing in this latter plane a cause of retardation; for in any such plane this same body is subject to a natural acceleration downwards. Accordingly we have here the superposition of two different states, namely, the velocity acquired during the preceding fall which if acting alone would carry the body at a uniform rate to infinity, and the velocity which results from a natural accelera-

    tion downwards common to all bodies. It seems altogether reasonable, therefore, if we wish to trace the future history of a body which has descended along some inclined plane and has been deflected along some plane inclined upwards, for us to

    assume

    216 THE TWO NEW SCIENCES OF GALILEO

    assume that the maximum speed acquired during descent is permanently maintained during the ascent. In the ascent, however, there supervenes a natural inclination downwards, namely, a motion which, starting from rest, is accelerated at the [244]

    usual rate. If perhaps this discussion is a little obscure, the following figure will help to make it clearer.

    Let us suppose that the descent has been made along the downward sloping plane AB, from which the body is deflecled so as to continue its motion along the upward sloping plane BC; and first let these planes be of equal length and placed so as to make equal angles with the horizontal line GH. Now it is well known that a body, starting from rest at A, and descending along AB, acquires a speed which is proportional to the time,

    j^ which is a maximum -at B, and which is maintained by the

    body so long as all causes of fresh ac-

    celeration or retarda- tion are removed;

    Fi£- 83 the acceleration to which I refer is that to which the body would be subject if its motion were continued along the plane AB extended, while the retardation is that which the body would encounter if its motion were deflected along the plane BC inclined up-

    wards; but, upon the horizontal plane GH, the body would maintain a uniform velocity equal to that which it had ac-

    quired at B after fall from A; moreover this velocity is such that, during an interval of time equal to the time of descent through AB, the body will traverse a horizontal distance equal to twice AB. Now let us imagine this same body to move with the same uniform speed along the plane BC so that here also during a time-interval equal to that of descent along AB, it will traverse along BC extended a distance twice AB; but let us suppose that, at the very instant the body begins its ascent it is subjected, by its very nature, to the same influences which

    surrounded

    THIRD DAY 217

    surrounded it during its descent from A along AB, namely, it descends from rest under the same acceleration as that which was

    effective in AB, and it traverses, during an equal interval of time, the same distance along this second plane as it did along AB ; it is clear that, by thus superposing upon the body a uniform motion of ascent and an accelerated motion of descent, it will be carried along the plane BC as far as the point C where these two velocities become equal.

    If now we assume any two points D and E, equally distant from the vertex B, we may then infer that the descent along BD takes place in the same time as the ascent along BE. Draw DF parallel to BC; we know that, after descent along AD, the body will ascend along DF; or, if, on reaching D, the body is carried along the horizontal DE, it will reach E with the same momentum [impetus] with which it left D; hence from E the body will ascend as far as C, proving that the velocity at E is the same as that at D.

    From this we may logically infer that a body which descends [245]

    along any inclined plane and continues its motion along a plane inclined upwards will, on account of the momentum acquired,

    ascend to an -equal height above the horizontal; so that if the descent is along 1% rf x e AB the body will be carried up the plane BC as far as the horizontal line ACD: and this is

    true whether the B

    inclinations of the “* 84 planes are the same or different, as in the case of the planes AB and BD. But by a previous postulate [p. 184] the speeds acquired by fall along variously inclined planes having the same vertical height are the same. If therefore the planes EB and BD have the same slope, the descent along EB will be able to drive the body along BD as far as D; and since this propulsion comes from the speed acquired on reaching

    the

    218 THE TWO NEW SCIENCES OF GALILEO

    the point B, it follows that this speed at B is the same whether the body has made its descent along AB or EB. Evidently then the body will be carried up BD whether the descent has been made along AB or along EB. The time of ascent along BD is however greater than that along BC, just as the descent along EB occupies more time than that along AB; moreover it has been demonstrated that the ratio between the lengths of these times is the same as that between the lengths of the planes. We must next discover what ratio exists between the distances traversed in equal times along planes of different slope, but of the same elevation, that is, along planes which are included between the same parallel horizontal lines. This is done as follows:

    THEOREM XV, PROPOSITION XXIV

    Given two parallel horizontal planes and a vertical line con- necting them; given also an inclined plane passing through

    the lower extremity of this vertical line; then, if a body fall freely along the vertical line and have its motion reflected along the inclined plane, the distance which it will traverse along this plane, during a time equal to that of the verti-

    cal fall, is greater than once but less than twice the vertical line.

    Let BC and HG be the two horizontal planes, connected by the perpendicular AE; also let EB represent the inclined plane

    E AC

    H EG Fig. 85

    along which the motion takes place after the body has fallen along AE and has been reflected from E towards B. Then, I say, that, during a time equal to that of fall along AE, the body will ascend the inclined plane through a distance which is

    greater

    THIRD DAY 219

    greater than AE but less than twice AE. Lay off ED equal to AE and choose F so that EB:BD=BD:BF. First we shall

    [246] show that F is the point to which the moving body will be carried after reflection from E towards B during a time equal to that of fall along AE; and next we shall show that the distance EF is greater than EA but less than twice that quantity.

    Let us agree to represent the time of fall along AE by the length AE, then the time of descent along BE, or what is the same thing, ascent along EB will be represented by the distance EB. Now, since DB is a mean proportional between EB and BF,

    and since BE is the time of descent for the entire distance BE, it follows that BD will be the time of descent through BF, while the remainder DE will be the time of descent along the remainder FE. But the time of descent along the fall from rest at B is the same as the time of ascent from E to F after reflection from E with the speed acquired during fall either through AE or BE. Therefore DE represents the time occupied by the body in passing from E to F, after fall from A to E and after reflection along EB. But by construction ED is equal to AE. This concludes the first part of our demonstration. Now since the whole of EB is to the whole of BD as the

    portion DB is to the portion BF, we have the whole of EB is to the whole of BD as the remainder ED is to the remainder

    DF; but EB>BD and hence ED>DF, and EF is less than twice DE or AE. Q. E. D. The same is true when the initial motion occurs, not along a

    perpendicular, but upon an inclined plane: the proof is also the same provided the upward sloping plane is less steep, i. e., longer, than the downward sloping plane.

    THEOREM XVI, PROPOSITION XXV

    If descent along any inclined plane is followed by motion along a horizontal plane, the time of descent along the in-

    clined plane bears to the time required to traverse any as- signed length of the horizontal plane the same ratio which

    twice

    220 THE TWO NEW SCIENCES OF GALILEO

    twice the length of the inclined plane bears to the given horizontal length.

    Let CB be any horizontal line and AB an inclined plane; after descent along AB let the motion continue through the assigned

    horizontal distance BD. Then, I say, the time of descent along AB bears to the time spent in traversing BD the same ratio

    — which twice AB bears to BD. Fig- 86 For, lay off BC equal to twice

    AB then it follows, from a previous proposition, that the time of descent along AB is equal to the time required to traverse BC; but the time along BC is to the time along DB as the length CB is to the length BD. Hence the time of descent along AB [247]

    is to the time along BD as twice the distance AB is to the dis- tance BD. Q. E. D.

    PROBLEM X, PROPOSITION XXVI

    Given a vertical height joining two horizontal parallel lines; given also a distance greater than once and less than twice this vertical height, it is required to pass through the foot of the given perpendicular an inclined plane such that, after fall through the given vertical height, a body whose mo-

    tion is deflected along the plane will traverse the assigned distance in a time equal to the time of vertical fall.

    Let AB be the vertical distance separating two parallel horizontal lines AO and BC; also let FE be greater than once and less than twice BA. The problem is to pass a plane through B, extending to the upper horizontal line, and such that a body, after having fallen from A to B, will, if its motion be deflected along the inclined plane, traverse a distance equal to EF in a time equal to that of fall along AB. Lay off ED equal to AB; then the remainder DF will be less than AB since the entire length EF is less than twice this quantity; also lay off DI equal to DF, and choose the point X such that EI:ID=DF:FX; from B, draw the plane BO equal in length to EX. Then, I say,

    that

    THIRD DAY 221

    that the plane BO is the one along which, after fall through AB, a body will traverse the assigned distance FE in a time equal to the time of fall through AB. Lay off BR and RS equal to ED and DF respectively; then since EI:ID=DFiFX, we have, component, ED:DI =DX:XF=ED:DF=EX:XD =BOOR =

    O A

    X F D IE Fig. 87

    ROrOS. If we represent the time of fall along AB by the length AB, then OB will represent the time of descent along [248] ̂

    OB, and RO will stand for the time along OS, while the re- mainder BR will represent the time required for a body starting

    from rest at O to traverse the remaining distance SB. But the time of descent along SB starting from rest at O is equal to the time of ascent from B to S after fall through AB. Hence BO is that plane, passing through B, along which a body, after fall through AB, will traverse the distance BS, equal to the assigned distance EF, in the time-interval BR or BA. Q. E. F.

    THEOREM XVII, PROPOSITION XXVII

    If a body descends along two inclined planes of different lengths but of the same vertical height, the distance which it will traverse, in the lower part of the longer plane, during a time-interval equal to that of descent over the shorter plane, is equal to the length of the shorter plane plus a portion of it to which the shorter plane bears the same ratio which the longer plane bears to the excess of the longer over the shorter plane.

    Let AC be the longer plane, AB, the shorter, and AD the common elevation; on the lower part of AC lay off CE equal

    to

    222 THE TWO NEW SCIENCES OF GALILEO

    to AB. Choose F such that CA:AE=CA:CA-AB =CE:EF. Then, I say, that FC is that distance which will, after fall from A, be traversed during a time-interval equal to that required for

    descent along AB. For since CA:AE=CE:EF, it follows that the remainder EA: the remainder AF = CA : AE.

    Therefore AE is a mean pro- portional between AC and

    AF. Accordingly if the length DAB is employed to measure

    Fig- 88 the time of fall along AB, then the distance AC will measure the time of descent through AC; but the time of descent through AF is measured by the length AE, and that through FC by EC. Now EC =AB; and hence follows the proposition.

    [249]

    PROBLEM XI, PROPOSITION XXVIII

    Let AG be any horizontal line touching a circle; let AB be the diameter passing through the point of contact; and let AE and EB represent any two chords. The problem is to determine what ratio the time of fall through A G AB bears to the time of descent over both AE and EB. Extend BE till it meets the tangent at G, and draw AF so as to bisect the angle BAE. Then, I say, the time through AB is to the sum of the times along AE and EB as the length AE is to the sum of the lengths AE and EF. For since the angle FAB is equal to the angle FAE, while the angle EAG is equal to the angle ABF it follows that the entire angle GAF is equal to the sum of the angles FAB and ABF. But the angle GFA is also equal to the sum of these two angles. Hence the length GF is equal to the length

    GA

    THIRD DAY 223 GA; and since the rectangle BG.GE is equal to the square of GA, it will also be equal to the square of GF, or BG:GF = GFiGE. If now we agree to represent the time of descent alpng AE by the length AE, then the length GE will represent the time of descent along GE, while GF will stand for the time of descent through the entire distance GB; so also EF will denote the time through EB after fall from G or from A along AE. Consequently the time along AE, or AB, is to the time

    along AE and EB as the length AE is to AE-f EF. Q. E. D. A shorter method is to lay off GF equal to GA, thus making

    GF a mean proportional between BG and GE. The rest of the proof is as above.

    THEOREM XVIII, PROPOSITION XXIX

    Given a limited horizontal line, at one end of which is

    erected a limited vertical line whose length is equal to one- half the given horizontal line; then a body, falling through this given height and having its motion deflected into a horizontal direction, will traverse the given horizontal dis-

    tance and vertical line in less time than t O

    will any other verti- – -A cal distance plus the

    given horizontal dis- tance.

    [250]

    N

    B

    Let BC be the given dis- * c tance in a horizontal

    plane; at the end B erect ___*___J_______

    a perpendicular, on which ~~*C (D ” lay off BA equal to half Fig. 90 BC. Then, I say, that the time required for a body, starting from rest at A, to traverse the two distances, AB and BC, is the least of all possible times in which this same distance BC together with a vertical portion, whether greater or less than AB, can be traversed.

    Lay off EB greater than AB, as in the first figure, and less than

    224 THE TWO NEW SCIENCES OF GALILEO than AB, as in the second. It must be shown that the time required to traverse the distance EB plus BC is greater than that required for AB plus BC. Let us agree that the length AB shall represent the time along AB, then the time occupied in traversing the horizontal portion BC will also be AB, seeing that BC=2AB; consequently the time required for both AB and BC will be twice AB. Choose the point O such that EB: BO=BO:BA, then BO will represent the time of fall through EB. Again lay off the horizontal distance BD equal to twice BE; whence it is plear that BO represents the time along BD after fall through EB. Seledt a point N such that DB:BC = EB:BA=OB:BN. Now since the horizontal motion is uni-

    form and since OB is the time occupied in traversing BD, after fall from E, it follows that NB will be the time along BC after fall through the same height EB. Hence it is clear that OB plus BN represents the time of traversing EB plus BC; and, since twice BA is the time along AB plus BC, it remains to be shown thatOB+BN>2BA.

    _But since EB:BO=BO:BA, it follows that EB:BA=OB2: BA2._Moreover since EB :BA =OB :BN it follows that OB :BN = OB2:BA2. But OB:BN = (OB:BA)(BA:BN), and therefore AB:BN=OB:BA, that is, BA is a mean proportional between BO and BN. Consequently OB+BN>2BA. Q. E. D.

    [251] THEOREM XIX, PROPOSITION XXX

    A perpendicular is let fall from any point in a horizontal line; it is required to pass through any other point in this same horizontal line a plane which shall cut the perpendicu-

    lar and along which a body will descend to the perpendicular in the shortest possible time. Such a plane will cut from the

    perpendicular a portion equal to the distance of the as- sumed point in the horizontal from the upper end of the

    perpendicular. Let AC be any horizontal line and B any point in it from which

    is dropped the vertical line BD. Choose any point C in the horizontal line and lay off, on the vertical, the distance BE

    equal

    THIRD DAY 225

    equal to BC; join C and E. Then, I say, that of all inclined planes that can be passed through C, cutting the perpendicular, CE is that one along which the descent to the perpendicular is accomplished in the shortest time. For, draw the plane CF cutting the vertical above E, and the plane CG cutting the vertical below E; and draw IK, a parallel vertical line, touching at C a cir-

    cle described with BC as radius. Let EK be drawn parallel to CF, and extended to meet the tan-

    gent, after cutting the circle at L. Now it is clear that the time of fall along LE is equal to the time along CE; but the time along KE is greater than along LE; therefore the time along KE is greater than along CE. But the time along KE is equal to the time along CF, since they have the same length and the same

    slope; and, in like manner, it fol- lows that the planes CG and IE,

    having the same length and the same slope, will be traversed in equal times. Also, since HE< IE, the time along HE will be less than the time along IE. Therefore also the time along CE (equal to the time along HE), will be shorter than the time along IE. Q. E. D.

    THEOREM XX, PROPOSITION XXXI

    If a straight line is inclined at any angle to the horizontal and if, from any assigned point in the horizontal, a plane of quickest descent is to be drawn to the inclined line, that plane will be the one which bisects the angle contained

    [252] between two lines drawn from the given point, one per-

    pendicular

    226 THE TWO NEW SCIENCES OF GALILEO

    pendicular to the horizontal line, the other perpendicular to the inclined line.

    Let CD be a line inclined at any angle to the horizontal AB; and from any assigned point A in the horizontal draw AC per-

    pendicular to AB, and AE perpendicular to CD; draw FA so as to bisect the angle CAE. Then, I say, that of all the planes which can be drawn through the point A, cutting the line CD

    at any points whatsoever AF is the one of quickest descent [in quo tempore omnium brevissimo fiat de- scensus]. Draw FG par-

    allel to AE; the alternate angles GFA and FAE will be equal; also the angle EAF is equal to the angle FAG. Therefore the sides GF and GA of the triangle

    FGA are equal. Accord- ingly if we describe a circle

    about G as center, with GA as radius, this circle will pass through the point F,

    p.

    and will touch the horizontal at the point A and the inclined line at F; for GFC is a right angle, since GF and AE are parallel. It is clear therefore that all lines drawn from A to the inclined

    line, with the single exception of FA, will extend beyond the circumference of the circle, thus requiring more time to traverse any of them than is needed for FA. Q. E. D.

    LEMMA

    If two circles one lying within the other are in contact, and if any straight line be drawn tangent to the inner circle, cutting the outer circle, and if three lines be drawn from the point at which the circles are in contact to three points on the tangential straight line, namely, the point of tangency on the inner circle and the two points where the

    straight

    THIRD DAY

    straight line extended cuts the outer circle, then these three lines will contain equal angles at the point of contact.

    Let the two circles touch each other at the point A, the center of the smaller being at B, the center of the larger at C. Draw [253]

    the straight line FG touching the inner circle at H, and cutting the outer at the points F and G; also draw the three lines AF, AH, and AG. Then, I say, the angles contained by these lines, FAH and GAH, are equal. Pro-

    long AH to the circumference at I; from the centers of the circles, draw BH and CI; join the centers B and C and extend the line until

    it reaches the point of contact at A and cuts the circles at the points O and N. But now the lines BH and CI are parallel, be-

    cause the angles ICN and HBO are equal, each being twice the angle IAN. And since BH, drawn from the center to the point of contact is perpendicular to FG, it follows that CI will also be perpendicular to FG and that the arc FI is equal to the arc IG; consequently the angle FAI is equal to the angle IAG. Q. E. D.

    THEOREM XXI, PROPOSITION XXXII

    If in a horizontal line any two points are chosen and if through one of these points a line be drawn inclined towards the other, and if from this other point a straight line is drawn to the inclined line in such a direction that it cuts

    off from the inclined line a portion equal to the distance between the two chosen points on the horizontal line, then the time of descent along the line so drawn is less than along any other straight line drawn from the same point to the same inclined line. Along other lines which make equal angles on opposite sides of this line, the times of descent are the same.

    Let

    Fig. 93

    228 THE TWO NEW SCIENCES OF GALILEO

    Let A and B be any two points on a horizontal line: through B draw an inclined straight line BC, and from B lay off a dis-

    tance BD equal to BA; join the points A and D. Then, I say, the time of descent along AD is less than along any other line drawn from A to the inclined line BC. From the point A draw AE perpendicular to BA; and from the point D draw DE per-

    pendicular to BD, intersecting AE at E. Since in the isosceles

    U j^ triangle ABD, we have the — angles BAD and BDA equal, [254]

    their complements DAE and EDA are equal. Hence if, with E as center and EA as radius, we describe a circle it will pass through D and will touch the lines BA and BD

    at the points A and D. Now since A is the end of the verti-

    cal line AE, the descent along AD will occupy less time than along any other line drawn

    from the extremity A to the line BC and extending beyond the circumference of the circle; which concludes the first part of the proposition.

    If however, we prolong the perpendicular line AE, and choose any point F upon it, about which as center, we describe a circle of radius FA, this circle, AGC, will cut the tangent line in the points G and C. Draw the lines AG and AC which will accord-

    ing to the preceding lemma, deviate by equal angles from the median line AD. The time of descent along either of these lines is the same, since they start from the highest point A, and terminate on the circumference of the circle AGC.

    PROBLEM XII, PROPOSITION XXXIII

    Given a limited vertical line and an inclined plane of equal height, having a common upper terminal; it is required to find a point on the vertical line, extended upwards, from

    which

    94

    THIRD DAY

    which a body will fall and, when deflected along the inclined plane, will traverse it in the same time-interval which is required for fall, from rest, through the given vertical height.

    Let AB be the given limited vertical line and AC an in- clined plane having the same altitude. It is required to find on

    the vertical BA, extended above A, a point from which a falling body will traverse the distance AC in the same time which is spent in falling, from rest at A, through the given vertical line AB. Draw the line DCE at right angles to AC, and lay off CD equal to AB; also join the points A and D; then the angle ADC will be greater than the angle CAD, since the side CA is greater than either AB or CD. Make the angle DAE equal to the angle [255]

    ADE, and draw EF perpendicular to AE; then EF will cut the inclined plane, ex-

    tended both ways, at F. Lay off AI and AG each equal to CF ; through G draw the horizontal line GH.

    Then, I say, H is the point sought.

    For, if we agree to let the length AB represent the time of fall along the verti-

    cal AB, then AC will likewise represent the time of descent from

    rest at A, along AC; Fig. 95 and since, in the right-angled triangle AEF, the line EC has been drawn from the right angle at E perpendicular to the base AF, it follows that AE will be a mean proportional between FA and AC, while CE will be a mean proportional between AC and CF, that is between CA and AI. Now, since AC represents the time of descent from A along AC, it follows that AE will be the time along the entire distance AF, and EC the time along AI. But

    since

    230 THE TWO NEW SCIENCES OF GALILEO since in the isosceles triangle AED the side EA is equal to the side ED it follows that ED will represent the time of fall along AF, while EC is the time of fall along AI. Therefore CD, that is AB, will represent the time of fall, from rest at A, along IF; which is the same as saying that AB is the time of fall, from G or from H, along AC. E. F.

    PROBLEM XIII, PROPOSITION XXXIV

    Given a limited inclined plane and a vertical line having their highest point in common, it is required to find a point in the vertical line extended such that a body will fall from it and then traverse the inclined plane in the same time which is required to traverse the inclined plane alone starting from rest at the top of said plane.

    Let AC and AB be an inclined plane and a vertical line respectively, having a common highest point at A. It is re-

    quired to find a point in the vertical line, above A, such that a body, falling from it and afterwards having its motion directed along AB, will traverse both the assigned part of the vertical

    [256]

    line and the plane AB in the same time which is required for the plane AB alone, starting from rest at A. Draw BC a hori-

    zontal line and lay off AN equal to AC; choose the point L so that AB:BN =AL:LC, and lay off AI equal to AL; choose the point E such that CE, laid off on the vertical AC produced, will be a third proportional to AC and BI. Then, I say, CE is the distance sought; so that, if the vertical line is extended above A and if a portion AX is laid off equal to CE, then a body falling from X will traverse both the distances, XA and AB, in the same time as that required, when starting from A, to traverse AB alone.

    Draw XR parallel to BC and intersecting BA produced in R; next draw ED parallel to BC and meeting BA produced in D; on AD as diameter describe a semicircle; from B draw BF

    perpendicular to AD, and prolong it till it meets the circum- ference of the circle; evidently FB is a mean proportional

    between AB and BD, while FA is a mean proportional between

    DA

    THIRD DAY 231

    DAandAB. Take BS equal to BI and FH equal to FB. Now since AB:BD=AC:CE and since BF is a mean proportional [257]

    between AB and BD, while BI is a mean proportional between AC and CE, it follows that BA:AC=FB:BS, and since BA: AC=BA:BN=FB:BS we shall have, convertendo, BF:FS = AB:BN =AL:LC. Consequently the rectangle formed by FB

    Fig. 96

    and CL is equal to the rectangle whose sides are AL and SF; moreover, this rectangle AL.SF is the excess of the rectangle AL.FB, or AI.BF, over the rectangle ALBS, or AI.IB. But the rectangle FB.LC is the excess of the rectangle AC.BF over the rectangle AL.BF; and moreover the rectangle AC.BF is equal to the rectangle AB.BI since BA:AC=FB:BI; hence the excess of the rectangle AB.BI over the rectangle AI.BF, or AI.FH, is equal to the excess of the rectangle AI.FH over the rectangle AI.IB; therefore twice the rectangle AI.FH is equal to the sum

    of

    232 THE TWO NEW SCIENCES OF GALILEO

    of_the rectangles AB.BI and AI.IB, or 2AI.FH=2AI.IB-f

    BI2. AddAI2 to each side_,_then 2AI.IB+BI2 +AI2=AB2 =

    2AI.FH+AP. Again add_BF2 to each side,jhen AE^+BF2 = AF2 = 2AI.FH + AI2 + BF2 = 2AI.FH + AI2 + FH2. But AF2=2AH.HF+AH2+HF2; and hence 2ALFH+AP + FH2=2AH.HF+AH2+HF2. Subtracting HF2 from each side we have 2AI.FH+AI2=2AH.HF+AH2. Since now FH is a fadlor common to both recftangles, it follows that AH is equal to AI; for if AH were either greater or smaller than AI, then the two redlangles AH.HF plus the square of HA would be either larger or smaller than the two recftangles AI.FH plus the square of LA, a result which is contrary to what we have just demonstrated.

    If now we agree to represent the time of descent along AB by the length AB, then the time through AC will likewise be measured by AC; and IB, which is a mean proportional between AC and CE, will represent the time through CE, or XA, from rest at X. Now, since AF is a mean proportional between DA and AB, or between RB and AB, and since BF, which is equal to FH, is a mean proportional between AB and BD, that is between AB and AR, it follows, from a preceding proposition [Proposition XIX, corollary], that the difference AH represents the time of descent along AB either from rest at R or after fall from X, while the time of descent along AB, from rest at A, is measured by the length AB. But as has just been shown, the time of fall through XA is measured by IB, while the time of descent along AB, after fall, through RA or through XA, is IA. Therefore the time of descent through XA plus AB is measured by the length AB, which, of course, also measures the time of descent, from rest at A, along AB alone. Q. E. F.

    [258] PROBLEM XIV, PROPOSITION XXXV

    Given an inclined plane and a limited vertical line, it is re- quired to find a distance on the inclined plane which a body,

    starting from rest, will traverse in the same time as that needed to traverse both the vertical and the inclined plane.

    Let

    THIRD DAY 233 Let AB be the vertical line and BC the inclined plane. It is

    required to lay off on BC a distance which a body, starting from rest, will traverse in a time equal to that which is occupied by fall through the vertical AB and by descent of the plane. Draw the horizontal line AD, which intersects at E the prolongation of the inclined plane CB ; lay off BF equal to BA, and about E as center, with EF as radius describe the circle FIG. Prolong FE until it intersects the circumference at G. Choose a point H such that GB:BF =BH:HF. Draw the line HI tangent to the

    Fig. 97

    circle at I. At B draw the line BK perpendicular to FC, cutting the line EIL at L; also draw LM perpendicular to EL and cutting

    BC at M. Then, I say, BM is the distance which a body, start- ing from rest at B, will traverse in the same time which is re-

    quired to descend from rest at A through both distances, AB and BM. Lay off EN equal to EL; then since GB:BF = BH:HF, we shall have, permutando, GB:BH=BF:HF, and, dividendo, GH:BH=BH:HF. Consequently the redlangle GH.HF is equal to the square on BH; but this same redlangle is also equal to the square on HI; therefore BH is equal to HI. Since, in the quadrilateral ILBH, the sides HB and HI are

    equal

    234 THE TWO NEW SCIENCES OF GALILEO equal and since the angles at B and I are right angles, it follows that the sides BL and LI are also equal: but El =EF; therefore [259]

    the total length LE, or NE, is equal to the sum of LB and EF. If we subtract the common part EF, the remainder FN will be equal to LB: but, by construction, FB=BA and, therefore, LB = AB -fBN. If again we agree to represent the time of fall through AB by the length AB, then the time of descent along EB will be measured by EB ; moreover since EN is a mean pro-

    portional between ME and EB it will represent the time of descent along the whole distance EM; therefore the difference of these distances, BM, will be traversed, after fall from EB, or AB, in a time which is represented by BN. But having already assumed the distance AB as a measure of the time of fall through AB, the time of descent along AB and BM is measured by AB + BN. Since EB measures the time of fall, from rest at E, along EB, the time from rest at B along BM will be the mean pro-

    portional between BE and BM, namely, BL. The time there- fore for the path AB +

    BM, starting from rest at A is AB+BN; but the time for BM alone, starting from rest at B, is BL; and since it has already been shown that BL = AB+BN,

    the proposition follows. Another and shorter

    proof is the following: Fig. 98 Let BC be the inclined

    plane and BA the vertical; at B draw a perpendicular to EC, extending it both ways; lay off BH equal to the excess of BE over BA; make the angle HEL equal to the angle BHE; prolong EL until it cuts BK in L; at L draw LM perpendicular to EL and extend it till it meets BC in M; then, I say, BM is the portion of BC sought. For, since the angle MLE is a right angle, BL will be a mean proportional between MB and BE,

    while

    THIRD DAY

    while LE is a mean proportional between ME and BE; lay off EN equal to LE; then NE =EL =LH, and HB =NE-BL. But also HB=NE-(NB+BA); therefore BN+BA==BL. If now we assume the length EB as a measure of the time of descent along EB, the time of descent, from rest at B, along BM will be represented by BL; but, if the descent along BM is from rest at E or at A, then the time of descent will be measured by BN; and AB will measure the time along AB. Therefore the time re-

    quired to traverse AB and BM, namely, the sum of the distances AB and BN, is equal to the time of descent, from rest at B, along BM alone. Q. E. F.

    [260] LEMMA

    Let DC be drawn perpendicular to the diameter BA; from the extremity B draw the line BED at random; draw the line FB. Then, I say, FB is a mean proportional be-

    tween DB and BE. Join the points E and F. Through B, draw the tangent BG which will be parallel to CD. Now, since the angle DBG is equal to the angle FDB, and since the alternate angle of GBD is equal to EFB, it follows that the triangles FDB and FEB are similar and hence BD:BF=FB:BE.

    LEMMA

    Let AC be a line which is longer than DF, and let the ratio of AB to BC be greater than that of DE to EF. Then, I say, . AB is greater than DE. For, if AB

    * 5 S bears to BC a ratio greater than that of D B G P -^ to ̂ > t^ien ̂ ^ w^ kear to some 1 — • — ‘ • length shorter than EF, the same ratio

    Fig. 100 which AB bears to BC. Call this length EG; then since AB:BC =DE:EG, it follows, componendo et con- vertendo,

    Fig. 99

    236 THE TWO NEW SCIENCES OF GALILEO

    vertendo, that CArAB =GD:DE. But since CA is greater than GD, it follows that BA is greater than DE.

    LEMMA

    Let ACIB be the quadrant of a circle; from B draw BE parallel to AC; about any point in the line BE describe a circle BOES, touch-

    ing AB at B and intersecting the circumference of the quadrant at I. Join the points C and B; draw the line CI, prolonging it to S. Then, I say, the line CI is always less than CO. Draw the line AI touching the circle BOE. Then,

    if the line DI be drawn, it will be equal to DB; but, since DB touches the quadrant, DI will also be tangent to it and will be at right angles to AI; thus AI touches the circle BOE at I. And since the

    angle AIC is greater than the angle ABC, subtending as it does a larger arc, it follows that the angle SIN is also greater than the angle ABC. Wherefore the arc IES is

    greater than the arc BO, and the line CS, being nearer the center, is longer than CB. Consequently CO is greater than CI, since SC: CB=OC:CI.

    This result would be all the Fig. ioi more marked if, as in the second

    figure, the arc BIC were less than a quadrant. For the per- pendicular DB would then cut the circle CIB; and so also would

    DI

    THIRD DAY 237 DI which is equal to BD; the angle DIA would be obtuse and therefore the line AIN would cut the circle BIE. Since the angle ABC is less than the angle AIC, which is equal to SIN, and still less than the angle which the tangent at I would make with the line SI, it follows that the arc SEI is far greater than the arc BO; whence, etc. Q. E. D.

    THEOREM XXII, PROPOSITION XXXVI

    If from the lowest point of a vertical circle, a chord is drawn subtending an arc not greater than a quadrant, and if from the two ends of this chord two other chords

    be drawn to any point on the arc, the time of descent along the two latter chords will be shorter than along the first, and shorter also, by the same amount, than along the lower of these two latter chords.

    [262]

    Let CBD be an arc, not exceeding a quadrant, taken from a vertical circle whose lowest point is C; let CD be the chord [planum elevatum] sub- tending this arc, and let | there be two other chords drawn from C

    and D to any point B on the arc. Then, I say, the time of descent along the two chords [plana] DB and BC is shorter

    than along DC alone, or along BC alone, starting fromrestatB. Through the point D, draw the horizontal line MDA

    cutting CB extended at

    T G S K P

    Fig. 102

    A: draw DN and MC at right angles to MD, and BN at right

    angles to BD; about the right-angled triangle DBN describe the semicircle DFBN, cutting DC at F. Choose the point O such that DO will be a mean proportional between CD and DF; in like

    manner

    238 THE TWO NEW SCIENCES OF GALILEO manner select V so that AV is a mean proportional between CA and AB. Let the length PS represent the time of descent along the whole distance DC or BC, both of which require the same time. Lay off PR such that CD £>O = timePS . timePR. Then PR will represent the time in which a body, starting from D, will traverse the distance DF, while RS will measure the time in which the remaining distance, FC, will be traversed. But since PS is also the time of descent, from rest at B, along BC, and if we choose T such that BC:CD =PS:PT then PT will measure the time of descent from A to C, for we have already shown [Lemma] that DC is a mean proportional between AC and CB. Finally choose the point G such that CA AV =PT:PG, then PG will be the time of descent from A to B, while GT will be the residual time of descent along BC following descent from A to B. But, since the diameter, DN, of the circle DFN is a vertical line, the chords DF and DB will be traversed in equal times; wherefore if one can prove that a body will traverse BC, after descent along DB, in a shorter time than it will FC after descent along DF he will have proved the theorem. But a body descending from D along DB will traverse BC in the same time as if it had

    come from A along AB, seeing that the body acquires the same [263]

    momentum in descending along DB as along AB. Hence it remains only to show that descent along BC after AB is quicker than along FC after DF. But we have already shown that GT represents the time along BC after AB; also that RS measures the time along FC after DF. Accordingly it must be shown that RS is greater than GT, which may be done as follows: Since SP:PR=CD:DO, it follows, invertendo et convertendo, that RS:SP=OC:CD; also we have SP:PT=DC:CA. And since TP:PG=CA:AV, it follows, invertendo, that PT:TG = AC:CV, therefore, ex czquali, RS:GT=OC:CV. But, as we shall presently show, OC is greater than CV; hence the time RS is greater than the time GT, which was to be shown. Now, since [Lemma] CF is greater than CB and FD smaller than BA, it follows that CD:DF>CA:AB. But CD:DF=CO:OF,

    seeing that CDOX) =DO:DF; and CA:AB =CV2:VB2. There- fore

    THIRD DAY

    fore CO£)F>CV:VB, and, according to the preceding lemma, CO>CV. Besides this it is clear that the time of descent along DC is to the time along DEC as DOC is to the sum of DO andCV.

    SCHOLIUM

    From the preceding it is possible to infer that the path of quickest descent [lationem omnium velocissimam] from one point to another is not the shortest path, namely, a straight line, but the arc of a circle.* In the quadrant BAEC, having the side BC vertical, divide the arc AC into any number of equal parts, AD, DE, EF, FG, GC, and from C draw straight lines to the points A, D, E, F, G; -g draw also the straight lines AD, DE, EF, FG, GC. Evidently de-

    scent along the path ADC is quicker [264]

    than along AC alone or along DC from rest at D. But a body, start-

    ing from rest at A, will traverse DC more quickly than the path ADC; while, if it starts from rest at A, it will traverse the path DEC in a shorter time than DC alone.

    Hence descent along the three Fig. 103 chords, ADEC, will take less time than along the two chords ADC. Similarly, following descent along ADE, the time required to traverse EFC is less than that needed for EC alone. There-

    | fore descent is more rapid along the four chords ADEFC than I along the three ADEC. And finally a body, after descent along ADEF, will traverse the two chords, FGC, more quickly than FC alone. Therefore, along the five chords, ADEFGC, descent will be more rapid than along the four, ADEFC. Consequently

    * It is well known that the first correct solution for the problem of quickest descent, under the condition of a constant force was given by John Bernoulli (1667-1748). [Trans.]

    240 THE TWO NEW SCIENCES OF GALILEO the nearer the inscribed polygon approaches a circle the shorter is the time required for descent from A to C. What has been proven for the quadrant holds true also for

    smaller arcs; the reasoning is the same.

    PROBLEM XV, PROPOSITION XXXVII

    Given a limited vertical line and an inclined plane of equal altitude; it is required to find a distance on the inclined plane which is equal to the vertical line and which is traversed in an interval equal to the time of fall along the vertical line.

    Let AB be the vertical line and AC the inclined plane. We must locate, on the inclined plane, a distance equal to the vertical

    A. line AB and which will be traversed by a body starting from rest at A in the same time needed for fall along the vertical line. Lay off AD equal to AB, and bisect the remainder DC at I. Choose

    C^ B the point E such that AC:CI Fig. 104 =CI:AE and lay off DO

    equal to AE. Clearly EG is equal to AD, and also to AB. And further, I say that EG is that distance which will be traversed by a body, starting from rest at A, in the same time which is required for that body to fall through the distance AB. For since AC:CI =CI:AE =ID:DG, we have, convertendo, CA: AI =DI:IG. And since the whole of CA is to the whole of AI

    as the portion CI is to the portion IG, it follows that the re- [265]

    mainder IA is to the remainder AG as the whole of CA is to the

    wrhole of AI. Thus AI is seen to be a mean proportional be- tween CA and AG, while CI is a mean proportional between CA

    and AE. If therefore the time of fall along AB is represented by the length AB, the time along AC will be represented by AC, while CI, or ID, will measure the time along AE. Since AI is a mean proportional between CA and AG, and since CA is a

    measure

    THIRD DAY 241

    measure of the time along the entire distance AC, it follows that AI is the time along AG, and the difference 1C is the time along the difference GC; but DI was the time along AE. Conse-

    quently the lengths DI and 1C measure the times along AE and CG respectively. Therefore the remainder DA represents the time along EG, which of course is equal to the time along AB.

    Q. E. F.

    COROLLARY

    From this it is clear that the distance sought is bounded at each end by portions of the inclined plane which are traversed in equal times.

    PROBLEM XVI, PROPOSITION XXXVIII

    Given two horizontal planes cut by a vertical line, it is required to find a point on the upper part of the vertical line from which bodies may fall to the horizontal planes and there, having their motion deflected into a horizontal direction, will, during an interval equal to the time of fall, traverse distances which bear to each other any assigned ratio of a smaller quantity to a larger.

    Let CD and BE be the horizontal planes cut by the vertical \CB, and let the ratio of the smaller quantity to the larger be that of N to FG. It is required to find in the upper part Df the vertical line, AB, a point from which a body falling to the plane CD and there having its motion deflected along this plane, will traverse, during an interval equal to its time of fall a distance such that if another body, falling from this same point to the plane BE, there have its motion deflected along this plane and continued during an interval equal to its time of fall, will traverse a distance which bears to the former distance the

    J266] ratio of FG to N. Lay off GH equal to N, and select the point L so that FHiHG =BC:CL. Then, I say, L is the point sought. For, if we lay off CM equal to twice CL, and draw the line LM cutting the plane BE at O, then BO will be equal to twice

    BL

    242 THE TWO NEW SCIENCES OF GALILEO

    BL. And since FH:HG=BC:CL, we have, componendo et convertendo, HG:GF=N:GF=CL:LB =CM:BO. It is clear that, since CM is double the distance LC, the space CM is that which a body falling from L through LC will traverse in the plane CD; and, for the same reason, since BO is twice the distance BL, it is clear that BO is the distance which a body,

    B

    <N

    H

    Fig. 105

    after fall through LB, will traverse during an interval equal to the time of its fall through LB. Q. E. F.

    SAGR. Indeed, I think we may concede to our Academician, without flattery, his claim that in the principle [principio, i. e., accelerated motion] laid down in this treatise he has established a new science dealing with a very old subject. Observing with what ease and clearness he deduces from a single principle the proofs of so many theorems, I wonder not a little how such a question escaped the attention of Archimedes, Apollonius, Euclid and so many other mathematicians and illustrious philosophers, especially since so many ponderous tomes have

    been devoted to the subject of motion. [267]

    SALV. There is a fragment of Euclid which treats of motion,

    but

    THIRD DAY 243

    but in it there is no indication that he ever began to investigate the property of acceleration and the manner in which it varies with slope. So that we may say the door is now opened, for the first time, to a new method fraught with numerous and wonder-

    ful results which in future years will command the attention of other minds.

    SAGR. I really believe that just as, for instance, the few properties of the circle proven by Euclid in the Third Book of his Elements lead to many others more recondite, so the prin-

    ciples which are set forth in this little treatise will, when taken up by speculative minds, lead to many another more remarkable result; and it is to be believed that it will be so on account of the nobility of the subject, which is superior to any other in nature. During this long and laborious day, I have enjoyed these

    simple theorems more than their proofs, many of which, for their complete comprehension, would require more than an hour each; this study, if you will be good enough to leave the book in my hands, is one which I mean to take up at my leisure after we have read the remaining portion which deals with the motion of projectiles; and this if agreeable to you we shall take up to- morrow.

    SALV. I shall not fail to be with you.

    END OF THE THIRD DAY.

    [268]

    FOURTH DAY

    ALVIATT. Once more, Simplicio is here on time; so let us without delay take up the question of motion. The text of our Author is as follows :

    THE MOTION OF PROJECTILES

    In the preceding pages we have discussed the properties of uniform motion and of motion naturally accel-

    erated along planes of all inclinations. I now propose to set forth those properties which belong to a body whose motion is compounded of two other motions, namely, one uniform and one naturally accelerated; these properties, well worth knowing, I propose to demonstrate in a rigid manner. This is the kind of motion seen in a moving projectile; its origin I conceive to be as follows :

    Imagine any particle projected along a horizontal plane with- out friction; then we know, from what has been more fully

    explained in the preceding pages, that this particle will move along this same plane with a motion which is uniform and perpetual, provided the plane has no limits. But if the plane is limited and elevated, then the moving particle, which we imag-

    ine to be a heavy one, will on passing over the edge of the plane acquire, in addition to its previous uniform and perpetual motion, a downward propensity due to its own weight; so that the resulting motion which I call projection \projectio], is com-

    pounded of one which is uniform and horizontal and of another which is vertical and naturally accelerated. We now proceed to

    demonstrate

    FOURTH DAY 245

    demonstrate some of its properties, the first of which is as fol- lows:

    [269]

    THEOREM I, PROPOSITION I

    A projectile which is carried by a uniform horizontal motion compounded with a naturally accelerated vertical motion describes a path which is a semi-parabola.

    SAGR. Here, Salviati, it will be necessary to stop a little while for my sake and, I believe, also for the benefit of Sim- plicio; for it so happens that I have not gone very far in my study of Apollonius and am merely aware of the fact that he treats of the parabola and other conic sections, without an un-

    derstanding of which I hardly think one will be able to follow the proof of other propositions depending upon them. Since even in this first beautiful theorem the author finds it necessary to prove that the path of a projectile is a parabola, and since, as I imagine, we shall have to deal with only this kind of curves, it will be absolutely necessary to have a thorough acquaintance, if not with all the properties which Apollonius has demonstrated for these figures, at least with those which are needed for the present treatment.

    SALV. You are quite too modest, pretending ignorance of facts which not long ago you acknowledged as well known — I mean at the time when we were discussing the strength of materials and needed to use a certain theorem of Apollonius which gave you no trouble.

    SAGR. I may have chanced to know it or may possibly have assumed it, so long as needed, for that discussion; but now when we have to follow all these demonstrations about such curves we

    ought not, as they say, to swallow it whole, and thus waste time and energy.

    SIMP. Now even though Sagredo is, as I believe, well equipped for all his needs, I do not understand even the elementary terms; for although our philosophers have treated the motion of pro-

    jectiles, I do not recall their having described the path of a projectile except to state in a general way that it is always a

    curved

    246 THE TWO NEW SCIENCES OF GALILEO

    curved line, unless the projection be vertically upwards. But [270]

    if the little Euclid which I have learned since our previous dis-

    cussion does not enable me to understand the demonstrations

    which are to follow, then I shall be obliged to accept the the- orems on faith without fully comprehending them.

    SALV. On the contrary, I desire that you should understand them from the Author himself, who, when he allowed me to see this work of his, was good enough to prove for me two of the principal properties of the parabola because I did not happen to have at hand the books of Apollonius. These properties, which are the only ones we shall need in the present discussion, he proved in such a way that no prerequisite knowledge was re-

    quired. These theorems are, indeed, given by Apollonius, but after many preceding ones, to follow which would take a long while. I wish to shorten our task by deriving the first property

    purely and simply from the mode of gen- eration of the parabola and proving the

    second immediately from the first. Beginning now with the first, imagine

    a right cone, erected upon the circular base ibkc with apex at /. The section of this cone made by a plane drawn parallel to the side Ik is the curve which is called

    a parabola. The base of this parabola be cuts at right angles the diameter ik of the circle ibkc, and the axis ad is parallel to the side Ik; now having taken any point / in the curve bfa draw the straight line fe

    Fig. 1 06 parallel to bd; then, I say, the square of bd is to the square of fe in the same ratio as the axis ad is to the portion ae. Through the point e pass a plane parallel to the circle ibkc, producing in the cone a circular section whose diameter is the line geh. Since bd is at right angles to ik in the circle ibk, the square of bd is equal to the rectangle formed by id and dk; so also in the upper circle which passes through the points gfh the square of fe is equal to the rectangle formed by

    f

    FOURTH DAY 247

    ge and eh; hence the square of bd is to the square of fe as the rectangle id.dk is to the rectangle ge.eh. And since the line ed is parallel to hk, the line eh, being parallel to dk, is equal to it; therefore the rectangle idjdk is to the rectangle ge.eh as id is to

    [271] ge, that is, as da is to ae; whence also the rectangle id.dk is to the rectangle ge.eh, that is, the square of bd is to the square of fe, as the axis da is to the portion ae. Q. E. D.

    The other proposition necessary for this discussion we demon- strate as follows. Let us draw a parabola whose axis ca is pro- longed upwards to a point d; from any point b draw the line be

    parallel to the base of the parabola; if now the point d is chosen so that da = ca, then, I say, the straight line drawn through the points b and d will be tangent to the parabola at b. For imagine, if possible, that this line cuts the par-

    abola above or that its prolonga- tion cuts it below, and through any

    point g in it draw the straight line fge. And since the square of fe is greater than the square of ge, the square of fe will bear a greater ratio to the square of be than the square of ge to that of be; and since, by the preceding proposition, the square of fe is to that of be as the line ea is to ca, it follows that the line ea

    will bear to the line ca a greater jlL. ratio than the square of ge to that W

    of be, or, than the square of ed to /[ that of cd (the sides of the triangles

    deg and deb being proportional). FlS- I07 But the line ea is to ca, or da, in the same ratio as four times the rectangle ea.ad is to four times the square of ad, or, what is the same, the square of cd, since this is four times the square of ad; hence four times the rectangle ea.ad bears to the square of cd

    a

    248 THE TWO NEW SCIENCES OF GALILEO

    a greater ratio than the square of ed to the square of c d; but that would make four times the rectangle ea.ad greater than the square of ed; which is false, the fact being just the oppo-

    site, because the two portions ea and ad of the line ed are not equal. Therefore the line db touches the parabola without cutting i . Q. E. D.

    SIMP. Your demonstration proceeds too rapidly and, it seems

    to me, you keep on assuming that all of Euclid’s theorems are [272]

    as familiar and available to me as his first axioms, which is far from true. And now this fact which you spring upon us, that four times the redtangle ea.ad is less than the square of de because the two portions ea and ad of the line de are not equal brings me little composure of mind, but rather leaves me in suspense.

    SALV. Indeed, all real mathematicians assume on the part of the reader perfect familiarity with at least the elements of Euclid; and here it is necessary in your case only to recall a proposition of the Second Book in which he proves that when a line is cut into equal and also into two unequal parts, the rec-

    tangle formed on the unequal parts is less than that formed on the equal (i. e., less than the square on half the line), by an amount which is the square of the difference between the equal and unequal segments. From this it is clear that the square of the whole line which is equal to four times the square of the half is greater than four times the rectangle of the unequal parts. In order to understand the following portions of this treatise it will be necessary to keep in mind the two elemental theorems from conic sections which we have just demonstrated; and these two theorems are indeed the only ones which the Author uses. We can now resume the text and see how he demonstrates his first proposition in which he shows that a body falling with a motion compounded of a uniform horizontal and a naturally accelerated [naturale descendente] one describes a semi-parabola.

    Let us imagine an elevated horizontal line or plane ab along which a body moves with uniform speed from a to b. Suppose

    this

    TV

    io8

    FOURTH DAY 249 this plane to end abruptly at b; then at this point the body will, on account of its weight, acquire also a natural motion down-

    wards along the perpendicular bn. Draw the line be along the plane ba to represent the flow, or measure, of time; divide this line into a number of segments, be, cd, de, representing equal intervals of time; from the points b, c, d, e, let fall lines which are

    parallel to the per- pendicular bn. On the

    first of these lay off any distance ci, on the second a distance four

    times as long, df; on

    . [273] the third, one nine times as long, eh; and so on, in proportion to the squares of cb, db, eb, or, we may say, in the squared ratio of these same lines. Accordingly we see that while the body moves from b to c with uniform speed, it also falls perpendicularly through the distance ci, and at the end of the time-interval be finds itself at the point i. In like manner at the end of the time-interval bd, which is the double of be, the vertical fall will be four times the first distance ci; for it has been shown in a previous discussion that the distance traversed by a freely falling body varies as the square of the time; in like manner the space eh traversed during the time be will be nine times ci; thus it is evident that the distances eh, df, ci will be to one another as the squares of the lines be, bd, be. Now from the points i, f, h draw the straight lines io, fg, hi parallel to be; these lines kl,fg, io are equal to eb, db and eb, respectively; so also are the lines bo, bg, bl respectively equal to ci, df, and eh. The square of hi is to that of fg as the line Ib is to bg; and the square of fg is to that of io as gb is to bo; therefore the points i,f, h, lie on one and the same parabola. In like manner it may be shown that, if we take equal time-intervals of any size whatever, and if we imagine the particle to be carried by a similar compound motion,

    the

    THE TWO NEW SCIENCES OF GALILEO

    the positions of this particle, at the ends of these time-intervals, will lie on one and the same parabola. Q. E. D.

    SALV. This conclusion follows from the converse of the first of the two propositions given above. For, having drawn a parabola through the points b and h, any other two points,/ and i, not falling on the parabola must lie either within or without; consequently the line/g is either longer or shorter than the line which terminates on the parabola. Therefore the square of hi will not bear to the square of fg the same ratio as the line Ib to bg, but a greater or smaller; the fact is, however, that the square of hi does bear this same ratio to the square of fg. Hence the point/ does lie on the parabola, and so do all the others.

    SAGR. One cannot deny that the argument is new, subtle and conclusive, resting as it does upon this hypothesis, namely, that the horizontal motion remains uniform, that the vertical motion continues to be accelerated downwards in proportion to the square of the time, and that such motions and velocities as these combine without altering, disturbing, or hindering each

    other,* so that as the motion proceeds the path of the projectile does not change into a different curve: but this, in my opinion,

    . [274] is impossible. For the axis of the parabola along which we imagine the natural motion of a falling body to take place stands perpendicular to a horizontal surface and ends at the center of the earth; and since the parabola deviates more and more from its axis no projectile can ever reach the center of the earth or, if it does, as seems necessary, then the path of the projectile must transform itself into some other curve very different from the

    parabola. SIMP. To these difficulties, I may add others. One of these is

    that we suppose the horizontal plane, which slopes neither up nor down, to be represented by a straight line as if each point on this line were equally distant from the center, which is not the case; for as one starts from the middle [of the line] and goes toward either end, he departs farther and farther from the center [of the earth] and is therefore constantly going uphill. Whence it follows that the motion cannot remain uniform

    * A very near approach to Newton’s Second Law of Motion. [Trans.]

    FOURTH DAY 251 through any distance whatever, but must continually diminish. Besides, I do not see how it is possible to avoid the resistance of the medium which must destroy the uniformity of the horizon-

    tal motion and change the law of acceleration of falling bodies. These various difficulties render it highly improbable that a result derived from such unreliable hypotheses should hold true in practice.

    SALV. All these difficulties and objections which you urge are so well founded that it is impossible to remove them; and, as for me, I am ready to admit them all, which indeed I think our Author would also do. I grant that these conclusions proved in the abstract will be different when applied in the concrete and will be fallacious to this extent, that neither will the horizon-

    tal motion be uniform nor the natural acceleration be in the

    ratio assumed, nor the path of the projectile a parabola, etc. But, on the other hand, I ask you not to begrudge our Author that which other eminent men have assumed even if not strictly true. The authority of Archimedes alone will satisfy everybody. In his Mechanics and in his first quadrature of the parabola he takes for granted that the beam of a balance or steelyard is a straight line, every point of which is equidistant from the common center of all heavy bodies, and that the cords by which heavy bodies are suspended are parallel to each other.

    Some consider this assumption permissible because, in prac- tice, our instruments and the distances involved are so small in

    comparison with the enormous distance from the center of the earth that we may consider a minute of arc on a great circle as a straight line, and may regard the perpendiculars let fall from its two extremities as parallel. For if in adlual practice one had to

    . consider such small quantities, it would be necessary first of all to criticise the architects who presume, by use of a plumbline, to erect high towers with parallel sides. I may add that, in all their discussions, Archimedes and the others considered them-

    selves as located at an infinite distance from the center of the

    earth, in which case their assumptions were not false, and therefore their conclusions were absolutely correct. When we

    wish

    252 THE TWO NEW SCIENCES OF GALILEO

    wish to apply our proven conclusions to distances which, though finite, are very large, it is necessary for us to infer, on the basis of demonstrated truth, what correction is to be made for the fact that our distance from the center of the earth is not really infinite, but merely very great in comparison with the small dimensions of our apparatus. The largest of these will be the

    range of our projectiles — and even here we need consider only the artillery — which, however great, will never exceed four of those miles of which as many thousand separate us from the center of the earth; and since these paths terminate upon the surface of the earth only very slight changes can take place in their parabolic figure which, it is conceded, would be greatly altered if they terminated at the center of the earth. As to the perturbation arising from the resistance of the

    medium this is more considerable and does not, on account of its manifold forms, submit to fixed laws and exact description. Thus if we consider only the resistance which the air offers to the motions studied by us, we shall see that it disturbs them all and disturbs them in an infinite variety of ways corresponding to the infinite variety in the form, weight, and velocity of the pro-

    jectiles. For as to velocity, the greater this is, the greater will be the resistance offered by the air; a resistance which will be greater as the moving bodies become less dense [men gravi\. So that although the falling body ought to be displaced [andare accelerandosi\ in proportion to the square of the duration of its motion, yet no matter how heavy the body, if it falls from a very considerable height, the resistance of the air will be such as to prevent any increase in speed and will render the motion

    [276]

    uniform; and in proportion as the moving body is less dense [men grave} this uniformity will be so much the more quickly attained and after a shorter fall. Even horizontal motion which, if no impediment were offered, would be uniform and constant is altered by the resistance of the air and finally ceases; and here again the less dense \piu leggiero] the body the quicker the process. Of these properties [accuLenli\ of weight, of velocity, and also of form [figura], infinite in number, it is not possible to

    FOURTH DAY 253 give any exact description; hence, in order to handle this matter in a scientific way, it is necessary to cut loose from these difficul-

    ties; and having discovered and demonstrated the theorems, in the case of no resistance, to use them and apply them with such limitations as experience will teach. And the advantage of this method will not be small; for the material and shape of the projectile may be chosen, as dense and round as possible, so that it will encounter the least resistance in the medium. Nor

    will the spaces and velocities in general be so great but that we shall be easily able to correct them with precision.

    In the case of those projectiles which we use, made of dense [grave] material and round in shape, or of lighter material and cylindrical in shape, such as arrows, thrown from a sling or crossbow, the deviation from an exact parabolic path is quite insensible. Indeed, if you will allow me a little greater liberty, I can show you, by two experiments, that the dimensions of our apparatus are so small that these external and incidental re-

    sistances, among which that of the medium is the most con- siderable, are scarcely observable.

    I now proceed to the consideration of motions through the air, since it is with these that we are now especially concerned; the resistance of the air exhibits itself in two ways: first by offering greater impedance to less dense than to very dense bodies, and secondly by offering greater resistance to a body in rapid motion than to the same body in slow motion.

    Regarding the first of these, consider the case of two balls having the same dimensions, but one weighing ten or twelve times as much as the other; one, say, of lead, the other of oak, both allowed to fall from an elevation of 150 or 200 cubits.

    Experiment shows that they will reach the earth with slight difference in speed, showing us that in both cases the retardation caused by the air is small; for if both balls start at the same moment and at the same elevation, and if the leaden one be slightly retarded and the wooden one greatly retarded, then the former ought to reach the earth a considerable distance in

    advance of the latter, since it is ten times as heavy. But this [2771

    does

    254 THE TWO NEW SCIENCES OF^ GALILEO does not happen; indeed, the gain in distance of one over the other does not amount to the hundredth part of the entire fall. And in the case of a ball of stone weighing only a third or half as much as one of lead, the difference in their times of reaching the earth will be scarcely noticeable. Now since the speed [impeto] acquired by a leaden ball in falling from a height of 200 cubits is so great that if the motion remained uniform the ball would, in an interval of time equal to that of the fall, traverse 400 cubits, and since this speed is so considerable in comparison with those which, by use of bows or other machines except fire arms, we are able to give to our projectiles, it follows that we may, without sensible error, regard as absolutely true those propositions which we are about to prove without considering the resistance of the medium.

    Passing now to the second case, where we have to show that the resistance of the air for a rapidly moving body is not very much greater than for one moving slowly, ample proof is given by the following experiment. Attach to two threads of equal

    length — say four or five yards — two equal leaden balls and suspend them from the ceiling; now pull them aside from the perpendicular, the one through 80 or more degrees, the other through not more than four or five degrees; so that, when set free, the one falls, passes through the perpendicular, and de-

    scribes large but slowly decreasing arcs of 160, 150, 140 degrees, etc. ; the other swinging through small and also slowly diminish-

    ing arcs of 10, 8, 6, degrees, etc. In the first place it must be remarked that one pendulum

    passes through its arcs of 1 80°, 160°, etc., in the same time that the other swings through its 10°, 8°, etc., from which it follows that the speed of the first ball is 16 and 18 times greater than that of the second. Accordingly, if the air offers more resistance to the high speed than to the low, the frequency of vibration in

    the large arcs of 180° or 160°, etc., ought to be less than in the small arcs of 10°, 8°, 4°, etc., and even less than in arcs of 2°, or i°; but this prediction is not verified by experiment; because if two persons start to count the vibrations, the one the large, the other the small, they will discover that after counting tens

    and

    FOURTH DAY 255 and even hundreds they will not differ by a single vibration, not even by a f raction of one.

    [278]

    This observation justifies the two following propositions,

    namely, that vibrations of very large and very small amplitude all occupy the same time and that the resistance of the air does not affect motions of high speed more than those of low speed, contrary to the opinion hitherto generally entertained.

    SAGR. On the contrary, since we cannot deny that the air hinders both of these motions, both becoming slower and finally vanishing, we have to admit that the retardation occurs in the same proportion in each case. But how? How, indeed, could the resistance offered to the one body be greater than that offered to the other except by the impartation of more momen-

    tum and speed [impeto e velocita] to the fast body than to the slow? And if this is so the speed with which a body moves is at once the cause and measure [cagione e misura] of the resistance which it meets. Therefore, all motions, fast or slow, are hin-

    dered and diminished in the same proportion; a result, it seems to me, of no small importance.

    SALV. We are able, therefore, in this second case to say that the errors, neglecting those which are accidental, in the results which we are about to demonstrate are small in the case of our

    machines where the velocities employed are mostly very great and the distances negligible in comparison with the semi- diameter of the earth or one of its great circles.

    SIMP. I would like to hear your reason for putting the pro- jectiles of ire arms, i. e., those using powder, in a different class

    from the projectiles employed in bows, slings, and crossbows, on the ground of their not being equally subject to change and resistance from the air.

    SALV. I am led to this view by the excessive and, so to speak, supernatural violence with which such projectiles are launched; for, indeed, it appears to me that without exaggeration one might say that the speed of a ball fired either from a musket or from a piece of ordnance is supernatural. For if such a ball be allowed to fall from some great elevation its speed will, owing to the

    resistance

    256 THE TWO NEW SCIENCES OF GALILEO resistance of the air, not go on increasing indefinitely; that which happens to bodies of small density in falling through short distances — I mean the reduction of their motion to uniformity- will also happen to a ball of iron or lead after it has fallen a few thousand cubits; this terminal or final speed [terminata velocita] is the maximum which such a heavy body can naturally acquire [279]

    in falling through the air. This speed I estimate to be much smaller than that impressed upon the ball by the burning pow- der.

    An appropriate experiment will serve to demonstrate this facft. From a height of one hundred or more cubits fire a gun [archibuso] loaded with a lead bullet, vertically downwards upon a stone pavement; with the same gun shoot against a similar stone from a distance of one or two cubits, and observe which of the two balls is the more flattened. Now if the ball which has come from the greater elevation is found to be the less flattened of the two, this will show that the air has hin-

    dered and diminished the speed initially imparted to the bullet by the powder, and that the air will not permit a bullet to ac-

    quire so great a speed, no matter from what height it falls; for if the speed impressed upon the ball by the fire does not exceed that acquired by it in falling freely [naturalmente] then its down-

    ward blow ought to be greater rather than less. This experiment I have not performed, but I am of the opinion

    that a musket-ball or cannon-shot, falling from a height as great as you please, will not deliver so strong a blow as it would if fired into a wall only a few cubits distant, i. e., at such a short range that the splitting or rending of the air will not be sufficient to rob the shot of that excess of supernatural violence given it by the powder.

    The enormous momentum [impeto] of these violent shots may cause some deformation of the trajectory, making the beginning of the parabola flatter and less curved than the end; but, so far as our Author is concerned, this is a matter of small consequence in practical operations, the main one of which is the preparation of a table of ranges for shots of high elevation, giving the dis-

    tance

    FOURTH DAY 257 tance attained by the ball as a function of the angle of eleva-

    tion; and since shots of this kind are fired from mortars [mortari] using small charges and imparting no supernatural momentum [impeto sopranaturale] they follow their prescribed paths very exactly. But now let us proceed with the discussion in which the

    Author invites us to the study and investigation of the motion of a body [impeto del mobile] when that motion is compounded of two others; and first the case in which the two are uniform, the one horizontal, the other vertical.

    [280] THEOREM II, PROPOSITION II

    When the motion of a body is the resultant of two uniform motions, one horizontal, the other perpendicular, the square of the resultant momentum is equal to the sum of the

    squares of the two component momenta.* Let us imagine any body urged by two uniform motions and

    let ab represent the vertical displacement, while be represents the displacement which, in the same interval of time, takes place in a horizontal direc-

    tion. If then the distances ab and be are c

    traversed, during the same time-interval, ‘ iwith uniform motions the corresponding FlS- I09 ‘momenta will be to each other as the distances ab and be are to

    ‘each other; but the body which is urged by these two motions describes the diagonal ac; its momentum is proportional to ac. Also the square of ac is equal to the sum of the squares of ab and be. Hence the square of the resultant momentum is equal to the sum of the squares of the two momenta ab and be. Q. E. D.

    SIMP. At this point there is just one slight difficulty which needs to be cleared up; for it seems to me that the conclusion

    * In the original this theorem reads as follows: ” Si aliquod mobile duplici motu esquabili moveatur, nempe orizontali et

    “perpendicularly impetus seu momentum lationis ex utroque motu com- posites erit potentia czqualis ambobus momentis priorum motuum”

    For the justification of this translation of the word “potentia” and of the use of the adjective “resultant” see p. 266 below. [Trans.]

    258 THE TWO NEW SCIENCES OF GALILEO

    just reached contradicts a previous proposition * in which it is claimed that the speed [impeto] of a body coming from a to b is equal to that in coming from a to c; while now you conclude that the speed [impeto] at c is greater than that at b.

    SALV. Both propositions, Simplicio, are true, yet there is a great difference between them. Here we are speaking of a body urged by a single motion which is the resultant of two uniform motions, while there we were speaking of two bodies each urged with naturally accelerated motions, one along the vertical ab the other along the inclined plane ac. Besides the time-intervals were there not supposed to be equal, that along the incline ac being greater than that along the vertical ab; but the motions of which we now speak, those along ab, be, ac, are uniform and simultaneous.

    SIMP. Pardon me; I am satisfied; pray go on.

    SALV. Our Author next undertakes to explain what happens when a body is urged by a motion compounded of one which is horizontal and uniform and of another which is vertical but naturally accelerated; from these two components results the path of a projectile, which is a parabola. The problem is to determine the speed [impeto] of the projectile at each point. With this purpose in view our Author sets forth as follows the manner, or rather the method, of measuring such speed [impeto] along the path which is taken by a heavy body starting from rest and falling with a naturally accelerated motion.

    THEOREM III, PROPOSITION III

    Let the motion take place along the line ab, starting from rest at a, and in this line choose any point c. Let ac represent the time, or the measure of the time, required for the body to fall through the space ac; let ac also represent the velocity [impetus sen momentum] at c acquired by a fall through the distance ac. In the line ab select any other point b. The prob-

    lem now is to determine the velocity at b acquired by a body in falling through the distance ab and to express this in terms of the velocity at c, the measure of which is the length ac. Take

    * See p. 169 above. [Trans.]

    FOURTH DAY 259

    as a mean proportional between ac and ab. We shall prove that the velocity at b is to that at c as the length as is to the length ac. Draw the horizontal line cd, having twice the length of ac, and be, having twice the length of ba. It then follows, from the preceding theorems, that a body falling through the distance ac, and turned so as to move along the horizontal cd Fig. 1 10 with a uniform speed equal to that acquired on reaching c

    [282] will traverse the distance cd in the same interval of time as

    that required to fall with accelerated motion from a to c. Like- wise be will be traversed in the same time as ba. But the time

    of descent through ab is as; hence the horizontal distance be is also traversed in the time as. Take a point / such that the time as is to the time ac as be is to bl; since the motion along be is uniform, the distance bl, if traversed with the speed [mo- mentum celeritatis] acquired at b, will occupy the time ac; but in this same time-interval, ac, the distance cd is traversed with the speed acquired in c. Now two speeds are to each other as the distances traversed in equal intervals of time. Hence the speed at c is to the speed at b as cd is to bl. But since dc is to be as their halves, namely, as ca is to ba, and since be is to bl as ba is to sa; it follows that dc is to bl as ca is to sa. In other words, the speed at c is to that at b as ca is to sa, that is, as the time of fall through ab.

    The method of measuring the speed of a body along the direc- tion of its fall is thus clear; the speed is assumed to increase

    direcflly as the time. But before we proceed further, since this discussion is to

    deal with the motion compounded of a uniform horizontal one

    and one accelerated vertically downwards — the path of a pro- jecftile, namely, a parabola — it is necessary that we define some common standard by which we may estimate the velocity, or momentum \velocitatem, impetum sen momentum] of both mo-

    tions

    26o THE TWO NEW SCIENCES OF GALILEO

    tions; and since from the innumerable uniform velocities one only, and that not selected at random, is to be compounded with a velocity acquired by naturally accelerated motion, I can think of no simpler way of selecting and measuring this than to

    assume another of the same kind.* For the sake of clearness, draw the vertical line ac to meet the horizontal line be. Ac is

    the height and be the amplitude of the semi-parabola ab, which

    is the resultant of the two motions, one that of a body falling [283]

    from rest at a, through the distance ac, with naturally ac-

    celerated motion, the other a uniform motion along the horizon- tal ad. The speed acquired at c by a fall

    / through the distance ac is determined by the height ac; for the speed of a body fall-

    ing from the same elevation is always one and the same; but along the horizontal one may give a body an infinite number of uni-

    <k form speeds. However, in order that I may select one out of this multitude and sepa-

    rate it from the rest in a perfectly definite manner, I will extend the height ca upwards to e just as far as is necessary and will call

    this distance ae the “sublimity.” Imagine a body to fall from rest at e; it is clear that we may make its terminal speed at a the

    same as that with which the same body- Fig, in travels along the horizontal line ad; this

    speed will be such that, in the time of descent along ea, it will describe a horizontal distance twice the length of ea. This preliminary remark seems necessary.

    The reader is reminded that above I have called the horizontal

    line cb the “amplitude” of the semi-parabola ab; the axis ac of this parabola, I have called its “altitude”; but the line ea the fall along which determines the horizontal speed I have called

    the “sublimity.” These matters having been explained, I proceed with the demonstration.

    * Galileo here proposes to employ as a standard of velocity the terminal speed of a body falling freely from a given height. [ Trans.]

    FOURTH DAY 261

    SAGR. Allow me, please, to interrupt in order that I may point out the beautiful agreement between this thought of the Author and the views of Plato concerning the origin of the various uniform speeds with which the heavenly bodies revolve. The latter chanced upon the idea that a body could not pass from rest to any given speed and maintain it uniformly except by passing through all the degrees of speed intermediate between the given speed and rest. Plato thought that God, after having created the heavenly bodies, assigned them the proper and uniform speeds with which they were forever to revolve; and that He made them start from rest and move over definite dis-

    tances under a natural and rectilinear acceleration such as governs the motion of terrestrial bodies. He added that once these bodies had gained their proper and permanent speed, their rectilinear motion was converted into a circular one, the only

    . motion capable of maintaining uniformity, a motion in which the body revolves without either receding from or approaching its desired goal. This conception is truly worthy of Plato; and it is to be all the more highly prized since its underlying princi-

    ples remained hidden until discovered by our Author who re- moved from them the mask and poetical dress and set forth the

    idea in correct historical perspective. In view of the fact that astronomical science furnishes us such complete information concerning the size of the planetary orbits, the distances of these bodies from their centers of revolution, and their velocities, I cannot help thinking that our Author (to whom this idea of Plato was not unknown) had some curiosity to discover whether

    or not a definite “sublimity” might be assigned to each planet, such that, if it were to start from rest at this particular height and to fall with naturally accelerated motion along a straight line, and were later to change the speed thus acquired into uniform motion, the size of its orbit and its period of revolution would be those actually observed.

    SALV. I think I remember his having told me that he once made the computation and found a satisfactory correspondence with observation. But he did not wish to speak of it, lest in

    view

    262 THE TWO NEW SCIENCES OF GALILEO

    view of the odium which his many new discoveries had already brought upon him, this might be adding fuel to the fire. But if any one desires such information he can obtain it for himself from the theory set forth in the present treatment. We now proceed with the matter in hand, which is to prove:

    PROBLEM I, PROPOSITION IV

    To determine the momentum of a projectile at each particular point in its given parabolic path.

    Let bee be the semi-parabola whose amplitude is cd and whose height is db, which latter extended upwards cuts the tan-

    gent of the parabola ca in a. Through the vertex draw the horizontal line bi parallel to cd. Now if the amplitude cd is equal to the entire height da, then bi will be equal to ba and also to bd; and if we take ab as the measure of the time re-

    quired for fall through the distance ab and also of the momen- tum acquired at b in consequence of its fall from rest at a, then

    if. we turn into a horizontal direction the momentum acquired by fall through ab [impetum ab] the space traversed in the same interval of time will be represented by dc which is twice bi. But a body which falls from rest at b along the line bd will during

    the same time-interval fall through the height of the parabola [285]

    bd. Hence a body falling from rest at a, turned into a horizontal direction with the speed ab will traverse a space equal to dc. Now if one superposes upon this motion a fall along bd, travers-

    ing the height bd while the parabola be is described, then the momentum of the body at the terminal point c is the resultant of a uniform horizontal momentum, whose value is represented by ab, and of another momentum acquired by fall from b to the terminal point d or c; these two momenta are equal. If, there-

    fore, we take ab to be the measure of one of these momenta, say, the uniform horizontal one, then bi, which is equal to bd, will represent the momentum acquired at d or c; and ia will represent the resultant of these two momenta, that is, the total

    momentum with which the projectile, travelling along the pa- rabola, strikes at c.

    With

    FOURTH DAY 263

    With this in mind let us take any point on the parabola, say e, and determine the momentum with which the projectile passes that point. Draw the horizontal ef and take bg a mean proportional between bd and bf. Now since ab, or bd, is as-

    sumed to be the measure of the

    time and of the momentum [mo- mentum velocitatis] acquired by fall-

    ing from rest at b through the dis- tance bdy it follows that bg will

    measure the time and also the

    momentum [impetus] acquired at / by fall from b. If therefore we lay off bo, equal to bg, the diagonal line joining a and o will represent the-

    momentum at the point e; because -‘ the length ah has been assumed to represent the momentum at b Fig- which, after diversion into a horizontal direction, remains con-

    stant; and because bo measures the momentum at / or e, ac- quired by fall, from rest at b, through the height bf. But the

    square of ao equals the sum of the squares of ab and bo. Hence the theorem sought.

    SAGR. The manner in which you compound these different momenta to obtain their resultant strikes me as so novel that my mind is left in no small confusion. I do not refer to the composition of two uniform motions, even when unequal, and when one takes place along a horizontal, the other along a vertical direction; because in this case I am thoroughly con-

    vinced that the resultant is a motion whose square is equal to the sum of the squares of the two components. The confusion arises when one undertakes to compound a uniform horizontal motion with a vertical one which is naturally accelerated. I

    trust, therefore, we may pursue this discussion more at length. [286]

    SIMP. And I need this even more than you since I am not yet as clear in my mind as I ought to be concerning those funda-

    mental propositions upon which the others rest. Even in the case

    264 THE TWO NEW_ SCIENCES OF GALILEO case of the two uniform motions, one horizontal, the other perpendicular, I wish to understand better the manner in which you obtain the resultant from the components. Now, Salviati, you understand what we need and what we desire.

    SALV. Your request is altogether reasonable and I will see whether my long consideration of these matters will enable me to make them clear to you. But you must excuse me if in the explanation I repeat many things already said by the Author.

    Concerning motions and their velocities or momenta [movi- menti e lor velocita o impeti\ whether uniform or naturally ac-

    celerated, one cannot speak definitely until he has established a measure for such velocities and also for time. As for time we have the already widely adopted hours, first minutes and second minutes. So for velocities, just as for intervals of time, there is need of a common standard which shall be understood and

    accepted by everyone, and which shall be the same for all. As has already been stated, the Author considers the velocity of a freely falling body adapted to this purpose, since this velocity increases according to the same law in all parts of the world; thus for instance the speed acquired by a leaden ball of a pound weight starting from rest and falling vertically through the

    height of, say, a spear’s length is the same in all places; it is therefore excellently adapted for representing the momentum [impeto] acquired in the case of natural fall.

    It still remains for us to discover a method of measuring momentum in the case of uniform motion in such a way that all who discuss the subject will form the same conception of its size and velocity [grandezza e velocita]. This will prevent one person from imagining it larger, another smaller, than it really is; so that in the composition of a given uniform motion with one which is accelerated different men may not obtain different values for the resultant. In order to determine and represent

    such a momentum and particular speed [impeto e velocita partico- lare] our Author has found no better method than to use the momentum acquired by a body in naturally accelerated motion.

    [287].

    The speed of a body which has in this manner acquired any momentum

    FOURTH DAY 265 momentum whatever will, when converted into uniform motion, retain precisely such a speed as, during a time-interval equal to that of the fall, will carry the body through a distance equal to twice that of the fall. But since this matter is one which is

    fundamental in our discussion it is well that we make it per- fedlly clear by means of some particular example.

    Let us consider the speed and momentum acquired by a body falling through the height, say, of a spear [picca] as a standard which we may use in the measurement of other speeds and momenta as occasion demands; assume for instance that the

    time of such a fall is four seconds [minuti secondi d’ora]; now in order to measure the speed acquired from a fall through any other height, whether greater or less, one must not conclude that these speeds bear to one another the same ratio as the

    ‘heights of fall; for instance, it is not true that a fall through four times a given height confers a speed four times as great as that acquired by descent through the given height; because the speed of a naturally accelerated motion does not vary in pro-

    portion to the time. As has been shown above, the ratio of the spaces is equal to the square of the ratio of the times.

    If, then, as is often done for the sake of brevity, we take the same limited straight line as the measure of the speed, and of the time, and also of the space traversed during that time, it follows that the duration of fall and the speed acquired by the same body in passing over any other ,

    distance, is not represented by this second distance, but ” by a mean proportional between the two distances.

    This I can better illustrate by an example. In the ver- .J.J ^ical line ac, lay off the portion ab to represent the dis-

    nce traversed by a body falling freely with acceler- ted motion: the time of fall may be represented by any imited straight line, but for the sake of brevity, we shall represent it by the same length ab; this length may also

    employed as a measure of the momentum and speed “• .cquired during the motion; in short, let ab be a measure of he various physical quantities which enter this discussion. Having agreed arbitrarily upon ab as a measure of these

    three

    266 THE TWO NEW SCIENCES OF GALILEO

    three different quantities, namely, space, time, and momentum,

    our next task is to find the time required for fall through a [288]

    given vertical distance ac, also the momentum acquired at the terminal point c, both of which are to be expressed in terms of the time and momentum represented by ab. These two required quantities are obtained by laying off ad, a mean proportional between ab and ac; in other words, the time of fall from a to c is represented by ad on the same scale on which we agreed that the time of fall from a tab should be represented by ab. In like manner we may say that the momentum [impeto o grado di velocita] acquired at c is related to that acquired at b, in the same manner that the line ad is related to ab, since the velocity varies directly as the time, a conclusion, which although employed as a postulate in Proposition III, is here amplified by the Author.

    This point being clear and well-established we pass to the consideration of the momentum [impeto] in the case of two compound motions, one of which is compounded of a uniform horizontal and a uniform vertical motion, while the other is compounded of a uniform horizontal and a naturally accelerated vertical motion. If both components are uniform, and one at right angles to the other, we have already seen that the square of the resultant is obtained by adding the squares of the compo-

    nents [p. 257] as will be clear from the following illustration. Let us imagine a body to move along the vertical ab with a

    uniform momentum [impeto] of 3, and on reaching b to move a toward c with a momentum [velocita ed

    impeto] of 4, so that during the same time- interval it will traverse 3 cubits along the vertical and 4 along the horizontal. But a

    t particle which moves with the resultant ve- Fig. 114 locity [velocita] will, in the same time, trav-

    erse the diagonal ac, whose length is not 7 cubits — the sum of ab (3) and be (4) — but 5, which is in potenza equal to the sum of 3 and 4, that is, the squares of 3 and 4 when added make 25, which is the square of ac, and is equal to the sum of the squares

    FOURTH DAY 267

    of ab and be. Hence ac is represented by the side — or we may say the root — of a square whose area is 25, namely 5.

    As a fixed and certain rule for obtaining the momentum which [289]

    results from two uniform momenta, one vertical, the other horizontal, we have therefore the following: take the square of each, add these together, and extra (ft the square root of the sum, which will be the momentum resulting from the two. Thus, in the above example, the body which in virtue of its vertical motion would strike the horizontal plane with a momentum [forza] of 3, would owing to its horizontal motion alone strike at if with a momentum of 4; but if the body strikes with a momen-

    tum which is the resultant of these two, its blow will be that of a body moving with a momentum [velocita e forza] of 5 ; and such a blow will be the same at all points of the diagonal ac, since its components are always the same and never increase or diminish.

    Let us now pass to the consideration of a uniform horizontal motion compounded with the vertical motion of a freely falling body starting from rest. It is at once clear that the diagonal which represents the motion compounded of these two is not a

    straight line, but, as has been demonstrated, a semi-parabola, in which the momentum [impeto] is always increasing because the speed [velocita] of the vertical component is always increas-

    ing. Wherefore, to determine the momentum [impeto] at any given point in the parabolic diagonal, it is necessary first to £x upon the uniform horizontal momentum [impeto] and then,

    [treating the body as one falling freely, to find the vertical momentum at the given point; this latter can be determined t>nly by taking into account the duration of fall, a consideration which does not enter into the composition of two uniform mo- ions where the velocities and momenta are always the same; nit here where one of the component motions has an initial value of zero and increases its speed [velocita] in direcft proportion to the time, it follows that the time must determine the speed velocita] at the assigned point. It only remains to obtain the momentum resulting from these two components (as in the case of uniform motions) by placing the square of the resultant equal to

    268 THE TWO NEW SCIENCES OF GALILEO

    to the sum of the squares of the two components. But her again it is better to illustrate by means of an example. On the vertical ac lay off any portion ab which we shall em

    ploy as a measure of the space traversed by a body falling freely along the perpendicular, likewise as a measure of the time and also of the speed [grado di velocita] or, we may say, of the mo-

    menta [impeti\. It is at once clear that if the momentum of a

    [290] body at b, after having fallen from rest at a, be diverted along the horizontal direction bd, with uniform motion, its speed will be such that, during the time-interval ab, it will traverse a distance which is represented by the line bd and which is twice as

    great as ab. Now choose a 4 point c, such that be shall be equal to ab, and through c draw the line ce equal and parallel to bd; through the points b and e draw the pa-

    rabola bei. And since, during the time-interval ab, the hori-

    zontal distance bd or ce, double the length ab, is traversed with

    c the momentum ab, and since

    during an equal time-interval / the vertical distance be is trav-

    ersed, the body acquiring at c a momentum represented by

    the same horizontal, bd, it fol-

    t

    IX5

    lows that during the time ab the body will pass from b to e along the parabola be, and will reach e with a momentum compounded of two momenta each equal to ab. And since one of these is horizontal and the other vertical, the square of the resultant mo-

    mentum is equal to the sum of the squares of these two compo- nents, i. e., equal to twice either one of them.

    Therefore, if we lay off the distance bf, equal to ba, and draw the diagonal af, it follows that the momentum [impeto e per- cossa] at e will exceed that of a body at b after having fallen from

    FOURTH DAY 269

    a, or what is the same thing, will exceed the horizontal momen- tum [percossa deirimpeto] along bd, in the ratio of of to ab.

    Suppose now we choose for the height of fall a distance bo which is not equal to but greater than ab, and suppose that bg represents a mean proportional between ba and bo; then, still re-

    taining ba as a measure of the distance fallen through, from rest at a, to b9 also as a measure of the time and of the momentum which the falling body acquires at b, it follows that bg will be the measure of the time and also of the momentum which the

    body acquires in falling from b to o. Likewise just as the momen- tum ab during the time ab carried the body a distance along the

    horizontal equal to twice ab, so now, during the time-interval bg, the body will be carried in a horizontal direction through a distance which is greater in the ratio of bg to ba. Lay off Ib equal to bg and draw the diagonal al, from which we have a quantity compounded of two velocities [impeti\ one horizontal, the other vertical ; these determine the parabola. The horizontal and uniform velocity is that acquired at b in falling from a; the other is that acquired at o, or, we may say, at i, by a body falling through the distance bo, during a time measured by the line bg,

    [291] which line bg also represents the momentum of the body. And in like manner we may, by taking a mean proportional between the two heights, determine the momentum [impeto] at the extreme end of the parabola where the height is less than the sublimity ab; this mean proportional is to be drawn along the horizontal in place of bf, and also another diagonal in place of af, which diagonal will represent the momentum at the extreme end of the parabola. To what has hitherto been said concerning the momenta,

    blows or shocks of projectiles, we must add another very im- portant consideration; to determine the force and energy of the

    shock [forza ed energia della percossa] it is not sufficient to con- sider only the speed of the projectiles, but we must also take into

    account the nature and condition of the target which, in no small degree, determines the efficiency of the blow. First of all it is well known that the target suffers violence from the speed

    [velocita]

    270 THE TWO NEW SCIENCES OF GALILEO

    [velocita] of the projedlile in proportion as it partly or entirely stops the motion; because if the blow falls upon an object which yields to the impulse [velocita del percuziente] without resistance such a blow will be of no effect; likewise when one attacks his enemy with a spear and overtakes him at an instant when he is fleeing with equal speed there will be no blow but merely a harmless touch. But if the shock falls upon an object which yields only in part then the blow will not have its full effect, but the damage will be in proportion to the excess of the speed of the projectile over that of the receding body; thus, for exam-

    ple, if the shot reaches the target with a speed of 10 while the latter recedes with a speed of 4, the momentum and shock [impeto e percossa] will be represented by 6. Finally the blow will be a maximum, in so far as the projedlile is concerned, when the target does not recede at all but if possible completely re-

    sists and stops the motion of the projedlile. I have said in so far as the projectile is concerned because if the target should approach the projectile the shock of collision [colpo e Vincontro\ would be greater in proportion as the sum of the two speeds is greater than that of the projedlile alone. Moreover it is to be observed that the amount of yielding in

    the target depends not only upon the quality of the material, as regards hardness, whether it be of iron, lead, wool, etc., but

    [292] also upon its position. If the position is such that the shot strikes it at right angles, the momentum imparted by the blow [impeto del colpo] will be a maximum; but if the motion be oblique, that is to say slanting, the blow will be weaker; and more and more so in proportion to the obliquity; for, no matter how hard the material of the target thus situated, the entire momentum [impeto e moto] of the shot will not be spent and stopped; the projectile will slide by and will, to some extent, continue its motion along the surface of the opposing body.

    All that has been said above concerning the amount of momen- tum in the projedlile at the extremity of the parabola must be

    understood to refer to a blow received on a line at right angles to this parabola or along the tangent to the parabola at the given

    FOURTH DAY 271

    point; for, even though the motion has two components, one horizontal, the other vertical, neither will the momentum along the horizontal nor that upon a plane perpendicular to the horizontal be a maximum, since each of these will be received obliquely.

    SAGR. Your having mentioned these blows and shocks recalls to my mind a problem, or rather a question, in mechanics of which no author has given a solution or said anything which diminishes my astonishment or even partly relieves my mind. My difficulty and surprise consist in not being able to see

    whence and upon what principle is derived the energy and im- mense force [energia eforza immensa] which makes its appearance

    in a blow; for instance we see the simple blow of a hammer, weighing not more than 8 or 10 Ibs., overcoming resistances which, without a blow, would not yield to the weight of a body producing impetus by pressure alone, even though that body weighed many hundreds of pounds. I would like to discover a

    method of measuring the force [fprza] of such a percussion. I can hardly think it infinite, but incline rather to the view that it has its limit and can be counterbalanced and measured by other forces, such as weights, or by levers or screws or other mechanical instruments which are used to multiply forces in a manner which I satisfactorily understand.

    SALV. You are not alone in your surprise at this effect or in obscurity as to the cause of this remarkable property. I studied this matter myself for a while in vain; but my confusion merely increased until finally meeting our Academician I received from [293]

    him great consolation. First he told me that he also had for a long time been groping in the dark; but later he said that, after having spent some thousands of hours in speculating and con-

    templating thereon, he had arrived at some -notions which are far removed from our earlier ideas and which are remarkable

    for their novelty. And since now I know that you would gladly hear what these novel ideas are I shall not wait for you to ask but promise that, as soon as our discussion of projectiles is completed, I will explain all these fantasies, or if you please,

    vagaries

    272 THE TWO NEW SCIENCES OF GALILEO

    vagaries, as far as I can recall them from the words of our Academician. In the meantime we proceed with the proposi-

    tions of the author.

    PROPOSITION V, PROBLEM

    Having given a parabola, find the point, in its axis extended upwards, from which a particle must fall in order to describe this same parabola.

    Let ab be the given parabola, hb its amplitude, and he its axis extended. The problem is to find the point e from which a body must fall in order that, after the momentum which it acquires at a has been diverted into a horizontal direction, it will describe the parabola ab. Draw the horizontal ag, parallel to bh, and

    _e having laid off af equal to ah, draw the straight line bf which will be a tangent to the parab- ola at b, and will intersect the horizontal ag at g: choose e such that ag will be a mean propor-

    tional between af and ae. Now I say that e is the point above sought. That is, if a body falls

    r from rest at this point ey and if the momentum acquired at the

    II6 point a be diverted into a hori- zontal direction, and compounded with the momentum ac-

    quired at h in falling from rest at a, then the body will describe the parabola ab. For if we understand ea to be the measure of the time of fall from e to a, and also of the momentum acquired at a, then ag (which is a mean proportional between ea and af) will represent the time and momentum of fall from / to a or, what is the same thing, from a to h; and since a body falling from e, during the time ea, will, owing to the momentum acquired at a, traverse at uniform speed a horizontal distance which is twice ea, it follows that, the body will if impelled by the same momentum, during the time-interval ag traverse a distance equal to twice ag which is the half of bh. This is true because,

    in

    FOURTH DAY 273

    in the case of uniform motion, the spaces traversed vary di- rectly as the times. And likewise if the motion be vertical and

    start from rest, the body will describe the distance ah in the [294]

    time ag. Hence the amplitude bh and the altitude ah are trav- ersed by a body in the same time. Therefore the parabola ab

    will be described by a body falling from the sublimity of e.

    Q. E. F.

    COROLLARY

    Hence it follows that half the base, or amplitude, of the semi- parabola (which is one-quarter of the entire amplitude) is a mean proportional between its altitude and the sublimity from which a falling body will describe this same parabola.

    PROPOSITION VI, PROBLEM

    Given the sublimity and the altitude of a parabola, to find its amplitude.

    Let the line ac, in which lie the given altitude cb and sub- limity aby be perpendicular to

    the horizontal line cd. The prob- lem is to find the amplitude,

    along the horizontal cdy of the semi-parabola which is described with the sublimity ba and alti-

    tude be. Lay off cd equal to twice the mean proportional be-

    tween cb and ba. Then cd will

    be the amplitude sought, as is evident from the preceding prop- Jt osition. Fig. 117

    THEOREM. PROPOSITION VII

    If projectiles describe semi-parabolas of the same ampli- tude, the momentum required to describe that one whose

    amplitude is double its altitude is less than that required for any other.

    Let

    274 THE TWO NEW SCIENCES OF GALILEO

    Let bd be a semi-parabola whose amplitude cd is double its altitude cb; on its axis extended upwards lay off ba equal to its altitude be. Draw the line ad which will be a tangent to the parabola at d and will cut the horizontal line be at the point ^, making be equal to be and also to ba. It is evident that this parabola will be described by a projectile whose uniform horizon-

    tal momentum is that which it would acquire at b in falling from rest at a and whose naturally accelerated vertical momentum is that of the body falling to c, from rest at b. From this it follows

    Fig. 118

    that the momentum at the terminal point d, compounded of these two, is represented by the diagonal ae, whose square is equal to the sum of the squares of the two components. Now let gd be any other parabola whatever having the same ampli-

    tude cd, but whose altitude eg is either greater or less than the altitude be. Let hd be the tangent cutting the horizontal [295]

    through g at k. Select a point / such that hg:gk =gk:gl. Then from a preceding proposition [V], it follows that gl will be the

    height

    FOURTH DAY 275

    height from which a body must fall in order to describe the parabola gd.

    Let gm be a mean proportional between ab and gl; then gm will [Prop. IV] represent the time and momentum acquired at g by a fall from /; for ab has been assumed as a measure of both time and momentum. Again let gn be a mean proportional between be and eg; it will then represent the time and momen-

    tum which the body acquires at c in falling from g. If now we join m and n, this line mn will represent the momentum at d of the projectile traversing the parabola dg; which momentum is, I say, greater than that of the projectile travelling along the parabola bd whose measure was given by ae. For since gn has been taken as a mean proportional between be and gc; and since be is equal to be and also to kg (each of them being the half of dc) it follows that cg:gn =gn:gk, and as eg or (kg) is to gk so is

    ng2 to gk2: but by contraction hg:gk=gk:gl. Hence ng2: gk2=gk:gl. But gk:gl=gk2: gm2, since gm is a mean propor-

    tional between kg and gl. Therefore the three ̂ quares ng, kg,

    mg form a continued proportion, gn2: gk2 = gk2: gm.2 And the sum of the two extremes which is equal to the square of mn is greater than twice the square of gk; but the square of ae is double the square of gk. Hence the square of mn is greater than the square of ae and the length mn is greater than the length ae.

    Q. E. D.

    [296] COROLLARY

    Conversely it is evident that less momentum will be required to send a projectile from the terminal point d along the parabola bd than along any other parabola having an elevation greater or less than that of the parabola bd, for which the tangent at d

    makes an angle of 45° with the horizontal. From which it follows that if projectiles are fired from the terminal point d, all having the same speed, but each having a different elevation, the maximum range, i. e., amplitude of the semi-parabola or of the

    entire parabola, will be obtained when the elevation is 45°: the other

    276 THE TWO NEW SCIENCES OF GALILEO other shots, fired at angles greater or less will have a shorter range.

    SAGR. The force of rigid demonstrations such as occur only in mathematics fills me with wonder and delight. From ac-

    counts given by gunners, I was already aware of the fact that in the use of cannon and mortars, the maximum range, that is the one in which the shot goes farthest, is obtained when the

    elevation is 45° or, as they say, at the sixth point of the quad- rant; but to understand why this happens far outweighs the

    mere information obtained by the testimony of others or even by repeated experiment.

    SALV. What you say is very true. The knowledge of a single fact acquired through a discovery of its causes prepares the mind to understand and ascertain other facts without need of recourse to experiment, precisely as in the present case, where by argumentation alone the Author proves with certainty

    that the maximum range occurs when the elevation is 45°. He thus demonstrates what has perhaps never been observed in experience, namely, that of other shots those which exceed or

    fall short of 45° by equal amounts have equal ranges; so that if the balls have been fired one at an elevation of 7 points, the other at 5, they will strike the level at the same distance: the same is true if the shots are fired at 8 and at 4 points, at 9 and at

    3, etc. Now let us hear the demonstration of this. [297]

    THEOREM. PROPOSITION VIII

    The amplitudes of two parabolas described by projectiles fired with the same speed, but at angles of elevation which

    exceed and fall short of 45° by equal amounts, are equal to each other.

    In the triangle mcb let the horizontal side be and the vertical cm, which form a right angle at c, be equal to each other; then the angle mbc will be a semi-right angle; let the line cm be pro-

    longed to d, such a point that the two angles at b, namely mbe and mbd, one above and the other below the diagonal mb, shall be equal. It is now to be proved that in the case of two parabolas

    described

    FOURTH DAY 277 described by two projectiles fired from b with the same speed, one at the angle of ebc, the other at the angle of dbc, their am-

    plitudes will be equal. Now since the external angle bmc is equal to the sum of the internal angles mdb and dbm we may also equate to them the angle mbc; but if we re-

    place the angle dbm by mbe, then this same angle mbc is equal to the two mbe and bdc: and if we subtract from each side of this

    equation the angle mbe, we have the remain- der bdc equal to the remainder ebc. Hence

    the two triangles deb and bee are similar. Bisect the straight lines dc and ec in the points h and /: and draw the lines hi and fg parallel to the horizontal cb, and choose / such that dh:hi = ih:hl. Then the triangle ihl will be similar to ihd, and also to the triangle

    egf; and since ih and gf are equal, each being *%• * *9 half of be, it follows that hi is equal to fe and also to fc; and if we add to each of these the common part fh, it will be seen that ch is equal to//.

    Let us now imagine a parabola described through the points h and b whose altitude is he and sublimity hi. Its amplitude will be cb which is double the length hi since hi is a mean proportional between dh (or ch) and hi. The line db is tangent to the parabola at b, since ch is equal to hd. If again we imagine a parabola described through the points / and b, with a sublimity // and altitude fc, of which the mean proportional is fg, or one-half of cb, then, as before, will cb be the amplitude and the line eb a tangent at b; for ef andfc are equal.

    [298] But the two angles dbc and ebc9 the angles of elevation, differ

    by equal amounts from a 45° angle. Hence follows the proposi- tion.

    THEOREM. PROPOSITION IX

    The amplitudes of two parabolas are equal when their alti- tudes and sublimities are inversely proportional.

    Let

    Fig. 120

    278 THE TWO NEW SCIENCES OF GALILEO

    Let the altitude gf of the parabola fk bear to the altitude cb of the parabola bd the same ratio which the sublimity ba bears to the sublimity />; then I say the amplitude kg is equal to the amplitude dc. For since the first of these quantities, gf, bears to

    <Tthe second cb the same ratio which the third, ba, bears to the fourth fe, it follows that the area of the rectangle

    -gf.fe is equal to that of b the rectangle cb.ba; therefore squares which are equal to these rectangles are equal to each other.

    But [by Proposition VI] the square of half of gh is equal to the rectangle gf.fe; and the square of half of cd is equal to the rec-

    tangle cb.ba. Therefore these squares and their sides and the doubles of their sides are equal. But these last are the ampli-

    tudes gh and cd. Hence follows the proposition.

    LEMMA FOR THE FOLLOWING PROPOSITION

    If a straight line be cut at any point whatever and mean pro- portionals between this line and each of its parts be taken,

    the sum of the squares of these mean proportionals is equal to the square of the entire line.

    Let the line ab be cut at c. Then I say that the square of the mean proportional between ab and ac plus the square of the mean proportional between ab and cb is equal to the square of the whole line ab. This is evident as soon as we de-

    scribe a semicircle upon the entire line ab, al erect a perpendicular cd at c, and draw da and db. For da is a mean proportional between ab and ac while [299]

    db is a mean proportional between ab and be: and since the angle adb, inscribed in a semicircle, is a right angle the sum of

    the

    I21

    FOURTH DAY 279 the squares of the lines da and db is equal to the square of the entire line ab. Hence follows the proposition.

    THEOREM. PROPOSITION X

    The momentum [impetus sen momentum] acquired by a particle at the terminal point of any semi-parabola is equal to that which it would acquire in falling through a vertical distance equal to the sum of the sublimity and die altitude of the semi-parabola.*

    Let ab be a semi-parabola having a sublimity da and an altitude ac, the sum of which is the perpendicular dc. Now , I say the momentum of the particle at b is the same as that which it would acquire in falling freely from d to c. Let us take the length of dc itself as a measure of time and momentum, and lay off cf equal to the mean proportional between cd and da; also lay off ce a mean proportional between cd and ca. Now cf is the measure of the time and of the momentum acquired by fall, from rest at d, through the distance da; while ce is the time

    and momentum of fall, from rest at 0, through^ the distance ca; also the diagonal ef will repre-$ sent a momentum which is the resultant of these Fig. I22 two, and is therefore the momentum at the terminal point of the parabola, b. And since dc has been cut at some point a and since cf and

    ce are mean proportionals between the whole of cd and its parts, da and ac, it follows, from the preceding lemma, that the sum of the squares of these mean proportionals is equal to the square of the whole: but the square of ef is also equal to the sum of these same squares; whence it follows that the line ef is equal to dc.

    Accordingly the momentum acquired at c by a particle in falling from d is the same as that acquired at b by a particle traversing the parabola ab. Q. E. D.

    * In modern mechanics this well-known theorem assumes the following form: The speed of a projectile at any point is that produced by a fall from the directrix. [Trans.]

    280 THE TWO NEW SCIENCES OF GALILEO

    COROLLARY

    Hence it follows that, in the case of all parabolas where the sum of the sublimity and altitude is a constant, the momentum at the terminal point is a constant.

    PROBLEM. PROPOSITION XI

    Given the amplitude and the speed [impetus] at the ter- minal point of a semi-parabola, to find its altitude.

    Let the given speed be represented by the vertical line ab, and the amplitude by the horizontal line be; it is required to find the sublimity of the semi-parabola whose terminal speed is ab and amplitude be. From what precedes [Cor. Prop. V] it is clear that half the amplitude be is a mean proportional between

    [300] the altitude and sublimity of the parabola of which the terminal speed is equal, in accordance with the preceding proposition, to

    the speed acquired by a body in falling from rest at a through the distance ab.

    <7 Therefore the line ba must be cut at a point such that the rectangle formed by its two parts will be equal to the square of half be, namely bd. Necessarily, there-

    e fore, bd must not exceed the half of ba; for of all the rectangles formed by parts of a straight line the one of greatest area is obtained when the line is divided into

    two equal parts. Let e be the middle I point of the line ab; and now if bd be / equal to be the problem is solved; for be

    will be the altitude and ea the sublimity Flg- I23 of the parabola. (Incidentally we may

    observe a consequence already demonstrated, namely: of all parabolas described with any given terminal speed that for

    which the elevation is 45° will have the maximum amplitude.) But suppose that bd is less than half of ba which is to be

    divided

    FOURTH DAY 281

    divided in such a way that the rectangle upon its parts may be equal to the square of bd. Upon ea as diameter describe a semi-

    circle efa, in which draw the chord of, equal to bd: join fe and lay off the distance eg equal to fe. Then the rectangle bg.ga plus the square of eg will be equal to the square of ea, and hence also to the sum of the squares of af andfe. If now we subtract the equal squares of fe and ge there remains the rectangle bg.ga equal to the square of af, that is, of bd, a line which is a mean proportional between bg and ga; from which it is evident that the semi-parabola whose amplitude is be and whose terminal speed [impetus] is represented by ba has an altitude bg and a sublimity ga.

    If however we lay off bi equal to ga, then bi will be the alti- tude of the semi-parabola ic, and ia will be its sublimity. From

    the preceding demonstration we are able to solve the following problem.

    PROBLEM. PROPOSITION XII

    To compute and tabulate the amplitudes of all semi- parabolas which are described by projectiles fired with the same initial speed [impetus],

    From the foregoing it follows that, whenever the sum of the altitude and sublimity is a constant vertical height for

    any set of parabolas, these parabolas are described by pro- jectiles having the same initial speed; all vertical heights thus

    [301] obtained are therefore included between two parallel horizontal lines. Let cb represent a horizontal line and ab a vertical line of equal length; draw the diagonal ac; the angle acb will be one

    of 45° ; let d be the middle point of the vertical line ab. Then the semi-parabola dc is the one which is determined by the sub-

    limity ad and the altitude db, while its terminal speed at c is that which would be acquired at b by a particle falling from rest at a. If now ag be drawn parallel to be, the sum of the altitude and sublimity for any other semi-parabola having the same terminal speed will, in the manner explained, be equal to the distance between the parallel lines ag and be. Moreover, since

    it

    282 THE TWO NEW SCIENCES OF GALILEO

    it has already been shown that the amplitudes of two semi- parabolas are the same when their angles of elevation differ from

    45° by like amounts, it follows that the same computation which is employed for the larger elevation will serve also for the smaller.

    Let us also assume 10000 as the

    ® greatest amplitude for a parabola whose angle of elevation is 45°; this then will be the length of the line

    0 ba and the amplitude of the semi- a, parabola be. This number, 10000,

    is selected because in these calcula- tions we employ a table of tangents

    /in which this is the value of the

    tangent of 45°. And now, coming J down to business, draw the straight

    line ce making an acute angle ecb greater than acb: the problem now is to draw the semi-parabola to which the line ec is a tangent and for which the sum of the sublimity

    Fig. 124 and the altitude is the distance ba.

    Take the length of the tangent* be from the table of tan- gents, using the angle bee as an argument: let /be the middle

    point of be; next find a third proportional to bf and bi (the half of be), which is of necessity greater than ja.\ Call this fo. We have now discovered that, for the parabola inscribed

    [302] in the triangle ecb having the tangent ce and the amplitude cb, the altitude is bf and the sublimity fo. But the total length of bo exceeds the distance between the parallels ag and cb, while our problem was to keep it equal to this distance: for both the parabola sought and the parabola dc are described

    * The reader will observe that the word “tangent” is here used in a sense somewhat different from that of the preceding sentence. The

    “tangent ec” is a line which touches the parabola at c; but the “tan- gent eb” is the side of the right-angled triangle which lies opposite the

    angle ecb, a line whose length is proportional to the numerical value of the tangent of this angle. [Trans.}

    f This fact is demonstrated in the third paragraph below. [Trans.]

    FOURTH DAY 283 by projectiles fired from c with the same speed. Now since an infinite number of greater and smaller parabolas, similar to each other, may be described within the angle bee we must find another parabola which like cd has for the sum of its altitude and sublimity the height ba, equal to be.

    Therefore lay off cr so that, ob:ba =bc:cr; then cr will be the amplitude of a semi-parabola for which bee is the angle of eleva-

    tion and for which the sum of the altitude and sublimity is the distance between the parallels ga and cb, as desired. The process is therefore as follows: One draws the tangent of the given angle bee; takes half of this tangent, and adds to it the quantity, fo, which is a third proportional to the half of this tangent and the half of be; the desired amplitude cr is then found from the following proportion ob:ba=be:er. For example let the angle

    ecb be one of 50°; its tangent is 11918, half of which, namely bf, is 5959; half of be is 5000; the third proportional of these halves is 4195, which added to bf gives the value 10154 for bo. Further,

    as ob is to ah, that is, as 10154 ‘1S to loooo, so is be, or loooo (each being the tangent of 45°) to er, which is the amplitude sought and which has the value 9848, the maximum amplitude being be, or loooo. The amplitudes of the entire parabolas are double these, namely, 19696 and 20000. This is also the amplitude of a

    parabola whose angle of elevation is 40°, since it deviates by an equal amount from one of 45°.

    [303] SAGR. In order to thoroughly understand this demonstration

    I need to be shown how the third proportional of bf and bi is, as the Author indicates, necessarily greater than fa.

    SALV. This result can, I think, be obtained as follows. The square of the mean proportional between two lines is equal to the rectangle formed by these two lines. Therefore the square of bi (or of bd which is equal to hi) must be equal to the rectangle formed by fb and the desired third proportional. This third proportional is necessarily greater than fa because the rectangle formed by bf and fa is less than the square of bd by an amount equal to the square of df, as shown in Euclid, II. I. Besides it is to be observed that the point f, which is the middle point of the

    tangent

    284 THE TWO NEW SCIENCES OF GALILEO tangent eb, falls in general above a and only once at a; in which cases it is self-evident that the third proportional to the half of the tangent and to the sublimity bi lies wholly above a. But the Author has taken a case where it is not evident that the third proportional is always greater than fa, so that when laid off above the point/ it extends beyond the parallel ag. Now let us proceed. It will be worth while, by the use of this table, to compute another giving the altitudes of these semi- parabolas described by projectiles having the same initial speed. The construction is as follows : Amplitudes of semi-parabolas described with the same in-itial speed.

    By Amjad Izhar
    Contact: amjad.izhar@gmail.com
    https://amjadizhar.blog

  • Modern Physics Relativity: Space, Time, and Motion

    Modern Physics Relativity: Space, Time, and Motion

    This collection of texts explores the evolution of physics from classical mechanics to modern quantum theory. It begins by examining the limitations of classical physics when applied to phenomena at very high speeds or very small scales. The discussion covers topics like electromagnetism, relativity, and the concept of the aether, highlighting experimental evidence that challenged classical assumptions. It introduces special relativity with its postulates, consequences such as time dilation, and the role of the Lorentz transformation. Furthermore, the source addresses the development of quantum mechanics, including the wave-particle duality of light and matter, the Schrodinger equation, and its probabilistic nature. **Finally, it explores the structure of atoms, the quantization of energy, and the limits of absolute knowledge imposed by quantum mechanics.

    Physics Study Guide: Vectors, Electromagnetism, Quantum Mechanics, and Thermodynamics

    Quiz (Short Answer)

    1. Explain the difference between a dot product and a cross product, including the type of result each produces. A dot product results in a scalar (a single number), calculated using the magnitudes of the vectors and the cosine of the angle between them. A cross product yields a vector, whose magnitude depends on the magnitudes of the vectors and the sine of the angle between them and whose direction is perpendicular to both original vectors.
    2. State Newton’s First Law of Motion and briefly describe its significance. Newton’s First Law states that an object’s velocity remains constant unless acted upon by an external force. This law emphasizes the concept of inertia, establishing that objects naturally resist changes in their state of motion and requires an external influence to alter this state.
    3. Summarize Gauss’s Law for electric fields and what it implies. Gauss’s Law for electric fields states that the divergence of the electric field is proportional to the charge density. This implies that electric fields originate from and terminate on electric charges, directly linking the presence of charge to the existence of an electric field.
    4. What is the significance of Maxwell’s equations in understanding the nature of light? Maxwell’s equations demonstrate that light is an electromagnetic wave, a result derived from the equations in a region of space with no charges or currents. These equations link the behavior of electric and magnetic fields to the propagation of light, revealing its fundamental nature as an electromagnetic phenomenon.
    5. Explain what is meant by the “lifetime” of an unstable particle. The lifetime of an unstable particle refers to the characteristic time after which approximately 63.2% of a sample of those particles will have decayed. It is also equivalent to the average time that any randomly selected unstable particle will exist before decaying.
    6. Describe the key difference between relativistic and classical kinetic energy. Relativistic kinetic energy accounts for the increase in mass as an object approaches the speed of light, whereas classical kinetic energy assumes mass remains constant. At speeds much lower than the speed of light, the relativistic formula reduces to the classical formula (1/2)mv^2.
    7. How does the ideal gas law connect microscopic properties to macroscopic properties? The ideal gas law (PV=nRT) relates the macroscopic properties of pressure, volume, and temperature to the microscopic properties of the number of moles of gas. Through classical physics, one can relate the average kinetic energy of gas molecules to temperature, thus connecting microscopic motion to macroscopic measurements.
    8. Explain the concept of emissivity and its range of values. Emissivity describes a body’s efficiency in radiating or absorbing thermal radiation. Its value ranges from 0 (no emission) to 1 (perfect emission or a black body), indicating how effectively a surface emits energy compared to a theoretical perfect emitter.
    9. What is the Balmer series, and what did it reveal about the hydrogen atom? The Balmer series is a set of visible spectral lines emitted by hydrogen, described by a specific mathematical relationship. It suggested that the energy levels within the hydrogen atom are quantized, leading to the emission of photons with only certain discrete wavelengths.
    10. Explain how the de Broglie hypothesis contributed to the Bohr model of the atom. The de Broglie hypothesis, postulating that particles have wave-like properties, helped explain the Bohr model’s quantization of electron orbits. By suggesting that electron wavelengths must be integer multiples of the orbital circumference, it provided a physical basis for the allowed energy levels in the atom.

    Quiz Answer Key

    1. A dot product results in a scalar (a single number), calculated using the magnitudes of the vectors and the cosine of the angle between them. A cross product yields a vector, whose magnitude depends on the magnitudes of the vectors and the sine of the angle between them and whose direction is perpendicular to both original vectors.
    2. Newton’s First Law states that an object’s velocity remains constant unless acted upon by an external force. This law emphasizes the concept of inertia, establishing that objects naturally resist changes in their state of motion and requires an external influence to alter this state.
    3. Gauss’s Law for electric fields states that the divergence of the electric field is proportional to the charge density. This implies that electric fields originate from and terminate on electric charges, directly linking the presence of charge to the existence of an electric field.
    4. Maxwell’s equations demonstrate that light is an electromagnetic wave, a result derived from the equations in a region of space with no charges or currents. These equations link the behavior of electric and magnetic fields to the propagation of light, revealing its fundamental nature as an electromagnetic phenomenon.
    5. The lifetime of an unstable particle refers to the characteristic time after which approximately 63.2% of a sample of those particles will have decayed. It is also equivalent to the average time that any randomly selected unstable particle will exist before decaying.
    6. Relativistic kinetic energy accounts for the increase in mass as an object approaches the speed of light, whereas classical kinetic energy assumes mass remains constant. At speeds much lower than the speed of light, the relativistic formula reduces to the classical formula (1/2)mv^2.
    7. The ideal gas law (PV=nRT) relates the macroscopic properties of pressure, volume, and temperature to the microscopic properties of the number of moles of gas. Through classical physics, one can relate the average kinetic energy of gas molecules to temperature, thus connecting microscopic motion to macroscopic measurements.
    8. Emissivity describes a body’s efficiency in radiating or absorbing thermal radiation. Its value ranges from 0 (no emission) to 1 (perfect emission or a black body), indicating how effectively a surface emits energy compared to a theoretical perfect emitter.
    9. The Balmer series is a set of visible spectral lines emitted by hydrogen, described by a specific mathematical relationship. It suggested that the energy levels within the hydrogen atom are quantized, leading to the emission of photons with only certain discrete wavelengths.
    10. The de Broglie hypothesis, postulating that particles have wave-like properties, helped explain the Bohr model’s quantization of electron orbits. By suggesting that electron wavelengths must be integer multiples of the orbital circumference, it provided a physical basis for the allowed energy levels in the atom.

    Essay Questions

    1. Discuss the limitations of classical physics in explaining phenomena at the atomic level and how quantum mechanics addresses these limitations. Provide specific examples from the source material.
    2. Explain how Maxwell’s equations revolutionized our understanding of electromagnetism and the nature of light. What are the key implications of these equations?
    3. Describe the Bohr model of the atom and its significance in the development of quantum theory. How did the de Broglie hypothesis contribute to a more complete understanding of the atom?
    4. Explain the concept of wave-particle duality and provide experimental evidence that supports this concept.
    5. Discuss the relationship between temperature and the average kinetic energy of the constituents of a material body, and how this relationship leads to an understanding of heat energy transfer.

    Glossary of Key Terms

    • Vector: A quantity with both magnitude and direction.
    • Magnitude (of a vector): The length or size of a vector.
    • Unit Vector: A vector with a magnitude of one, used to indicate direction.
    • Dot Product: A scalar quantity resulting from the multiplication of two vectors, calculated as the product of their magnitudes and the cosine of the angle between them.
    • Cross Product: A vector quantity resulting from the multiplication of two vectors, with a magnitude equal to the product of their magnitudes and the sine of the angle between them, and a direction perpendicular to both vectors.
    • Newton’s First Law: An object at rest stays at rest, and an object in motion stays in motion with the same speed and in the same direction unless acted upon by an unbalanced force.
    • Charge Density: The amount of electric charge per unit volume.
    • Electric Field: A region around an electric charge where a force would be exerted on other charges.
    • Magnetic Field: A region around a moving electric charge where a force would be exerted on other moving charges.
    • Maxwell’s Equations: A set of four fundamental equations describing the behavior of electric and magnetic fields, and how they are related to electric charges and currents.
    • Gauss’s Law (Electric): The total electric flux out of a closed surface is equal to the charge enclosed divided by the permittivity.
    • Gauss’s Law (Magnetic): Magnetic monopoles do not exist; the total magnetic flux out of a closed surface is zero.
    • Faraday-Maxwell Law: A time-varying magnetic field creates an electric field.
    • Ampere-Maxwell Law: A magnetic field is created by electric current, or by a changing electric field.
    • Lifetime (of a particle): The average time an unstable particle exists before decaying.
    • Relativistic Kinetic Energy: The kinetic energy of an object, taking into account relativistic effects at high speeds, where mass increases with velocity.
    • Ideal Gas Law: An equation of state relating pressure, volume, temperature, and the number of moles of an ideal gas: PV = nRT.
    • Emissivity: The ratio of energy radiated by a material to energy radiated by a black body at the same temperature.
    • Black Body: An idealized object that absorbs all incident electromagnetic radiation and emits radiation based only on its temperature.
    • Spectral Radiance: The power emitted per unit area per unit solid angle per unit wavelength.
    • Balmer Series: A set of spectral lines in the visible region of the electromagnetic spectrum emitted by hydrogen atoms.
    • De Broglie Hypothesis: The concept that all matter has wave-like properties, with a wavelength inversely proportional to its momentum.
    • Bohr Model: A model of the atom in which electrons orbit the nucleus in specific, quantized energy levels.
    • Quantization: The concept that energy, momentum, and other physical quantities can only exist in discrete values.
    • Complex Number: A number that can be expressed in the form a + bi, where a and b are real numbers, and i is the imaginary unit, satisfying i2 = −1.
    • Complex Conjugate: For a complex number a + bi, its complex conjugate is a – bi.
    • Schrödinger Equation: A fundamental equation in quantum mechanics that describes the time evolution of a quantum mechanical system.
    • Postulates of Quantum Mechanics: The fundamental assumptions that underlie quantum mechanics, including the representation of the state of a system by a wave function, the representation of observable quantities by operators, and the probabilistic nature of measurement outcomes.
    • Operator (quantum mechanics): A mathematical function that when applied to a quantum state (wavefunction) yields another state and can represent a physical observable.
    • Eigenvalue: A characteristic number associated with an operator; the only possible result of a measurement of an observable quantity in quantum mechanics.

    Physics: From Mechanics to Quantum Theory

    Okay, here’s a briefing document summarizing the key themes and ideas from the provided excerpts.

    Briefing Document: Key Concepts in Mechanics, Electromagnetism, and Quantum Mechanics

    Overview:

    The provided excerpts cover fundamental concepts in physics, spanning classical mechanics, electromagnetism, and the introduction of quantum mechanical ideas. The text progresses from basic vector operations and Newtonian mechanics to Maxwell’s equations and then to the breakdown of classical physics in the realm of unstable particles, radiation, and atomic structure. It then introduces core principles of quantum mechanics and some applications. A recurring theme is the tension between classical and quantum descriptions of the universe and how the limitations of classical physics necessitate the development of quantum theory.

    Key Themes and Ideas:

    1. Vector Operations and Newtonian Mechanics:
    • Vectors: Vectors are defined with magnitude and direction, broken down into components. Unit vectors (I, J, K) denote directions along coordinate axes. “Vectors even though they carry both length information and direction information can be summarized as having a singular length that characterizes the full straight line distance that you would have to go to get from the beginning of the vector to the end of the vector and this is known as its length or its magnitude.”
    • Vector Addition/Subtraction: Vector components are added or subtracted to find the resultant vector.
    • Dot Product: Produces a scalar. C = A · B = |A||B|cos(θ). “In component notation you can calculate this by taking the X components and multiplying them together together taking the Y components and multiplying them together Etc and then adding all of those products together and again this yields a pure scalar a pure number with no direction.”
    • Cross Product: Produces a vector, perpendicular to both input vectors. A x B = C, where |C| = |A||B|sin(θ).
    • Newton’s Laws: The first law states that objects maintain constant velocity unless acted upon by an external force. “The first law states that the state of motion that is the velocity of an object remains constant unless the object is acted upon by an external Force absent external influences the natural state of an object is to maintain whatever velocity it presently has.” ΣF = 0 implies a = 0.
    1. Electromagnetism and Maxwell’s Equations:
    • Electric and Magnetic Fields: Electric charge density is the source of electric fields, while electric current density (moving charges) is the source of magnetic fields. “A density of electric charge however is the source of the electric field of force Mass has nothing to do with the electric field of force… now an electric current density that is a flow of electric charge is the source of a magnetic field of force.”
    • Nabla Operator (∇): A vector operator made of derivatives with respect to space, used to express Maxwell’s equations concisely. “I’m going to define a symbol it’s this funny triangular symbol known as nabla because it resembles a ancient harp of the same name…it’s made of derivatives.”
    • Maxwell’s Equations: Four equations governing electric and magnetic fields.
    • Gauss’s Law for Electric Fields: Relates charge density to the electric field. “a charge density that is a charge per unit volume row is the source of an electric field on the left hand side we have this operator I defined above which is just a triplet of space derivatives acting on an electric field via the action of the dot product so this thing returns a number and that number is equal to the charge density divided by Epsilon knot which is a constant of nature”
    • Gauss’s Law for Magnetic Fields: Indicates no magnetic monopoles. “What this equation tells you is that so far as we know there are no such thing as a magnetic charge”
    • Faraday-Maxwell Law: A time-changing magnetic field generates an electric field. “The Faraday Maxwell law tells me that if I have a time-changing magnetic field this can generate an electric field”
    • Ampere-Maxwell Law: A time-changing electric field or current density generates a magnetic field.
    • Light as Electromagnetic Waves: Maxwell’s equations explain light as an electromagnetic wave. By solving Maxwell’s equations in empty space (no charges, no currents), wave-like solutions for electric and magnetic fields are obtained.
    1. Relativistic Kinematics
    • Relativistic Momentum and Energy: The text addresses relativistic corrections to kinetic energy and momentum, “these quantities for momentum and kinetic energy have all the right behaviors they don’t look like what they looked like in their assumed classical forms they reduce to their classical forms in the appropriate limit and they leave laws of physics invariant where they can be applied”
    • Kinetic Energy: An equation for relativistic kinetic energy is presented: Kinetic energy of a particle is simply given by the quantity of its gamma Factor minus 1 times mc^2.”
    1. Unstable Particles and Decay:
    • Radioactive Decay: Mathematics of unstable particles was developed to understand radioactive decay.
    • Half-life and Lifetime: The concept of half-life (T1/2) and characteristic lifetime (τ) is introduced to describe the exponential decay of unstable particles. “If you allow enough time to pass that one time constant’s worth of time goes by you find that 63.2% of the original number of objects are gone.” τ is the time constant and also the average lifetime of an unstable particle. “The halflife is equal to the time constant time the natural log of 2”
    • Exponential Decay: The number of unstable objects decreases exponentially with time: N(t) = N0 * e^(-λt), where λ = 1/τ.
    1. Heat, Radiation, and Blackbody Radiation:
    • Heat Transfer: Heat can be transferred through conduction, convection, and radiation.
    • Stefan-Boltzmann Law: The power radiated/absorbed by a body is proportional to its emissivity, surface area, and the fourth power of its temperature: P = εσAT^4. “The power radiated or absorbed by a body that that is to say the change in heat energy per unit change in time is given by the product of four numbers Sigma which is a constant of nature known as the Stefan boltzman constant… the stuff on boltzman constant is multiplied by another number which is this curly lowercase Greek Epsilon Epsilon is the emissivity of the surface of a body”
    • Blackbody Radiation: A perfect emitter and absorber of radiation (ε = 1).
    • Spectral Radiance: The power emitted per unit wavelength, denoted as B(λ).
    • Rayleigh-Jeans Law: A classical attempt to predict spectral radiance, which fails at short wavelengths (the “ultraviolet catastrophe”). “for say a spherical body that’s heated to uh a certain temperature T and that body has a certain surface area a the shorter the wavelength of the radiation you consider being emitted from the body the more and more power is radiated around that wavelength if true this would be a catastrophic feature of nature”.
    • Planck’s Law: Provides the correct description of blackbody radiation, demonstrating the quantization of energy. Plank’s law Nails it Max Plank’s law as he derived it in the early 1900s was the Cornerstone of the correct description of the radiation from heated matter”
    1. Ideal Gas Law and Kinetic Theory:
    • Ideal Gas Law: PV = nRT, relates pressure, volume, number of moles, ideal gas constant, and temperature.
    • Molar Mass: Molar mass is defined as the mass per mole (mass per 6.022×10^23 molecules/atoms).
    • Kinetic Theory: Connects microscopic properties (molecular speed, mass) to macroscopic properties (pressure, temperature). The text details how force and pressure can be related to the average kinetic energy of gas molecules.
    • Average Kinetic Energy: Kavg = (3/2)kT, where k is the Boltzmann constant (R/Na). “… the average kinetic energy of a constituent of an ideal gas is given simply by a number three halves times another number the gas constant divided by avagadro’s number times a single variable the temperature of that gas”
    1. Atomic Structure and Quantum Mechanics:
    • Atomic Emission Spectra: Excited atoms emit discrete lines of light, unique to each element. These lines could not be explained by classical physics. “Atoms in general when excited by an ionizing electric potential emit not a continuous rainbow of colors but rather a discrete set of color colors”
    • Balmer Series: A mathematical relationship between the wavelengths of visible light emitted by hydrogen. Balmer noted that the wavelength of each of these lines is given by a simple formula a constant btimes this ratio an integer n^ 2 / by that same integer n^ 2us 2^ 2 or 4″
    • Early Atomic Models:Thomson Model: Electrons embedded in a uniform positive charge (“plum pudding”).
    • Rutherford Model: A central, positively charged nucleus surrounded by orbiting electrons. This was based on alpha particle scattering experiments.
    • Bohr Model:Quantization of Angular Momentum: Bohr postulated that angular momentum is quantized (L = nħ). “Since h and H bar the reduced plunks constant have units of angular momentum that is jewles time seconds it might be in an atom that the angular momentum L is a multiple an integer multiple of H bar”
    • Matter Wave Hypothesis: de Broglie’s hypothesis (λ = h/p) implies that only certain wavelengths are allowed for an electron orbiting the nucleus.
    • Allowed Radii and Energies: Combining the quantization condition with classical mechanics leads to discrete allowed radii (rn = n^2a0) and energy levels (En = -13.6 eV / n^2). ” the fact that just using a classical model of the atom combined with matter wave nature of the electron one can immediately reproduce a pattern in the world around you in this case the Balmer series of atomic emission spectrum lines this is incredible”
    1. Simple Harmonic Motion and Waves
    • SHM: a model of oscillatory behavior.
    • Wave Equation: A mathematical description of wave behavior relating spatial and temporal derivatives of the wave amplitude.
    1. Complex Numbers and their Use in Describing Waves:
    • Complex Numbers: The text introduces complex numbers (Z = x + iy) and the imaginary unit (i = √-1). It explains how to find the magnitude of a complex number using the complex conjugate (Z* = x – iy): |Z|^2 = Z*Z = x^2 + y^2. “To get a real number you need to do something like this and this is part of what defines the algebra of complex numbers you’re going to take Z and you’re going to multiply it by a special version of itself known as Zar uh this is just x + i y the original complex number times x minus i y”
    • Euler’s Formula: The excerpts allude to but don’t explicitly state Euler’s formula (e^(ix) = cos(x) + i sin(x)), which is used to simplify wave equations. It also sets up the use of complex conjugates with respect to calculations of wavefunctions.
    1. Quantum Mechanics Postulates
    • The state of a system is completely described by a wavefunction.
    • Every observable property of the system has a corresponding mathematical “operator”. (e.g. energy is measured by a Time derivative).

    Quotes Illustrating Key Shifts in Understanding:

    • “…but the real laws of nature that we encounter in a course on introductory mechanics are Newton’s famous three laws of motion…” – This highlights the starting point of classical mechanics.
    • “Now what’s amazing about the laws of electricity and magnetism Maxwell’s equations is that when you consider them in a particular situation it finally clarifies what the heck the nature of light is…” – This marks a transition to electromagnetism and its explanatory power.
    • “If true this would be a catastrophic feature of nature…” – This reflects the crisis in classical physics when applied to blackbody radiation.
    • “They are all solutions to a wave equation that is an equation that describes how changes in space relate to changes in time.”

    Conclusion:

    The excerpts provide a concise overview of fundamental physics concepts, from basic mechanics to the dawn of quantum theory. The progression highlights how classical physics, while successful in many domains, breaks down when applied to phenomena at very small scales, requiring the development of quantum mechanics. The discussion of the blackbody problem and atomic structure exemplifies this shift in paradigm. The introcuction of complex numbers provides a basic tool for QM, and the postulates begin to lay out the mathematical structure for the theory.

    Vectors, Fields, and Particle Physics: Key Concepts

    Vectors and Unit Vectors

    Q1: What are vectors and how are they represented? Vectors are quantities that have both magnitude (length) and direction. They are typically represented with an arrow over the variable, such as ‘a’. In a Cartesian coordinate system, a vector can be broken down into components along the x, y (and z in 3D) axes (Ax, Ay, Az). Unit vectors (like i-hat, j-hat, and k-hat) denote direction along the x, y, and z axes, respectively.

    Vector Magnitude and Operations

    Q2: How is the magnitude (or length) of a vector calculated? The magnitude of a vector ‘a’ can be denoted as ‘a’ (without the arrow) or as |a|. It is calculated using the Pythagorean theorem in multiple dimensions. The square root of the sum of the squares of its components (e.g., √(Ax^2 + Ay^2)). Q3: How are vectors added and subtracted? Vectors are added component-wise. If you have vectors ‘a’ and ‘b’, the resulting vector ‘c’ (c = a + b) has components Cx = Ax + Bx, Cy = Ay + By, and Cz = Az + Bz. Subtraction follows the same principle, just with subtraction instead of addition. Q4: What are dot product and cross product and what do they return? The dot product (a · b) is a scalar (a single number) quantity calculated as the magnitude of a times the magnitude of b times the cosine of the angle between them. In component form, it’s (Ax * Bx) + (Ay * By) + (Az * Bz). The cross product (a × b) yields another vector ‘c’, perpendicular to both ‘a’ and ‘b’. The magnitude of the result is the magnitude of a times the magnitude of b times the sine of the angle between the two vectors.

    Force Fields and Maxwell’s Equations

    Q5: How are electric and magnetic fields created, and what are Maxwell’s equations? Electric fields are created by electric charge density (charge per unit volume), while magnetic fields are created by a flow of electric charge, aka current density. Maxwell’s equations are a set of four equations that describe these fields. They relate the electric and magnetic fields to their sources (charges and currents) and to each other:

    1. Gauss’s law for electric fields
    2. Gauss’s law for magnetic fields
    3. Faraday-Maxwell law
    4. Ampere-Maxwell law These equations also clarify the nature of light as electromagnetic waves.

    Unstable Particles and Half-Life

    Q6: What is the relationship between lifetime, time constant, and half-life for an unstable particle? Unstable particles decay over time, described by exponential decay. The lifetime (τ) is the average time an unstable particle exists. The time constant (τ) is the time it takes for approximately 63.2% of a sample of unstable particles to decay. The half-life (t1/2) is the time it takes for half of the particles to decay. The relationship is t1/2 = τ * ln(2).

    Relativistic Kinetic Energy

    Q7: How is relativistic kinetic energy calculated, and how does it relate to classical kinetic energy? Relativistic kinetic energy (K) is given by K = (γ – 1)mc^2, where γ is the Lorentz factor (gamma), m is mass, and c is the speed of light. In the limit where velocity (u) is much smaller than the speed of light, this equation reduces to the classical kinetic energy formula K = (1/2)mu^2.

    Ideal Gas Law and Kinetic Theory

    Q8: How does the ideal gas law relate macroscopic properties to microscopic properties of gas molecules? The ideal gas law (PV = nRT) relates the pressure (P), volume (V), number of moles (n), ideal gas constant (R), and temperature (T) of a gas. From that law it can be derived that the average kinetic energy of a single gas molecule is directly proportional to the temperature of the gas (KE = (3/2)kT), where k is the Boltzmann constant. Furthermore, the average speed of the gas molecules relates to the temperature and molar mass of the gas.

    Understanding the Speed of Light: Definition, Constancy, and Implications

    The speed of light is a crucial concept in modern physics, particularly in the context of special relativity. Here’s a breakdown:

    • Definition: The speed of light refers to the rate at which light travels a certain distance in a specific amount of time. It is the number of meters that light can travel once emitted by a source in a certain amount of time.
    • Numerical Value: Light travels at 299,792,458 meters per second. This is often approximated as 2.998 * 10^8 m/s.
    • Constancy: The speed of light is constant for all observers, regardless of their state of motion or the motion of the light source. This principle is a cornerstone of Einstein’s theory of special relativity and was established through experiments like the Michelson-Morley experiment.
    • Implications for Space and Time: The constant speed of light leads to some surprising consequences:
    • Relativity of time and space: Measurements of space and time are not absolute and can differ for observers in different states of motion.
    • Time dilation: Observers in motion relative to others will experience a slower passage of time.
    • Length contraction: Objects in motion will appear to be shorter in the direction of motion to a stationary observer.
    • Notion of simultaneity: Events that appear simultaneous in one frame of reference may not be simultaneous in another.
    • Historical context:
    • Early attempts at measurement: Galileo attempted to measure the speed of light using lanterns and assistants on distant hills, but the speed of light was too fast for his methods.
    • Rømer’s determination: Ole Rømer used observations of Jupiter’s moon Io to make a reasonable determination that light travels at a finite speed.
    • Modern establishment: By the time of Einstein’s publications, the speed of light had been established to be within approximately 50 km/s of today’s methods.
    • Practical Considerations: A useful rule of thumb is that light travels approximately one foot in one billionth of a second (one foot per nanosecond).

    The speed of light, therefore, is not just a measure of how fast light travels, but a fundamental constant that shapes our understanding of space, time, and the universe.

    Understanding Wave Phenomena: Properties, Effects, and Duality

    Wave phenomena are characterized by oscillatory behavior distributed in space and time. Here’s an overview of key concepts related to wave phenomena, as discussed in the sources:

    • Gross properties of waves: To understand waves, it’s helpful to focus on a specific point and observe its repetitions, which represent the wave’s spatial or temporal distribution.
    • Wavelength ((\lambda)): The distance between two corresponding points on a wave, such as the distance between crests or the distance between maximum displacement from zero.
    • Period (T): The time it takes for a wave to complete one full cycle, or the time between successive maximums or minimums passing a fixed point in space.
    • Frequency (f or (\nu)): The number of cycles per unit of time, typically measured in Hertz (Hz) or per second, and is the inverse of the period ((f = 1/T)).
    • Wave front: A line or plane that indicates the location in space of a maximum of the traveling wave.
    • Speed of a wave: The speed ((v)) at which a wave moves through space is the product of its wavelength and frequency ((v = \lambda f)). For light waves in a vacuum, this speed is the speed of light ((c)), approximately (2.998 \times 10^8) m/s.
    • Doppler effect: The change in frequency or wavelength of a wave for an observer who is moving relative to the wave source.
    • Classical Doppler Effect: Occurs when a source emitting a wave is moving relative to an observer. For example, if a car is moving and honking its horn, the sound waves in front of the car will be compressed (shorter wavelength, higher frequency), while the sound waves behind the car will be stretched out (longer wavelength, lower frequency).
    • Relativistic Doppler Effect: A modification of the classical Doppler effect that accounts for the effects of special relativity on space and time measurements.
    • Wave equations: Mathematical expressions describing how changes in space relate to changes in time for a wave. Examples include:
    • Mechanical waves: For a vibrating string, the vertical displacement ((Y)) varies with horizontal position ((x)) and time ((t)), described by a wave equation involving second derivatives with respect to both space and time.
    • Electromagnetic waves: Maxwell’s equations lead to a wave equation describing the propagation of oscillating electric and magnetic fields in space.
    • Energy in waves:
    • Mechanical waves: The kinetic and potential energy of a small segment of a vibrating string can be calculated, with the total energy being constant and dependent on factors like angular frequency, linear mass density, and displacement.
    • Electromagnetic waves: The energy of a light quantum (photon) is related to its frequency by (E = hf), where (h) is Planck’s constant. The momentum ((p)) of a photon is related to its wavelength by (p = h/\lambda).
    • Wave-particle duality: Light and matter exhibit both wave-like and particle-like properties.
    • Light: Demonstrates wave-like behavior through phenomena like diffraction and interference, but also exhibits particle-like behavior as seen in the photoelectric effect and black body radiation.
    • Matter: Particles like electrons can also exhibit wave-like behavior, as demonstrated by electron diffraction experiments. De Broglie proposed that the wavelength of a matter wave is inversely proportional to its momentum: (\lambda = h/p).
    • Interference: When waves overlap, they can either constructively interfere (amplitudes add up) or destructively interfere (waves cancel out). A classic example is the double-slit experiment, where light or matter waves passing through two slits create an interference pattern of bright and dark spots.
    • Scale and wave behavior: The manifestation of wave or particle behavior depends on the size of the system relative to the wavelength of the radiation.
    • If the wavelength is much larger than the system, wave-like behavior dominates.
    • If the wavelength is much smaller than the system, particle-like behavior dominates.
    • Matter waves: de Broglie’s hypothesis suggests that matter also has wave properties, with the energy and momentum of a particle related to the frequency and wavelength of its corresponding matter wave. This is described by (E = hf) and (p = h/\lambda).

    Understanding Relative Motion: Classical and Special Relativity

    Relative motion is a concept that describes how the motion of an object is perceived differently depending on the observer’s own motion. It’s a fundamental aspect of both classical and modern physics and is essential for understanding how different observers can disagree on the measurements of space and time.

    Here’s an overview of relative motion, as discussed in the sources:

    • Classical (Galilean) Relativity:
    • Basic principle: In introductory physics, relative motion is often introduced with examples like a person on a moving train throwing a ball. Observers in different frames of reference (e.g., the person on the train and someone standing on the ground) will perceive the motion of the ball differently.
    • Galilean transformation: This set of equations mathematically relates spatial coordinates and velocities of objects from a frame considered at rest to a frame considered moving, assuming time passes the same for all observers.
    • Intuitive but limited: Classical physics and Galilean relativity provide an intuitive understanding of relative motion at everyday human scales and speeds.
    • Ether: In classical physics, it was presumed that light, like mechanical waves, needed a medium to travel through, called the ether. The speed of light was thought to be constant with respect to this ether.
    • Einstein’s Theory of Special Relativity:
    • Postulates: Einstein’s theory is based on two key postulates:
    • The laws of physics are the same for all observers, regardless of their state of relative motion.
    • The speed of light in a vacuum is the same for all observers, regardless of their state of motion or the motion of the light source.
    • Rejection of Absolute Time: Einstein challenged the idea that time is experienced the same way by all observers.
    • Consequences:
    • Time dilation: Moving observers experience time passing more slowly relative to stationary observers.
    • Length contraction: Objects in motion appear shorter in the direction of motion to a stationary observer.
    • Relativity of simultaneity: Events that are simultaneous in one frame of reference may not be simultaneous in another.
    • Frames of Reference:
    • A frame of reference is any object or system, all of whose parts move at the same velocity with respect to an agreed-upon reference point in space.
    • Descriptions of events depend on the frame of reference in which the observation is made.
    • It is impossible to determine whether you are in motion by making observations in your frame of reference.
    • Velocity Addition:
    • Non-intuitive at high speeds: In special relativity, velocities do not simply add linearly as they do in classical mechanics.
    • Velocity Transformation Equation: Provides a way to correctly add velocities, ensuring that the speed of light remains constant for all observers.
    • Light Speed Limit: Nothing can move faster than the speed of light.
    • Practical Implications and Examples:
    • Muon Decay: The behavior of muons, subatomic particles with short lifetimes, provides experimental evidence for time dilation and length contraction. Muons created in the upper atmosphere travel much farther than expected because their lifetimes are extended due to their high speeds.
    • Doppler Effect: The change in frequency or wavelength of a wave (e.g., light) due to the relative motion between the source and the observer. This effect is used to infer relative velocities on a cosmic scale.

    Understanding relative motion requires careful consideration of the observer’s frame of reference and the effects of special relativity, especially when dealing with high speeds. The speed of light remains the same for all observers. The laws of physics remain the same for all observers.

    General Relativity: Spacetime, Gravity, and Einstein’s Theory

    General relativity is a theory of space, time, and gravitation developed by Albert Einstein that extends his earlier theory of special relativity. It provides a framework for understanding gravity not as a force, but as a consequence of the curvature of spacetime caused by mass and energy.

    Here’s an overview of general relativity, based on the information in the sources:

    • Transition from Special Relativity:
    • Einstein sought to generalize special relativity into a complete theory of space and time, aiming to explain gravity.
    • This endeavor required a decade of struggle and relearning advanced mathematics.
    • Key Concepts:
    • Spacetime: General relativity treats space and time as a single, four-dimensional framework called spacetime.
    • Curvature of Spacetime: Mass and energy cause spacetime to curve, bend, or warp.
    • Gravity as Curvature: What we perceive as gravity is the result of objects following the curvature of spacetime. Objects are not pulled down by the mass of the Earth. Rather, they are following a path in SpaceTime that’s curved due to the presence of the mass energy of the earth bending that space and time.
    • Equivalence Principle: There is no difference between an accelerated frame of reference and a frame experiencing a gravitational field.
    • Implications and Predictions:
    • Bending of Light: Gravity bends the path of light. The amount of bending predicted by general relativity is twice as large as that predicted by Newtonian physics.
    • Gravitational Lensing: Massive objects can bend light from distant objects, allowing us to see objects that would otherwise be obscured. The degree of light deflection can be used to measure the mass of the intervening object.
    • Gravitational Time Dilation: Time is affected by gravity. Clocks at different heights in a gravitational field run at different rates. This must be accounted for in systems such as the Global Positioning System (GPS).
    • Cosmological Implications: General relativity is essential for understanding the origin and fate of the universe, including the Big Bang and the expansion of the universe.
    • Inertial Mass vs. Gravitational Mass:
    • Equivalence: General relativity is built on the observation that inertial mass (resistance to acceleration) and gravitational mass (response to gravity) are equivalent.
    • Eötvös Experiment: Confirmed the equivalence of inertial and gravitational mass with great precision.
    • Spacetime and Gravity:
    • Space and time tell energy and matter how to move.
    • Energy and matter tell space and time how to bend or curve.
    • Mathematical Complexity:
    • General relativity relies on complex mathematics beyond the scope of introductory physics courses.

    Matter Waves: Wave-Particle Duality and Quantum Mechanics

    Matter waves describe the wave-like behavior of particles, a concept that revolutionized physics in the early 20th century. This idea, primarily associated with Louis de Broglie, suggests that matter exhibits both particle-like and wave-like properties.

    Here’s a detailed discussion of matter waves, based on information from the sources:

    • De Broglie’s Hypothesis:
    • In his 1924 PhD thesis, Louis de Broglie postulated that matter, like light, has wave properties.
    • He proposed that the energy ((E)) and momentum ((p)) of a particle are related to the frequency ((f)) and wavelength ((\lambda)) of its corresponding matter wave through the same equations that apply to photons:
    • (E = hf)
    • (p = h/\lambda)
    • Where (h) is Planck’s constant.
    • Experimental Verification:
    • De Broglie’s hypothesis was experimentally verified through scattering experiments, where matter, such as electrons, is scattered off a target.
    • These experiments revealed interference patterns, demonstrating the wave nature of matter.
    • For instance, when electrons are scattered off crystals, the resulting diffraction patterns are similar to those observed when light passes through a diffraction grating.
    • These diffraction patterns could not be explained if electrons were solely particles.
    • Wave-Particle Duality:
    • Matter waves imply that particles, which had been primarily experienced as definite objects with specific locations in space and time, also exhibit wave-like behavior.
    • The manifestation of wave or particle behavior depends on the scale of the system and the wavelength of the matter wave.
    • When the wavelength is much smaller than the size of the object it interacts with, the particle-like aspects of the matter are observed.
    • When the wavelength is comparable to or larger than the size of the object, the wave-like aspects are observed.
    • Implications for Atomic Structure:
    • The concept of matter waves helps explain the discrete atomic spectra of elements.
    • In the Bohr model of the atom, electrons orbiting the nucleus can only exist in specific energy levels.
    • This quantization arises from the requirement that the electron’s matter wave must form a standing wave around the circumference of the orbit.
    • Only certain wavelengths (and therefore energies) satisfy this condition, leading to the observed discrete emission spectra.
    • Heisenberg Uncertainty Principle:
    • The wave nature of matter imposes a limit on the precision with which certain pairs of physical properties, such as position and momentum, can be known.
    • This is formalized in the Heisenberg Uncertainty Principle, which states that the more accurately the position of a particle is known, the less accurately its momentum can be known, and vice versa.
    • This principle arises because determining position requires localizing the wave, while determining momentum requires knowing its wavelength (and thus its spatial distribution).
    • Mathematical Description:
    • Schrödinger Equation: The wave equation for matter is the Schrödinger equation. It describes how matter waves propagate and evolve over time.
    • The solutions to the Schrödinger equation are wave functions, which are generally complex functions.
    • The square of the magnitude of the wave function represents the probability density of finding the particle at a particular location in space and time.
    • Born interpretation (not stated explicitly in the source but implied), which is the most practical interpretation, gives a probabilistic meaning to the square of the wave function.
    • Analogy for Understanding:
    • Consider the example of a beaker in a tank of water. When struck by a long wavelength, the beaker exhibits gradual motion. When struck by a short wavelength, the beaker is violently hit.
    • This illustrates how a wavelength short compared to the target will exhibit particle-like behavior, whereas a wavelength long compared to the target will exhibit wave-like behavior.
    Modern Physics Modern Physics Full Lecture Course

    The Original Text

    [Music] hello I’m Professor Steven sakula and I’d like to welcome you to Modern physics at SMU modern physics is the first course past the two semester introductory sequence where we begin to get closer to the physical principles that are at play in the modern technological and scientific world now like all good Sciences physics Builds on the past discoveries that have driven the field forward to continue to move through the frontier of human knowledge and modern physics is going to challenge you to Step Beyond the comfortable confines of introductory physics and into the modern view of the universe particularly space time energy and matter which are the four subjects at the heart of the science known as physics now introductory physics tends to leave off at the end of electricity and magnetism and that period coincides with the end of the 19th century the late 1800s which was a feverish time of experimentation the foundations of space and time had been laid in the centuries before by people like Isaac Newton and the laws of electricity and magnetism which were relatively the New Kids on the Block as regards scientific law had been established firmly in the mid 1800s it was believed at the time that for the most part everything that needed to be known about the universe had been established and all that was really left was to sort out some lingering puzzles that hadn’t quite yet been fit into the framework of Newton’s mechanics and the laws of electricity and magnetism now one of those phenomena is light it’s fairly straightforward to make a light source in the modern world all we have to do for instance is take a sealed tube filled with gas for instance and expose it to a strong electric field a large electric potential and we can coax it into emitting light now light is of course all around us it’s what’s literally Illuminating the scene here but what light was was only thinly established by the end of the 1800s we’ll look at the foundations of light and electricity and magnetism and mechanics in this video but one of the puzzles that was left over at the end of the 19th century was why certain elements emitted certain kinds of light but not others so for instance this sealed tube here is containing hydrogen gas and you’ll notice that it gives off a fairly strong reddish color on the other hand if I replace it with another tube of gas this one of mercury vapor Mercury being a a metal the only Elemental metal that’s liquid at room temperature in this case sealed as a vapor in the tube you’ll notice that in this case we get a very strong blue light from this particular element if you study the Fingerprints of these light emissions very closely you’ll observe that they have strong colors in some places but not in others in what is known as their atomic spectrum why was that the mystery of atomic Spectra would only be fully understood in the early part of the 20th century with the Advent of what we now call Modern physics specifically quantum physics now another interesting phenomenon that had been observed in the 1800s but which was not fully understood had to do with electric currents so what I’m going to do here is using the tribo electric effect using friction to build up an electric charge on a piece of plastic and then placing that plas plastic in contact with a conductor so that it can soak up that excess charge you’ll see that I’ve now caused a net electric charge to sit on this myar material attached to this aluminum soda can well nothing dramatic here so far the charges can freely move on the conductor and in this case they don’t like to be near each other because they’re electrons and they all have the same electric charge and so they Rush as far apart as they can get while remaining on the conductor so they’re trapped on this conductor they can’t escape but they’ve done everything in their power to get away from each other and in the process they’ve exerted an electric force that mechanically causes the myar to spread out in space now while I’ve been talking the light in this room which comes from something like a dozen light fixtures in the ceiling has been blasting this metal and yet while there’s a breeze in the room that moves the myar sheets around nothing’s really draining the charge off of this we don’t see these sheets appreciably falling down so so we have many watts of natural light coming in from light sources here nothing happens but it was observed in the 1800s that if you expose certain metals to certain colors of light or even colors of light that are Beyond human perception like ultraviolet they’ll begin to allow an electric current to flow this is the so-called photoelectric effect and it was unexplainable by Newton’s mechanics and the laws of electricity and magnetism as they were established in the 1800s so all I have to do according to the photoelectric effect idea is take something that emits ultraviolet radiation in this case a sanitizing wand for a sink or a counter or a toilet or something like that just go ahead and switch it on and it will begin to emit ultraviolet C radiation or light down toward the surface of the table and if I move it over the aluminum can it will begin to almost instantaneous ously drain the electric charge off the myar and if I sit here long enough it will eventually pull almost all of that electric charge off the myar leaving it hanging back down in a more vertical position this is four watts of UV light compared with tens of watts of natural light coming from the light fixtures in the ceiling why why does the color of the light matter to this effect more than the intensity of the light that was a Myst left over at the end of the 1800s now another mystery which would ultimately combine to lead to a firmer understanding of matter energy space and time had to do with heat energy now heat energy is something that we will explore in this class if I light this burner on fire so that it emits a nice blue flame I can take a biim metallic Strip This is a strip of two metals bonded together back to back and and I can place it in the flame now we’ll we’ll take a look at some of mechanically what’s going on here later in the course when we establish the foundations of heat energy uh in about a month or so but if I leave this metal in the flame not only because does it begin to bend but after enough time it will also begin to glow on its own originally it had a silvery color room temperature but as I continue to expose it to the source of heat energy this Open Flame and heat energy is transferred into the metal eventually the metal begins to Glow of its own accord with its own light now this wasn’t such a mystery to physicists and chemists of the 1800s but what was a mystery had to do with the degree of absorption of energy and the degree of reemission of energy at other at other frequencies and wavelengths other kinds of light that you can see with your eye and the exact relationship between heat energy and temperature and the kinds of radiation that should be emitted from a heated body proved a real challenge to mechanics electricity and magnetism and the laws of heat energy transfer or thermodynamics that were also established in the 1800s so modern physics is your gateway into a world that’s more consistent with the kind of world we live in today not the world of the 17 and 1800s but the world of the 1900s the 20th century and now the 21st century the foundations that we will establish in this course will lay the groundwork for a variety of important technological revolutions non-invasive Imaging of the human body the harnessing of the energy at the heart of every atom the construction of semiconductor devices which revolutionized our ability to do computations quickly and efficiently and a host of other advancements whose root Roots were all laid down in a revolutionary period in the transition between the 1800s and the 1900s that led into the era of modern physics welcome to this course for the rest of the video we’ll do a foundations review of introductory physics to refresh your memory about the most Salient things from the past two semesters of material and then we’ll move on to the foundations of modern physics in this lecture we’ll re-explore the foundations of introductory physics the basic concepts that should have been communicated to you in the first two semesters of introductory physics physics Builds on the past like all Sciences the discoveries of an earlier era influence our understanding of new discoveries and how to adapt our mathematical descriptions of nature in order to describe what we know know from the past but include new observations that don’t quite fit into the original framework that we had developed the big picture that I want you to take away from this foundation’s lecture can be broken into four large Parts first of all a foundation of the physics that you have learned so far is Newton’s mechanics these are laws of motion they link forces that act on objects to changes in the states of motion of the those objects and a state of motion is characterized by the velocity of an object the laws of mechanics were first established by Isaac Newton in his foundational publication philosophia naturalis prinkipia Mathematica or the prinkipia published in 1687 this set of laws illuminates how velocity the state of motion of an object can be influenced by external forces and codifies mathematically using geometry algebra and the newly invented calculus the way in which you can describe the interaction of these things in order to understand the natural world what would also be developed over the following centuries were a series of what we now call conservation laws these are principles that establish that certain quantities appear to be conserved that is left unchanged even by complex phenomena in nature these include things like the total energy of a system including internal forms of energy like chemical energy the total linear momentum of a system and the total angular momentum of a system and for closed and isolated systems where no external forces especially of the non-conservative variety those that can’t store in release energy and some kind of potential in those systems conservation laws will absolutely hold and they were established through careful chemical and physical work up through the 1700s and they continued to be built on in work on heat energy in the 1800s heat energy and the laws that govern its transfer from the mechanical form to the thermal form will be Revisited in later lectures the second part of the foundations of modern physics the third key idea is Newton’s law of gravitation that is the law that relates the distance between material bodies and the force between them a force that requires no actual physical contact no medium to be present between two things in order for them to exert a force on each other and this was also established in Newton’s prinkipia and finally the last set of laws of physics that we have to accept as a foundation for what’s going to happen in this course are the laws of electromagnetism these are the rules of electricity and magnetism describing them as forces in the same way that gravity is a force that can induce changes in states of motion again without physical contact between material bodies electric forces and magnetic forces can operate even if there’s no medium between the two bodies that are interacting with each other via these forces they were established in the 17 to 1800s and they were finally codified formally in four equations known as Maxwell’s equations in 1862 one of the mathematical foundations of describing nature in physics is a kind of number known as a vector the these are essential to describing any multi-dimensional quantity and they have a well-defined algebra which you should have exercised in previous physics courses you probably have also exercised these in a dedicated math or engineering or both course vectors are numbers that can be built from scalers scalers are numbers that have no directional information so for instance a good example of a scaler would be if you asked for directions to somebody’s house and they told you go 10 miles well that might eventually by going 10 miles get you to their house but without some crucial directional information how far east south north or west should I go to add up to those 10 miles you’re probably not going to make the journey successfully scalers however can be assembled using for instance component notation into a vector so here for instance is demonstrated a vector denoted a with a little arrow over its head and it’s broken into components it has a component that lies entirely along the x coordinate axis in a cartisian coordinate system with length a with a subscript X and similarly it has a component along the Y AIS in a cartisian coordinate system a with a subscript Y and these little vectors here I with a little triangular hat over at J with a little triangular hat over it we’ll come back to those in a moment but they’re essential in indicating a dedicated Direction either only along the x or y or Z axis vectors even though they carry both length information and direction information can be summarized as having a singular length that characterizes the full straight line distance that you would have to go to get from the beginning of the vector to the end of the vector and this is known as its length or its magnet magnitude this can be denoted in one of several ways either just drawing the vector with no Arrow over it so a in this case or putting absolute value signs around the vector that’s another common notation for length or magnitude of a vector and this can be computed using the sums of the squares of the components and then you take the square root of that total sum in two Dimensions this will recall the familiar Pythagorean theorem which given the lengths of of the sides of a right triangle will tell you the length of the hypotenuse then there are unit vectors this is a a subspecies of vector and they’re special because they are vectors whose length is always exactly one in whatever unit system you choose to use unit vectors are denoted with that little triangular hat symbol so for instance I hat J hat and K hat as they would be denoted in spoken terms are special and they’re unit vectors that point only along the X Y and Z axis respectively of a cartisian coordinate system this also means that because the angles between the X and Y Y and Z and z and x axis are 90° the angles between these unit vectors are also always 90° for any pair you can add vectors so for instance if I have a vector a and a vector B and I want to know what the resulting Vector for in C with a little Vector Arrow over its head looks like all I have to do is take the X components and add them together noting that they point along the IAT Direction take the Y components and add them together noting that they point along the y direction Etc and this will give you the resulting sum of two vectors you can replace the sum with a minus sign to get the difference of two vectors but the math is the same there are two kinds of multiplicative products of vectors the dot product which gives you a number and the cross product which returns a vector the dot product is given by the following notation C can be represented as the dotproduct of two vectors A and B with a little dot between them and it’s a number it’s a pure scalar whose size is the magnitude of a Time the magnitude of B time the cosine of the angle between a and b in component notation you can calculate this by taking the X components and multiplying them together together taking the Y components and multiplying them together Etc and then adding all of those products together and again this yields a pure scalar a pure number with no direction on the other hand the cross product the other multiplicative operation between two vectors yields a vector so in this case the cross product of two vectors A and B would yield a third Vector C the cross product is denoted by putting a cross multiplicative sign in between the two vectors A and B this one’s a little bit more complicated and you have to be a bit more careful with this I like component notation because you can essentially distribute the multiplication algebraically between the two vectors A and B and you wind up with terms that look like the X component of a and the Y component of b with this cross product of unit vectors next to it and then the Y component of a and the X component of b with the reverse cross product of I and J hat next to it and then a bunch of other terms that look similar to this depending on how many dimensions this thing has and in the end this yields a pure Vector with a length given by the magnitude of a Time the magnitude of B time the sign of the angle between the two vectors A and B also the vector c will always point at exactly a right angle to both a vector and B Vector that’s one of the natural consequences of the cross product now the cross products of coordinate axis unit vectors like IH hat J hat and K hat obey the following rules the cross product of any unit Vector with itself is zero because there is no Vector that’s perpendicular at the same time to both I and itself there’s an infinite number of those vectors and the cross product yields a result of zero for this similarly with J cross J and K cross K now the rule of thumb for computing all of the other cross products is that I cross J is K and then if you kind of conveyor belt K to the beginning of this operation move I to where J is and move j to where K is you get one of the other cross products K cross I is J and then similarly doing this conveyor belt permutation one more time socalled cyclic permutation you get J cross K is I now what about J cross I I cross K or k cross J well if you swap the order on the left side of these equations then the right side changes by a minus sign so J cross I would be negative Kat I cross K would be negative J hat and so forth vectors are an essential building block of everything that happens in mechanics but the real laws of nature that we encounter in a course on introductory mechanics are Newton’s famous three laws of motion the first law states that the state of motion that is the velocity of an object remains constant unless the object is acted upon by an external Force absent external influences the natural state of an object is to maintain whatever velocity it presently has this can be summarized in an equation as follows the sum of all forces with subscript I and there can be from 1 2 3 all the way up to capital N forces acting on an object if all of those add up and cancel each other out so that there is zero net force acting on an object then the resulting acceleration that is the change of velocity with respect to time or the change of the state of motion with respect to time given by the second derivative of a position Vector of the object is zero no net force no change in state of motion the more general form of this equation is given by Newton’s Second Law which relates the net unbalanced force acting on an object to any resulting acceleration or change in the state of motion of that object the change in Newton’s second law is proportional to something force and acceleration can be related to each other by a simple equation and the constant of proportionality between force F and acceleration a is given by m the so-called inertial mass of an object because you can write the acceleration as the second derivative with respect to time of the position Vector of an object I’ve put here the calculus notation for the acceleration in three dimensions where R Vector is a position Vector X Y and Z that not only can change with time but whose change with time can be further altered by having an external force act on it that is an acceleration and then finally there’s Newton’s third law that in every interaction of two material objects let’s call them A and B two forces are in action the direction of the force exerted by object a on object B is the opposite of the force of object B on object a but they are otherwise equal in magnitude so if I take my hand and push on the surface of a table the table pushes back against my hand with an equal magnitude but opposite direction Force that’s why my hand doesn’t go through the table now usually after learning about Newton’s Laws of Motion we then learn about quantities that are associated with motion these are known as energy and momentum what what is common between these quantities is that they vary in some proportion to the degree of motion so for example the quantity of energy associated with a moving object so-called kinetic energy is proportional to mass and to the square of the velocity of an object it is a scalar because you square the velocity you lose all directional information about it and the exact equation for kinetic energy is determined to be 1/2 * the mass times the velocity squared or the speed squared of an object there is a Direction full quantity of motion and that is known as linear momentum it is proportional to mass and directly to the velocity of a body at least in this classical physics and this introductory mechanics we learn about this is observed to be the thing that appears to also be conserved in nature like energy linear momentum is denoted by the letter P with a vector hat over it and it’s the product of inertial mass and the velocity of the object we can write this in calculus notation as the mass times the first derivative of the position Vector with respect to time now there’s another momentum quantity that’s associated with a body that can rotate as well so the degree of its rotation around some axis imparts some angular momentum to the system and we also learn that enclosed and isolated systems this quantity can be conserved it’s proportional not to the mass of the body but to the distribution of mass around the axis of rotation the so-called moment of inertia and to the rotational velocity of that body all points on a rigid body that can rotate about an axis will have the same rotational velocity regardless of their distance from the axis of rotation and the moment of inertia describes using an integral which is shown here I is the integral of r² DM where R is the distance from the axis of rotation for the little bit of DM mass that you’re considering at the time the product of these two things yields the angular momentum and this is observed to be conserved in systems that are closed and isolated now if an external conservative Force acts one where the work done by the force in moving an object from point A to point B is the negative of the work moving from point B to point a by any path that you can take then there is an Associated potential energy as well which we denote U this is another kind of energy so there’s kinetic energy and then for conservative forces where those things like gravity for instance can act on a system you have an Associated potential energy you can lose kinetic energy and store it in potential energy when you can lose potential energy and gain it in kinetic energy there’s an interplay in these kinds of energy in systems and the total energy can be conserved on the other hand for external non-c conserv ative forces such as friction or Air drag there is no Associated potential energy but other forms of energy such as heat which is the motion of atoms in a material object can result from losses of kinetic energy through the action of those forces now as I’ve hinted at before energy and momentum can be conserved and for a system that is acted upon only by conservative forces which have an Associated potential energy and is otherwise close osed to and isolated from all other kinds of forces in that specific case what is known as mechanical energy is completely conserved mechanical energy is the sum of all kinetic and all potential energy in the system at any moment so for instance there might be some initial moment of time where there’s a total kinetic energy Ki and a total potential energy UI and if the system obeys the constraints I’ve listed above that I can look at any other time say some time final later denoted with an f and I can see that although kinetic and potential energies may have morphed one into the other the sum of these two things across all objects in the system is the same sum as I had at the earlier time now for a non-closed and nonisolated system and especially where non-conservative forces can act total energy will be conserved but not just mechanical energy and total energy is the sum of kinetic potential and all other forms of internal energy like heat due to friction or drag or even chemical energy if for instance mechanical energy has been converted into stored chemical energy through some chemical and mechanical and electrical process then you can retain the energy in that form and you may be able to get it back later in the form of either potential or kinetic I energy depending on what kinds of non-conservative forces are acting in the system but if you can figure out all the energy buckets where energy can go in a system even one where non-conservative forces can act then you can still see that the total energy in all of those buckets added up remains constant over time even if you can’t recover mechanical energy when it’s lost into forms like heat or chemical energy and for a closed and isolated system of objects total momentum both kinds linear and angular is conserved so if I sum up all the linear and all the angular momentum at one time initial TI I will find later on that the sum of all momentum and all angular momentum all linear momentum and all angular momentum is the same even if it’s been interchanged between objects maybe they’ve collided with each other things like that now if only elastic Collision of these objects are possible that is the number and mass of the objects never changes then the total momentum and kinetic energy are conserved in that case but if in elastic collisions are possible where objects can stick together for appreciable periods of time or if they can lose mass or gain mass then only momentum will be conserved but again you have to be very careful with how closed and isolated the system is now another law that we encounter in introductory physics which seems a strange Beast compared to the other kinds of mechanical phenomena that we encounter in in these courses is the law of gravitation which governs the gravitational force between any two bodies with mass it acts without physical contact and it does so even across empty space and I’ve Illustrated that here by showing you the planet Jupiter which is the heaviest planet in our solar system and four of its moons the ones that were first spotted by Galileo when he turned his telescope to the night sky to see what he could see these are the so-called Galilean moons they’re the biggest moons of Jupiter Jupiter has many more moons than this but these are the four most visible the most easily visible even with a modest uh Aid to the eye and those are IO Europa ganam and Kalisto and the these four moons do an orbital dance around Jupiter they don’t orbit the earth they orbit this planet and this was a remarkable observation in the days of Galileo that you had objects in the night sky that didn’t go around the earth and they do this under the influence of gravity the same force that holds our moon in orbit around our planet and our planet and all the other planets of the solar system in orbit around the Central Star our sun it’s gravity gravity explains all of this stuff now the gravitational force that an object a exerts on an object B is proportional to the masses of both objects and inversely proportional to the square of the distance between them and this is codified in the law of gravitation that is the gravitational force between any two bodies so for instance the force on a that’s exerted by B is proportional to the product of their masses divided by the distance squared between them the constant of proportionality G I’ll get to in a moment but the force points from the object that’s acted Upon A toward the object that’s doing the acting B so it’s an attractive Force now again this is the force that a experiences exerted by B now G is this Universal constant of proportionality it must be determined by experimental methods and it’s currently known to be about 6.67 * 1011 new m s per kilogram squar not a very big number gravity may seem like a strong force but that’s because we’re being pulled on for instance by all the atoms of the planet Earth and that’s why when we try to jump off the surface of the planet Earth we get pulled back down to the surface so all the atoms of the earth below us are pulling back on us as we attempt to accelerate away and it re accelerates us back to the Center of the Earth but of course we don’t go through the surface of the Earth when we hit it why is that that’s because another set of forces electromagnetism governs the interactions between atoms and atoms tend to repel each other because they have clouds of electrons around them and the electrons have the same electric charge and in the laws of electromagnetism this causes a repulsive Force to occur and so while gravity may seem strong the truth is because we don’t get pulled through the surface of the Earth and down to the core of the planet is because of the strength of electromagnetism which overcomes an entire planet’s worth of atoms pulling on you now what’s worse gravity seems like a strong force but it’s not and also this this Force law doesn’t really tell us its origin it it has something to do with mass and it it w weakens or or strengthens depending on your distance squared between two objects it tells you what direction it points but it doesn’t explain what the origin of gravity actually is what is this Force where does it come from so one of the unsatisfying things about the law of gravitation is that it’s very descriptive but it is by no means explanatory and this was something that even Isaac Newton recognized and because he could provide no evidence to explain the origin of the force known as gravity he preferred not to speculate on it and left it open for the people that would come after him to try to figure out but it was certainly one of those puzzles he never managed to resolve in his lifetime and its resolution would be left until the modern era of physics now speaking of the laws of electricity and magnetism let’s take a look at those and I’m going to do so in a form that may not be very familiar to you but it will be beneficial to you later even if you don’t completely understand the notation now electric and magnetic forces have something in common with gravitation they can act without physical contact across stretches of empty space however it’s pretty much right about there that they part ways from Gravity there strength is proportional to a completely different physical property of nature electric charge which various bits of matter like the electron for instance appear to carry as a fundamental property now like Gravity the strength of say the electric or magnetic force appears to vary inversely with the square distance between charges or flows of charges and depending on the situation we’re talking about here but I can wave my hands sort of make that rough approximation a density of electric charge however is the source of the electric field of force Mass has nothing to do with the electric field of force it has something to do with the gravitational field of force but again this is roughly where gravity and electricity and magnetism all part ways now an electric current density that is a flow of electric charge is the source of a magnetic field of force so a static electric charge just sitting there in space will exert an electric force on another charge somewhere nearby but in order to get a magnetic interaction to occur one of those charges has to be moving relative to the other now I’m going to define a symbol it’s this funny triangular symbol known as nabla because it resembles a ancient harp of the same name it’s got a little Vector sign over it which immediately tells you that whatever this thing is it has directional information and it’s funny because it’s not made of numbers it’s made of derivatives and specifically it’s made with either the uh full or partial derivatives with respect to space so for instance the derivative of something with respect to X the derivative of something with respect to Y and the derivative of something with respect to Z this exposed triplet of derivatives is known as an operator it doesn’t itself return a number but when used on another thing like another Vector it can return a number so you can think of it as a function that when finally given something on the right hand side to act on will give you some information back but on its own it doesn’t really give you information it’s just prepared to tell you how something changes in space now you may not have seen this symbol before and that’s okay but by defining it it allows me to write the laws of electricity and magnetism so-called Maxwell’s equations in four compact mathematical lines now the laws governing these electric and magnetic fields are four in Number the first one is known as gauss’s law for electric fields and believe it or not from this this compact little equation here you can under special conditions derive kul’s law which is probably what you really learned was the law of the electric force in introductory physics there is a simple exercise one can go through to show that this reduces to kul’s law but this is the most preferred in general form of this particular law of electricity and magnetism and in English what it tells me is it tells me that a charge density that is a charge per unit volume row is the source of an electric field on the left hand side we have this operator I defined above which is just a triplet of space derivatives acting on an electric field via the action of the dot product so this thing returns a number and that number is equal to the charge density divided by Epsilon knot which is a constant of nature the second law is gauss’s law for itic fields and this one is probably the simplest of the four it’s that same operator action the nabla symbol with a DOT product with the magnetic field but on the right hand side you get zero and what this equation tells you is that so far as we know there are no such thing as a magnetic charge in order to create a magnetic field you have to have moving electric charge and so far as we know and many experiments have tried and many experiments have failed uh there is no such thing as a magnetic charge that’s what this equation codifies then there’s the Faraday Maxwell law the Faraday Maxwell law tells me that if I have a time-changing magnetic field this can generate an electric field now I have a different Vector operation on the left hand side I have this nabla symbol the vector cross product with the electric field which returns a vector and indeed I have a vector on the right hand side as well the time derivative of a vector field is also a vector and then finally there’s the Ampere Maxwell law and this tells me something a little bit similar to the Faraday Maxwell law and that is that if there’s a time-changing electric field or if there’s a current density of electric charge a flow of electric charge or both then this results in a magnetic field so the left hand side tells me that there’s a magnetic field that exists the right hand side tells me where those magnetic fields might come from either from a charge current density or from a Time varying electric field and mu KN here is another fundamental constant of nature Epsilon KN and mu you should have encountered in introductory physics and you can go ahead and look up their values now what’s amazing about the laws of electricity and magnetism Maxwell’s equations is that when you consider them in a particular situation it finally clarifies what the heck the nature of light is light is an amazing phenomenon it carries information from one place to another and it does so at a seemingly immense speed and it turns out that by solving Maxwell’s equations in a certain regime you find out what light is it’s a very rewarding exercise one that you would presumably go through in a more advanced course than this one but I’ll tease it here so for instance if you consider empty space where there are no electric charges no row no charge densities and where there are no electric currents no JS with the vector hat over the top of it um nonetheless Maxwell’s equations are not just simply all zero so let’s take a look at those equations under those conditions I’ve Rewritten the four equations with no electric charges and no current densities so I have this uh nabla e Vector is zero NAA B Vector is zero I have NAA cross e Vector is just negative dbdt and no across B is something proportional to the time derivative of e so there is a trivial solution to this e and B can be zero that works out just fine but there’s another solution to this that isn’t the so-called trivial solution and the non-trivial solutions are vector functions of space and time and this is what they look like the electric field and the magnetic field as a function of space and time that also satisfy these four equations are these time and space varying functions over here they’re cosinusoidal and they can all be written in terms of the electric field they describe some kind of oscillatory phenomenon oscillatory phenomena like waves are things you should have learned about in an introductory mechanics Class K hat here simply indicates a unit Vector that’s in the direction of travel of the phenomenon and this number c with a zero subscript that turns out to be the speed of the phenomenon in empty space because that’s the kind of space we’re considering here empty no matter no charges no currents and it turns out that you can solve for that speed and you find out that it’s equal to 1 over the square root of those fundamental constants of nature mu * Epsilon KN and if you plug those numbers in you get an amazing fact out of this that whatever this phenomenon is it travels at 2998 * 108 m/s and for the astute among you this is the speed of light so what Maxwell’s equations in empty space tell us is that when solved they describe a phenomenon that can travel from point A to point B seemingly through empty space and it does so at precisely the speed at which light was known to travel in the days when this was solved so light is what is known as an electromagnetic wave and like a mechanical wave which was the only analogy that physicists had at the time it was originally assumed that it must travel in a medium sound travels in air water waves travel in water they are distortions of a medium and so it was presumed that light must too be some kind of mechanical wave and that means that seemingly empty space couldn’t really be empty something’s got to be there that distorts to allow this wave to travel that was the Assumption based on mechanics now finally I want to go into the subject of Relativity which would have been introduced to you probably under the phrase relative motion in introductory physics you get some exposure to relative motion that is a person standing on a train the train is moving relative to somebody on the ground the person on the train throws a ball up in the air what do the person on the ground see that’s usually the way in which this is couched the person on the train for instance who throws the ball straight up in the air will see it go up gravity will accelerate it and eventually it will come straight back down into their hand so it just goes up slows to a stop and then accelerates down back to their hand all along a straight vertical line that’s what the person on the train sees a person on the ground watching this sees the ball follow a parabolic trajectory because the ball and the person have a horizontal velocity because they’re standing on the train so the ball goes up and comes down yes but it doesn’t land at the same coordinate along the horizontal that it started at it appears to follow a parabola and so the two observers will disagree on the motion of the ball the person on the train says no no no it goes straight up and then comes back down to my hand and the person on the on the ground says well no it didn’t go straight up it followed a parabolic trajectory but your hand moved too and so it was there to catch it when it came back down and it’s possible to use mathematics to relate these differ observations of space and time uh and to do this you assume that time passes the same for all observers the person on the train and the person on the ground all experience time the same way and when you make that assumption you get out of this something known as as the Galilean transformation that allows you to relate spatial coordinates and velocities of objects from a frame you consider to be at rest to a frame that you consider to be moving so in our case you might consider the platform or the ground next to the train to be the rest frame you might consider the train to be the moving frame and these equations shown down here will relate coordinates velocities and times in the moving frame with the primes next to them to things in the rest frame the numbers without the primes attached to them okay so that’s not so bad it’s actually one of the more complicated things that most students encounter in introductory physics because it forces you to think in two different frames of reference and this is not always as straightforward as it seems but the math itself is not that bad it’s more the conceptual issues that go along with this that that pose a particular challenge for most people who see this the first time so that is basically a summary of of what we now call classical physics introductory mechanics and the laws of electricity and magnetism or semester 1 and semester 2 physics and even though classical physics is challenging there are many difficult things that you have to do there’s new math you haven’t seen before you’re often learning calculus at the same time you’re expected to use calculus in introductory physics nonetheless at the end of the day if you stop and look at all of this stuff you’ll often say okay the mathematical or some conceptual difficulties side all of the stuff feels to me very intuitive I can throw a ball up in the air I can catch it I can watch somebody do that in a train and see it moves in a parabolic Arc okay yeah we disagree on on what’s happening but we can explain to each other why we see what we see it’s all very you know normal day-to-day human scale stuff really this is intuitive it just had to be described by mathematics and that that often is the difficult part but you have to be very careful about intuition intuition is largely based on experience with events that involve the following things speeds that turn out to be very close to zero you know driving at 70 M an hour may seem really fast to you as a human being or getting on a rocket ship that goes into Earth orbit might seem really extreme and they are for human beings but compared to the fastest known phenomenon in the universe which is light 2998 x 108 m/s 70 M an hour seems pretty pathetically slow and in fact is so close to zero that from the perspective of light it might as well be nearly at rest not very impressive to light so you have to be careful one because the speeds that you’re used to encountering are really close to it turns out zero and so your intuition is built on a very narrow spectrum of experience in the universe the other thing that you may take for granted is that the sizes of things that we usually think about In classical physics with the exception of electrons and protons and electricity and magnetism the sizes of those things tend to be very large by comparison to what are known to be the building blocks of the material universe and for the stuff around us that’s mostly going to be atoms that’s the day-to-day stuff that we are interacting with but when you interact with a table that table has like avagadro number worth of atoms in it that is a huge number of atoms and the scale of the structure built from those atoms is vast by comparison to the atoms themselves and so as a result as we begin to encounter phenomena and this was true of physicists at the end of the 1800s as you begin to encounter phenomena that are very fast or very small so objects moving very close to the speed of light or objects that are really more at the atomic or even the subatomic scale the things that make up the atoms you begin to find that classical physics needs to be be modified to describe the universe more completely it works for slow things at large scales like human scales or plane size scales or even bigger but it breaks down in regimes where it was never designed to operate the very fast and the very small so as a result you’re often going to find as you go into modern physics that what you think to be true about the universe is based on intuition from a limited set of experiences in the cosmos and as a result your intuition is actually fundamentally wrong but the good news is is that this only means that you are finally finally experiencing the breadth of the universe all it has to offer at all of its scales in speed and size rather than that limited scale of phenomena closer to Human Experience so let’s use classical physics and let’s make some predictions to set ourselves up for where people started to go really wrong with these ideas in roughly the late 1800s now the tenets of classical physics which I can summarize based on the earlier part of this lecture are encoded largely in Newton’s laws and Maxwell’s equations and they should if this is all there is to the universe apply to all phenomena in the natural world after all if this was really the complete set of all the laws of nature that had been disced discovered in the 16 and 1700s that it must be true that they describe everything otherwise they’re not a complete set of laws so let’s take a look at light what would the framework of classical physics then insist be true about light well from Maxwell’s equations we know that light is some kind of oscillatory phenomenon like a wave and so our experience with waves in the 1800s was that they must be mechanical in nature they must represent the Distortion of a medium so they gave it a name they named it before they ever discovered it and they called it The Ether and it was believed to be the thing that actually fills empty space empty space isn’t empty it’s made of this substance called The Ether that we normally can’t experience but light experiences it and the Distortion of the ether is what we call light that was the hypothesis based on the mechanical understanding of wave phenomena so the speed of light in so called empty space the number that we got from Maxwell’s equations that isn’t really the speed of light in empty space it’s the speed of light measured relative to an observer at rest with respect to the ether The Ether is the universal reference frame for light and if you can be at rest with respect to the ether then you will observe that light moves at 2998 * 10 8 m/s it’s a big number okay but this would then make ether the universal rest frame that is the the frame that you could Define to always and absolutely be at rest and then everything else is in motion relative to it that would be awesome the Galilean relativistic and Newtonian mechanical view of the universe would have allowed something like this to exist now the problem was that sort of the new kid on the Block Maxwell’s equations which really only emerged in the you know second half of the 19th century they were silent on the topic of The Ether they described no substance that required this electromagnetic wave called light to propagate so it was assumed that they must be incomplete that the new kit on the Block they’re probably not complete they need to be completed and The Ether would complete them so it was assumed that Newton’s mechanical view of the universe the laws of motion and all that stuff that that was correct but that Maxwell’s equations was just incomplete and needed needed to be completed with this mechanical substance The Ether so if we then apply this thinking to a problem involving light and travel and time what would we predict let’s put ourselves in the role of sort of late 19th century physicists we’ve learned all this stuff it’s been solid for 200 years so what are we going to predict so let’s do a thought experiment a thought experiment is a kind of experiment that you can carry out entirely inside of your head what you do is you imagine a scenario you analyze the scenario using the understood principles of nature or laws of physics and you look to see if the conclusions of running this imaginary experiment would in any way violate logical or physical consistency and if you determine that that’s the case you may have hit upon a useful inconsistency in our our understanding of nature that could then be used to figure out what the correct description of nature might be so to do our thought experiment let’s imagine that we are in a space that is filled with ether the medium in which light traveling as a wave disturbs the medium and propagates at 2998 * 108 m/s now imagine into this volume of ether we place two cars one car at the left one car at the right and the car at the right has its headlights aimed at the car on the left so that an observer in the car on the left could look back out the window and if the headlights of the car behind them were on they should be able to see the light but let’s put a 30 km gap between the front of the right car and the back of the left car so that light if it wants to go from the car on the right to the car on the left has to cross this gap of 30 km okay fine so we’ve placed the cars in The Ether the cars are at rest with respect to the ether so they’re in the frame of reference of The Ether and the car on the right switches on its headlights how long does it take for an observer in the car on the left the second car to see the light reach them well this seems pretty straightforward right you know the distance it’s 30 km from where the light leaves the right hand car and arrives at the left-hand car and Maxwell’s equations tells us that light travels at a fixed speed it doesn’t say anything about The Ether but we’ve invented The Ether to help us to have electromagnetic waves comport with all prior knowledge of mechanical waves so it’s a medium with mechanical properties that can stretch and squash and those stretchings and squashing are electromagnetic waves and in that medium light travels at 2998 * 10 8 m/s okay everything’s at rest with respect to the ether light travels at the speed of light in ether so we just run the numbers we take the distance we divide by the speed and we get the time that is required to make this journey and we find that that time comes out to be about .1 milliseconds 1 * 10-4 seconds okay nothing hugely revelatory here but let’s take our thought experiment One More Level forward now let’s imagine that both cars have been plopped into this ether volume and they accelerate at the same time up to a constant velocity of half the speed of light that’s a speed of 1.5 * 108 m/s and let’s imagine that the cars are both moving together at the same velocity from right to left so they’re traveling from the right to the left in The Ether at all times they maintain a fixed distance between the front end of the right car and the Observer at the back end of the left car of exactly 30 km the car on the right turns on its headlights now how long does it take the light to reach the observer in the other car well let’s review what we think we know about light speed and this so-called ether that distorts to allow electromagnetic waves to propagate light travels at C the number given by Maxwell’s equations 2998 * 10 8 m/s in the rest frame of The Ether but now from the perspective of the cars the ether is a wind that’s rushing past them still air on a calm day leaves no sensation on your body but if you were to start running forward you would perceive a wind hitting you in the face and that’s sort of the equivalent situation here both of these cars are now traveling through the ether they’re doing so at half the speed of light and so from their perspective The Ether is rushing past them as a wind and its speed is also half the speed of light it’s as if they perceive themselves to be at rest and The Ether to be rushing past them at half the speed of light so the velocity of this wind is the negative of their velocity with respect to the ether now Gile relativity and Newtonian mechanics demand that from the perspective of observers in the car that the light that leaves the car on the right while it’s traveling at 2.99 x 108 m/s in the rest frame of the ether is encountering this wind of ether that has the apparent effect of slowing it down this is sort of like sound waves or water waves in their respective media if the medium is moving then the medium’s speed can add or subtract from the velocity of the wave in that medium and so Galilean relativity and Newtonian mechanics are going to demand that the observed speed of light in the frame of the cars is the speed of light in the rest frame of The Ether minus the velocity of the cars and so you would actually see the light leaving from the right hand car and traveling the gap between the right hand car and the left-hand car and what seems like a slowed speed as if it’s encountering resistance as it moves forward it’s not moving at 2998 m/s anymore it’s moving at about half that and so you would answer that well the distance between the cars is still the same it’s 30 km and the speed of light has been reduced by The Ether wind and so you you would predict based on all knowledge at this stage that the time it takes for the light to get to the other car is greater than it was before it’s about2 milliseconds now twice the time that was required when the cars were at rest with respect to the ether now that’s a prediction and it comports with all prior experience in the pre-20th century World it comports with ideas about how velocities add in relative motion it comports with the idea that waves can only travel because they distortions in some kind of medium a mechanical explanation for waves that’s consistent with Newton’s mechanics all of this seems to be perfectly acceptable from the perspective of the Bare Bones introductory physics to which you would have been exposed but a fair question to ask is this is the outcome of a thought experiment what would be observed in a real experiment in the real world and we’ll take a look at that so let’s review the basic ideas that are the foundations for modern physics the groundwork for modern physics are Newton’s mechanics the concepts of energy and momentum quantities associated with motion that can be conserved under certain conditions the law of gravitation and the laws of electromagnetism however these were largely built to describe phenomena that comport with typical human experiences phenomena at our size scales or slightly larger or smaller essentially within our ability to see the world around us including with a mic microscope or a telescope that would all be within the human scale um the exception however is Maxwell’s equations they were developed by studying electric charges which are very small and they are really beyond the scale of everyday experience except in their large scale macroscopic effects like electric and magnetic forces electric currents lightning strikes refrigerator magnets things like that they have these big macroscopic effects that feel familiar to us but but at the individual level of a an electron let’s say things are not typical compared to the human world by the end of the 1800s chemists and physicists were beginning to directly interact with scales that really were Beyond Human Experience so for example the electron is discovered in 1897 and it turns out to be the first subatomic particle although that really wouldn’t be fully understood for several more decades in addition an invisible radiation like for instance what we now call x-rays this was discovered at the end of the 1800s and 1895 in the case of X-rays and these phenomena and other phenomena at the same scale even atoms themselves or other General forms of light they turn out to be Way Beyond Human Experience and so trying to adapt our intuition in the form of Newton’s Mechanics for instance to these phenomena would lead to spectacular fails now not only were such new phenomena small they also turned out to be capable of moving extremely fast x-rays move at the speed of light electrons with minimal effort can be compelled to move at almost the speed of light such speeds are also very much Beyond human day-to-day experience although you might lead yourself foolishly to think that you understand them really well so this concludes a foundational lecture a review of the material you should have been exposed to already in semester 1 and semester 2 physics I know that I’ve couched this in some ways that are unfamiliar but I’m trying to Rattle you out of any complacency you might be in after having had a couple of introductory semesters of physics and we’re going to begin to explore the consequences of these classical physics predictions on phenomena like light in class and then we will build on what we conclude from those Explorations into the first steps of modern phys [Music] physics in this lecture we will learn the transition in thinking that led from Galilean relativity to the special theory of relativity in 1905 we will learn the postulates of special relativity which are the basis of the mathematics of the framework and we will look at some of the consequences of those postulates even before we delve into the mathematical framework itself in class we looked at the lessons of the Michaelson Morley experiment which can be summarized ized as follows first light travels at a fixed and constant speed in any medium regardless of the relative velocity of the light source and the light Observer this is unlike any other phenomenon described in mechanics and it implies that Newton’s mechanics is actually the incomplete theory of nature no medium is actually required for that light to propagate unlike a mechanical oscillatory phenomenon a wave to exist light requires no medium to be distorted it is not mechanical in origin and this implies that Maxwell’s equations are complete or at least sufficiently complete to understand light these lessons however would not be fully absorbed until about 1905 when Albert Einstein one of the most famous physicists in history published the definitive papers explaining how to reconcile mechanics electricity and magnetism and the results of the Michaelson Morley experiment now interestingly the mathematics that Einstein would come to rely on for encoding the relationship between space and time measurements in one frame and space and time measurements in another frame were actually laid down much earlier by Hendrick Loren in a famous paper on the compression of bodies in The Ether the mathematics that would later become a replacement for the Galilean relativity equations would actually be kind of sketched out but for a completely different purpose than they would ultimately be used for Loren was considering the effects of The Ether on bodies that are moving through it now these bodies are held together by chemical bonds they’re made of atoms and those atoms are chemically bonded to each other but chemical bonds are just electromagnetism in action and So based on this he arrived at a few hypotheses should The Ether exist first that mechanical bodies would compress along the direction of motion in The Ether and this has a precise mathematical description for the process and second in transforming observations from The Ether frame to other frames of reference he would conceive of an alteration of time that also had a very firm mathematical description now Loren conceived of this during a period when The Ether was still very much believed to exist the results of the Michaelson Morley experiment were not fully digested during this period the ether’s existence would ultimately be disproven or at least shown not to be necessary to explain anything that was then known about nature in the decades that would follow this work but the mathematics laid down by Loren during this period would still prove extremely useful and today we know this as the Loren’s transformation the replacement of the Galilean transformation from frame to frame we’ll come back to that in a later lecture let’s talk about Albert Einstein and his miracle year of 1905 so Albert Einstein in 1905 was a young PhD physicist who was laboring doing physics as sort of side work in what was otherwise supposed to be his regular work at the Swiss patent office in Burn Switzerland he had this job because he was unable to secure for instance a faculty job after completing his PhD and in part this was because Einstein really couldn’t get any recommendations out of any of the professors that had supervised his education because he had so irritated them with his behavior during what we would consider graduate school including skipping out entirely on classes uh in particular for instance math classes for mathematics he didn’t consider to be physically useful uh and also for challenging his professors challenging their Authority thinking of them as idiots and so forth now Einstein was a very bright young man but he was also a bit arrogant and temperamental and this didn’t do him any favors when he was trying to get a job now ultimately it was the thinking that culminated at the end of his PhD work and then into the years leading up to 1905 that would lead to a change in the way that the community of physicists thought about the supremacy of the assumptions made in Newton’s mechanics versus what the laws of electromagnetism that is Maxwell’s equations had to say about light and space and time and in 1905 he published the work that had resulted from his PhD research in a series of about four papers and this was his so-called miracle year this is a highly productive year for a young and relatively unknown physicist in this day in doing so he reframed assumptions about space and time and what is and what is not invariant to all observers and all frames of reference recall that in the neonian and Galilean view of

    space and time time is experienced the same way by all observers regardless of their relative states of motion time would be referred to then as an invariant but what Einstein proposed challenged thinking about what was and what was not invariant in space and time now it in short here’s what Einstein did he accepted the conclusion of the Michaelson Morley experiments that light has a fixed speed regardless of the motion of the source relative to The Observer of the light from the source this then implied that there’s no ether as well using a simple thought experiment like the one that we did in the foundations lecture involving car headlights and The Ether he explained also why time is not absolute even in Newton’s mechanics time itself self is not an invariant concept and he did a quick thought experiment that showed that it wasn’t even true under Newton’s way of thinking and so he was free to abandon time as the constant thing in Transformations from one frame to another frame instead he chose to preserve overall the forms of the laws of physics and the speed of light which the Michaelson morly experiment implied was constant regardless of your state of motion this then led to the foundation of two postulates that allowed him to then Define all the mathematics that would follow the first postulate is what I hinted at a moment ago the forms of the laws of physics that is f equals ma for instance or Maxwell’s equations will be the same for all observers regardless of their state of relative motion that is their frame of reference the second postulate is that the speed of light is the same for all observers regardless of their frame of reference their state of relative motion now let’s begin by breaking down the concepts that we need to dig into so that we can really understand where all of this is headed we need to take these postulates and parse them into some phrases and words Define those things and then go forward from there this will allow us to build back up to a more complete understanding of the math that we’ll eventually need in order to understand relativity and relative motion going forward in modern physics first off there’s the word event you might think you know what this is but in physics it is given a very precise definition so that we can always try to define the concept mathematically so that everyone can agree on what an event is and what an event is not another phrase that’s deceptive and may seem to have a common definition for you but where we have to be careful about this in physics is the phrase frame of reference we need to Define it it comes up a lot in our discussions and because descriptions of events can depend on the frame of reference in which the observation is made we have to carefully Define this concept simultaneity is another word probably the one that causes the most consternation among people who are making the transition from introductory mechanics and electricity and magnetism into modern physics because simultaneity has probably been implied in a lot of things in the past but we have to put it on some firm footing conceptually here so that we can use it and explore it going forward it turns out that the concept of simultaneity is actually essential to many things you take for granted all the time you’ve just never been forced to think about it before this concept turns out to be a subset of the discussion of events and it’s going to play a very important role so we’re going to have to Define this and then finally you might think you’re comfortable with this idea but the phrase speed of light would benefit from some context and some description we should really try to understand the number that is behind this phrase It’s a ridiculously large number compared to most things on the day-to-day human scale but it actually turns out that this speed is only impressive on the scale of things that are roughly the size of planets I’ll even allow solar systems uh and maybe smaller depending on how you define a solar system but it turns out the speed of light is not as fast as we would like it to be um and certainly on the scale of things like the entirety of the universe it is pathetically slow so let’s get started in the next few slides trying to Define each of these things very carefully first of all let’s talk about the concept of an event an event is quite simply anything with a location in space and time so let’s practice this concept I will show you an event and I want you to try to describe it with words and numbers go ahead and pause the video while you’re doing this when prompted see you can come up with a short sentence that describes the event using the definition that an event is anything with a location in space and time this is excellent practice for defining events in any new situation that you will encounter as a exercise in setting up a problem for eventual solving I’ve given you a one-dimensional axis so an X AIS and let’s say that the numbers here have units of meters that’ll make it easy for us to very precisely describe an event I’ve also given you a timer the timer is capable of ticking out about 12 seconds and the units on each of these tick marks 1 two 3 and so forth are seconds okay so you have a spatial reference and you have a time reference given that information let’s go ahead and proceed with looking at an event and attempting to describe it I want you to describe the event depicted above on the x-axis go ahead and pause the video come up with a short sentence that uses the definition of an event to describe it and then resume the video when you’re ready to compare to my answer you should have come up with something like the following the dot is at position xal 0 m at time tal 0 seconds that’s an example of describing an event in physics the dot is at a spatial location that is defined at a time that is also defined x and t space and time locations if you didn’t feel comfortable doing this now that you’ve seen me go through it once let’s try another event you try to describe it and let’s see what you come up with describe the event depicted above now on the x-axis go ahead and pause the video write down a short sentence that uses the definition of an event to describe this event and then resume the the video when you’re ready to see what I came up with so what I decided to do is to describe this as follows the dot is at position x = 2 m at time T = 2 seconds make sure to check your space reference and your time reference when presented with an event so that you correctly Mark in say x and t or XYZ and T the coordinates of an event an event is something that has well- defined coordinates in space and time a location in space and time let’s now talk about a frame of reference a frame of reference is any object or system all of whose Parts move at the same velocity with respect to an agreed upon reference point in space that’s quite a mouthful let’s go ahead and illustrate this with an example I want you to consider the three objects shown below labeled black a black dot blue a blue dot and red a red dot now one of them the black dot is agreed upon by the others the red and blue dots as the common reference point for all measurements now as I’ve depicted them here blue and red have an Associated velocity Vector shown here and as depicted red and blue are in the same frame of reference because they have the same velocities let’s check that if I roughly eyeball the length of this Vector it seems to be pretty similar to the length of this Vector so from this I could conclude that very likely blue and red have the same speed with respect to black but velocity is not just speed it’s not just the magnitude it’s also the direction and here I see that the directions align they Point parallel to each other and so I conclude from this that they have the same velocities therefore although blue and red are both moving they are moving in the same way with the same velocity they have the same state of motion and therefore they are in the same frame of reference now take a look at this one I’ve changed something here does this change alter the conclusion about blue blue and red do the Red Dot and the blue dot share the same or different frames of reference pause the video here look carefully at the image and then resume the video when you’re ready to hear the answer the answer is that they do not although their speeds are the same the lengths of those two arrows look pretty much identical the direction of the mo notion of the Red Dot relative to the Blue Dot and all measured with respect to the black dot has changed this means that they have different velocities and different velocities means different states of motion and therefore different frames of reference have now emerged here the blue frame of reference is no longer the same as the red frame of reference now I want you to consider the objects in this picture blue red and now a purple dot all of their velocities are measured with respect to the black dot as the reference point that hasn’t changed I want you to practice a little bit more and I want you to think about how many unique frames of reference you can identify in the above picture go ahead and pause the video here I’m not going to provide the answer here because I really want you to try to step out on a limb on this one but feel free to talk to me as the instructor outside of class or in class if you’re not confident in how to determine the answer to this question now let’s visit the concept of simultaneity simultaneity is a subset of events in which two events or more are said to be simultaneous that is to possess of this quality simultaneity if they are observed to occur at the same moment in time this seemingly straightforward definition of the concept should not fool you you have to think really hard about whether events are actually simultaneous and if there are multiple observers in different frames of reference for whom are those events simultaneous finally let’s look at the speed of light and let me be clear about the speed of light light it is the number of meters that light can travel once it’s been emitted by some kind of source in a certain amount of time that’s just the old definition of speed but light is special it’s special because the Michaelson Morley experiment tells us that no matter the state of motion of the Observer or the emitter of the light all parties will agree that when they measure the speed of that phenomenon in any frame of reference it always comes out to be the same number 2998 * 108 me per second at least an empty space now the history of the speed of light is interesting it can be cherry-picked through to take a look at what people try to do to measure the speed of this phenomenon because it is ridiculously fast now Galileo galile famously claims to have attempted to measure the speed of light by uncovering a lantern having an assistant on a distant Hill who in response to seeing the light from Galileo’s Lantern then uncovers one of their own and then G Alo upon seeing the assistance Lantern light records the time for the round trip taking into account human reaction time it turns out of course that light moves way too fast for this to work with 17th century technology even if Galileo used the most precise clocks of his day which he had invented water clocks there’s no way that even given 40 50 or 60 miles of distance between him and his assistant that that technology would have been sufficient especially with really slow human reaction times to in fact measure the speed of light so this was kind of a lost cause but a clever technique nonetheless and one which can successfully be used to measure the speed of sound another important person in the story of the measurement of the speed of light is Ol RoR now he would go on to use the period that is the time it takes to complete one cycle of Jupiter’s moon a which had been discovered by Galileo using the telescope and and by looking at its cycle of eclipses by Jupiter to then make the first reasonable determination that light travels in finite time he did this in about 1676 revisiting his data in a modern context suggests he shouldn’t have been as accurate as he was in measuring the speed of light but he actually got fairly close to the currently accepted value certainly impressive for its time uh impressively close to the currently accepted value of the speed of light but one could definitively conclude from his work that light does not travel instantaneously from place to place rather it takes a finite amount of time to cross space even if it does so very quickly now by the time of Albert Einstein’s Publications the speed of light had been established by multiple experimental methods to be within about 50 km/s of the Precision of today’s methods and that is is remarkable for such a large number representing such a incredibly high speed so let’s then take a look at the Modern speed of light and the number that is the currently accepted calibrated value of this speed today and I say that because the definition of things like the meter are based on the distance that light travels in a certain amount of time so based on the current definition of the meter and the second the speed of light is defined to be exactly 299,792,458 m/s or about 2998 * 10 to8 m/s A good rule of thumb something that will Aid you whether you’re thinking about how long signals will take to propagate in electronics or if you’re thinking about how long it will take for a light signal to propagate across some space for a communication system or something like that A good rule of thumb is that light travels roughly 1 foot in one billionth of a second that it goes that is it goes one foot per nanc that’s a handy little thing to remember for engineering purposes going forward now let’s begin to look at the consequences of the postulates of special relativ and I say special because there’s a more general theory of relativity a more general theory of space and time that Einstein would spend another decade working out after 1905 what makes the early theory of space and time that he developed special is that it focused on what are called inertial frames of reference those in which there are no net unbalanced forces now that doesn’t mean that accelerations can’t be present but it is a special case of a more general theory of reference frames space and time now under this special condition an object in motion will appear to all observers in all frames to have a constant velocity even if observers in different frames disagree on the magnitude and direction of the vector so let’s recall his postulates again in light of this special condition for the frames of reference that we’re talking about here postulate one is that the forms of the laws of physics are the same for all observers regardless of their state of relative motion that is regardless of the frame of reference in which they find themselves we’ve looked at the definition of the terminology frame of reference the second postulate is that the speed of light is the same for all observers regardless of of their frame of reference all observers no matter their relative state of motion when they measure the propagation speed of light signals will always find and this is based on experimental observation that light travels at the same speed in every frame of reference even if that frame of reference is moving with respect to the source of the light this is taken to be the thing that is invariant from one frame of reference to another frame of reference not time the speed of light now let’s look at some of the consequences from these postulates starting with the first postulate so the consequences of the first postulate are both straightforward and a little surprising so one of the conclusions you can draw from the first postulate assuming that it’s true is that all physical laws like Newton’s laws or Maxwell’s equations will all all have the same observed form in all inertial reference frames now this is pretty helpful actually because what it means is that regardless of our relative states of motion the basic laws of physics that we can uncover by doing experiments observations of the natural world are not dependent on your current state of motion the moon goes around the earth so from our perspective the moon appears to be moving but the law of gravity has been tested on the Moon by dropping objects there we see no difference between the law of gravity on the moon and the law of gravity on the earth despite the fact that we are definitely in relative motion to one another this has been tested more precisely than just dropping things on the moon but the basic conclusion is that this postulate holds and as a consequence of that basic laws of physics can be determined regard regardless of what your state of motion actually is but this consequence has a flip side it’s impossible based on determining the laws of physics by making observations in your frame of reference to determine whether or not you are actually in motion the analogy I like to make for this one is is being a little sleepy on a train if you’ve ever been on a light rail car or a real passenger train you’ve been a little tired you’re sitting on the car waiting at the station for the train to leave and another train is parked next to you you might doze off for a moment while sitting there looking at the other train and then you might wake up and during the time when you were slightly unconscious your train began to move with ever so slight an acceleration you started to gain some velocity and so when you wake up you’ve missed the fact that there was an acceleration in your frame of reference that caused you to start moving and you might look out the window and see the train next to you moving past you and draw the conclusion that the other train is pulling out of the station you conclude therefore that you’re in you’re in the rest frame with respect to the Earth your train is standing still because you feel no forces and the train next to you is moving but then suddenly you reach the end of the train next to you and you realize that your train is the one moving with respect to the ground and that other train was sitting still the whole time you had no way of knowing that you were actually the frame in motion with respect to the Earth because there were no cues and there’s no experiment you could have done in that 30 seconds while you’re passing the other train that would have definitively told you you were moving and the other train was not or that the other train was moving and you were not and that’s one of the consequences of the first postulate there’s no way to measure even the most fundamental statements about nature the laws of nature and figure out that you are moving and not something else so as a result of this postulate it has to be concluded that there is no such thing as an absolute state of rest or an absolute state of motion all motion is relative all Motion in nature is relative to a reference point you have to pick what that reference point is and depending which one you pick may change the degree of your state of motion or the state of motion of the other frame of reference all motion is relative as a consequence of this postulate there is no experiment that could be done if this postulate holds forever that would tell you that you were moving and something else wasn’t or vice versa now let’s look at the consequences of the second postulate the speed of light is the same for all observers regardless of their frame of reference now the consequences of the second postulate are typically more surprising to a general audience of individuals who start really thinking about this for the first time on their own and these conclusions tend to put most people well outside the comfort zone of typical Human Experience so let’s take a look at these so all observers agree that light moves at a fixed speed this is a singular and variant independent of states of relative motion now that’s already a bit freaky in the sense that you could be driving in a car at 70 M an hour and switch on your headlights and somebody on the side of the road standing still with respect to the Earth measures the speed with which the light from your headlights passes them and they measure 2.99 * 108 m/s exactly not 70 M hour faster exactly the speed that it would travel in empty space if it were emitted from rest and you in your frame of reference could get out on your hood and do some very careful experiment to measure how fast light is moving when it’s emitted from your headlamps and you would draw the same conclusion that the speed of the light is exactly that number from a few slides ago even though uh the person on the ground sees you and the source of your light is moving they still measure the same speed of light you measure that is freaky that some how light is immune to a state of motion of the emitting Source but that is an observational fact it may be freaky but it’s also reality and that means you need to rethink the universe at a fundamental level particularly rethink space and rethink time and so as a consequence of this observational fact of nature the belief that humans typically hold that say time or space or both are experienced in the same way by observers in different states of motion has to be completely abandoned if we are to hold the speed of light constant in all frames of reference you have to abandon the absolute nature of for instance time time passing will be experienced differently by observers in different frames of reference so as a result of this postulate there’s just no such thing as an absolute measure of time or an absolute measure of space I mean we already have to abandon the notion of an absolute frame of reference in space from the first postulate but in the second postulate we also find that we can’t hold on to this seemingly intuitive belief that time passes at the same rate for all people regardless of their state of motion measurements in one frame of reference regarding space and time distances need not agree with measurements in a different frame of reference but all observers will agree that light signals travel at a specific and fixed speed independent of the relative states of motion so really special relativity is not so much a theory of what is relative it’s a theory of what is invariant between observers in different frames of reference and it allows us to define a mathematical framework to figure out how to relate our observations so let’s take a look at the relative nature of time briefly using a variation on Einstein’s thought experiment or as he called them gunan experiments gunan from the German for thoughtful or mindful what freed Einstein to write down the postulates ultimately of what we now call special relativity was his ability to be able to abandon Newton’s old idea of absolute time that is time that passes the same way for all observers regardless of their state of motion it was this thing that was really a key moment for Einstein of insight a moment when he he relates that the damn kind of broke at that moment in his mind and he was freed to draw the conclusions that ultimately LED down the correct path to the correct description of nature so let’s take a look at a variation on the gonan experiment that he felt liberated him from the sort of tyranny of absolute time that had been passing down through the generations as an assumption that turned out not to be true so while riding on a street car in Burn Switzerland where he worked as a patent clerk Einstein began to think more carefully about what it meant to know the time by observing the Clock Tower so shown here on the right hand side of the slide is a picture of the burn Clock Tower and here you can see the the tram lines in the street that likely carried the street car on which he was writing at the time when he finally had one of his moments of insight into this question what does it mean to know the time by observing the clock tower well we’re going to do a modern version of his thought experiment because analog clocks are not as common as they were in his day time is the measure of distance if you want to think about it that that way between events that occur for example at the same spatial coordinates so imagine not a analog clock on the face of this clock tower but rather a large blinking light and when the light is on that marks a moment of time it’s an event it has a location in space and a location in time and then the light goes out and then it comes back on later in the same position in Space the gap between the two blinks is what we refer to as a duration of time and we could use that gap between these regular blinks of the light to define a standard unit of time whatever we choose that to be the second for instance now Einstein realized that the way you know that time is passing is you see these two events but to see these two events you need to receive light from the blinking light and light has to travel through space so if you’re on the street car and the street car is moving away from the clock tower the light from the clock tower has to travel from the tower to your eyes so you see the blink after it’s actually occurred but in your frame of reference in the street car it’s the arrival of the light that tells you that a moment in time an event has occurred and then you wait for the next blink to occur but by then the street cars moved a little further away so the light has to travel a little bit further and that takes a little bit longer and so in your frame of reference in the street car time appears to be slowing down and this is just using a Newtonian view of the Universe I haven’t even invoked the postulates of special relativity here this is just the simple fact that light has to travel across a distance and it does so in finite time and the time intervals are stretched by moving away from the clock tower because light has to catch up to you so even in Newton’s view of the universe time measurements cannot be absolute as a result of this so imagine two observers that are using a blinking light to measure time they agree that the blinking of this light is how they will Define their Standard Time units now one of The Observers is at rest on the ground with respect to the source of the light maybe standing right next to the blinking light and the other is on a super Train That’s racing away from the light source and it’s doing so at a ridiculous speed half the speed of light so the two observers agree to count how many blinks occur while the super train makes a journey of 2 million miles now I chose that because this is about how far light can travel in 10 seconds now on the ground The Observer at rest with respect to the blinking light counts 10 blinks during the journey each blink being 1 second apart but for the observer in motion not all of those 10 blinks will have had time to reach the super Trin by the time it arrives at its agreed upon destination it will have marked off fewer observed blinks from the light and thus an observer on the train would rightly claim that less time was required than the 10 blinks that the person on the ground saw to make the journey two observers disagree on how much time has passed using a common reference point so even in Newton’s view of space and time the notion of an absolute time measurement is just not correct now this thought experiment is essentially based on an optical effect you could even say it’s based on an optical illusion the transit time of light through space but nonetheless because it already using a Newtonian view of the universe disproves that there is such a thing as a notion of an absolute unit of time that passes the same way for everybody this completely then frees a thinker from abandoning the concept of absolute time as a necessary tenant of reality so the speed of light is the same for all observers regardless of their frame of reference and since space and time displacements are not experienced the same in frames with different relative states of motion even based on this optical illusion based thought experiment observers at rest looking at an object in a frame of reference that’s moving with respect to them will observe that that object is contracted in length along the direction of motion now we will firmly see that when we explore the Loren transformation for relating observations in one frame to observations in another frame but already you could have concluded that since it’s the speed of light that remains fixed not time or something else that you’re going to have to give something in this process and what you find out from all of this is that objects in motion from the perspective of people who are in the frame that’s agreed upon as being the rest frame will be observed to shorten along their direction of motion this is known as length contraction so hang on to that phrase because it will come up over and over and over again it refers to this phenomenon of spatial measurements from the perspective of observers at rest looking at the moving frame getting contracted in the moving frame now observers in motion relative to other observers will will also experience a slower passage of time it’s not an optical illusion that you need to use to explain this it’s a physical change in the experience of time itself no Optics required to explain the phenomenon it simply is a behavior of time that for objects in motion relative to other observers if they stop moving and then compare their clocks to people on the ground they’ll find out that they have experienced less passage of time than their colleagues who remained in what was agreed upon to be the original rest from frame this is known as time dilation that time slows down in a moving reference frame relative to a frame that’s at rest now it will be a lot easier to appreciate the degree of these consequences as we actually explore the postulates of Relativity in class and then in the next section of this class look directly at the Loren transformation which is the correct way to relate observations between frames of reference so so I want you to get these Notions of terminology down you don’t necessarily have to agree that this is what happens right now because I have done no math to prove to you that this is possible and I’ve certainly shown you no experimental results to tell you that this is what happens but for now look at the terminology and understand what a length contraction or a time dilation is so that we can carry that terminology forward with us so to review what we have done in this lecture we have learned about the following things we’ve learned about the transition in thinking that led from Galilean relativity to the special theory of relativity in 1905 we’ve learned about the postulates of special relativity which are the basis of the mathematics of the framework and further we’ve started looking at the consequences of those postulates from the fact that it’s impossible to tell from looking at the laws of physics in different reference frames that a given frame is in motion relative to any other all motion is therefore relative and also that different observers in different frames of reference while they’ll all agree that light moves at the same fixed speed regardless of their relative states of motion they will disagree on the lengths of objects and the durations of time that are passing in different frames these consequences will carry forward into the next section of the course a discussion of the Loren transformation and preview the conclusions that we’ll draw from the correct mathematics that relates observations from one frame of reference to another frame of [Music] reference in this lecture we will learn to appreciate the Gan transformation and it’s built in assumptions uh decent understanding of the past will help us to set the stage for the present we’ll learn a way to derive the form of the correct transformation between frames of reference respecting the postulates of special relativity and we’ll learn how to begin applying this transformation and see that it is in fact consistent with the postulates of special relativity it does end up being being entirely self-consistent and it gives us a basis for making predictions about the natural world predictions that can be tested the Galilean transformation was predicated on two assumptions and these assumptions may not have been made very clear when you were originally learned about this transformation for observers in inertial frames of reference that is frames of reference in which all observers agree that objects in motion are moving at constant velocities time is assumed to pass in the same way for all observers regardless of their state of motion and all observers agree that objects in each other’s frames are in states of constant motion and I’ve drawn down here an example graphic of a representation of an object in motion with some velocity Vector uh Illustrated here that I’ll use in a lot of the images going forward that will help us to think about these Transformations now let’s define two frames of reference that we can use as the archetypes for thinking about transformations of space and time information from one frame to another let’s denote one of these as frame s and we will always take frame s to be the thing that we call the rest frame now this is an arbitrary assignment you can choose one thing to be at rest and not another or vice versa but once you make that choice you need to stick with your choice you need to see that through to The Bitter End so for the purposes of illustrating the process of thinking about transformations of space and time information from frame to frame we’ll always take s to be the frame that is not in motion now in this frame of reference S we will imagine that they carry along with them a coordinate system uh like a framework of three lines that are at right angles to each other that they’ll use as reference markers for all spatial measurements and the coordinates from their cartisian coordinate system will be denoted X Y and Z when they describe object velocities they’ll notate them as using the letter U the letter V for velocity as you’ll see in a moment has a special place in relativity calculations and So to avoid confusing us as to what velocities we’re talking about we will use U to denote object velocities now we will Define a second frame s with a little Prime symbol next to it or S Prime that is moving relative to frame s at a velocity V so everything in that frame is moving all at once in the same direction at the same speed relative to S and in that frame they too have a little framework a little cartisian coordinate system framework of X Y and Z but they label their coordinates X Prime y Prime and Z Prime and when they measure object velocities they denote them as U Prime to be consistent with the notation of their coordinate system now we will do the following to simplify our thought process going forward it doesn’t have to be this way but we can set the problem up this way to make it easier for ourselves we will assume that they have arranged their coordinate axes so that they are always parallel to each other X is always parallel to X Prime even if x Prime is moving relative to X Y is paral parallel to Y Prime Z is parallel to Z Prime this makes it easier for us mathematically to relate things between the frames we can’t allow chaos to rign in all of this and we’re going to further simplify for the purposes of our discussion here that frame S Prime has a velocity V that is entirely and only along either X or X Prime it’s entirely parallel to X and xpre and has no component along y y Prime Z or Z Prime now in the Galilean picture of things in all frames time is absolute so that t is equal to T Prime the time measured in one frame frame s is equal to the time measured in another frame frame S Prime always that is the definition of absolute time now I’ve recovered the picture of our two little frames of reference here and our blue object in motion viewed from the perspective perhaps of one or the other frame with its velocity U or U Prime now this picture built from the postulates of the Galilean or Newtonian approach to space and time then allows us to define the equations transforming observations in one frame to observations in another frame for instance if we measure X Prime and T Prime and U Prime and frame S Prime these equations will allow us to figure out what the people in frame s would see and here are the equations you’ve probably seen them in this form or similar form in introductory physics because the motion of frame S Prime is entirely and only parallel to x and x Prime there’s a transformation between the X and X Prime coordinate system that is given by the first equation measurements in y are equal to measurements in y Prime and measurements in Z are equal to measurements in Z Prime and of course because of absolute time T is equal to T Prime now object velocities are related between the frames using the velocity transformation equation which if you use calculus and I would invite you to do this as a simple exercise you can prove from the first of the equations up here x = x Prime + VT you can prove that this equation is the addition of velocities derivable from the coordinate transforms using a little bit of calculus and a few minutes of of work on paper but basically the velocity observed for an object object in frame s is equal to its velocity in frame s prime plus v the velocity of frame S Prime with respect to S now there’s a problem here we know that the postulates of special relativity are more compatible with reality than the assumptions that were made to define the Galilean transformation in the first place and if you play around with a gilean transformation you can pretty quickly find out that it violates the postulates of special relativity so for example and this can be left as an exercise for the student especially because Maxwell’s equations are not something you get rigorous training on in introductory physics you can show that Maxwell’s equations their forms are not invariant under a Galilean transformation that would violate the first postulate of special relativity because if the equations if Maxwell’s equations have different forms that can be determined by experiment in different frames of reference that implies that it’s then possible to know whether or not you’re in the absolute rest frame for instance in the mechanical view of the universe in the frame of The Ether it’s a bit easier to see how the second postulate is violated by using a simple example you can imagine that the object in motion on the previous slides is a beam of light and it’s been emitted in The Ether frame which will take to be frame s the absolute rest frame and what you’ll find then is that that beam of light will have a very different speed in all other frames moving with respect to the ether frame the absolute rest frame or frame s in our notation here that violates the second postulate that the speed of light must be observed to be the same by all observers regardless of their relative states of motion so already the Galilean transformation is immediately shown to be at odds with the postulates of special relativity which again are based on observational evidence so here’s our picture again of these two inertial reference frames with a blue object being studied by both of The Observers in each of the reference frames S and S Prime and we want to find a transformation of X and X Prime Y and Y Prime v u and U Prime and T and T Prime between these frames um that gets something that’s compatible with the postulates of special relativity and we don’t have to change this picture to build up the correct transformation we just have to apply the postulat of special relativity in constructing the the transformation from one frame to the other and these new postulates the postulates of special relativity will enable us to arrive at a mathematics that’s consistent with observation so our goal is to figure out what is the correct transformation and we will continue to work with frames of reference wherein object velocities are observed to be constant that is inertial frames of reference that puts the special in special relativity now it must be true that in two inertial reference frames S and S Prime as depicted in the cartoon above that because the object in motion will be observed in either frame to have a constant velocity maybe a different magnitude but both frames of reference will agree yes we have each observed a constant velocity for the object that we’re studying it must therefore be true that X in the frame s is equal to the object velocity time time if it’s moving entirely along the X Direction and in the in the moving frame frame S Prime it must be true that X Prime is related to T Prime by The observed velocity of the object in that frame U Prime in order to further satisfy the first postulate of special relativity it must also be true that the transformation equations represent a linear transformation between the frames otherwise it can’t be true that all frames observe object velocities to be constant let me demonstrate this it’s important I think to start exercising your Calculus a little bit at this stage in the class so that you get a bit more comfortable with using calculus as a means to make predictions about the natural world so let’s begin by assuming and extremely generic form for the transformation between spatial coordinates in frame s and space in time coordinates in frame S Prime I’ve assumed that X is given by some unknown transformation with a spatial term and a temporal term each of these has coefficients I’ll talk about those more in a moment and each of the coordinates in the moving frame is raised to some unknown power for space it’s n and for time it’s m and similarly the time coordinate in frame s is related to the space and time coordinates in frame S Prime in a same way there are some new coefficients C and D that enter in here but again I’ve raised X Prime and T Prime to various Powers they could be two could be 10 could be 20 we don’t know now let me comment on these coefficients a b c and d are constants here we could always absorb some non-constant behavior in the coefficients into the function of X prime or t Prime that we’re using here now I’ve used a simple function just raising the uh space and time coordinates to a single power but you can draw the conclusion more generally that an arbitrary polinomial of X Prime and T Prime also won’t work to satisfy the requirement that all observers agree that the objects moving at a constant velocity regardless of their frame of reference in inertial reference frames so here’s my generic pair of transformation equations I don’t know what a b c and d are and I don’t know what n and M are but what I can do is I can recall that any velocity any object velocity like you know the object velocity along the X Direction in frame s uux or the object velocity along the uh X Prime axis in frame S Prime so U Prime X is defined by a derivative with respect to time that is ux is DxD t or U Prime X is DX Prime DT Prime that’s the definition of of velocity from its most foundational um aspects so what I would like to do to motivate that this has to be a linear transformation between the frames is to Simply take the above equations and turn them into statements about differentials of X and X Prime and T Prime and T rather than just statements about the coordinates themselves so this is where you can dust off your Calculus and see if you can arrive at the same answer that I get here but the bottom line is that the differential of X is related to differentials in X Prime and T Prime by this equation so the coefficients A and B remain unscathed but you wind up with this uh new power of X Prime and T Prime due to transforming this into a statement about differentials in space and time rather than just about space and time themselves and then similarly you get an equation that looks very much like that for DT differentials and time in the frame S as well okay so let’s hold those equations here for a moment and consider them so what I would like to do now is use these equations to relate The observed velocities in each frame of reference and so to do that I’m just going to take the ratio of the above two equations why because when I do that I get DX / by DT which on the left side is just the definition of the velocity of the object in frame s ux and on the right side I get something that’s a fair bit nastier than that but we will simplify it into something that looks a bit more familiar in a moment so notice that I have DX Prime and DT Prime both in the numerator and in the denominator of this ratio so what I can do next is I can divide the top and the bottom of the right hand side by DT Prime doing so eliminates DT Prime from the right hand terms in each part of the ratio and creates a DX Prime DT Prime in the left term of each part of the ratio and that should look very familiar because DX Prime DT Prime is by definition the velocity of the object in the prime frame in the S Prime frame and so we arrive at this final relationship that relates the observed velocity of the object in frame S Prime to The observed velocity in frame s but there’s a problem here unless Nal m = 1 the above equation will always leave a lingering space and time functional dependence on the right hand side which violates the first poti of special relativity the speed observed in frame s even if the speed in frame S Prime is constant will not be observed to be constant because it will depend on where in frame S Prime the object is at any given moment it will have a space and time time dependence that that is uh rather nonlinear and so we’re forced to conclude that in order to be compatible with the basic idea that we’re looking at inertial reference frames and relating object velocities uh in inertial frames of reference where there are no net forces that can cause changes in in the state of motion of the object we’re really forced to choose a linear transformation from frame to frame now that’s what we had in the Galilean transformation but we are still stuck with it even here in special relativity that’s a good thing because vastly simplifies the mathematics okay so we’re going to now begin to build up the mathematics of the transformation now that we’ve accepted that we need a linear transformation from frame S Prime to frame s for example we get a very simplified pair of equations x = a x Prime + BT Prime and T = CX Prime + DT Prime but we don’t know what these coefficients are they may be trivial they may be zeros or ones but we need to figure it out so now that we’ve established the linearity of the transformation we need to nail down a b c and d so we need to think of some special limiting cases of this picture where we can isolate these constants maybe one at a time or in small batches and in doing so figure out what they are in order to be compatible with the postulates of special relativity now this is a standard trick in algebra we have four unknowns in two equations we’re going to need four special cases to solve for all the unknowns and the postulates give us the framework to define those special cases so let’s pick special case number one where we take the moving object the blue ball and we pin it to the origin of the coordinate system of frame S Prime so in frame S Prime the object will always be located at 00 0 in the X Prime y Prime Z Prime coordinate system it’s moving along at the same same speed as the frame itself and so it’s observed to be at rest in frame S Prime in frame s however what we see is we see the blue ball pinned to the coordinate system of this Frame and they’re all moving together at a velocity V to the right along the x axis so in frame s it’s observed that the object is moving at V so U equals V in frame s now when we do this we have a simplifying situation for X Prime X Prime will always be zero because this thing is pinned to the origin of frame S Prime and so we can simplify the above equations to the following we have now that X is just equal to BT Prime in this special case T is just equal to DT Prime in this special case and then the velocities x equal U and 0 equal U Prime T Prime are the resulting equations from this special case so let’s take the first two equations the one for x and the one for T and divide them and then let’s use the third equation xal UT as a substitution to eliminate one of the unknowns so when we do this we wind up with x / T is equal to B over D go ahead and check this yourself and then from the velocity equation we get that x/ t is equal to U but in this special case the object speed is also the frame speed V so we wind up with x/ tal V and as a consequence of that we get the first constraint on our coefficients whatever B and D are their ratio is equal to V the velocity of frame S Prime now let’s choose a second special case and you might have guessed that this would be the next thing that we would do we we pin the blue object to the origin of the rest frame coordinate system we put it at 0000 0 in frame s so x y and z are 0 0 and zero and now we observe that blue object from frame S Prime now from the perspective of frame S Prime which is moving to the right at speed V the blue object appears to be falling further and further behind their frame its velocity appears to be negative V from the perspective of an observer in frame S Prime the moving frame so with that in mind and fixing X to Zero in the uh General equations up here on the right we can simplify the equation set to the following 0 = ax Prime + BT Prime uh the T equation doesn’t get affected by any of these choices we wind up with 0al u and X Prime = VT Prime substitut suting in for U Prime with Nega V now let’s employ the first and third equations namely this simplified first equation for the coordinate X and this x Prime equal Nega VT equation to further get the constraint on coefficients so if we do that we wind up with the first equation telling us that negative ax Prime equal BT Prime that’s the consequence of the first equation from the third equation we get that X Prime is negative VT T Prime and if we combine these two things together we find out that avt Prime equals BT Prime now T Prime entirely drops out of both sides using the substitution and we find out that V equal B over a go ahead and try this yourself I’m going through this a little bit fast but of course you can pause this at any time and work through the algebra on your own and so we arrive at our next batch of constraints now we already knew from the first special case that V equals b/ D from the second special case we also find out that V equal b/ a and if these two constraints are simultaneously true then it must be true that a equals D so now we have really constrained ourselves down so here’s the third case we’ll look at what if the object is a beam of light now this is the first time that we will definitively deploy one of the postulates of special relativity specifically the second postulate because if the object that’s observed to be moving in both frames of reference is a beam of light then by the second postulate of special relativity observers in both frames must observe the velocity of the object to be exactly C 2.9 98 * 10 8 m/s regardless of their relative motion so what happens to our equations as a result of this fact that according to observations and encoded in the second postulate all observers observe that the beam of light moves at the same speed regardless of their state of relative motion the equations simplify as follows the first two don’t really change at all but because the last two are statements about velocity and their relationship to space and time measurements it must be true that x equals CT but also X Prime equals c t Prime when the object in motion is a beam of light so let’s combine the first two equations the x and t equations and substitute using the uh information from the third equation at the bottom and when we do that this one gets a little nasty at first we wind up with x / T equals this horrible ratio over here not looking very promising so far but X overt is just C by the third equation and if we divide the top and bottom of the ratio on the right hand side by T Prime we wind up with X Prime over t Prime in both the numerator and the denominator and X Prime / T Prime is just C the speed of light so we wind up with the speed of light here the speed of light here and the speed of light Here and Now what we can further do is take the previous constraints relating b and d and a we can substitute those in and go through a simplification process and when we do that we find out that c is equal to B over the speed of light squared and we finally arrive at C = A * the velocity of frame S Prime relative to S over the speed of light squared so all of our constraints allow us to eliminate B eliminate C and eliminate D from the equations up here in the top right of the screen B is equal to AV C is equal to a * V over the speed of light squared and D is just equal to a itself so what we have done now with special case 1 2 and 3 is we’ve eliminated three of the unknown coefficients in favor of the fourth a and all we have to do is come up with one more constraint that allows us to figure out what a is well here’s our last case and the last case is a basic assumption about the Transformations first of all we chose that the transformation of x and t to X Prime and T Prime from the perspective of observation in frame S Prime being mapped onto observations in frame S have a certain form but because it shouldn’t matter which frame we pick to be the one that’s at rest and the one that’s in motion we should get the same transformation equations if we had started with frame S Prime being at rest and having frame s be the one that was in motion and the only thing that should change between observations in frame s and observations in frame S Prime is that the Rel relative velocity of the two frames changes sign that’s the only thing that should change when you alter the perspective of which one is at rest and which one is in motion and so as a consequence of that we should be able to eliminate the unknown a and figure out what it actually is so let’s start by writing down x and t in terms of the coefficient a and all the other things we’ve already sorted out so I’ve effectively just copied these two equations down here next let’s rearrange and rewrite these equations not as solutions for x and t but solutions for X Prime and T Prime as if we were trying to figure out what the person in frame S Prime would have seen if frame s was chosen to be the frame that was in motion now a lot of algebra is involved in this and I will leave it as an exercise to the viewer to try this out to practice their chops at algebraic manipulation in order to get what we want but the bottom line is that if you work this through you will find out that X Prime is given by this nasty function of x and t and t Prime is given by this equally nasty function of x and t now these equations tell us what should have been observed in frame S Prime given observations in frame s treating frame S as if it had been the frame in motion and frame S Prime as if it had been the frame at rest but all that should change when switching perspectives on the problem like this is you should get the same equation differing only by a minus sign on any term with v in it then we’re forced to say that a is equal to 1 over a * the quantity 1 – v^2 over c^2 theus1 and if you rearrange now and solve for a you find out that a must be equal to 1/ the < TK of 1 – V ^2 / c^2 this quantity shown here now this may not look pretty but this strange thing 1 over the < TK of 1 – v^2 over c^2 shows up all over the place in special relativity calculations and so it’s given a special name we don’t leave this as coefficient a it gets the symbol gamma the lowercase Greek letter gamma shown here in the lower right gamma is defined to be this strange Beast here one over the < TK of 1 – v^2 c^2 so let’s take a look at the final form of the Loren transformation the mathematical transformation that obeys all the postulates of special relativity if we are making observations in frame S Prime and relating them to frame s which is taken to be at rest then these equations here tell us what we want to know we measure things in frame S Prime and we get X Prime and T Prime Prime and then we combine them using these equations to get x and t the observations that should be made in frame s if on the other hand we’re making observations in frame s and want to convert them to observations in frame S Prime we flip the sign of v and change all the coordinates around and we get basically equations that look the same up to a minus sign on terms that just have V in front of them this number gamma always lurks out in front of everything and we see here another interesting thing that’s going on we see that in transforming space and time from one frame to another the space and time measurements in one frame get tangled up to compose the space and time measurements in the other frame in a sense space and time and special relativity are not separate entities we we treat them as separate in introductory physics but the lesson of special relativity is that we should have been thinking of this all as one framework SpaceTime not separate Frameworks of space and time this entire time now this multiplicative Factor gamma depends on the relative velocity of the two frames and as you’ll see it’s effectively a measure of the degree of the relativistic effects how much you need to take into account special relativity to solve a problem correctly between two frames of reference and again it’s given by this funny combination of velocity of the frames relative to each other and the speed of light one over the square root of the quantity 1 – v^2 over c^2 so let’s build a little bit of intuition about the meaning of the gamma Factor it appears everywhere in relativity calculations at least in special relativity it gets absorbed into other things and what is known as general relativity the general theory of space and time which we will only cover in the most shallow way in this course it’s largely un avoidable for all of the physics calculations you’re going to do going forward so let’s try to understand it a little bit better and to do that I I think we can build some intuition by playing around with the quantity gamma at various limits of its uh observable nature so for instance what is gamma for a frame S Prime that’s at rest with respect to frame s well we would expect to find that the two frames are the same since they are then in the same state of motion well we already know that if frame S Prime has a velocity of zero with respect to frame s that the terms with VT in front of them will vanish but what happens to gamma well if you plug in zero for V in the function gamma you find out that indeed for V equals z we observe that gamma is one in other words the multiplicative factor in front of either the space or the time coordinates when relating those to space and time coordinates in frame s all become coefficient of one in other words you’re in the same frame so you should get the same space and time measurements that’s good that’s what we would hope would happen now on the other hand what is gamma for a frame S Prime that achieves a velocity of exactly C the speed of light relative to S so this would be like imagining a frame of reference that’s pinned to a beam of light moving at the speed of light and it’s another very weird special case and it’s weird because what happens is that the gamma function takes on its biggest possible value Infinity you wind up with 1 over the square root of the quantity 1 – c^2 over c^ 2 well c^ 2 over c^2 is just 1 so you wind up with 1 over the < TK of 1 – 1 which is 1 over Z which is infinity zero goes into one an infinite number of times so that’s the upper limit for gamma so far as we know it’s impossible to travel faster than the speed of light there’s no observational evidence that anything does travel faster than the speed of light and so we are led to believe and special relativity encodes this that the fastest speed in the universe is that of light and so gamma takes the special value of infinity at that speed so as we can see gamma is a frame velocity dependent number and it has a well-defined range at the low end its smallest value it can take is one and at the high end the largest value it can take is infinity and it can take all numbers in between that depending on what V is I think it’s useful to graph this albeit perhaps in a way that’s not terribly familiar to you this is a plot of the value of gamma so the so-called gamma Factor on the y- AIS as a function of the frame relative velocity V on the x axis and so you can see here that I have chopped off the low end of the xaxis at about 10% the speed of light why because gamma has a value that’s so close to one that generally speaking you don’t have to worry about it being different from one now that’s not true in all cases and we’ll look at some of those cases going forward in the class but generally speaking if you are at about 10% or less the speed of light you do not expect to really observe what are called relativistic effects that is effects that distinguish an observation from what you expected from Galilean or Newtonian relativity above 10% of the speed of light however gamma can begin to take values that are appreciably distinct from one and you can see here that when you get to values that are about half the speed of light which occurs roughly here on this graph this is 1 * 10 8 this is 2 * 10 8 so this is about 1.5 x 10 E8 right here and as a result of that you can see we’ve now appreci appreciably started getting gamma factors that are above one by about 20% or so when we get to about 2/3 the speed of of Light 2 * 10 8 m/s we’ve achieved gamma factors that are about 50% bigger than 1 so 1.5 and as we begin to approach closer and closer and closer to the speed of light we see that the gamma function takes on increasingly larger and larger and larger and larger values spiking up to Infinity at exactly the speed of light this plot will help you to understand why it is that we just didn’t notice the these deviations from the Galilean or Newtonian view of the universe for most of the history of Science in humanity and that is because the laboratory experiments that we were effectively conducting as a species were all done at speeds that were far less than that of light and so we really never would have noticed these effects to begin with it was only when we began playing around with light and things that can

    move very close to the speed of light like subatomic particles that we began to get ourselves into trouble with the intuition we had built up on Human Experience prior to that but we see now that the postulates of special relativity predict that we should have expected deviations from that Newtonian or Galilean view of time where time is the constant between all observers it’s not it’s the speed of light we can see this effect encoded in the gamma function now as has been teased in the previous lecture on the basics of special relativity that is the postulates and their con consquences this theory of space and time has some consequences that can feel surprising to the average human being for instance objects in motion relative to what we consider to be a rest frame that is the frame we denote s will appear contracted along the direction of travel we can actually show this as a prediction of the Len’s transformation now that said to appreciate this particular effect even from the Loren transformation really requires you to think extremely carefully about what it has ever meant to measure the length of something I feel that discussion is best saved for class time as in class time we can get very handson with the concept before we start plunging into calculations where the language you would use to describe the recipe for attacking this particular question may not feel very natural because you haven’t really thought about what it means to measure the length of something especially the length of something that’s moving relative to you so instead in this lecture let me concentrate on frames in motion relative to the rest frame s which will also observe a passage of time that relative to S seems slowed this effect I labeled time dilation and we’re going to formally calculate it now and finally I’ll also look look at events that are simultaneous in one frame and show that that simultaneity is not guaranteed at all in another frame that’s moving relative to the first let’s explore time and simultaneity in this lecture so how does one consider the passage of time in different frames of reference well to measure time we have to define a clock of some kind a regular pattern of events that all happen for instance at the same reference point in in a frame so consider a clock that’s at rest in frame S Prime and it provides regular information so for instance pulses of light at different times T1 Prime T2 Prime T3 Prime and so forth always with regular intervals between them but that clock is always pulsing at the same position X Prime so X1 Prime equals X2 Prime equals X3 Prime whenever the time measurements are established and what we want to know is what’s the time between the pulses observed in the rest frame so that clock is in a frame that’s moving with respect to a frame that we agree is at rest we call it to be the rest frame what does a observer in the rest frame observe the time to do in the moving frame well again you want to relate time observations between the two frames but to do that you need to use the Loren transformation which takes space and time information from the moving frame and translates it into time information in the rest frame so we want to take the the pulse at T2 and transform its time into what the uh person in the rest frame measures they’re so-called T2 and we want to take the Pulse at T1 and transform that into the rest frames T1 and to do that we have to use this equation this comes from the Loren transformation now we have a simplifying fact here and that is that the clock in frame S Prime is always pulsing away at the same location X1 Prime equals X2 Prime so if we were to combine these two equations to calculate the duration of time between T2 and T1 we might do the following we might take T2 minus T1 and try to figure out what that is in terms of T2 Prime minus T1 Prime well because the clock pulses at the same location in frame S Prime X2 Prime and X1 Prime terms cancel out and we’re left with this equation which relates the durations and time observed in the two frames by a gamma factor and so I can write the two time durations delta T in frame s the rest frame and delta T Prime the the time duration and the moving frame and I can relate them and they’re related Again by a gamma factor and I find that if I take the ratio of the time duration observed in the rest frame and the time duration observed in the moving frame that they will differ they will not be a ratio of one and the ratio however will be given by the gamma factor which takes a value of one but only in the the special case that the two frames are at rest with respect to each other at any relative speed greater than that gamma takes a value that’s greater than one now until you get to very high speeds it’s not appreciably greater than one but nonetheless it’s not exactly one unless you’re at rest with respect to each other and so that we see now that in a frame that’s moving relative to another durations of time will always observe to be shorter than in the rest frame the duration of time observed in the rest frame is greater than the duration of time that’s observed in the moving frame for the same pair of events and the degree of dilation of time depends again on the ratio of V over C specifically through the gamma Factor time in the moving frame will appear to the rest frame to pass more slowly now another expected effect due to special relativity is that events that are simultaneous in one frame of reference may not necessarily be simultaneous in another now we already explored that a little bit even under classical velocity situations but we can revisit that idea here under the Loren transformation so for instance consider two events like pulses of Light which are observed to be simultaneous in frame S Prime the moving frame the events have coordinates X1 Prime T Prime and X2 Prime T Prime so what is the time between the events observed in the rest frame does the rest frame observe that they are also simultaneous that is also at exactly the same time well we can start by relating the times in the rest frame to the space and time measurements in the moving frame for T2 and T1 again we’re just writing down the Loren transformation here between observations in frame S Prime and the observations we want to establish in frame s now since the events are simultaneous in frame S Prime T1 Prime will be equal to T2 Prime so if I now calculate the duration of time that passes in frame s T2 minus T1 I find a very interesting fact that it’s not equal to zero it’s equal to this thing on the right hand side here which depends on the velocity of the frames relative to one another V quite directly in this case not just through the gamma Factor but gamma multiplied by V and what’s particularly interesting about this is this question whether two events are simultaneous in one frame and simultaneous in another frame of reference that’s in motion relative to it really picks at this interesting thing I mentioned earlier which is that space and time have to be treated as one framework space time and they can get tangled up in each other and what we see here is that because the events in frame S Prime are simultaneous but not necessarily at the same location in space in frame S Prime this creates a displac m in time between the two events in the rest frame in other words delta T in the rest frame is not necessarily equal to zero it’s equal to gamma V / c^2 times the spatial displacement of the two events in the moving frame Delta X Prime so we see that in the rest frame events cannot be observed to be simultaneous even when they are simultaneous in the moving frame unless those events happen at exactly the same position in space that is X2 Prime equal X1 Prime that’s a special case or unless the two frames are not in relative motion to each other that is V equals z in that case of course the two frames become indistinguishable and the whole discussion was moot to begin with but if the two frames are not the same frame of reference if the events occur at different locations in space in one frame but are otherwise simultaneous there they will not be viewed to be simultaneous in the other frame simultaneity of events could have been guaranteed in Galilean relativity even if they were at different um locations in space because of the absolute passage of time but since time is not absolute and special relativity doesn’t accept that as one of its postulates you find out that except under these very special conditions simultaneity cannot be guaranteed in another frame and again this is a really good example of how space and time are not Inseparable from one another in special relativity in trans forming observations in space in one frame you wind up with observations in time in another frame space and time get kind of Tangled Up in each other and going from frame to frame a spatial separation in S Prime becomes a temporal separation in frame s and I find this to be one of the more remarkable features of space and time as viewed through the lens of special relativity now one last question we can visit in all of this is whether or not it’s possible to recover classical physics from this view of the universe in other words are the Galilean Newton view of space and time and relative motion totally gone were they totally wrong this whole time it turns out the answer is not really after all the Galilean transformation did work in real computation for centuries before special relativity was needed right I mean people were able to relate observations in different frames of reference at relatively modest speed compared to those of light so one of the things that that should become evident from all of this and this is a general feature of a good theory of nature a good predictive description of nature that can be tested and even falsified um a good theory of nature describes all new phenomena but also it accounts for the existing confirmed observations the old observations the old phenomena that we had all that experience with from which we built intuition what generally seems to happen is that when you find out that your current understanding of nature is wrong you find out eventually through enough observation and experiment and mathematical work the correct description of Nature and you find out that your old observations were correct but in a more limited regime of nature in this case for instance low velocities so to recover the Galilean or Newtonian view we need only slow nature down from speeds close to that of light for example we can consider the special case of speeds between the frames of reference that are much much much much much much much less than the speed of light so that V over C for instance becomes a very tiny number approaching zero so let’s look at what happens to the gamma factor using something called the binomial expansion now I’m going to illustrate the binomial expansion for this specific function here but in general there is a general form for the binomial expansion and you can use a math reference on the web or a paper book to explore the binomial expansion more on your own free time so let’s begin by looking at the gamma function the gamma function was defined as 1 over theare < TK of 1 minus the quantity V ^2 over c^2 so what we have here is we have a function of a number v^2 over c^2 that’s bounded between 0o and 1 V over C can either be Zer for V equals 0 or 1 for V equal C and as a result of that gamma takes values between one and Infinity but this this parameter V over C that depends on the relative frame speed that thing is bounded between zero and one now one of the things that we can do is we can rewrite this gamma function as a more basic generic looking function by replacing V over C with a number alpha or in other words rewriting this in terms of alpha s which is v^2 c^2 and note that Alpha squ is a number that is less than or equal to 1 and its lowest value it can take is zero so we wind up with this more generic looking function 1 – Alpha 2 all raised to the- one2 power in other words 1 over the < TK of 1 – Alpha 2 now if we apply the binomial expansion to this more generic looking function with this condition on Alpha that it’s bounded between zero and one then we find that we can rewrite this function as a series expansion the first term in the series is just the number one the second term in the series depends on Alpha and it’s plus 1 12 Alpha squar now the binomial expansion allows you to keep adding terms that have higher and higher powers of alpha in them the next one will have a power of alpha to the 4th the one after that Alpha to the sixth and so forth with different coefficients in front of them now because Alpha is a number that cannot be greater than one as a result of that for the special case that Alpha is much much much less than one in other words as Alpha tends towards zero we see that indeed we recover the behavior that as these terms with alpha squar alpha to 4th Alpha to the 6th as they approach zero the only term left that really dominates in the sum is the leading term one and we see that gamma is approximately equal to one as Alpha gets closer and closer and closer to zero and when Alpha equals Zer we get gamma equals 1 which we know is the limiting case of gamma for velocities of zero speed so that makesense it’s approaching the limit in case when V equals z that’s what it means for Alpha to go to zero it means V is going to zero too so what happens to the luren transformation equations when we replace gamma in it with this binomially expanded version of gamma so let’s start with xal gamma * the quantity X Prime + VT Prime let’s substitute in the binomial expansion in terms of V over C being equal to Alpha all right so now we replace gamma with this thing 1 + 12 Alpha squ plus all terms with Alpha to the 4th and higher in them and in the limiting case that V is much much less than C that is as Alpha approaches zero these terms with alpha squar alpha to 4th Alpha to the 6 they contribute less and less and less and less to the sum until we’re left with just one in front of the sum X prime plus VT Prime so in the special case that the velocity we’re considering for the relative motion of the frames is much less than that of the speed of light we find that we recover x = x Prime + VT Prime which is the old Galilean relationship between x and x Prime and T Prime now similarly I can take the equation relating X Prime and T Prime to time in the rest frame and I can substitute for the gamma factor using this binomial expansion I can also notice that there is a v over C lurking here in front of the X Prime coordinate that’s another Alpha that sits in front of X Prime so if I write that all out here’s the binomial expansion of gamma here’s that Alpha that I’ve substituted in for the V over C that was lurking in front of X Prime and what you’ll notice is if I distribute this gamma to both the terms inside of this sum that the space term the X Prime term always has an alpha somewhere in front of it multiplying it you can’t escape it you don’t just get a bare number like one multiplying the X Prime coordinate whereas for the T Prime coordinate there is a term in the expansion that just goes as 1 * T Prime and so in the limit that the velocity is much much less than the speed of light all terms with Alpha in front of them vanish to zero and we’re left just with t Prime in other words in the low velocity limit T is equal to T Prime and we see that we have completely recovered the Galilean transformation and we’ve reconciled with classical physics in the limit of low velocities this is why time appeared to be absolute in the original formulation of mechanics it’s because when the velocities are much lower than the speed of light between two frames of reference you have a very hard time seeing these extremely subtle effects between clock measurements between the two frames but that is laid bare as a false perception of nature as you approach the speed of light in relative velocities between two frames of reference but we see that we can Recon the old picture of space and time with this modern and correct picture at least correct as regards observation of the natural world simply by considering the limit of of small velocities compared to that of the speed of light and we completely recover the old view the old view is nested inside the modern view as a limiting case so to review in this lecture we have learned the following things we’ve learned to appreciate the Galilean transformation and the assumptions upon which it’s built we’ve learned a way to derive the form of the correct transformation between frames of reference the so-called Loren transformation that is the modern way of relating observations in one frame to observations in another frame of reference and we’ve begun to see how you start to apply this transformation by asking questions like what are the events and in what frame are they defined and is anything the same for those events are space measurements the same are time measurements the same in a given frame of reference to simplify the questions that we’re trying to answer and then get the answers out of the luren transformation and we see that we have arrived not only at a transformation that’s consistent with the postulates of special relativity but which gives us a mathematical formulation for the intuition that we built off of the postulates that distance and time measurements are not going to be the same in different frames of reference even if all observers agree on on the speed of light as a constant of nature so while we see that all observers must agree that light moves at the same speed regardless of their relative motion nonetheless observers in different frames of reference will disagree on lengths of objects the durations of time that pass and the simultaneity of events events simultaneous in one frame will not necessarily be simultaneous in another frame of reference except under very particular conditions we’ve also seen how to recover classical physics from special Rel ity by allowing the velocity uh of the relative motion of the two frames of reference to drop far below the speed of light so that these Corrections from the the original Galilean transformation all vanish and leave behind the Galilean transformation with its Assumption of absolute time laid bare and we’ve seen how in that limit the Loren transformation exactly reproduces what were the original assumed relationships between space and time and velocities as encoded in the gy transformation we’ve recovered the past from the present and we can continue to use the present to build a foundation for making future predictions and that is precisely what we’re going to do in the next section of the [Music] course in this lecture we will learn the following things we will learn what is a muon we will learn how to use the muon as a laboratory for making predictions with the Loren transformation and finally we will learn how the muon was the first direct test of the validity of special relativity let’s begin by reminding ourselves one more time time about the Loren transformation if we make observations in a frame S Prime that we consider to be moving and we want to convert them into observations in a frame s we consider to be at rest then the equations in the top left of the slide will do just fine if on the other hand we have observations that are made in the rest frame s and we want to convert them into observations in the moving frame then all we have to do is change X to X Prime T to T Prime and V to negative V in the upper left equations and we get the necessary equations in the upper right this function gamma that appears in all of these equations is a function that depends on the relative speed between the two frames V and it has the form of 1 over the < TK of 1 – V ^2 over c^2 where C is the speed of light this can be expanded into a series representation of this function using the binomial expansion we’ve looked at this before and so we get an expansion that looks like 1 plus a half v^2 over c^2 plus terms that are higher order in V like V 4th V to the 6 and so forth and for sufficiently low velocities less than a few perent the speed of light we can usually neglect some or all of these higher order terms in the series expansion and get a simpler representation of the gamma function at low velocity now we’ve looked at some consequences of the Loren transformation and special relativity we’ve looked at length measurements and we’ve seen how they depend on frame of reference we’ve looked at time measurements and seen how they depend on the frame of reference and we’ve looked at the simultaneity of events in one frame and see that they are not necessarily guaranteed to be simultaneous in any other frame that’s moving with respect to the one in which they are simultaneous but is the relativity of time a real thing I ideally what we would want to do is take two twins put one in a spaceship that can accelerate very quickly up to speeds well in excess of half the speed of light that’s where the gamma function begins to take on values that are very much in excess of one and then we would let the twin travel out on a journey of maybe 10 or 20 years and then bring them back that’s going to take another 10 or 20 years and when the twin gets back from the journey at high speed we would compare the two twins and see how they have aged from the perspective of observers on Earth that would be a great experiment except that it is really not easy to construct a vessel not only they can hold humans for long duration space flights but also that can accelerate up to speeds that are appreciably close to that of light this is an engineering challenge that we as human beings have never really mastered instead what we need to do to test the claims of special relativity is to identify a laboratory where such speeds can be readily achieved but also one where there’s a natural clock of some kind a regular sequence of events whose time can be well predicted so that we can compare those things when they’re at rest to when they are in motion now tiny particles would be a great potential lab atory tiny particles have very small masses they’re very easy to accelerate and so it’s possible that something like the electron with a mass of 9.11 * 10- 31 kg could be ideal for investigating fast moving objects and maybe even the relativity of time you know for example the ground state energy of the hydrogen atom the lowest energy state that an electron can have when orbiting a single proton is 13.6 electron volts you can look back at the conversion factor for the electron volt look it up on the web whatever you want to do but you’ll find that this comes out to be about 2.19 * 10- 18 jewles of energy now if you translate that into a corresponding kinetic energy for the electron in that state of hydrogen what you’ll find is that you can calculate the speed of the electron that is represented if you pretend that the electron is orbiting like a little planet around the proton under the influence of the kulum force so let’s do that let’s imagine that the electron is orbiting the proton at the center of the hydrogen atom and let’s use the energy of this state to estimate the kinetic energy and therefore the velocity of this electron and if we do this using the classical kinetic energy 1 12 MV squ and rearranging it to solve for the speed of the electron we find that it should should come out to be about 2.2 * 10 6 m/s or thereabouts that’s already pretty fast on its own that’s about 1% of the speed of light without having to do anything exotic except maybe study the electron in a hydrogen atom now of course we’d like to get the electron up to faster speeds than that but if but if that’s how fast it’s already going in a hydrogen atom then you can imagine it’s probably not too hard to get it going faster in fact JJ Thompson who’s credited with the discovery of the elron um isolated them by ripping them off of parent metal atoms using the kulon force using a strong electric field and a large electric potential this has the effect of accelerating the electrons up to rather High kinetic energies for the tabletop experiments of his day representing tens of thousands of electron volts of energy in the electrons and that would equate to speeds roughly of the scale of 108 m/s perfect those are the speeds we want to investigate phenomena at so a particle like the electron would be easy to accelerate but there’s a problem the electron doesn’t do anything it’s an extremely stable particle in fact left on its own an electron will simply be for the rest of the history of the universe so far as we know so it doesn’t have any regular characterizable phenomena associated with it once you’ve isolated an electron it lacks a kind of clock that it carries along with it that we could use to see whether or not the passage of time is affected by the motion of the electron well are there any such clock-like phenomena in nature that are associated with very small particles the answer is yes radioactive decay of atomic nuclei is exactly an example of a natural clock that ticks all the time in nature whether we’re there to observe it or not and if we do observe it we can use it to measure the passage of time in a system so for example among her many discoveries two-time Nobel Prize winner Marie curri isolated the element polonium it is highly unstable and the natural isotope of polonium polonium 210 transforms spontaneously into a stable lead atom lead 206 after emitting energy in the form of radiation specifically what it does is it spontaneously ejects two protons and two neutrons from the polonium nucleus these two protons and two neutrons are bonded into the nucleus of a helium atom and this thing is known as an alpha particle we’ll return to alpha particles later in the course the bottom line is there’s some spontaneous phenomenon that happens with regular time intervals that we can use to actually track time in nature now polonium 210 has what is known as a halflife of 138 days but what does that phrase halflife mean it means that if I were to isolate 100 atoms of polonium 210 in a sealed container and have some way of looking at those atoms and Counting them every hour of every day if I were to wait 138 days from the time I seal the container and then look in the container on average I will find that after 138 days about 50 atoms of polonium 210 will remain in the container the container will also now be home to 50 lead 206 atoms they resulted from the spontaneous decay of the missing polonium 210 now if I further wait another 138 days from that moment and look in the container again on average I will find that I now have 25 polonium 210 atoms left half of the sample I had 138 days ago and correspondingly I’ll find 75 lead 206 atoms in the container a halflife is a regular interval of time and if you had some kind of equipment that could be used to establish the amount of a certain isotope present in a sample you will find that after one halflife after every halflife passes you’ll lose half of what was there the last time you looked so unstable radioactive elements have a reliable built-in clock a regular process that occurs at the same place that is the nucleus at regular time intervals however there’s a problem and in the historical context what I’m talking about here is is a problem in the early 1900s polonium and other radioactive elements were pretty hard to come by in the days when they were discovered and even in the the decades after that and even if you could isolate an appreciable sample of them how would you know precisely how to count the numbers of those things whether they’re at rest or whether they’re in motion and and not only that you got to put them in motion which means you need to accelerate them and they are thousands or tens of thousands or hundreds of thousands of times heavier than electrons particle accelerators that are capable of bringing ions up to speeds approaching that of light of any decent quality and control were decades away in the early 1900s they wouldn’t emerge until the 1930s 40s and 50s so it’s nice that we have these regular clocks like radioactive isotopes but you can actually do practical experiments of the variety we’re thinking about trying to do that is attempting to see whether their clocks slow as they are put into a state of motion relative to the observing frame if only we had a tiny particle that combined the lightness abundance and ease of acceleration of the electron with the regular instability of radioactive atoms and it’s into this part of our story enters the muon or for short the MU now the mathematical description of unstable nuclear behavior and of the strong binding of things in the nucleus took decades to work out but around the 1930s with some experience now with other forces in nature like electromagnetism it was hypothesized that the forces inside the nucleus that both bind it together so tightly but also occasionally allow it to catastrophically break apart that these forces were maybe of two different kinds and that they had particles like the particles of light that transmit electromagnetism that acted as intermediaries in the nucleus and transmitted these forces within the nucleus and so these intermediaries were given a generic class of name maisons from the Greek word MOS meaning intermediary and and by the 1930s or certainly the 1940s the hunt was on to find them now shown at the left here on this slide is an image that was taken by two physicists Anderson and netm and published in 1936 a previously unobserved electrically charged particle punches through the slab of lead that runs through the center of the photograph these are two different views taken from different angles of the same particle interactions at the same moment in time and the interactions are taking place in a lead Target that runs through the center of the picture roughly here in the picture now as this previously unidentified particle passes through the lead it knocks apart nuclei but in this process it barely loses any energy this was a really strange beast in its day it would come to be dubbed the muon or muon for short as the physicists of that day mistakenly thought that this has to be one of the sought-after nuclear force intermediaries I mean what else could it be this turned out to be a bit of a lack of imagination and experience on the part of physicists with the broader picture of nature a good lesson for all of us of course and this assumption turned out to be wrong the particle was real but its role in nature was not as originally assigned and that wouldn’t be fully understood until the 1940s and 1950s its electric charge however was pretty well determined from careful experimentation and it was found to be identical to that of the electron ne1 . 609 * 10-9 kums so it carries with it the same Elementary charge that the electron possesses its mass however was very unlike the electron it weighed in at 27 times the mass of the electron too light to be a proton too heavy to be an electron and crucially unlike the proton and unlike the electron it is also so unstable if you trap a muon nearly at rest and there are some fairly straightforward ways to do this on average you will find that it only lives about 2.2 microsc or 2.2 * 10- 6 seconds now let me make an important aside while we’re on the topic of unstable particles about unstable particles half- lives and the characteristic LIF time of an unstable particle the mathematics of unstable particles was developed in response to the discovery of radioactive decay of atomic nuclei and it’s it’s a fairly straightforward application of algebra and calculus and I find it’s instructive to run through it here consider the kinds of systems we’ve been talking about so far so for instance let’s imagine you have a system of n0 unstable objects you know like a 100 nuclei or particles like the muon and You observe them to have that number at time T and then you wait a little bit we consider some change in time t plus delta T at which point we then discover that the number of objects has decreased by negative DN now here DT and DN are differential units of time and number respectively and you find that the number of objects remaining after a time has passed DT since the original time measurement is the original number n0 minus DN but if you double the number of objects objects so if you start with 200 unstable objects for instance and wait the same amount of time you don’t find that the DN is the same size as it was before it gets bigger and if you triple the objects to 300 or quadruple them to 400 you again find that DN after the same DT is proportionally larger and it’s larger in proportion to the size of the starting sample there is some proportionality between the change of number of objects the change in time and the original number of objects and so we can express this observational relationship in a simple equation negative DN the change or decrease in the number of objects is equal to a constant which we have yet to determine we’ll denote that the with the Greek letter Lambda lowercase Lambda times the original number of objects n times DT the time that passed during which the time the number changed now you’ll notice that this is set up to look like a an equation of differentials and so one could actually integrate both sides of this equation you can put all the n’s and DNS on one side and all the constants in DT on the other side and then you could integrate the side with the number stuff on it from the original number n Zer to the final number n after waiting a Time T now you’ll notice that on the left hand side we have the integral of -1 Over N DN or 1 /x DX if that sounds more familiar to you and so you you should know from some experience with second semester calculus that the natural log of the argument n in this case will wind up being the answer to this integral on the right hand side we have a much simpler integral we’re just integrating a constant times DT from time Z to time T whenever we observe the system later and that’s a very simple integral you just wind up getting the time T back times Lambda and then you just evaluated at the end points so if you do that you should find that you get the following equation the natural log of the original number of objects minus the natural log of the final number of objects at time T is equal to Lambda * T minus 0 zero is the original time at which You observe the system and see that it has objects n zero in it well if you rearrange now and try to isolate the number of objects n at the later time t on the left side you wind up with this equation moving the logs and constants and signs around and so finally you can solve for n as a function of time and you find out that it’s exponential in nature if you start with a number of objects n0 the number of objects left after a Time T is given by E Theus Lambda T times the original number of objects n0 now let’s talk about this constant of proportionality which we’ve been calling Lambda so in order to satisfy the requirement that the total argument of the exponential function be dimensionless it must be true that Lambda has has units of inverse time one over time or one/ seconds per second Hertz in the units of oscillatory phenomena it’s convenient to therefore Define Lambda as one over some characteristic time which I’ll denote with the lowercase Greek letter Tao where too is known as the time constant of the phenomenon well what does it actually mean for t to reach to the time constant well if you allow enough time to pass that one time constant’s worth of time goes by you find that 63.2% of the original number of objects are gone for unstable particles this characteristic time is what is known as the lifetime of the particle and you can actually show using some math we’ll develop later in the course that mathematically to is also equal to the average time that an unstable particle exists so it has two meanings one it’s the time after which 6 63.2% of the original n0 objects have disappeared from the system and two it’s the average time that any randomly picked unstable particle will exist now where does the halflife come into all this well you can show that the halflife of an unstable particle which we could denote as t with a subscript 1/2 is directly related to the time constant tow by the following simple equation the halflife is equal to the time constant time the natural log of 2 so when we say quote the muon has a lifetime of 2.2 micr seconds unquote we’re referring to the time at which there is a 63.2% chance that any single muon has decayed vanished gone away from the original sample of muons now let’s talk about muons and observing them and their origins in the world around us muons are not naturally occurring in the same sense that atoms are naturally occurring atoms are generally speaking stable they stick around for a long time and they form large structures because they have a chance to bind to each other through chemical means which is just electromagnetism and action muons on the other hand are a bit stranger you have to make them and because they don’t live very long you have a very limited opportunity to study them once they come into existence now thankfully nature does make them all the time and it does so because the Earth is constantly bombarded by particles from outer space that are smash ing into the atmosphere at very high speed very high kinetic energy and these things are known as cosmic rays and when cosmic rays slam into the Earth’s atmosphere they result in a whole bunch of particle interactions that ultimately spray muons down onto the Earth among other things so they do this by smashing into nitrogen or oxygen nuclei having all kinds of nuclear reactions in the process that produce produce a whole bunch of other particles and I’m not going to worry about what those are right now but ultimately muons can result from this and the symbol mu with a minus sign next to it denotes the muon with its natural negative electric charge there are also positively charged muons and that’s a subject we’ll come to later in the course now Anderson and neet Meer who originally discovered the muon did so using Showers of particles or from cosmic ray so-called cosmic ray showers and they did so by putting detectors at different altitudes in the Earth’s atmosphere so for instance they did a bunch of experiments with a detector located on top of Pike’s Peak which is 4.3 kilometers above sea level uh and they did a bunch of experiments at home base at Caltech in Pasadena California which is roughly at at sea level and it turns out in the decades we’ve been studying cosmic ray interactions and muon production we’ve learned that most of the muons that are produced by cosmic rays are made roughly at a height of 15 km above the Earth’s surface surface that’s not the top of the atmosphere but it does correspond to the place where the density of the atmosphere gets big enough that these interactions of cosmic rays and nitrogen and oxygen molecules or nuclei uh get very high in probability and so we get a lot of muons that get produced at at that part of the atmosphere now based on the known instability of the muon one might expect that if one counts a certain number of muons at a high altitude say counting a number N1 then by the math of unstable particle decay in using the known lifetime of the muon when it’s nearly at rest that is to for the MU is roughly 2.2 micr seconds one could accurately predict the number of muons you should expect to see at a lower altitude N2 now at that lower altitude because particle Decay has had a chance to happen we expect fewer total muons to be found if we make a 100 muons or a thousand muons at 15 kilm above the Earth’s atmosphere and we go down a bunch of of kilometers we don’t expect to find uh the same number of muons we expect to find typically fewer all right now this is very interesting here let me show you this so here in the basement of Fondren science the physics department has a small experiment set up that allows us to capture Neons created in cosmic ray showers above the Earth trap them by trapping them in in atoms and a material in this device over here on the left and then after trapping them we can wait and see how long they stick around until they Decay so all of this equipment is designed to establish the time uh at which an a muon is trapped and then the time at which it then subsequently decays because it doesn’t live forever and if we take a look at the data here what we find is that when we trap these muons and hold them nearly at rest in our reference frame indeed we see an exponential falloff in the number of muons that survive after a certain amount of time as predicted by the theory of particle Decay and we can see that after about 2.2 micros that there’s a roughly 60 to 70% chance that any single muon will have already decayed exactly as previous experiments have determined so this is our own little way of caging muons using atoms to trap them then waiting to see them Decay and measuring the time between those two events in a frame that’s essentially at rest with respect to the muon and indeed this is how we figure out for instance that uh the muon lifetime is about 2.2 microsc this experiment alone here in the time it’s been operating which looks to be about 2,300 hours or so has trapped and observed the decay of about 1.6 million muons so just think about the sheer number of muons that must be raining down on the surface of the Earth all the time we’re capturing just a tiny slice of all of those they’re fantastic laboratory for looking at the little clocks that fundamental particles carry around with them so that we can study time um using the tiniest building blocks of the universe now the mu’s short lifetime should radically cut down its numbers as we go lower and lower into the atmosphere and in fact the the effect is quite stunning so let’s imagine we give the muon the best possible chance of making it to a low atmospheric height so close to the surface of the Earth now to do that we’re going to crank its velocity up to the fastest that anything that we know of can travel and that’s the speed of light so we’re going to set the speed of a muon that’s just been produced at 15 km above the surface of the Earth we’re going to set its velocity aimed straight at the surface of the Earth to the maximum it can be 2998 x 10 8 m/s so if you crunch the numbers you’ll find out that in one lifetime a muon can travel just 66 km not even a kilometer it doesn’t even go 1 15th of the way way down closer to the surface of the Earth but at this point it’s already had a 63.2% chance of decaying there’s a 63.2% chance that that muon won’t exist anymore by this point but let’s imagine it survives and it makes it two lifetimes after two lifetimes at at the speed of light it could have gone 1.3 km doubling the distance it’s traveled into the Earth’s atmosphere and now having made it a little a little more than 1/5th of the way into the Earth’s atmosphere but by this point it pays the immense Penal of having a probability of 86.4% of having already decayed 10 lifetimes will only bring a muon 6.6 km into the atmosphere that still leaves it about 8 km above the surface of the Earth but by then it has had a 99.995% chance of of decaying there’s really very little chance that muon really makes it this far and if you take it twice as far 20 lifetimes the probability is even smaller so the bottom line is that we don’t really expect if we produce a th000 muons at 15 km to find really any of them down at sea level so what actually is observed well shown at the right is some data it’s real data taken from an experiment that really can count muons at different altitudes and the graph shows the number of counts per minute versus the altitude where the measurements were taken and these measurements were taken by High School teachers who were involved in a program called quarknet this program engages teachers in K through 12 typically High School teachers in real physics research environments and this data is actually taken from an experiment they did that was reported in the article that’s listed in the footnote on this slide now what they found was that if the experiment was run 3.5 km above the surface of the Earth above sea level they found about 300 counts per minute of muons at that height now let’s use the Galilean and Newtonian Assumption of time that time passes at the same rate for all observers that is whether the muon is moving or not its clock and clocks on the ground tick at the same rate now that’s a total violation of the assumptions of special relativity and of course the conclusions that one would then draw from the Loren transformation but we can make a prediction using the Newtonian or Galilean idea and so we can basically estimate how many counts per minute we expect at half a kilometer which is roughly the lowest height where the teachers took data now what You observe is that at 3.5 km the number of counts is about 300 per minute 300 muons per minute passing through the detector and if we give the best case chance of all those muons making it down to half a kilometer above the surface of the Earth we find out that um we we should expect the yield to go as e to Theus y over C * to where C is the speed of light and to is the lifetime and so after a height change of just 3 km going from 3 and 1/2 km to half a kilometer here we expect to find at most about 3.2 counts per minute from muons that are produced uh at this altitude of 3 and a half kilomet is that what we actually observe and the answer is heck no in fact the teachers observed not three counts per minute but 100 counts per minute at both sea level and about half a kilometer above the surface of the Earth it’s pretty hard to tell the difference between those two sets of data so why would that be why would it be that the Galilean Newtonian prediction or at least its assumption that time is the same for all observers regardless of the state of motion uh would not get this experiment right it seems so simple we know the lifetime of the muon when it’s at rest you know the height difference between where you make the first and second measurements you just do some counting should be easy right and you don’t even get close to the right answer so why would that be well I think we already know the answer the answer is time dilation special relativity with its Loren transformation that’s supposed to be valid for all speeds up to that of light will help us to understand this so let’s relate what’s going on in the muons ref reference frame which we’ll call S Prime and what’s going on in the earth’s reference frame which we’ll call S so we choose the earth and the atmosphere to be at rest we choose the muon to be moving uh so it’s viewed as a moving object with respect to the uh the Earth and so so we can call that the moving frame now in the reference frame of the muon where it thinks it’s at rest its lifetime is 2.2 micros recall that this is the lifetime as observed when the muan is nearly or exactly at rest and that would be its proper lifetime when it’s exactly at rest the proper lifetime is measured in the frame where all events happen at the same location in space for the muon coming into existence and going out of existence all happen at the same place itself and so that’s the frame in which proper lifetime is defined that’s also how we measure the lifetime of the muons we stop it and we let it decay and we see how long that takes typically and so that’s the 2.2 microsc associated with how long the muon lives now the Loren transformation would predict that the time measured by an observer on the earth the time that’s passing in the muons frame of reference will be different from a person who would measure the time but ride along with the muon thinking the muon is the thing at rest the whole time so we can take observations in the frame of the earth observations of say the clock ticks in the muons frame at T2 and T1 and take the difference the delta T between those ticks and we can relate those to the spatial coordinates where all the events happen and the time measurements as observed in the frame of the muon the S Prime frame we’re just using the Loren transformation one more time here now all the events in the muons frame of reference being created in the upper atmosphere decaying later at a time T2 where the Earth is closer to the muon they all happen in the same place in the muons reference frame in other words X2 Prime is equal to X1 Prime therefore this equation simplifies and the time difference in the earth rest frame is relatable to the time difference in the muons frame of reference by a factor of gamma times the time difference in the muons frame of reference now the lifetime of the muon in its frame of reference is 2.2 micros so delta T prime or T2 Prime minus T1 Prime is 2.2 micros so special relativity would predict that from the perspective of an observer on the ground the muon would appear to live longer than would be expected if it were at rest as well this is completely in accordance with the observational evidence more muons many more muons are obs deserved to survive to a lower altitude than would be expected from classical physics and its Assumption of the absolute passage of time for all observers so the data told us that of the say 300 muons per minute observed at 3 and a half kilom roughly speaking 100 per minute of those survive around half a kilometer above the earth’s surface in the reference frame of the earth we can relate these numbers to the observed Decay time of the muons in their rest frame that is TOA the proper time and also the lifetime of the particle and the distance that they travel from the perspective of the earth and atmosphere rest frame y as well as the typical speed of muons so what we find is that taking the decay equation n = n0 * e to Theus T over gamma to then tells us that we can solve for the velocity and the gamma factor of the muons using the data we we know n and we know n0 from the data um we know why because we know the height difference that the teachers made the measurements at we can solve for this quantity gamma V which is related of course ultimately to the speed of the muons in the atmosphere relative to the Earth so if we do the algebra here and solve for gamma V we get the following equation now I’m going to leave it as an exercise to the viewer um we want to solve ultimately for either Gamma or V but since there each a function of the other we have to do some algebra to isolate one or the other and to help you along with this recall that the gamma factor is defined as the uh 1 over the < TK of 1 – v^2 over c^2 and and that means that that the velocity if you solve for that is equal to C * the < TK 1 – 1/ gamma 2 and from this you can take gamma * V and you can get a nice expression for that so gamma V can actually be written entirely in terms of of gamma which is interesting and if you use that trick you can uh get to isolated expressions for either Gamma or V from from this equation here on the right hand side so go ahead and try that yourself as an exercise but you should find the following things you should find that the gamma factor for these muons assuming that all the 300 that are created at or appear at an altitude of 3 and a half kilm then could be counted or not at .5 km above the Earth’s surface and you then based on that assumption estimate that the gamma factor is around 4.3 and if you solve for the velocity of these muons they are radically close to the speed of light they are 2.91 x 10 E8 meters per second in speed relative to the Earth in the atmosphere now from the person on the ground’s perspective that Journey from three and a half kilm to half a kilometer above the surface of the Earth takes about 10.3 micros which is way longer than one lifetime of a muon so it’s no wonder muons make a fantastic and early laboratory for tests of of special relativity nature is readily making lots of them per second in the upper atmosphere they can be measured using technology that was available in the early 1900s at least the first half of the 1900s they can be observed and to see when they Decay and how often they Decay and so forth and all of that together can be used to assess the validity or not of special relativity and of course what we find is that special relativity wins the day it is the correct description of space and time for inertial reference frames and it’s remarkable how well it actually works now of course the atmosphere is complicated the production of muons in the atmosphere is complicated if you really wanted to do a super thorough job of this you would have to do a detailed simulation of the interactions of cosmic rays in the Earth’s atmosphere producing muons at various Heights and then see how many you count at various Heights with and without special relativity if you do that we find that with special relativity we can exact produce the atmospheric data without special relativity we utterly fail to reproduce the atmospheric data it really is true that special relativity is the correct description of space and time and motion now I find it’s instructive to quickly take a look at this same situation but from the muons perspective that is if you could ride along with the muon at its ridiculous speed what Would You observe of course the mu’s frame of reference it’s at rest and the earth and the atmosphere are rushing toward it or in the case of the atmosphere past it so you and the muon come into existence very suddenly 3 kilomet above the final Earth observation place now that’s in the perspective that 3 km statement that’s made in the frame of an observer on the earth we’ll get to the distances in a second in the muon frame what you can say for sure as you come into existence the Earth is is far from you it’s racing toward you at a speed of v and it’s getting closer to you all the time and at some point you’ll go out of existence and the question is how far is the Earth and atmosphere going to move in the time between those two events coming into existence and going out of existence so the perspective of the earth Observer is on the left in this cartoon and the perspective of the muon Observer is shown on the right in this cartoon we don’t know the distance between the surface of the Earth and the muon in this picture we only know it from the original experiment in the earth rest frame but here we are confident that the Earth is rushing toward us at the opposite velocity that’s measured in the earth frame for the muon heading toward the earth now from the muons perspective of course it’s standing still in all events coming into existence and decaying they happen at the same location in its frame of reference therefore the time it typically is going to stick around is going to be 2.2 microc in its frame of reference it sees the earth below it when it comes into existence and it sees that that surface of the Earth rushing toward it at at a velocity of negative V so how far does the muon have to go to make it to its destination from its perspective and its reference frame now time is ticking away at whatever rate it goes at for the muon and ultimately it can measure time using its own lifetime which is about 2.2 microseconds nothing funny with time in its rest frame but of course the distance where the muon was created above the surface of the Earth is 3 km in the rest frame of the earth that’s the frame where the earth and its atmosphere appear to be at rest and that makes that distance 3 kmers the proper length or proper height above the surface of the Earth that is the longest distance that any reference frame would measure between where the muon is created and say the surface of the Earth the the muon on the other hand will see the earth atmosphere system as moving and therefore distances in that system contracted along the direction of flight and the length or height above the surface of the Earth that it will measure will be the proper length 3 km divided by gamma and this comes out to be about 23.3% of the proper length or .699 km 699 M that’s the distance that the muan perceiv between where it comes into existence and that final measuring point which was 3 km away in the frame of the Earth in the atmosphere so from the muons perspective we conclude that it observes that the distance it will travel is contracted compared to what observers on the Earth are seeing and that contraction factor is one over gamma from the mu’s perspective the distance between the place in the atmosphere where it came into existence and where It ultimately decays is greatly shortened requiring only a time of delta T Prime of about 2.4 micros to make the the the trip because the earth atmosphere system is contracted and moving relative to it and so you know given that it lives about 2.2 microsc in its reference frame it’s absolutely plausible that it could make it that full distance that people on Earth said it went it’s just that the people on Earth are confused because the distance is shorter than they claim from the muons perspective so while observers on Earth and an observer moving with the muon would disagree on the reason for the muon reaching the lower measurement point they all agree that it’s very likely to happen the Earth Observer argues that the reason it makes it is because time in the muons frame is passing more slowly than they claim because the muon is moving and so that takes longer to Decay as a result of that it’s able to cross the 3 km Gap even though it should have only live 2.2 micros because time has slowed down for it while it’s in motion the muon obser Observer says no our clocks are working just fine what’s going on is that because the Earth and atmosphere are rushing toward us they’re in motion and so they seem contracted along the direction of motion and as a result of that we don’t have to go that far to make it to say the surface of the Earth and we’re definitely going to make it in about 2.2 microseconds or so that’s why we made it so far now they’re both right even if they have different reasons for what happens they both observe the same outcome the muon makes it to the surface of the Earth but they disagree on the space and time reasons for that and that’s okay because the Lauren’s transformation allows them to relate their perspectives and put their measurements into the other person’s frame to see what’s going on and resolves the Paradox in that sense so to review in this lecture we have learned first of all what is a muon it’s a subatomic particle it’s about 200 times heavier than the El electron it’s about five times lighter than the proton and it has the same Elementary charge as the electron so it’s its own thing and it would take decades after it was originally discovered to finally fit it into the the the sort of final picture of nature that we’ve reached today the muon regardless of what it really is is an outstanding laboratory for testing predictions that are made using the Loren transformation specifically about whether or not muons given their very short lifetimes should be able to travel the vast distances from where they’re created in the Earth’s atmosphere to where they can be measured down on the surface of the Earth and in fact we find that muons in vast numbers make it from where they’re produced in the upper atmosphere to the ground but they’re not supposed to if time passes at the same rate for all observers and all frames of reference so it may seem weird that time doesn’t pass at the same rate when you’re moving but it’s the truth we have direct tests of this not only with muons but with many other systems as well and in many ways the muon wound up being the very first direct test of the validity of special relativity and it it held up against that that test beautifully to live another day and make more predictions which is what makes it such a spectacular theory of space and [Music] time in this lecture we will learn the following things we’ll Begin by learning what is the classical Doppler effect on an oscillatory phenomenon like a wave we’ll also learn about the effect of the motion of a light source on the characteristics of the light other than its speed and finally we will learn how to compute the so-called special relativistic Doppler effect on light and interpret the effect on observations of the world around us let’s begin by recalling oscillatory phenomena from introductory physics specifically let’s look back at something called simple harmonic motion this is a kind of repetitious motion that has a time and space structure that allows itself to be described using S or cosine functions of space and time so for example depicted in the graph at the bottom of the slide we have the vertical position of some object as a function of the horizontal position of the object and we see that the vertical position varies gently upward then downward then upward again and then repeats with the horizontal position and this motion of the y coordinate with with respect to the x coordinate lends itself to a direct description in this case using a sign function now a wave phenomenon such as a water wave or a sound wave can similarly be described using exactly this kind of mathematics a sound wave is a region of high compressed air followed by low compressed air followed by high compressed air and so forth It’s a so-called density wave in air a water wave is similar L an increase in the number of water molecules in one region of water and a decrease in the number of water molecules in another a rising and a falling of the surface of the water these wave phenomena are oscillatory in nature and can be exactly described using the same kind of s or cosine function approach that we apply on simple harmonic motion now the wave phenomena just like oscillatory phenomena have characteristic numbers that describe their spatial distribution there’s no one place where a wave is and where it is not for instance you could say that there’s more of the wave in this region of Y and less of the wave in this region of Y the wave is a structure that’s spread out in space and it has both a spatial structure and because it can move uh in time it has a Time structure as well we have to use use

    certain quantities to characterize the overall macroscopic shape of an oscillatory phenomenon or a wave and the wavelength denoted by the lowercase Greek letter Lambda is one such number for instance the wavelength of a wave like the one depicted here could be taken as the distance between crests of the wave the locations of the Maxima the maximum displacement from zero in the y direction or it could be taken as the zero displacement of the phenomena really picking any two similar points on the wave and drawing a line between those points horizontally will give you the wavelength now if we were to observe this phenomenon passing Us by by picking a reference point in space and just watching it Go by that point the time between Maxima or Minima passing the same spatial reference point is known as the period capital T of the wave this is the time between the same thing happening over and over and over again in the wave phenomenon now the inverse of the period one overt is the rate at which for instance Maxima passed that point and it’s known as the frequency and it can be denoted in one of a couple of ways for instance the lowercase letter f for frequency which equals 1 /t or it can be denoted using the Greek letter new which looks like a little curved V that’s also used often to describe the frequency of a wave and again that’s just equal to one over the period or 1/t frequency have units of per second or Hertz h r TZ the unit of frequency now the speed with which waves move in space during some unit of time is actually given by a very simple product of frequency and wavelength if you want to know the speed of a wave you just take the wavelength Lambda and you multiply it times the frequency f and of course for a light wave the speed with which all light waves move is known to be C the speed of light and so this will just be again the product of the wavelength of the light wave and the frequency of the light wave but that product will always yield C 2998 * 108 m/s regardless of whether an observer is in motion relative to the source of the light or not we know that already as one of the postulates of special relativity now summarizing again the gross properties of waves it’s helpful to pick a characteristic point on a wave and think about the repetition of those characteristic points as representative of the spatial or temporal distribution of the wave phenomenon so for instance we can think of waves of sound or waves of light as merely being represented by lines or planes so for instance uh the location of a line in two Dimensions or a plane in three dimensions could indicate a location in space of a maximum of the traveling wave using this picture at the right you might imagine that each of the locations in space marked by one of these red planes is a place where you would find a maximum of the wave having been sliced through by the plane this is a very common way to quickly and simply sketch a wave without having to draw the s or cosine function the distance between lines or planes is the wavelength that’s the sort of cartoon way you represent that particular feature of a wave in the image now such a line or plane would be referred to as a wave front and wave fronts can be used to characterize the location and space of a particular point on a wave and all of the repetitions of that point the frequency of such a phenomenon can be thought of as how many fronts per second are emitted by the source so if you think about one plane and then another plane and then a third plane being emitted by the source the distance and time between those planer emissions but would be the period of the wave and one over that would be the frequency the rate at which it emits wavefronts now this brings us to the so-called classical Doppler effect and I’m going to use sound waves to motivate this because most of you have probably at one point in your life or another actually experienced the Doppler effect with sound waves the Doppler effect occurs when the source emitting a wave is itself moving relative to an observer so if we’re talking about a sound wve here we’re talking about a listener someone who can receive the pressure changes in their ears and when that Observer is moving relative to the source the Doppler effect can occur and this is actually Illustrated in this cartoonish animation below a car starts emitting sound perhaps by the driver laying on the horn and the wave fronts are represented by those red circles so everywhere you can locate a point on a red circle would be a wave front of the sound waves and they’re emitted at rest uniformly in all directions but as the car accelerates forward the wave fronts in front of the car begin to pile up the sound waves get closer together the wavelength shrinks and behind the car we see the wave fronts spread out the wavelength gets bigger so in this example using sound waves we have the effect that in the direction of motion of the emitter the car honking its horn for instance and ahead of the Source the wave fronts are pressed together more densely shrinking the wavelength and thus increasing the frequency with which waves will strike our ears if we were to be standing on the backs side of this moving object while it’s moving away from us sort of against its direction of motion the wave fronts are more widely spread apart than they would normally be and this increases the wavelength and thus decreases the frequency with which these waves reach our ears so to human ears it’s the frequency of waves that determines what we call pitch high pitched sounds are also high frequency sounds the wavefronts are striking our ears more frequently and vice versa low pitched sounds are low frequency sounds the time between wave fronts hitting our ears is [Music] longer [Applause] [Music] now let’s think about the Doppler effect on light waves there is a classical Doppler effect on light waves but because time and space measurements are also relative to the frame in which you’re making the measurement there is a Rel relistic component that gets added to this kind of pitch shift even for light waves so it’s true that while all observers may agree on the speed of light we know that special relativity leads to the conclusion that space and time measurements May differ between observers in different reference frames now wavelength is a space measurement and frequency is a Time measurement so couched in that language it should come as no surprise then that while observers in relative motion all agree that a wave of light travels at C 2998 * 108 m/s regardless of what frame of reference they’re in observers in different frames will disagree on the wavelength and frequency of that light wave now to measure wavelength for example is to be able to simultaneously locate Ma Maxima on a wave think back to our discussion of measurement and how one measures distances on a moving object there are different ways you can do it but one of those ways is to simultaneously collocate points on the object in this case one maximum and then the next maximum but we know that simultaneity is a frame dependent statement and in moving frames of reference we know that the objects pinned to those frames appear contracted along the direction of motion from observers that are not in that frame and are for instance at rest with respect to that frame similarly to measure frequency is to be able to measure the time displacement between two events at the same location in space how often or how much time happens between wavefronts going by a single point and we know that from the perspective of a frame that’s at rest the time in the moving frame passes more slowly and these two frames would disagree on the amount of time between two wavefronts so the combination of the classical piling up of wavefronts or stretching out of wavefronts due to the motion of the source with these time or space effects that come from the special relativity postulates and the Loren transformation come together in what is known as the special relativistic Doppler effect and we will derive it here using the transformation applied on top of the classical Doppler effect calculation and we will discuss the implications of this phenomenon for observing the universe it has some extremely deep impacts on our ability to understand nature even distant parts of the universe where we have no physical access to moving objects so to motivate the derivation of the relativistic Doppler effect I’m going to start by talking about just the sort of classical Doppler effect and to do this what I want to do is have you imagine a light emitting device Illustrated here as a blue ball that is sending out wave fronts to the left along this coordinate axis and it’s doing so at regular intervals in its rest frame so for instance we might have a moment in time T1 Prime in in the frame of the source where it emits its first wavefront its first Maximum is emitted from a point say for instance 14 along this coordinate axis and then at a later time T2 prime it emits the second front so in the time between emitting the first front and the second front of course the first front has moved at the speed of light to the left and it’s now at 13 on the x-axis at the moment T2 Prime that the second front is emitted by the source and if these fronts are emitted regularly as would be true in a simple harmonic oscillatory or wave phenomenon then there will be a third front emitted at T3 Prime and a fourth front emitted at T4 Prime and the distance in time between these successive emissions of fronts will all be the same corresponding to the period of The Source Capital T Prime in the frame of the emitting source so T2 primeus T1 Prime will be the same as T3 Prime minus T2 Prime all of those intervals between neighboring wavefront emissions will be the period of the source as observed in the frame of the source this defines the period of the light wave thus we have a regular frequency in the source frame we can write that frequency in the source frame as F Prime = 1 / T Prime now that was done with the source sitting at rest along this coordinate axis and so in that P picture an observer sitting at zero on the coordinate axis would agree that the wave fronts all arrive with a time between them equal to what the source said it would be because the source and the Observer were at rest with respect to each other but now let’s imagine that the light emitting device that’s sending out those wave fronts at regular intervals and its own reference frame is moving with respect to the Observer at zero in the above coordinate axis so let the velocity of the source be plus v that is it’s moving away from the origin to the right and entirely along the X AIS treat The Observer is being at rest we’ll call that frame s and the source as being in motion we’ll call that frame S Prime let’s think about what will be the distance between wavefronts arriving at The Observer from the perspective of the source so we’re doing all of this from inside the source frame S Prime we will transform to the rest frame of the Observer later so here is our source it’s at location 12 along the x axis and it emits its first front front one at time T1 Prime and the wavefront moves toward the zero point on the X x axis at the speed of light C now the next time that the source emits a wavefront it has itelf move to the right from 12 to3 in the meantime the light wave front that it emitted the first front front one has moved at the speed of light to the left and in this particular example it winds up being at 10 on the coordinate axis so in the frame of reference of the source the distance between the wave fronts would have been Lambda Prime the wavelength and that’s depicted here on this cartoon showing where the original unstretched wavelength of this phenomenon would have been if the source had been at rest the source had been at 12 when it emitted front one front one is now at 10 and so the distance between those two would be Lambda Prime the wavelength of the phenomenon in the frame of the source but the source is now moved and so it’s at this location 13 that it emits its second front so by the time time emits the second wavefront at time T2 Prime the first wavefront is a distance of C * T2 primeus T1 prime or c times the period in the frame of the source and that’s just equal to Lambda Prime that’s the distance from where it was emitted but the source is now farther from the Observer by an amount of V * T2 Prime minus T1 prime or V * the period when it emits the second wavefront and again that’s depicted up here so this is the distance from the point of emission that front one has traveled which technically would have been one wavelength of the original emission but the source has moved back uh further along the x axis by a distance VT Prime V * the period and so the source will argue that as a result of this an observer who is looking at these wavefronts coming at them sitting at point zero should see the combined distance of Lambda prime plus VT Prime between the two wavefronts and and this will actually be the observed wavelength of the phenomenon according to the person uh riding with the source that will be what the Observer sitting at zero should see so this is all Illustrated above we take Lambda Prime and we add VT Prime and that’s going to give us the observed wavelength in the frame of the source so this is all Illustrated and we can write the equation down adding these two together and then we can rewrite this in terms of frequencies by remembering first of all that period is equal to 1 over frequency or frequency is equal to one over period and that the speed of light is equal to the product of wavelength and frequency similarly the wavelength uh for instance in the frame of the source will be the speed of light divided by the frequency in the frame of the source and the wavelength according to the Observer from the perspective of frame S Prime will be equal to the speed of light divided by the frequency according to the observ in that frame so we then find again all of this from the perspective of frame S Prime that the observed period at at zero should be 1 + V / C all divided by the frequency of emission fime so maybe pause here and try to work all this out for yourself but again keep in mind that we have not yet transformed this observation into the rest frame that is the frame in which the Observer is at rest right now we’re calculating The observed wavelength or period according to what the Observer should see if their uh time measurements were absolutely in agreement with the moving Source there’s this is the classical Doppler effect the stretching or compressing of wavelength with the motion of the source we haven’t yet included for instance time dilation or length contraction in all of this now we’re going to take that last step and to Aid Us in notation here we’re going to begin by defining a very convenient symbol and that is the lowercase Greek letter beta which is rather regularly used to represent the ratio of the speed of the frame for instance divided by the speed of light because speeds V never exceed the speed of light C and because speeds can never be any less than zero beta is a number that goes from zero to one zero for things that are at rest one for things that are moving exactly at the speed of light and can take all values in between in terms of beta the claimed period of the phenomenon as observed by The Observer in frame S Prime should be 1 plus beta divided by the frequency of emission but again so far all of this is in the S Prime frame this is what a person in S Prime follow following along with the source would argue is what the Observer should see the original frequency of emission from the perspective of the source f-prime which we can just call F with the subscript source and includes the relative speed of the source in The Observer V and the period and frequency with which a person in the source frame would expect the Observer to receive the wavefronts T Prime observed or you know correspondingly the observed frequency however if we now do the special relativity and use the Loren transformation and transform this stuff into the actual frame of reference of the Observer we know that there’s going to be another effect here and for instance we could summarize that by saying that it will be the relativity of time time in the source frame where all the missions happen at the same location is proper time the source always says that wavefronts are being emitted from its location in space and as a result of that that’s the frame where proper time will be observed but in any other frame moving relative to the source time dilation will be what is observed that is the passage of time on the moving object will appear to be slower than the observers on the moving object would claim and it’s given simply by taking the proper time and multiplying it by gamma so the time the period observed in the rest frame of the Observer will be gamma time s the period that the source claims The Observer should have seen according to the classical Doppler effect so we then finally arrive at a expression for the period of the phenomenon of the light wave as observed in the rest frame so we start by just saying that the period observed in the rest frame will be equal to gamma times the period that should have been observed from the perspective of the moving frame T Prime with the subscript obs we can substitute in with 1+ beta over the source frequency fime and we can do some algebraic gymnastics to sort of rewrite this in a more pleasant looking form we’ve got gamma and we’ve got beta of course gamma depends on beta gamma has v over C all squared inside of it that’s beta squared so it’s nice to try to rewrite this all either in terms of just beta or just gamma and so with a little algebraic gymnastics starting with writing gamma is 1 over the < TK of 1 – beta 2 you can then do a little work and show that a final neat looking expression for this is that the period observed in the rest frame is equal to the square root of the ratio of 1 + beta over 1 minus beta all times 1 over the source frequency the frequency of emission from the perspective of the source itself in its rest frame so we can transform this into an observation of course of the frequency in the rest frame by simply doing one over T observed and that just flips the stuff in the square root upside down and you wind up with this neat little relationship that the frequency of the light observed in the frame that’s not moving we’ll see the frequency uh as emitted in the source frame where the source is at rest shifted by an amount given by the square otk of 1 minus beta over 1+ beta so we’ve solved the problem now we’ve derived the special relativistic Doppler effect the shifting of the frequency due to relative motion between the source of emission and the Observer of the light by considering the situation where the source is moving away from The Observer now this special relativistic doler effect is a combination of two effects the classical Doppler effect of just the effect effect of the moving source that adds extra space between the wavefronts but in addition to that the dilation of time due to relative motion of the source and the Observer proper time and therefore proper frequency would be in the frame of the source this is modified by a gamma factor to go into any other reference frame so the special relativistic Doppler effect is a combination of the classical Doppler effect with the relativity of space and time measurements and you actually would expect from just Newtonian and Galilean relativity that there’s a Doppler effect on frequency and wavelength of light but the special relativistic addition to that actually makes the effect even more extreme than expected from Newtonian and Galilean mechanics and in fact what we see in the universe is what is predicted by special relativity and not just the old mechanical Galilean and Newtonian approach to motion now now for a source that’s moving toward an observer that is approaching the Observer while emitting wave fronts the sign of the Velocity is all that needs to be changed we go from having beta the velocity moving away to negative beta the velocity now moving toward the Observer and in fact you can do the work yourself but this formula up here for the source moving away from The Observer can be transformed into the case for the source moving toward the Observer by flipping the sign of beta so taking beta and turning it into negative beta and all that does is it takes the stuff under the square root and flips it upside down so now we have the square root of 1 plus beta divided 1 minus beta that whole thing times the frequency of emission in the source rest frame so I would recommend you practice this calculation by checking for yourself that this second equation for an approaching source is correct um but once you’ve convinced yourself of that the shortcut is a really easy thing to remember if you can remember one of these two formulas you can get the other one simply by changing the sign of beta not too bad so let’s talk about some expectations from the special relativistic Doppler shift for example if a light source is moving away from us or toward us what do we expect to happen to the frequency of its light so for a source that’s moving away from us at speed beta along our line of sight we expect to scale The Source frequency by the following quantity the square root of the ratio of 1 – beta over 1+ beta now if you play around with this a little bit you’ll notice that this thing is always less than or equal to one it’s exactly one when beta is zero and if beta is anything other than zero its value decreases from one the frequency therefore that we should observe should always be lower than in the source’s frame of reference owing to the stretching of its wavefronts combined with the dilation of time now because as I said beta is a number that’s inclusively bounded between 0 and one we are taking the ratio of a number less than one and a number greater than one for beta that’s anything other than zero now if instead the source and observer are moving toward each other then we scale The Source frequency by this quantity the square OT of 1+ beta / 1us beta and again if you play around with this you’ll find out that this is either always equal to one or greater than one this means that the observed frequency is always greater than what what is observed in the frame of the source since we’re dividing a number greater than or equal to one by a number that’s less than or equal to one so to summarize all of this for a source that’s moving away from The Observer of the light the frequency that the Observer sees will be lower than the frequency that’s observed in the rest frame of the source itself and similarly for a source that’s moving toward an observer The Observer will always see that the frequency is increased over the frequency as observed in the rest frame of the source of the light emissions these equations are all for frequency but we can very quickly derive the equations for wavelength using the fact that the speed of light is equal to wavelength time frequency so if we go through the brief algebra gymnastics for this we find that we get the following equations for The observed wavelength depending on whether the source is moving away from The Observer or if the source is moving toward an observer so as expected when a receding Source uh is in is present this gives us a lower frequency and a longer or greater wavelength so the frequency has gone down therefore the wavefronts are farther apart from each other because the wave is still traveling at the same speed C on the other hand when the source is approaching us we get the higher frequency which means the wavefronts are coming at us more often and that means a shorter wavelength will be observed for the phenomenon so let’s talk about the perception of light color due to the full relativistic Doppler shift so I’ve Rewritten here the equations for the weight wavelength that an observer sees depending on whether the source is moving away from the observer in which case the wavelength is stretched by the motion or if the source is moving toward an observer in which case the wavelength is compressed uh by the motion receding sources of light are said to Red shift compared to when they are at rest and that’s because longer wavelengths correspond to redder light than shorter wavelengths which correspond to Bluer light I’ve Illustrated this over here on the right using a spectrum and specifically I’ve isolated the visible or color spectrum of the electromagnetic frequency spectrum so for instance red light near the edge of where the human eye can detect the color red has a length of about 700 nanometers or 700 billions of a meter blue light or uh violet light which is at the other end of our ability to see comes in at around 400 nanometers or 400 billionths of a meter in length blue light has a shorter wavelength and thus a higher frequency than red light so if a source is moving away from us the wave fronts get stretched out and that would take something that’s Bluer and shift it more toward the red end of the light spectrum and conversely an approaching source is said to be blue shifted because this results in shorter wavelengths which corresponds to the Bluer end of the color spectrum now of course it’s possible that if you have an object that’s already say Violet in color and it’s moving toward you very rapidly at a significant fraction of the speed of light the shifting effect can be such that it actually shifts uh so blue that it goes outside the visible spectrum and then you’d have to look for it in ultraviolet or x-rays or other similar very short wavelength electromagnetic phenomena uh similarly if an object is already very red and on the edge of the ability of the human eye to see it and the source of the light is moving away from you appreciably quickly this can result in a red shift that puts it into the infrared or even microwave or radio depending on the speed of the object that’s emitting that light you can imagine therefore that this has some strong implications for measuring our place in the whole Cosmos for example without making physical contact with distant stars or galaxies which are collections of billions or trillions of stars it’s possible to actually determine whether or not those objects are receding away from Earth or approaching toward Earth based on the degree of the color shift of their atomic Spectra let’s take a look at an example of this through long centuries of observation of distant objects by astronomers and especially by breaking down the light from distant objects into their component colors the so-called color spectrum or atomic Spectrum astronomers have determined that the stuff that makes up everything out there is the same stuff that makes up everything down here on Earth and that is at least for the Luminous stuff the stuff that emits light or absorbs light that stuff is atoms and the atoms that are out there appear to be the same as the atoms that are down here on Earth iron has the same atomic spectrum whether it’s found on Earth or in the heart of of a star so as a result of that we can look at the light coming from distant objects figure out what atoms it’s made from and knowing the pattern of light each atom gives off determine whether or not first of all it’s composed of certain atoms and second of all whether it’s moving relative to us so here’s how you figure out the motion the Spectrum on the left over here on the slide is actually from our own star the sun the sun is not appreciably getting closer or farther away from us over the course of a day or a year we’re going around the Sun and our distance is changing slightly as we orbit the Sun every year but it’s not happening so fast that you get an appreciably different shift at least to the eye in the color of the Sun so we can consider the sun to be an atomic spectrum that represents a star at rest on the other hand on the right hand side is an atomic spectrum from a very distant so-called supercluster of galaxies that’s a cluster of galaxies of stars which are themselves uh clusters of stars and it’s named bas1 it’s not so important what it’s called but if you stare at this for a few moments you’ll notice that there’s an interesting similarity in the pattern of the light between our sun and the light that’s coming from all the stars that make up this distant supercluster of galaxies there’s a missing color line here in our sun and then there’s a a gap where there’s lots of colors and then there’s another missing line and then there’s another Gap and then there’s a missing line and then there’s a small Gap and there are two missing lines and if you look over on the right at the light from the super cluster you see that oh look there’s a line in the red that’s missing and then there’s that same Gap and then there’s a line in the yellow that’s missing and then there’s that same Gap and then there’s a line in the green that’s missing and then there’s that same small Gap and then the two dark lines that are next to each other it’s as if somebody took the pattern in our sun and shifted it toward the red end of the spectrum and this is exactly what we would expect from special relativistic Doppler effect if the supercluster is moving away from us thus stretching its emitted wavelengths of light longer from our perspective so these black gaps so-called absorption lines in the spect have the same pattern but in a slightly shifted location in the sun compared to this super cluster of galaxies and the fact that those missing colors are red shifted means the Galaxy supercluster is moving away from us that’s the conclusion from the special relativistic Doppler effect and you can actually then use measurement differences of the wavelengths between where the missing wavelengths are present in the Sun and where they’re present in the galaxy supercluster and using some astronomy you can actually estimate the relative velocity beta equal V / C with which the supercluster is receding from us this is incredible this allows us to measure velocities without having physical access to a material object all we have to do is look at the pattern of light that comes from its atoms and knowing that those patterns are the same patterns that should be found here in the atoms on earth look at the shifting of those patterns to determine the relative velocity of o to the distant object this kind of measurement is actually how we know that the universe as a whole is expanding so far as we’ve been able to determine all distant objects appear to be receding away from the earth as if carried along on the momentum of an initial explosion that set the whole universe in motion with all points expanding outward from every other point this implies the universe as a whole and on the largest distance scale is expanding with time so let’s review what we have learned in this lecture we’ve looked at the classical Doppler effect both in a cartoonish way and using the example of a moving Source emitting wavefronts along a horizontal axis we’ve then considered the effect of the motion of a light source where observers all agree that the light waves are moving at the same speed we’ve looked at the effect of the motion on characteristics of light other than that speed which isn’t changing the wavelength and the frequency of that light and by combining the classical Doppler shift with time dilation we’ve seen how to compute the special relativistic Doppler effect on light and we’ve even looked at ways that you can interpret that effect on observations and take observations of the natural world and use those to infer relative velocities on the grandest scales of the the [Music] cosmos third party another object from the perspective of those two frames and thinking about the velocity of that object as perceived in the two frames we’ll also learn how to properly add velocity of objects to frame velocities in special relativity let’s use a concrete example to motivate a kind of basic problem we can use going forward to think about the question of object velocities relative to moving frames of reference so the example I will pick for this is a non-copyright violating space wars recently in a globular cluster fairly nearby two ships were engaged in a chase the lead ship is moving away from the pursuing ship at a velocity given by the vector v the pursuing ship fires a projectile straight at the lead ship along the line of motion and at a velocity Vector U relative to the firing ship with what velocity does the lead ship observe the projectile to move now I’ve Illustrated this with a little graphic cartoon right here here we have the pursuing ship on the left the projectile it’s fired with the velocity of the projectile from its perspective drawn here in red the ship it’s pursuing over here on the right and the velocity of that ship being pursued relative to the the pursuing ship given by the vector v now in the Galilean or Newtonian view of space and time the answer to the question with what velocity does the lead ship observe the projectile to move is rather straightforward The observed velocity in its frame of reference U Vector Prime would be equal to the velocity of that ship with respect to the pursuing ship minus the velocity of the projectile with respect to the pursuing ship that would also turn out to be completely wrong when the velocities in this question approach velocities near that of light so for for instance if the projectile is actually a beam of light imagine a laser beam a laser cannon mounted on the front of the pursuing ship it turns on the cannon the beam is emitted this is a beam of pure light it should move at the speed of light if we plug that into this calculation we get all kinds of wrong answers here the lead ship doesn’t sh doesn’t see the laser beam approaching at the speed of light and we know that’s just not consistent with observation as encoded in the postulates of special relativity so what then is the correct addition of velocities in a problem like this and that’s the question we want to figure out in this lecture we can begin by writing down the Loren transformation equations treating the pursuing ship as the rest frame the lead ship as the moving frame and the projectile as an object to be located or studied in either frame the SpaceTime coordinates of that object in each frame are given as follows for example if we have the SpaceTime coordinates x and t in the rest frame we can get the space-time coordinates in the moving frame the S Prime frame using this version of the Loren transformation equations yielding X Prime and T Prime the location and the time at which the Lo is observed for the projectile in the perspective of the lead ship now we can write differentials of space and time using calculus DX Prime and DT Prime and this will allow us to work toward obtaining equations with velocities so for instance U Prime is the first derivative of x Prime with respect to T Prime after all that would be the velocity of the object as observed in the lead ship or moving frame U would be the first derivative of x with respect to T that’s the perspective of the projectile’s Velocity from the rest frame or the pursuing ship now if this particular step feels weirdly familiar to you in an earlier lecture I walked you through a brief example as to why the Loren transformation needs to be a linear transformation between moving frame coordinates and rest frame coordinates and we came dangerously close in that lecture to deriving the velocity transform albeit I was doing that for arbitrary Powers x to the n and T to the M for instance here of course it’s purely linear because it’s based on the Loren transformation and so if some of this feels awkwardly familiar you may flip back to the earlier lecture on the Loren transformation and have a look and see where the roots of this were planted so the differential of space in the moving frame DX Prime is going to be equal to gamma time the quantity dxus vdt and the differential of time in the moving frame is going to be equal to gamma * the quantity V / c^2 DX plus DT now we can take the ratio of DX Prime over DT Prime and this allows us to get the velocity U Prime of the projectile as observed in the moving frame or the frame of the lead ship substituting in with our differentials for DX Prime and DT Prime we arrive at this rather unpleasant looking equation but one of the nice things about this is that the leading gamma factors the 1 / < TK 1 – V ^2 over C sared terms they cancel out in both the numerator and the denominator and this leaves us with an equation that looks like this just in terms of the remaining differentials of DX and DT now if we divide the top and the Bottom by DT the little unit of time that we’re considering then we wind up with terms that go like DX over DT which is just U the velocity of the projectile in this case entirely along the x-axis and so this equation takes the following form which at the end of things doesn’t look horrible the velocity of the projectile as observed in the moving frame the frame of the lead ship is simply given by the velocity of the projectile as launched from the perspective of the rest frame the pursuing ship minus the velocity of the frame so the velocity difference between the lead ship and the pursuing ship divided by a quantity that goes like the Motion 1 minus UV over c^2 so we have arrived at a formula for combining the velocities of the moving frame with the velocity of the projectile as observed in the rest frame to allow us to compute The observed velocity of the projectile in the moving frame this equation is a substitution for the old galile and transformation addition of velocities equation and is correct from the perspective of special relativity so let’s plug in some numbers and see what we learn about projectile motion in the case where objects are also in relative motion to each other and observing that projectile as it moves and let’s begin by picking a low velocity situation where the ships are not really moving apart from each other all that fast I’ve decided to pick the lead ship having a velocity of just 1% the speed of light or 0.01 C and I’ve picked a projectile velocity that’s just three times bigger than that or 3% the speed of light 03c from the perspective of the firing ship the pursuing ship now from the above equation we learn that the lead ship observes the projectile approaching it at a speed of 0.02 C now if you stare at this for a moment and recall the Galilean velocity transformation you’ll note that this is exactly what we would have expected from the low speeed case where all the velocities of objects in the problem are are not really a large fraction of the speed of light although I’ve allowed them in this case to go up to a few perent the speed of light we actually get back exactly what would have been told To Us by the velocity transformation in Newtonian SL Galilean relativity that is that U Prime equals uus V now that doesn’t mean that this is exactly true at every decimal place there’s some decimal place where the Newtonian Galilean approximation uh to space and time and motion breaks down compared to the more accurate special relativistic calculation so let’s instead pick some bigger velocities let’s now assume that the lead ship is racing away from the pursuing ship at half the speed of light and then from the perspective of the pursuing ship it fires this projectile at 8/10 the speed of light8 C plugging those numbers in we find out that the lead ship observes the projectile to approach it at 1/ half the speed of light and if you stare at that again for a moment play around with the numbers on your own you’ll very quickly realize that this is definitely not what would have been predicted by the Newtonian or Galilean approach it’s not simply U minus V in this case now interestingly we can look at the case of when the lead ship is flying toward the pursuer so now we turn the lead ship around and we aim it back at the pursuing ship and flip its velocity Vector so that it’s moving at negative .5 C from compared to its original direction of motion in that case we see that the lead ship that’s now racing toward the projectile that’s been fired at it doesn’t observe that projectile to be moving in excess of the speed of light rather it observes it to be moving at 93% the speed of light and that’s again a distinction from what the Newtonian or Galilean approach would have yielded the old relativistic approach from Galilean relativity would have predicted that the lead ship observes the projectile to be approaching at a speed that is far in excess of the speed of light but we also know from the postulates of special relativity that one consequence is that nothing can move faster than the speed of light and so we see that that’s preserved here in the velocity transformation although the velocity of the ship is now aimed back at its pursuer and although the naive addition of velocities would give you something in excess of the speed of light the naive addition is not consistent with observ obervations of space and time and the speed of light and using the special relativistic transformation we see that while it’s true that the velocity of the projectile does appear to be larger than when the lead ship was racing away from it it does not exceed the speed of light but comes in at a pretty pretty fair fraction of the speed of light so let’s summarize what we’ve learned about adding velocities in special relativity keeping in mind that the cases that I’ve built these equations from All In involved an object velocity that was parallel or anti-parallel to the velocity of the frames if you have the velocity of the object in the rest frame and want to determine it in the moving frame then the left equation is what you want if on the other hand you have the object velocity in the moving frame and you want to determine it in the rest frame all that should change between the left equation and its corresponding equation on the right should be that you swap U and U Prime and you flip the sign of all terms that involve V or V cubed or something like that you take V and send it to negative V and in fact that’s the equation that’s written here on the right you can always derive these directly from the Loren transformation or you can memorize one of them and remember how to transform it into the other by swapping the object velocities and flipping the sign of the frame velocities I’ll leave it up to you as to what your best possible learning strategy is for this but know that if you memorize one of these you can figure out the other from Context and knowing how to trade the mathematics around now what if the object instead of having its velocity aligned parallel or anti-parallel to the frame velocities is moving in a direction that isn’t solely parallel or anti-parallel to V so you might be tempted to assume that the object velocity in for instance the y direction assuming that the frames are moving only along the X and X Prime axis you might be tempted to assume that the object velocity along the y direction and the Z direction as observed in either frame is the same since in the Loren transformation coordinates Y and Y Prime Z and Z Prime are equal to each other if all the motion is along x and x Prime and you’d be wrong you need to be very careful with these things why well think about it a second object velocity necessarily involves the time derivative of a coordinate is time absolute between two different frames of reference well we should feel pretty confident at this point that the answer is that it does not t does not equal T Prime in special relativity because a Time derivative is involved there’s going to be a dydt and there’s going to be a Dy Prime DT Prime and While y may be equal to Y Prime T is not equal to T Prime so let’s go through this I’m going to consider motion component along the Y AIS the frames are moving entirely along X so V in this is still directed entirely along the X and X Prime axes but I’m going to allow the velocity of the object to develop a component uy or uy prime along the Y and Y Prime axes respectively so let’s look at what the transformation of U Prime to U would be for the case of this component along the Y AIS and Y Prime axis so we know that in the rest frame U subscript Y is just dydt it’s the change in the y-coordinate with respect to to time as observed in the rest frame now it’s true that in the Loren transformation if the motion is entirely along x and x Prime that y does equal y Prime so we can replace Dy with Dy Prime and no harm no foul that’s mathematically allowed but if we’re going to substitute for DT with DT Prime we have to use the full glory of the differential form of the time equation in the Loren transformation and that means replacing DT with the quantity I show here in the denominator of this fraction now of course I can divide the top and the Bottom by DT so that I get a uy prime in the numerator and the denominator gamas don’t cancel out in this case though between the numerator and the denominator because y equals y Prime Y and Y Prime don’t depend on a gamma factor to correct between them and as a result It’s actually an easier derivation I feel than for the case of the object object motion component along the direction of travel of the the frame relative to the rest frame um but it it’s not perhaps quite is uh memorable looking now similarly if we have uy Prime and want uy all we have to do is swap uy Prime and uy in these equations and replace V with minus V and so the corresponding equation that tells us what the velocity component in the moving frame looks like given the velocity component in the rest frame uh will be the one I show here and by the way if there’s a component of motion along Z and Z Prime you can obtain a similar equation it has pretty much exactly the same form as the one shown here with uy replaced by u z and u y Prime replaced by u z Prime um you can very quickly write that equation down but I I just want to go through this because it’s important to recognize that while it’s true that y equal y Prime and Z equals z Prime when the motion is entirely along x and x Prime it is not true that uy is naively equal to uy Prime and that’s because a Time derivative is involved and time does not pass the same way in the two frames when one is moving relative to the other finally let’s take a look at one last special case and that is if the pursuing ship shoots a laser beam at the lead ship so what I’ve done is I’ve replaced the red projectile with a red squiggly line to represent an electromagnetic wave light being fired at the lead ship now the lead ship is still moving at a velocity V Vector with respect to the pursuing ship I’ve put everything along the horizontal axis here but now the velocity of the projectile is C because this is a beam of light and so it will always and forever move at exactly the speed of light so the speed of this projectile is now exactly 2998 * 10 8 m/s as viewed from all frames so if the pursuing ship had fired a weapon like this a laser beam a beam of light well we know that the second postulate of special relativity demands that all observers must see light moving at C regardless of their state of motion so does this velocity addition relationship capture that postulate in all of its full Glory well let’s find out let’s assume that the relative velocity of the lead ship to the pursuing ship is 1 12 C and that the projectile speed as viewed in the rest frame of the pursuing ship is C the speed of light well plugging these numbers in uh we can start from the equation where we have the relative velocity of the two frames and the speed of the projectile in the rest frame and we can get the speed of the projectile as observed in the moving frame so all I’ve done is I’ve replaced u in this equation with c because the projectile is a speed of is a beam of light that’s moving at the speed of light and if you do some algebra you can simplify this equation to C minus V all over the quantity 1 minus V / C and if you do a little bit more algebra you’ll find out that this is just equal to C minus V over the quantity 1 / C * C minus V and if you play with this one step further you find out that this is just equal to C the speed of light so in fact we see that V entirely drops out of this equation once the projectile is a light beam the value of V doesn’t matter at all the relative velocity between these two vessels can be any number and it won’t affect the outcome of the calculation V could have been a half C or negative a half c or8 c or 99999 C basically once U equals c v drops entirely out of the equation and we always recover that U Prime equals c as well the second postulate of special relativity is fully obeyed by this velocity transformation equation so to review in this lecture we have learned how to think about object velocities in different frames of reference and how to go from the coordinates of an object that’s in motion to its velocity in different frames we’ve then used that information to figure out how to properly add velocities of objects to frame velocities in special relativity we’ve looked at a couple of case studies of this and seen that everything seems to comport with the postulates of special relativity which themselves comport with observations of the natural [Music] world in this lecture we will learn the following things we will learn how to Define kinetic energy and momentum while incorporating special relativity we will learn about the nature of mass and the concept of intrinsic mass and we will learn about the relationship between energy momentum and Mass now let’s take a look back at Newton’s Second Law from the perspective of classical physics and in particular have a look at momentum or classical momentum in the context of this discussion so in introductory physics you are introduced to the concept of momentum roughly as follows historically it was observed that there appeared to be a conserved directional quantity associated with motion this quantity which we call Momentum is well defined in the classical domain of physics that is low velocities and large scales by the product of the mass of an object M and the velocity of the object U Vector so we arrive at the definition the so-called classical definition of momentum by taking the product of these two things M * U and that gives us P the momentum or linear momentum of that object now in a closed and isolated system perhaps with a whole bunch of different objects I equal 1 to n it is observed that this quantity overall is conserved that is the sum of all momenta of all objects in a closed and isolated system can be written as a singular number the total momentum and that total momentum remains constant no matter what happens inside that closed and isolated system now when a system is not closed and isolated for instance subject to some net external Force F then the full beauty of Newton’s second law is observed to be obeyed by the system that is that the net force acting on the constituents of the system is just given by the change in momentum of that system divided by the change in time or dpdt so Newton’s Second Law fals ma can actually be Rewritten in terms of momentum Concepts as just f equals dpdt now of course we need to bridge from classical physics to Modern physics and to do that I want you to start thinking a little bit about the laws of physics and their invariance under Transformations from one inertial frame of reference to another recall that one of the postulates of special relativity is that the laws of physics should not depend on what frame of reference you are measuring them in they should be the same for all frames of reference the consequence of that of course is that you can’t tell if you’re in an absolute state of motion but the benefit of that is it preserves the forms of the laws of physics for all observers regardless of whether or not they’re moving so if one subjects the classical momentum concept to consideration moving from one frame of reference to another imagine a second frame of reference observing an object moving at speed U Prime and that second frame of reference S Prime is moving at relative velocity V to the original frame s now imagine that this is all closed and isolated and in the rest frame the velocity of the object is U and in the moving frame it’s U Prime and and the conservation of momentum will hold and so for instance if we take the momentum observed in the rest frame for this object so p = m * U and we use the Galilean transformation from classical physics to move to what we observe in the moving frame we find that of course the moving frame will observe P Prime equals M * U Prime and we can relate the momentum in the moving frame and the momentum in the rest frame using a gilean velocity transformation changing U Prime to U minus V and then Distributing that inside the definition here so when we do that we find out that the momentum observed in the moving frame is related to the difference between the momentum observed in the rest frame and sort of the frame momentum itself M * the velocity of the moving frame now if we then consider changes in momentum in the moving frame with respect to Universal and absolute time so DT prime or DT it doesn’t matter which in the classical view of physics we just wind up taking the time derivative of momentum in the moving frame and if we distribute that time derivative to the two terms on the right hand side above we find out we have dudt and dvdt now since the moving frame is moving at a constant velocity relative to the rest frame dvdt is zero that is the moving frame is not accelerating with respect to the rest frame it’s moving at a constant velocity with respect to the rest frame so the second term is zero and we see that we recover exactly dpdt in the rest frame in other words dpdt in the moving frame is the same as dpdt in the rest frame this is Newton’s second law and so we find that this transformation in classical physics leaves the form of Newton’s Second Law invariant at least under Galilean transformations assuming that’s the correct trans transformation of space and time and velocity now this should all work in domains where the speeds are low compared to that of light but we know that the original definition of momentum was predicated on experiments and observations that were all done in that low velocity large scale regime of investigation that is sort of the human scale of speeds and sizes we also know that that wasn’t quite correct the Loren transformation not the Galilean transformation gives the correct way to define relationships between frames great well let’s just take the classical definition of momentum and apply the Loren transformation the correct transformation between frames so when we do this of course we find that the momentum is equal to mass time velocity and we want to view this in the moving frame where the momentum in the moving frame should be mass times the Velocity in the moving frame what well if we insert into this the relativistic transformation of velocities and special relativity we wind up with this nasty thing over here the mass time uus V over the quantity 1 – UV over c^2 that’s the thing we have to insert that contains the velocity of the object as observed in the rest frame and of course the relative velocity of the two frames all right well fine so let’s then transform this into a statement about differential so if I try to write the differential of of P Prime uh in terms of the differential of P the momentum in the rest frame if I do the calculus on this I wind up with this horrible looking thing here and then of course if I do DP Prime DT Prime which would be the change in momentum with respect to time in the moving frame that’s related to the change in momentum with respect to time in the rest frame by this horribly velocity dependent thing here this is bad why is this bad it’s bad because it totally violates the first postula of special relativity the forms of the laws of physics must be invariant across all inertial reference frames but here we see that one frame has that force is just equal to dpdt but in the other frame that very same law is horribly velocity dependent this is not good and rather than throwing the whole concept of momentum out the window what we should do is stop and ask ourselves did we really Define momentum the conserved quantity associated with degree of motion did we do that assignment correctly in the classical regime of physics did we just get the wrong definition is M * U too naive a definition of momentum given now what we know about space and time and invariance in special relativity now in order to come up with a more appropriate and physically correct definition of momentum that is relativistic momentum there are many many alternative approaches to finding the correct definition of momentum textbooks gloss over this because in many cases the framework for coming up with the exact form of this is not really approachable to students at the level uh of a student taking this course so I had to cherry-pick a methodology to motivate where the definition of relativistic momentum might come from and I prefer the method that comes from my colleague Dar aosta so let’s assume that the problem in the original definition of momentum was that of the definition of time used in the time derivative of space momentum was defined as mass time velocity of object U velocity is the derivative of space with respect to time so perhaps it’s that time definition that’s the flaw in the original definition of momentum after all that definition of time did not regard changes from frame to frame as having any appreciable effect on time DT was not necessarily invariant from frame to frame and in fact could have been the root cause of the problem we saw on the previous slide however there is in fact a Time unit that all observers regardless of their states of relative motion can agree on they can agree it exists and it can be measured the same way in a specific frame every time and that is proper time denoted with the letter to so if two events occur and those events are observed by all observers and all frames of reference all observers agree that proper time will be observed in a frame where the two events happen at the same spatial location that is the definition of the proper time it is the shortest time duration measured in any frame by any method of measuring time durations using two events now it’s always possible to find such a frame if you’re not in the frame where proper time is defined you could always accelerate yourself in such a way until you enter the frame where the regularly occurring events that will be used to define passage of time occur at the same place the time in any other frame is going to be given by the relationship between time in that frame and the proper time so in any other frame the time t for a frame moving at velocity V with respect to the proper time frame is simply given by gamma the gamma Factor associated with the motion of that frame relative to the proper time frame times the proper time duration to now we’re talking about inertial frames of reference moving in relative constant velocities with respect to one another and so as a result of that the gamma factors involved here will not be time dependent they are defined using constant velocities of objects or constant velocities of frames relative to one another or both so consider an object moving at velocity U with respect to the proper time frame that in it of itself that object would be a frame of reference that’s in relative motion to the frame in which proper time can be observed so let’s trade the old time derivative in the definition of momentum that is momentum equals mass time the first derivative of space with respect to time for the derivative with respect to proper time that is momentum will now be defined as mass times the first derivative of space with respect to proper time now we want to convert that into any other frame specifically into the frame where the momentum is being measured which may not be the proper time frame and to do that we just substitute for da with the relationship between it and DT and if you do that you’ll find that you now have math time the first derivative of space with respect to time time a factor of gamma so if this is a better definition of momentum one that preserves the second law from Isaac Newton under Transformations from frame to frame then we should be able to show that and the definition that we get from this exercise using proper time derivative instead of just the plain old time derivative is that the moment um of an object viewed from a reference frame is given by the gamma factor of that object relative to that frame times its mass times its velocity as observed in that frame now again I want to be careful here because the gamma factor that appears here is very specific it has to do with the gamma Factor associated with the velocity of that object viewed in the frame of reference the object itself could be viewed as a reference frame of course but because we’re going to start talking about transforming object velocities into other frames moving at speed V relative to the one where we measured it it’s extremely important to realize that there are suddenly going to become multiple gamma factors in your equations some of those gamma factors will relate to the observation of the object and the passage of time relative to its frame of reference and some of the gamma factors will be related to the ative motion of other frames of reference relative to the one in which you’re defining momentum and if that all seems confusing it is and the only way to get better at this is to practice practice practice so the gamma Factor here I’ve denoted especially with a subscript U to indicate that it is not the velocity of another reference frame V that appears in here but rather the velocity of the object itself U and so this gamma factor is defined as 1 < TK of 1 – U ^2 c^2 that’s what gamma with a subscript U is going to refer to now this redefinition of momentum can be demonstrated with a lot of algebraic pain to leave Newton’s Second Law invariant and in fact this is accepted to be now the correct definition of momentum I leave it to the viewer to go through the exercise sketched out on previous slide to transform

    the momentum of an object observed in one frame into another frame moving at velocity V with respect to that first observing frame and show that the form of Newton’s Second Law dpdt remains invariant from frame to frame now any good definition of momentum will hopefully respect the observations of the past that at low velocity the classical definition of momentum seemed to be good enough if special relativity is the more correct General framework for describing space and time then in some appropriate limit in this case low velocity of the object we should be able to recover the classical definition of momentum so let’s give this a try and I’m going to begin by writing the gamma factor for the moving object gamma with a subscript U as a binomial expansion and I’ve used this before in an earlier lecture so hopefully the rhythm of this will begin to look familiar the binomial expansion is very useful for carefully stepbystep exploring what happens when you send a parameter of the theory in this case the velocity of an object relative to that of light closer and closer to one of its limits so we’ll start by writing down gamma subscript U with its traditional definition of 1/ the < TK 1 – u^ c^2 and then we can use the binomial expansion approach to write it instead as a series of terms of increasing powers of the velocity over C so the first term is just one the second term is 12 u^2 over c^2 Etc after that you have terms of order U 4th over C 4th U 6 over C the 6 and so forth those terms matter when U over C is very close to one but when U over C is very close to zero those higher order terms really don’t matter so much compared to the lower order or leading terms in the expansion so now let’s write relativistic momentum using this series expansion of the gamma Factor so I have momentum is equal to gamma subscript U * muu which is now this series expansion * m * U and you’ll notice now that I have an extra U to multiply into the series expansion if I take M * U and distribute it to every term in this series expansion i w up with something that looks like this the leading order term now has a dependence on velocity but the subleading term has a dependence on velocity cubed over c^2 and then the terms after that are Velocity to the 5ifth over C to the 4th or velocity to the 7th over C to the 6th Etc and as U approaches zero that is as the velocity of the object gets much much much much much lower than the speed light essentially as its velocity is sent toward zero any terms that depend on U cubed over c^2 or higher in this expansion are going to vanish they’re going to approach zero much faster than that leading term of mu the leading term will dominate the series expansion as U over C gets very small so I can start from this expanded version of momentum using the binomial expansion and in the limit that the velocity is much much less than the speed speed of light only the first term in the series Will Survive the one that’s largest compared to the others as U over C goes to zero and that’s just m * U we have recovered the classical definition of momentum in the limit of velocities that are small compared to the velocity of light so we can proceed similarly now having had some measure of success with looking at momentum as the quantity the directional quantity of motion thinking about kinetic energy which which is the scalar or directionless quantity associated with motion that can also be conserved so let’s begin to think about kinetic energy in special relativity did we really have the right definition in the old days 1/ 12 mv^ s is that the relativistically correct definition of kinetic energy well we can start by looking at the relationship between external forces changes and states of motion work and kinetic energy when an external Force acts on an object and displaces it over some for instance straight line distance s Vector the action of accelerating this object under the influence of an external Force represents itself a unit of energy being imparted to the object and that energy is known as work work done by an external Force changes the kinetic energy of the object it was in a state of some kinetic energy maybe zero and then a force acted on it and accelerated it and now it’s in a different state of kinetic energy because its velocity has changed that means that the work done by the force has had some action in changing the kinetic energy of the object and according to the work kinetic energy theorem the change in the kinetic energy of an object is directly proportional to the work done by the external Force now the work done by the force on the object displacing it over for instance a linear distance s Vector can be written as the dot product of that external force and that displacement now I’m taking some shortcuts here with the form of the work equation this is for a constant magnitude Force displacing an object over a straight line distance that’s not the general form of the work equation and I will use the general form of the work equation in a moment so let’s assume a constant force acts on an object from the perspective of an observer in frame s and the of course the the form of that force and its relationship to the momentum of that object and the changes in momentum of that object will be given by Newton’s Second Law the force is equal to the change in relativistic momentum with respect to time this is now the correct definition of momentum in that frame and used in any other frame preserves the form of Newton’s Second Law which is FAL ma or FAL dpdt now let’s say the force acts over a small displacement a differential of a path DS vector and at any moment it’s related to the velocity of the object and the time over which the displacement occurs via the fact that the object velocity is the change in the path position divided by the change in time in that frame in other words U Vector is DS Vector DT we can write the differential of work the little bit of work done by that constant force over that little bit of displacement by thinking about the definition of work itself in a more general form that is the little bit of work done in displacing the particle over a little bit of path DS vector by a constant force f is given by the dotproduct of F and DS Vector Now by Newton’s Second Law this has to be equal to the first derivative of the relativistic momentum with respect to time that is what the force should be equal to and again that thing is dotted into DS Vector the little bit of displacement but we can replace DS Vector with its relationship to the instantaneous velocity of a particle under the action of this ex external Force DS is just going to be equal to U DT now to simplify this dot product I’d like to assume that the change in momentum is in the same direction as the force that’s applied on the object so the force is entirely directed in the direction of the displacement or the change in momentum or the change in velocity and as a result of that the dotproduct trivially becomes the product of the magnitudes of the two vectors to find the total work done by the force which is to be related to the total change in kinetic energy if I can find the total work being done by this Force I can absolutely relate that to Delta K the change in kinetic energy and perhaps arrive at the form for the kinetic energy what we’re going to do is we’re going to integrate both sides so by the work kinetic energy theorem the change in the kinetic energy of the object whatever equation that is is given by the work done by the force on the object and that is going to be the integral of this equation here the sum of all the little bits of work should add up to the total work and so that equates to taking the sum of all these little bits here and if I pull out all the con and all of this I’m going to wind up with the mass time the integral of U time the quantity D gamma u u the DTs have canceled out here in this dot product leaving us with just a differential of the gamma U * U well that doesn’t look like a very pleasant integral but there is a way that we can get this into a more pleasing form one that’s more easily solved I’m going to start by rewriting this relationship Delta k = w = m * the integral of the speed * the differential of gamma U * speed to get this into an easier to solve form we’re going to integrate by parts to get a final form for the integral this is using the trick that the for instance integral of UV is equal to UV minus the integral of V du so let’s make some identities between this more general form of the equation and the specific stuff that appears in the integral up here I’m going to identify U as being equal to U that’s straightforward I’m going to identify V as being equal to gamma * U when I do that I can then write U * V which I need here as gamma U * U ^2 and then I need V * du well V * du is just going to be equal to gamma sub U * U * du that’s pretty straightforward try this on your own this will help you dust off your integration by parts but you’ll find that the integral becomes the following the change in kinetic energy is given Now by substituting in using the integration by parts trick as M * gamma U * U ^2 evaluated at the end points of velocity the initial velocity UI and the final velocity UF minus the masstimes the integral of gamma U du again evaluated between the initial and final velocities and if you work through all this you’ll get an equation that looks something like this you have this first term M gamma U * U ^2 plus the second term which looks a bit nastier M c^2 * the < TK of 1 – u^2 over c^2 and we are to evaluate this at the end points of the motion so let’s do ourselves some favors here and assume that the initial speed of the object is zero that means that the initial kinetic energy must also be zero whatever the equation for kinetic energy is that’s got to be true the final speed will just set to be U some final speed U that we achieve and at that point the kinetic energy is K so substituting all this in we find out that the kinetic energy is equal to M * gamma U * U ^ 2 + mc ^ 2 * the inverse of gamma U minus mc^ 2 and rewriting this doing some algebraic gymnastics with the gamma factors and mc^ squ you’ll find that this can be simplified to this lovely little equation here the kinetic energy of a particle is simply given by the quantity of its gamma Factor minus1 * mc^ 2 now I’m going to let you show that last step on your own it’s good practice for the gamma Factor gymnastics that you’ll often have to do in these problems we find that the relativistic kinetic energy is just gamma U minus one all * m c^ 2 m is just the mass of the object C is just the speed of light and Gamma U is its gamma Factor relative to the frame in which the object is being observed you can use the binomial expansion trick once again and I encourage you to try this on your own in the limit that the velocity is much much less than the speed of light and you’ll find that the expression reduces to 12 mu^ 2 the classical definition of kinetic energy these quantities for momentum and kinetic energy have all the right behaviors they don’t look like what they looked like in their assumed classical forms they reduce to their classical forms in the appropriate limit and they leave laws of physics invariant where they can be applied now we’ve looked at momentum and we’ve looked at kinetic energy but what about the total energy of an object in special relativity In classical physics the total energy of an object was just its kinetic energy and if it wasn’t moving it was said to have no energy now that’s not entirely true if that object was being acted on by an external conservative Force it’s possible that that object could have some potential energy associated with it for instance if you raise a ball up in a gravitational field it has some now stored potential energy if you let the ball go it will be released and turned into kinetic energy but for a forcef free situation an object at rest really had no defined energy in classical physics is that still true well we can start by just simply noting that as before the total energy of a body in any system is composed of at least two parts a kinetic part describing the energy associated with its motion and a potential part describing any energy that is stored internally in the system and that could be released by some means now the total energy then is the sum of these two pieces so I will use capital E to denote total energy K to denote kinetic energy and youu to denote potential energy or stored energy we see that kinetic energy and special relativity is the difference of two pieces K is equal to gamma mc^ 2 minus mc^ 2 so if we rearrange the above total energy equation and then plug in this expression for kinetic energy we arrive at an interesting preliminary conclusion so if I take K and solve for that using the above equation I find that K is equal to the total energy minus the stored energy and if I substitute in with this equation I find that K is also equal to gamma u mc^ 2 minus mc^ 2 and by identifying and relating terms in these two equations for K I can draw the conclusion that the total energy of a object is given by gamma u mc^ 2 and the stored energy of an object even one that’s at rest is mc^ squ it’s mass times the speed of light squared so by this identity the total energy of an object in special relativity is given by gamma c^ 2 and in the limit that the object is at rest we see that the total energy becomes not zero but mc^ squar mass time the speed of light squared and we note that the same quantity mc^ 2 has been identified in the above exercise as a kind of energy stored somewhere in the object what’s particularly remarkable about this exercise is that by our own means we arrive at a conclusion that Albert Einstein too arrived at in his miracle year in 1905 it’s one of the most profound conclusions drawn from special relativity that mass is itself a form of stored energy and even when a body is not moving it’s to total energy is not zero but rather decreases to a minimum given by e equal mc² and this latter equation is one of the most famous in the history of science it is an equation that would lead to the development of nuclear weapons nuclear power plants the Pet Scan a non-invasive medical invention the particle collider and many other Technologies taken for granted feared or loved in the modern world for an indivisible fundamental particle for instance the electron is a pretty good example of this we’ve never seen that the electron is made of anything else one has to conclude then when when it’s at rest its energy is the result of some kind of intrinsic Mass a fundamental property of matter just like electric charge appears to be a fundamental property associated with matter now it’s it’s interesting to ask yourself well how much energy if I could find a way to convert it into some other form is contained in the mass of an object well consider the fact that a typical is human being has a mass somewhere in the realm of 60 kg and if by some means all of that could be converted to another form of energy like kinetic energy or chemical energy or radiation then the above equation tells us the energy in Jews that this represents e = m * c^2 which is 60 kg time something that’s about 9 * 10 16 M s/s squar this yields of total energy in Jews stored in your body in the form of mass energy is 5.4 * 10 the 18 jewles now for comparison the energy the little sliver of energy that reaches the Earth from the Sun every second a tiny bit of the total energy that the sun can Emit and yet the same energy that keeps our planet warm and hospitable to Life as We Know It that energy is 10 the 17 jewels the stored energy in the form of mass energy in your body is a factor of 10 more than that and if it even a fraction of it could be converted into some other form of energy it represents a terrifying amount of potential so let’s do an example of this sort of hidden energy of matter by considering the mass that’s lost by a uranium nucleus during fision the process of breaking that nucleus into pieces nuclear fion was itself first discovered by physicists and chemists OT Han and frit stman and this was done in Germany in December 193 38 if you know anything about your history this was the period of Nazi rule of Germany now the observation of nuclear fion especially the fact that the uranium nucleus was observed to split into nearly two equally massed Parts was a bit of a mystery and it was explained very quickly thereafter by physicist Lisa miter and her nephew OT frish the physics Community came to understand that what was going on here just in breaking up the nucleus of a uranium atom was the potential of a vast power that lays in the hearts of all unstable atoms to be Unleashed on humankind so consider the process shown at the left this little blue ball is supposed to represent a neutron one of the components of a nucleus they can be freed from the nucleus and fired at other nuclei a neutron striking a u235 nucleus will set off a chain of events that results sometimes and it breaking up into roughly equal Mass pieces a nucleus of the element Krypton Krypton 92 and a nucleus of the element barium barium 141 now the mass of the unsplit u235 nucleus is given in atomic mass units using this number and I’m keeping the Precision on purpose because small differences when it comes to mass energy matter a lot now the masses of the daughter nuclei Krypton 92 and barium 141 are 83. 798 Atomic units and 137.225 atomic units respectively now I should note that for purposes of conversion one atomic mass unit is given roughly as the mass of a proton 1. 16605 42 * 10us 27 kg now if you check the daughter masses do not add up to the parent mass mass is not conserved in this process it’s lost in the fision process and the amount of mass that is lost is roughly 14 Atomic units even accounting for the fact that three neutrons get produced in the fision process that only adds up to roughly three atomic mass units that’s still 10 atomic mass units or so of energy left over that could be converted into forms like kinetic energy or heat now since we’ve checked that the daughter masses don’t add up to the parent Mass we recognize that there’s missing mass energy here and that mass energy that’s missing is about 2.1 * 10- 9 jewles about a billionth of a jewel now that doesn’t sound like much but consider what’s going on in this cartoon at the left three neutrons have also been produced in this process three neutrons that are bullets that can be fired at other u235 nuclei that might be lingering nearby for instance if you highly enrich uranium to greater than 90% pure u235 it’s possible to set off a reaction of events that cannot be stopped and has catastrophic consequences this process can initiate what is known as a chain reaction as you multiply the fision process over and over and over again using these Neutron bullets that get produced from the first fion process so for instance the first split makes 3 to the 1 neutrons the second generation of splits makes 3 to the 2 neutrons because each of these neutrons goes on to split a uranium nucleus that produces three neutrons so that gives you nine the third generation gives you 3 to 3 or 27 a typical chain reaction in purified u235 can go something like at least 40 to 50 Generations before or this device will blow itself to Pieces that’s a multiplicative factor of about 3 to 45 or 3 * 10 the 21 so you’re taking the energy left over from one split and you’re multiplying it by about 10 to the 21 now those neutrons won’t all go on to split uranium nuclei some of them will be thermalized and will result in dumping thermal energy into the body of the material or into the surrounding air around it if the energy of those neutrons is converted to heat from collisions you’ll find that this level of multiplication is sufficient to explain the explosive yield of the very first uranium Atomic weapon code named little boy which was equivalent to about 13 to 18,000 tons of TR Nitro toine or TNT being dropped on a single City that’s 54 to 75 trillion jewles of energy that weapon devastated the Japanese city of Hiroshima at the end of World War II so we can see that a little bit of mass energy goes a long way and it can have positive applications in society you can have negative applications in society but all of this stems from the Revelation that energy and mass are not distinct from each other now in classical or Newtonian Galilean physics there is a relationship between momentum and kinetic energy we know that k equal p^2 2 M go ahead and try it yourself if you’ve never seen this before convince yourself that this is true in classical physics P = MV K = 12 mv^2 do the substitution there’s a relationship between kinetic energy and momentum now in the more correct description of space and time given by the special theory of relativity we have kinetic energy mass energy and momentum what is the correct relationship between these things let’s begin with the momentum equation that is momentum is equal to to gamma U * m * U let’s then insert a sort of clever multiplicative one multiply this equation by C over C which has the effect of just multiplying the equation by one but allows us to distribute the C in a useful way we can take the denominator 1/ C and move it to the left and Associate it with the velocity of the object U so we wind up with a term of U over C in this equation now we know that the equation for total energy has c^2 and Gamma u in it and Gamma U depends on u s over c^2 they’re related to each other so I recommend you you try squaring this above equation Square P which then squares this thing on the right hand side gamma U * m * U over C * C and when you do that you get this equation here now if you then use the fact that u^2 over c^2 can can be related to gamma by 1 – 1 / gamma ^ 2 you can then insert that and you find that P ^2 is equal to m^2 c^ 2 * the quantity gamma U ^ 2us 1 now if you stare at this for a moment you’ll notice that this equation has a piece in it that’s awkwardly close to e^2 e^2 the total energy would be given by gamma ^ 2 m^2 C 4 so multip both sides of this equation by c^2 we wind up with p^2 c^2 on the left this is going to be equal to m^2 C 4 * the quantity gamma ^ 2 – 1 if we then distribute the m^2 C 4th into the parentheses we wind up with this equation and we can identify the first piece here as e^2 and the second piece here as m^2 C to the 4th or the square of mass energy so putting it all together we find that energy and mass and momentum have a relationship to each other and it’s an elegant relationship between an object’s total energy its momentum and its mass energy in special relativity and that relationship is given by this quadratic equation e^2 = p^2 c^2 + m^2 C forth now this equation allows us to think about some cases of certain kinds of particles and one very interesting special case is to look at particles that have no intrinsic mass now the electron is a particle with intrinsic Mass the muon is another example of a particle with intrinsic Mass albeit 27 time that of the mass of the electron but we can ask ourselves what if there is a part particle out there in nature that has no intrinsic Mass can it exist and if it did exist what would its properties be well let’s take a look at that we can use these relationships to study this very special case now it will turn out that photons which are the particles involved in light have never been observed to have an intrinsic Mass they behave as if they have no mass at all so let’s go ahead and take that exact limiting case of m equals 0 and if we plug that into the energy momentum and mass energy relationship we find that we’re left with e^2 = p^2 c^ that is we can take the square root of this and say that the total energy of a massless particle is given by its momentum times the speed of light the total energy of a massless particle is entirely energy of motion in other words if such a particle could be stopped from moving you would have to interpret it as them ceasing to exist their total energy would suddenly become zero but of course that violates the conservation of energy you can’t just make energy go away without consequence so this implies that such particles can actually only be stopped when they’re removed from the natural world by being absorbed into another process now you might then feel emboldened by this and say aha well this is great uh I’m going to go ahead and figure out what e and P are for massless particles but then you very quickly run into a problem and that is that e depends directly on M and P depends directly on M as defined in special relativity and so you get no useful information from these equations from special relativity special relativity can’t give you otherwise useful information about what the total energy actually comes from and what the momentum actually comes from for such particles so what is it that defines energy and momentum of a common particle like a photon which so far as we know has no intrinsic Mass no mass energy well to answer that question we’re going to have to wait a little bit longer and see as we enter the next phase of this course so to conclude this lecture let’s look at what we have learned we have learned how to Define kinetic energy and momentum while in incorporating the principles of special relativity and in doing so we’ve learned something deep about the nature of mass and we’ve learned to appreciate that there is intrinsic mass in nature and that mass in general is associated with a kind of internal energy of all objects an object at rest does not have zero energy it has internal energy given by MC s we’ve also learned about the relationship between energy momentum and mass we’ve looked at some applications of the relativistic concept of energy momentum and mass and we’ve left ourselves with some questions that we can hopefully resolve by delving deeper into nature in the next phase of the course before we get started on this lecture on the first glimpses into the general theory of relativity I want to kind of put a little cautionary warning label at the beginning of this video in any textbook at the level of a course like this and certainly in this lecture video I don’t want you to walk away with a feeling of full confidence that you have completely ly understood the generalization of the theory of space and time that Einstein set in Motion in 1905 with what we call the special theory of relativity relativity is an extremely rich subject you can quite literally fill volumes on this particular bit of material and in fact I’m holding one of them in my hands now that I’ll show you later the general theory of relativity is fundamentally built on a rich and complex set of mathematics that students at the level of a course like this simply have never seen and cannot be expected to master in a week or a month or even three months without really first having had the full breadth of undergraduate mathematics now that said there are nuggets of of ideas and Mathematics that one can draw out of the general theory of relativity and use to motivate in the context of special relativity the implications of the grander theory of space and time since 1916 when Einstein first established the calculational framework the reliable calculational framework that set the stage for the general theory of relativity in all the work that would be done with it students have struggled with this material because it challenges many preconceived notions many Concepts that we walk into any standard science class cherishing so I want you to be a little bit forewarned first of all that the Nuggets that we will draw out of the general theory of relativity and analyze in the context of special relativity can have some stunning implications that either will challenge things you already believe to be true or which open your eyes to the grander scale of the cosmos that we inhabit now I I mentioned this book earlier this book is one of the seminal works in the field of physics on the whole of the general theory of space and time and it’s entitled gravitation its three co-authors are Charles misner Kip Thorne and John archabald Wheeler now all three of these individuals each in their own way are considered some of the brightest lights of 20th and 21st century theoretical physics and this book is expansive in its treatment of the subject I mean look first of all at how thick this book is and if you flip through this you will quickly see that most of us would be out of our depth in the level of mathematical rigor and notation and variety and subject matter that is minimally expected in order to follow along with a text like this certainly to its Bitter End all of this is to Simply point out that the general theory of relativity is complex and Rich and mathematically far beyond the scope of a course at this level now that said we can draw buets of ideas out of the general theory of relativity and we can put them in context in our own course experience for a course at a level like this one in modern physics now some of the names on here may seem familiar to you Kip Thorne for instance has become recently famous not only for winning the Nobel Prize in physics for one of his key bits of work on space and time energy and matter and the theory that links them together in general relativity but also because he has served as an advisor to film and TV including things like the movie Interstellar from 2014 which had some of the most advanced visualizations of physics based on the general theory of relativity in any movie that came before it or since then John Wheeler is another bright light in the field of theoretical physics he will feature brief briefly later on in this lecture in the context of an individual who could not only deal with the mathematics of this subject but elegantly communicate to an audience even at the level of our course the grandest sweeping summary of the general theory of relativity and its implications for energy matter space and time so with all of those caveats in mind let’s pluck some interesting nuggets of ideas out of the general theory of relativity place them in a local context and special relativity where we feel more comfortable with the mathematics albe it with the caveat that the mathematics required to really do this treatment is far beyond our grasp at this stage in a Physics course at the University level without the full breadth of undergraduate mathematics behind us quite yet let’s see what those nuggets tell us should be revealed about energy matter space and time and then let’s look at how those ideas have implications for the whole structure of the cosmos in which you and I live so with all of those things in mind let’s start digging into some of the basic ideas that motivated the general theory of relativity and take a look at some of those nuggets of ideas that we can couch in the picture of special relativity that we’re a bit more comfortable with at this stage of our engagement in physics in this lecture we will learn the following things we will learn about the transition and thinking from the special to the general theory of relativity this will by no means be comprehensive but merely a taste of some of the basic ideas that led Albert Einstein ultimately to construct this theory of space and time we’ll look at some implications of the general theory the of Relativity on specific physical phenomena and we’ll look at some of the large scale implications for space and time let’s talk a little bit about the transition from special to general relativity experiments on the speed of light in the first and second Decades of the 1900s continued to yield no disconfirming evidence for the postulat of special relativity now Einstein’s physics work in 1904 and 1905 and Beyond earned him the faculty position that he had so richly sought after his PhD and he was able to escape the job as a patent clerk in Burn Switzerland and finally take on the mantle of the academic position that he had hoped for after earning his PhD he was sure that the special Theory itself self could be generalized to a complete theory of space and time including he hoped an explanation of the nature of gravity itself the very prize that had eluded Isaac Newton all that Isaac Newton could establish was that his law of gravitation correctly described the behavior of gravity on all scales that could be observed at the time Einstein ambitiously pursued The elusive prize that Newton could not grasp and that was to finally unmask the nature of gravity itself now this work would take another decade of struggle Einstein would fail many times and in fact if you look at the record of Einstein’s work in the decade that followed his miracle year there were some serious missteps in papers that he had published based on his work beyond the special Theory of Relativity and had anyone been able to mount an experimental test of the claims of the general theory of relativity prior to 1916 Einstein might have been laughed off the scene of physics but it took more than 10 years to be able to make an observational test of one of the key predictions of the general theory of relativity and that ultimately saved Einstein buying him the time he needed needed to fully develop the mathematics behind the general theory put on firm ground theoretically speaking the predictions of the theory of general relativity and to finally publish key papers in 1915 and 1916 that are considered the first accurate and fairly complete treatment of the subject from Einstein himself now interestingly in order to lay the groundwork for the general theory of relativity Einstein required much of the advanced math that he had issued during his time in graduate school one of the things that offended many of the faculty that had him in his classes he was forced then to go and actually relearn subjects that he had actively avoided in some cases during his PhD education he benefited from a close network of friends who were outstanding physicist assistant mathematicians in their own right and through his network of friends he was able to build his own Foundation strong enough to eventually lead to his key insights and firm mathematical grounding of the general theory of space and time in this lecture we will explore some of the very basic ideas and tease the larger implications of the general theory it’s very difficult at this level as I have warned you before to give you the full treatment but I will do my best with the aid of the textbook that we use in class to attempt to communicate some of the key nuggets and frame them in the language of special relativity which we’ve developed more carefully over the last few weeks let’s begin with a tale of two masses we take for granted in introductory physics that Mass appears in a large number of equations but if you really boil it down mass as a concept appears in two distinct equations in introductory physics the rest of the equations that we use that involve Mass can be found to stem from these two laws of nature now what’s interesting and what may not have been pointed out in introductory physics was that the two equations do not necessarily have anything specific to do with one another as regards Mass itself the two equations in question are Newton’s second law of motion FAL ma where Mass appears as the multiplicative scalar factor in front of acceleration the constant of proportionality between what an external force of a general nature exerts on a body and the responding change in the state of motion the acceleration of the body but Mass also appears in an equation that describes the nature of a very specific fundamental Force Gravity the law of gravity states that the force of gravity between two bodies which we might label one and two is proportional to the product of their masses divided by the square of the distance between them now the M for mass that appears in Newton’s second law has to do with inertia the tendency of a body to resist changes in the state of motion and so it’s more honest to say that Newton’s second law is concerned with a mass concept we might label inertial mass the mass of a body that resists changes in state of motion but the m that appears in Newton’s law of gravitation has to do with the primary cause of the gravitational force between two bodies that have mass this doesn’t necessarily have anything to do with their tendency to resist changes in the state of motion it has all to do with the degree of the gravitational attraction between the two bodies this is more honestly referred to as gravitational Mass potentially distinguishing it from inertial Mass there is nothing in these two laws that says that these two quantities these two kinds of Mass have to be fundamentally the same and yet their equivalence the equivalence of inertial mass and gravitational Mass has been tested to a remarkable precision inertial mass and gravitational Mass appear to be one in the same let’s take a look at this by briefly stepping through some mathematics now couched in the language of inertial mass as potentially distinct from gravitational mass and revisit some conclusions we drew in introductory physics so a consequence of their equivalence is often taken for granted what if they weren’t equivalent well if they weren’t equivalent then we would rightly state that two bodies that are acting on one another through the gravitational force can have their degree of acceleration explained by Newton’s second law but without necessarily equating inertial mass and gravitational mass for example if we consider the Earth to be pulling on a body say you you jump off the surface of a table in an attempt to accelerate down to the floor and land on the ground the Earth is attracting you down toward its Center we can figure out the local degree of acceleration due to the gravitational force by taking the product of Newton’s gravitational constant G the mass of the Earth and dividing by the distance between U and the center of the earth squared now we multiply that acceleration which we often denote little G for gravitational acceleration times your gravitational mass and we set that equal to your inertial mass times your total acceleration now if we then solve for the acceleration due to gravity we find that this would be equal to this gravitational acceleration little G or Big G times the mass of the earth divided by the distance between U and the center of the earth squared times the ratio of of the gravitational mass and the inertial mass and if we were to substitute for little G 9.81 m/s squared here we we would conclude that if gravitational mass and inertial mass are not the same if for instance the gravitational Mass were 10% of the inertial mass that only 10% of your inertial mass has anything to do with causing the force of gravity well then we might conclude that your acceleration might be very different than a body that has more mass but we don’t observe that when two objects of different masses fall in a uniform gravitational field a field of uniform gravitational acceleration if you will all bodies even though they possess of different masses appear to fall at the same rate and so by I you can already draw the inference that gravitational and inertial mass are if not equal to each other very close to one another and as far as the limits of our ability to test this have taken us we’ve never seen a difference between gravitational and inertial mass they really seem to be one and the same within the limits of experimental methodology this leads us into one of the key insights that Albert Einstein had early on in the process of trying to generalize the ideas of space and time to include gravity this is summarized by the phrase the equivalence principle now I’ve pointed out that observationally there seems to be an equivalence of gravitational and inertial mass and this can lead you down the path as it did along with some thought experiments for Albert Einstein to a larger consequence and that is the principle of the equivalence of a system accelerated by a constant force or alternatively experiencing a constant gravitational field the principle of equivalence in the language of Relativity and space and time is about the equivalence of two different situations one in which a system is experiencing an external force of some kind that causes it to change its state of motion like a a rocket or something pushing on something else the equivalence of that system and a system that’s experiencing a gravitation acceleration that pins objects to a low point in the system and I will illustrate this Einstein observed early on in his thought process about all of this that due to the equivalence of these two kinds of mass inertial mass and gravitational Mass there’s really then no difference between being under the influence of a uniform and constant gravitational field or source of gravitational acceleration or instead being placed in a non-inertial reference frame one where there is an observed net force acting on all the parts of the system by the action of some other kind of external Force this picture illustrates the idea in a rather cartoonish but elegant way the scenario I like to have that goes along with these set of pictures is the following imagine you wake up and you find yourself in a room with no doors and no windows there’s no way to see past the walls of the room at all as I like to joke with people this is like the premise of the opening scene of some kind of cheap horror movie you push yourself up from the ground you feel gravity pulling you down and you have to work against gravity to raise yourself up that’s what it feels like to you now on the floor next to you was a red ball you leaned down down and you lift up the red ball and you hold it out roughly at arms length and level with your shoulder and you let the ball go and indeed You observe that the ball falls down to the floor of the room you then check your watch think about the average height of a human being measure roughly how long it takes that ball to make that drop from shoulder height to the ground and you’re relieved to find out that you seem to still at least be on earth albeit you have no other external information to tell you where you are because the ball appears to fall at a rate of acceleration consistent with g at the surface of the Earth 9.81 m/s squared but in reality in this opening scene of this cheap horror movie the camera zooms out and gets a view from outside of the enclosure in which you have woken up and reveals that you’re not on Earth but rather far from all planets and stars in empty outer space being accelerated upward from your perspective by a rocket that you can’t hear through the soundproof and vibration prooof walls of your little prison and that rocket is accelerating you Upward at 9.81 m/s squared so from your perspective in the sealed room you think that objects are falling down in a gravitational field or that you have to work against a gravit field to lift yourself off the floor but in reality what’s going on is the entire system is being pushed by an external Force experiencing an acceleration in One Direction of 9.81 m/s squared which gives you the illusion inside the room that you’re in a gravitational field even though you’re not how would you be able to tell the difference between these two situations a soundproof vibration prooof windowless doorless room with no external reference information to tell you that you’re moving or not in outer space and a gravitational field on Earth under the same conditions where yes you’re on the surface of a planet but you have no external information that tells you that a ball dropped in either of those two environments will look and behave the same way and it was this Insight or a variation on it that led Einstein to realize that a constant acceleration due to gravity is no different from taking a reference frame and accelerating it at a constant rate as a visual test of the equivalence principle let’s see if you can tell the difference between the following two situations a zero gravity environment and an environment that’s in freefall in a gravitational field take a look at the video on the left and the video on the right which one do you think is shot in a zero gravity environment which one do you think is shot in a freefall environment where a gravitational field is present the answer is that neither of these is in a zero gravitational field environment this may surprise you maybe you recognized somebody in one of the videos and said aha that person’s an astronaut therefore this video must have been shot in zero gravity but in fact both of these videos are depicting life in a locally inertial reference frame in freef fall in a gravitational field the video on the right is shot in something known as a reduced gravity flight an airplane that makes a parabolic Arc through the sky and briefly enters freef fall in the earth’s gravitational field close to the ground the video on the left is shot in the International Space Station the International Space Station may be far above the surface of the Earth but the acceleration due to gravity is actually quite strong in its orbital position however the International Space Station is orbiting the Earth every 90 minutes and as a result of this circular motion it’s actually in freef Fall constantly it’s just missing the Earth because it’s moving to the side every time it falls down a little bit it’s almost impossible to the human eye to tell the difference between life in a freefall frame of reference in a gravitational field and life in a zero gravity environment that’s no accident that’s the equivalence principle in action Einstein then defined the concept of a locally inertial frame by imagining not this situation I’ve described here but a system in which a person for instance is in freefall in an external gravitational field the concept of a locally inertial frame of reference is one in which all parts of the system are experiencing a constant acceleration due to gravity but because all parts are accelerated the same way it’s as if the system is entirely free of any external forces it’s as if everything is in an inertial reference frame with no external forces because all of you are accelerating at the same rate at the same time this is an incredible Insight it may not seem that impressive but it frees you very suddenly from thinking of gravity and acceleration of an entire reference frame of things as different things and it was this Insight that freed Einstein to think about gravity in a completely new way as another aspect of space and time the key idea here is that without external information in any of the situations I’ve just described being in free fall above the surface of the Earth um or being inside a sealed room with no windows and no doors that’s vibration resistant and sound resistant without external information there’s absolutely no experiment you could do in any of those situations that will tell you that the system is either away from a gravitational Source or simply in freefall in a gravitational field these all seem like inertial reference frames as a result of that now since there is no difference between gravitational acceleration and the ACT of changing a whole reference frame into a non-inertial reference frame you can analyze phenomena in a situation for instance where an inertial frame of reference is considered instantaneously inertial that is although it experiences overall some acceleration like taking a whole room stopping a rocket to the bottom of it and accelerating the entire room and all its contents up at 9.81 m/s squared you’re doing that equally and fairly to all parts of the system and so so at any moment in time all elements of the frame will have the same velocity now of course if you take an object off the wall of this frame of reference hold it out and drop it it will appear to fall down because once you let go of it it’s no longer part of the frame of reference it’s not bound to it in any way and so uh it will appear to fall down as if under the influence of an external gravitational field but of course what’s really happening is that the floor of the reference frame is being accelerated up toward the object now read from the bonds of the reference frame at 9.81 m/s squar perspective is everything it’s the relativity of whether you’re falling in a gravitational field or whether the floor is rushing up toward you at the same rate it’s the ambiguity in those perspectives that lead to the key insights that blossom into the general theory of relativity so in order to help us to picture this let’s consider reference frames in the same way that we’ve done the this in special relativity before let’s imagine a frame that’s we’re going to always take to be exactly and absolutely at rest we choose which frame that is and then we Define it as the rest frame it’s our choice to make it doesn’t matter which one we pick I’m going to choose this one with X and Y coordinates as the absolute rest frame now the frame over here with our friendly observer in it it’s labeled as having its own axes X Prime and Y Prime for instance uh and I’ve exaggerated the x-axis here only because I’m going to need some room on this as I start to play around with it but at first at Time Zero in our little thought experiment here the rest frame and this Frame are in the same state of motion the velocity of what will become the moving frame is instantaneously zero at time zero and so it to instantaneously at Time Zero is a rest frame but this Frame which I’ve labeled with prime notation ation is actually experiencing a net constant acceleration a and in the next instant of time its velocity changes from zero to something non zero in this case it goes to being a Teensy bit above zero a little differential of velocity DV above zero in the IAT or positive X Prime Direction so it was at rest same as the actual rest frame and an in later it is no longer instantaneously at rest instantaneously now it represents a moving frame S Prime at velocity DV relative to what we consider the actual rest frame but that acceleration continues to act and so in the next instant of time the velocity is increased again to twice DV and so now while at this instant of time it’s another inertial reference frame albeit with a different velocity relative to the rest frame it is the result of an acceleration that has been acting on the system the whole time from Time Zero to time one to time two at each instant in time this frame is inertial because it has a well-defined velocity at that moment with respect to the rest frame but overall we can clearly see that this is a non-inertial reference frame one that is experiencing a net acceleration and a person in that frame would conclude overall that there must be some external force acting on the system because they will see objects freed from their frame of reference to behave as if an external force is acting on them now let’s start to dig a little bit into some of the implications of the more General view of space and time now that we freed ourselves and allowed for the equivalence of a gravitational force to a frame that’s non-inertial experiencing an external acceleration by any means necessary Now using this imagery of frames that are instantaneously inertial but overall non-inertial frames of reference let’s analyze an observation of light that has been emitted during this period of slight accelerations of the frame of reference S Prime so let’s consider a light source this black dot here that’s pegged to the Y AIS of frame S Prime it’s fixed in that frame it’s bolted to the wall we can consider the y- AIS to be a wall in the frame of reference the x-axis is like the floor of the frame of reference the person is firmly rooted on the frame of reference and they’re experiencing only a slight acceleration it doesn’t totally knock them off their feet to be accelerated it’s a very gentle acceleration the speeds that we will consider always in these examples will be very much less than the speed of light this will help us to get at the implications of general relativity without having to dig into the full General mathematics of Relativity which is much harder so this is our situation at Time Zero we have our happy Observer they’re looking at this light source on the wall that could emit a pulse of light at any time and in fact at Time Zero we’re going to allow the light source to send out a wave of light so at that moment T equals z it emits a wavefront but at the same time the frame is accelerating it’s been accelerating and instantaneously at time zero its velocity happened to be zero so it was in our rest frame as we’ve defined it but at that moment Time Zero the light source pulses emits a wavefront and that wavefront of course being light is going to travel at a velocity of exactly C from the perspective of any observer in any frame of reference and it will travel from the left to the right from the uh above the origin of the S Prime coordinate system where the light source is pegged toward the Observer over here at some other coordinate along the horizontal axis in frame S Prime so we have a light wave a wavefront traveling at Sea released from its prison in the light Source at Time Zero now at some time later the light wave will cross the gap between the light source and the light Observer and the light Observer will see it but what will the Observer see because in that time that it took the light wave to cross the Gap the frame has changed its state of motion from zero to some velocity DV will the The Observer see the light wave as it was emitted from The Source or will they see something else let’s take stock of the key elements of this question from the picture that I showed you on the previous slide first the light was emitted originally from a source that was considered to be at rest that light was emitted with frequency F at the source F source and wavelength Lambda also defined at the rest frame of the source now the light source and observer remain a fixed distance apart the entire time in this question because they both accelerate together the system is experiencing only gentle accelerations not enough to knock the Observer away from their spot on the xais so they remain planted at their position the light source is bolted to the wall the whole thing is accelerating together at the same rate so their state of motion is changing instantaneously in the same way at every moment of time and so there is no change in the distance call it capital L between the source of the light and the Observer of the light light travels at C 2998 * 108 m/s no matter the frame of reference in which it was observed it was emitted in a frame that was at rest it will be observed in a frame that is moving but no matter the state of motion or the change of the state of motion of that frame of reference The Observer if they were measuring the speed of this wave is always going to say it moved at C the light will take a finite nonzero time to travel to the Observer from its source it has to cross a gap that’s going to take some time and by the time the Light reaches the Observer they will have entered a state of nonzero velocity from the cartoon on the previous page they went from zero to 0er plus a little bit due to their acceleration and I’ve called that little bit a differential of velocity DV therefore the light wave in the end will be observed in a frame of reference that is now moving with respect to the frame of the source which had been a rest frame what does this sound like this sounds an awful lot like a Doppler shift problem light being viewed in a frame that’s moving with respect to the original frame of emission this basic Insight then guides the math within the framework of special relativity that we can do to calculate just what the Observer will see so let’s do some very basic calculations with this building on top of all of the stuff that we’ve been looking at over the previous lectures and time in our course I have emphasized this before I’m going to codify it now I want us to assume that we are in an instantaneously inertial reference frame that is at any moment in time we have a definite velocity that’s well defined albe it changing from moment to moment to moment to moment we want the velocity of that frame to be greatly less than c not even close to the speed of light less than 1% the speed of light or even smaller and that’s so that we can have V over C which we’ve previously defined as this nice number beta to be much much less than one this is going to come in handy very quickly in this problem let’s assume again as I pointed out before that the distance from the light source to The Observer which is fixed this whole time is some length L because the light source travels in the same frame as the Observer L remains constant the whole time the light will take a time which will denote delta T T2 minus T1 T1 being the time of emission T2 being the time of observation and that’s going to be given by L / C light has to travel across a gap of length L it does so at a fixed speed C the speed of light the time that will take is L over C full stop now in that time delta T the frame of the Observer will have accelerated by an amount a up from rest to a velocity V and we can actually then analyze this using the very same equations of motion from introductory physics which are still valid here for the conditions that we’re we’re looking at there we could relate initial velocity to final velocity by considering the acceleration of a system and the time over which the acceleration acts this equation will do nicely the final velocity V will be equal to the initial velocity v subscript 0 or V not plus a term that’s the acceleration times the time that has passed over which the acceleration has acted now we can plug in some specifics here for us V will be equal to zero the initial velocity of our instantaneously inertial reference frame plus the acceleration time L over C which is delta T the time it takes for the light wave to get from the source to The Observer this then leads us to the conclusion that V is equal to a * L / C and if we transform this into a expression for beta we find that beta which is V / C is given by the acceleration times the distance divided by the speed of light squared now let’s take this information and let’s put it into the context of the Doppler shift the relativistic Doppler shift specifically the special relativistic Doppler shift so we’re going to treat the case of small velocities relative to light beta is a small number the Doppler shift of the light wave by the time the Observer sees it will be given simply by what we did before we take the frequency of the source we multiply it because the Observer is is in a frame that becomes a frame that’s moving away from where the source was we have to multiply by the square root of the quantity 1 minus beta / 1 plus beta this represents a lengthening ultimately of the wavelength of the light a red shift but we want to get acceleration of the frame the distance between the light source and the Observer and the speed of light into this equation we want to put these things from our picture into this equation and the way we can do that is by doing some binomial expansions of the numerator the square < TK of 1 minus beta and the denominator 1 over the < TK of 1 + beta well if you do those two things and multiply them do the binomial expansion of the < TK of 1 minus beta of 1 over theare < TK of 1 + beta multiply those together you find you get expansion products that look like this 1 minus a half beta plus terms that are higher order in in beta and 1 minus a half beta the same thing again plus again higher order term terms in beta which I’ve just left out but indicated that they’re supposed to be there from these three dots the product of these things is multiplied by the frequency at the source of emission now because we’re working in the case that beta is a number much much much much much smaller than one because V is much much much much much smaller than C we only have to keep the leading terms in all of this and if we multiply out this product and then only keep the the the leading terms in in beta we wind up finding out that this product is approximately equal to just 1 minus beta all times the frequency at the source of emission now we have an expression for beta in terms of the acceleration of the frame the distance between the source and the Observer and the speed of light and if we plug that in we get a final form of this approximate equation for the frequency that the Observer should see the observer in the moment after the light has been trans Ed they accelerate they get up to a a velocity V relative to where the source had been and then they observe the light they will see the frequency shifted by an amount of 1 minus the quantity Al over c^2 this represents a shortening of this represents a decrease in the frequency relative to the source or an increase in the wavelength of the light you can play around with this yourself and convince yourself that that’s the case but we basically conclude that the Observer who now at the moment of observation has been put into a new frame that’s not that in which the light source was at rest when that emission had originally occurred uh will now observe the light to appear shifted from its source frequency and in this case it’s a red shift if the Observer winner frame accelerating in the opposite direction in the direction from The Observer toward the source rather than from the direction of the source toward the Observer so a becomes minus a then the light would instead appear blue shifted shifted to smaller frequencies or shorter wavelengths so let’s think about light as viewed in an accelerating frame of reference we found by making this approximation that we have a frame of reference that’s all accelerated at once so that the light that was emitted at Time Zero is a observed by an observer in a frame that’s no longer at rest with respect to where the source of admission had been that the Observer will see a frequency as they accelerate to the right in the direction from The Source toward the Observer they’ll observe A reduced frequency of the light a lengthening of the wavelength but let’s dig back to the equivalence principle the equivalence principle states that there is no difference between an entire High frame of reference that’s all experiencing an acceleration due to some external force and a frame of reference that is merely experiencing an external gravitational acceleration as a result of the equivalence of these two things one is forced to conclude that the shifting of light must also occur in a gravitational field of acceleration in other words if the source of that acceleration is GRA gravity you know for instance a equal G * the mass of the Earth ID the radius of the Earth squ because you’re

    standing on the surface of the Earth that’s just 9.81 m/s Square the old little G from introductory physics and imagine instead we’re viewing light from a source above us we are Downstream in the gravitational field and there is a light source down on the ground below us sort of Upstream in the gravitational field we would conclude that the light as we observe it emitted up from the ground toward our eye must be shifted in this case a red shift in frequency this phenomenon is real it has been confirmed repeatedly by experiments over and over and over again and we’ll look at some of those through problem solving in the class it’s a real phenomenon it must be taken into account when you are dealing with electromagnetic radiation and gravitational fields and it’s known as the gravitational red shift or depending how the problem is set up gravitational blue shift of light in this example if we were laying on the ground looking up at a source that’s above us and looking at light that’s emitted down toward us because we are further Upstream in the gravitational field of acceleration we would see the light in that case as blue shifted it’s equivalent to switching around the acceleration sign now this very same phenomenon the red shifting or blue shifting of light merely because of its transmission in a field of gravitational acceleration has other implications including for the very nature of the passage of time in different parts of a gravitational field of acceleration so by implication from this previous example the Doppler shifting of Light by a gravitational field one can also predict that time itself will pass at different rates at different heights different locations in a uniform gravitational field we saw that the frequency of light in a gravitational field is altered depending on the degree of acceleration if I increase the acceleration of a frame of reference or equivalently increase the amount of gravitational acceleration a system experiences I will increase the Doppler shifting effect frequency of course looking back at the discussion of waves and the Doppler shift and other things related to waves frequency is a measure of the rate at which events happen the time between events effectively so consider observing time at a height zero above the surface of the Earth we’ll call that person the lower Observer somebody right at ground level looking at time passing Say by looking at pulses of light or ticks of a clock or something like that and instead a person who’s way higher up more uh Upstream in a gravitational field of acceleration a higher Observer also looking at their clock or their light pulsing or ticking away from our exploration of frequency and period we know that the frequency of a wave is given simply by one over the period of the wave you can think of that as the passage of time between regular spaced events so the period is just a difference in time it’s a delta T and so really frequency is another way of saying that we’re looking at one over a time difference between regularly spaced events in other words frequency is really probing time structure now if we were to be looking at the time between regular events at our higher altitude in the gravitational field uh this would be related to one over the frequency of events at that higher altitude and we already know how to relate those through the Doppler shift to the frequency of events at the lower altitude we just have to take this Doppler shifting uh equation again do the binomial expansion and we find out that we are just uh multiplying the time duration at the lower altitude by a quantity 1 minus beta and because we are exper exping a gravitational acceleration here height H above the surface of the Earth this is equivalent to 1 – G c^2 in this approximation 1 minus say 9.81 m/s squar times your height above the surface of the earth divided by the speed of light squared you take that quantity and you multiply it by the duration of time between regularly spaced events at lower altitude and you get the time at higher altitude and so as a consequence of this we expect time to pass more slowly for observers who are lower down in a gravitational field if you were to take this to some extreme imagine a person deep down in a gravitational field they might experience an hour but a person higher up in the gravitational field might observe that days weeks or months pass depending on the degree of difference of location in the gravitational field time that passes higher up in a gravitational field is always multiplied by a number whose value is less than or equal to one meaning that less time passes lower down in the field this is a real effect and this effect has been confirmed experimentally over and over and over again and it plays a major role in the operation of Key Systems to Modern existence such as the global positioning satellite or GPS system all modern navigation typically relies on a system of about 24 satellites each satellite orbits the earth twice per day so it’s moving very fast around the earth as a result of that these are not so-called geosynchronous or geostationary satellites that always sit above the same point on the surface of the Earth rather the GPS satellites orbit and they make about two rotations around the earth per day three satellites at any time are required to make a triangulation measurement on the surface of the Earth and they do this using very precise clocks that they carry along with them that have been synchronized to clocks on the ground and this system allows you to make position measurements on the surface of the Earth but the problem is first of all that those satellites are traveling actually very fast relative to the surface of the Earth so they experience a special relativistic time dilation observers on the ground would claim that their clocks are running a bit more slowly than an equivalent clock on the ground because they’re moving and people on the ground argue that they’re at rest so there’s AAL paral relativistic time dilation effect but in addition because humans who are down on the ground making these observations are lower in a gravitational field an observer on the GPS satellite would argue that well okay that’s true there’s a special relativistic effect but there’s also a gravitational effect a general relativistic effect because the clocks on the earth that we’re supposed to be synchronized to are lower down in a gravitational field than the clocks in orbit around the Earth and so for those clocks there’s a general gravitational slowing of time and these two factors must be taken into account in the modern GPS system and in fact any guideline document that you look at for engineering systems for the GPS network will warn you about these Corrections spell them out for you and tell you how to do them so that you can properly synchronize clocks taking into account all of these time effects between the ground and in orbit around the earth these are real effects with real consequences on things like basic day-to-day navigation and without the general and special theory of relativity we would never have understood these had we launched a GPS system before understanding space and time at this level we would have failed to construct a working GPS system now one other implication of general relativity and this can relatively kind of quickly looked at in a cartoonish Way by referring to our uh accelerated frame of reference our sealed vessel um this other effect that we’ll take a look at here is the deflection of Light by a gravitational field now this might seem novel to you but in reality the deflection of Light by a gravitational field the falling of light near the surface of the Earth was not a new idea in the time of Albert Einstein it was actually quickly realized within certainly decades or a century after the work of Isaac Newton had established the laws of mechanics and gravitation that since all objects regardless of their Mass fall at the same rate in a uniform gravitational field think of dropping a wated up balll of paper and a bowling ball at the same time from a few feet above the ground if you drop them so that their bottoms are starting at the same height they’ll hit the ground at the same time the mass of the paper and the mass of the bowling ball seem to play no role in the rate at which their velocity changes as they head toward the surface of the Earth well if Mass doesn’t matter for gravitational acceleration then even one might argue a massless phenomenon like light should fall in a gravitational field now the specific reason why this would happen was put on much firmer footing thanks to the equivalence principle and I’ll walk through an example of that argument here so consider the cartoon at the right we have our sealed vessel it’s sound and vibration proof no windows no doors no way of knowing whether you’re on Earth or far out in space away from all planets and stars now in reality this system is being accelerated upward by a rocket you can neither hear nor feel nor see and it’s doing so at 9.81 m/s squared constantly so you’re in this sealed room there’s a light source on one wall and and you can push a button and fire a wfront a pulse of light across the room so that it strikes the wall on the other side now at the moment that the pulse is emitted and that’s Illustrated here on the left the line connecting its location of emission points straight across the room to a point on the other side of the wall but by the time that the wave reaches the other wall and that’s Illustrated here on the right the wave freed from its connection to this frame of reference that in the meantime has changed its state of motion the light wave will travel on that absolute straight line but from the perspective of a person inside the vessel looking at where the light wave strikes the wall if they had very precise equipment or if the speed of light were much slower than an actually is then they would actually observe that the light wave strikes the wall at a point that’s lower than where it was emitted from so in an external frame of reference that light traveled on a real straight line but the frame moved up in the time during which it crossed the room from a perspective of an observer inside the frame who doesn’t know that any of this is going on they see the light wave strike the wall at a lower Point some vertical displacement below where it was expected to strike that is at the level of the emission source so the light wave reaches the wall but it does so in this case at a lower point now by the equivalence principle there is absolutely no difference between this frame of reference um being accelerated by a rocket or a similar sealed room that’s sitting on a planet experiencing a gravitational acceleration downward of 9.81 m/s and so because of the equivalent of an accelerated frame of reference and a frame that’s merely experiencing a gravitational acceleration light must also fall in a gravitational field because there’s no distinction between these two cases it turns out that this is actually generalizable to any body with mass bending the path of light and this is actually the key Insight that Albert Einstein’s general theory of space and time the general theory of relativity had that helped to distinguish it from Isaac Newton’s original theory of acceleration and gravitation in Newton’s Theory the deflection of say Starlight around a massive body like another star is smaller than the deflection predicted in general relativity which is supposed to be the more correct description of space and time and the way that energy and matter respond to space and time so in the general theory of relativity the degree of flection of light around a massive object by Falling in a gravitational field if you will is twice as big as predicted in Newton’s original mechanical Theory combined with his law of gravitation that’s a key distinction between the two ideas the general theory of relativity and the old theory of mechanics married to the law of gravitation it was that prediction that was tested in the late 19s and led to the confirmation that Einstein’s work was probably the correct description of space and time and energy and matter and this catapulted Einstein into Global Fame it also led to a host of other predictions for other interesting phenomena because light can be deflected by large masses we could imagine being able to see objects that shouldn’t be visible to us using larger rays of telescopes and looking out into the distant Sky we can look for cases where we see an a background object whose light has been bent around a foreground object allowing it to reach our telescope this so-called gravitational lensing allows astronomers not only to see objects that would otherwise be obscured behind other foreground objects things that that sit between us and the thing we want to look at but because the general theory of relativity gives very specific relationships between the amount of mass and the degree of the deflection of light one can use the deflection of light itself to measure the mass of objects with which you can never hope to have physical contact gravitational lensing is one of the many tools that general relativity gives to us as human beings to better understand the universe even parts of the universe that are very old very distant or both so as you can see the general theory of relativity has some fairly impressive large scale implications if you remember something back from your Calculus the second derivative of something with respect to something else tells you about the curvature of the system that you’re studying with the derivative now we’ve considered the fact that space and time are really part of a singular structure they really should be thought of as part of one four-dimensional framework which is called SpaceTime in special relativity we see that space we see that space measurements in one frame can turn into time measurements in another space and time are constantly getting traded for one another or Tangled Up in one another in calculations of motion from one inertial frame of reference to another there’s a link between space and time and that link comes from the fact that they’re really part of one interchangeable four-dimensional framework SpaceTime and it’s in this framework that matter and energy can be described to move and change so general relativity is really a theory of SpaceTime a general broad theory of space and time and ultimately it concludes that what we call the force of gravity is really due to the fact that mass and energy cause space and time to curve or in more colloquial language Bend or warp the second derivative is a sign of curvature and so it should have been a clue that since there’s no distinction between accelerating a frame of reference or subjecting that same frame of reference to an external gravitational field there must be no difference between curvature and gravity and in fact that’s one of the broad conclusions of the general theory of relativity energy and matter curve space and time and so other bits of matter or even light that travel past that object that’s bending SpaceTime will follow the curvature of SpaceTime and the result of this is that from our perspect perspective in three dimensions they appear to accelerate what is a ball doing when you hold it out at shoulder height and drop it it’s not being pulled down by the mass of the Earth rather it’s following a path in SpaceTime that’s curved due to the presence of the mass energy of the earth bending that space and time that is what gravity is that is what Einstein was able to achieve the very thing that Isaac Newton could not grasp the nature of gravity curvature of space and time space and time tell energy and matter how to move energy and matter tell Space in Time how to bend or curve or warp this elegant summary paraphrased from its author is a beautiful way of remembering the implications of the general theory of relativity writ large and it comes from the mind of luminary theoretical physicist John archabald Wheeler the universe is observed to expand in all directions at once and the more distant an object you view in the universe the faster it appears to be moving away from us this tells us that overall SpaceTime is curved now on the grandest scales the largest distances that we can reasonably observe in the universe the universe’s space itself appears to be very flat and smooth but just because space is flat and smooth overall doesn’t mean that SpaceTime is and the expansion of the universe is evidence that SpaceTime itself is curved the curvature of SpaceTime leads us to conclusions about the origin and the fate of the entirety of the universe and it tells us that that the universe as we know it now space and time and energy and matter was born 13.78 billion years ago in an event we have yet to fully understand but which is described by the phrase the Big Bang let’s review what we have learned in this lecture we’ve looked at the transition in thinking from the special to the general theory of relativity we’ve looked at some implications of the general theory of relativity on physical phenomena specifically we’ve considered what it means for light to travel in a gravitational field from a higher to a lower vantage point in that field we’ve concluded that light should Doppler shift either red shift or blue shift depending on the direction in the field That You observe it we’ve also concluded that light should be bent in its path of travel in a gravitational field and we’ve drawn all of these conclusions by using the equivalence principle to map behavior in an accelerated frame of reference onto a frame that’s experiencing an external gravitational acceleration we’ve then looked at some of the large scale implications for space and time the bending of distant Starlight around massive objects that intervene between us in the universe the use of the warping of space and time and the bending of light to infer the mass of objects that we can never hope to weigh by putting them on a scale and the overall implications for the nature of space time as a framework in which energy and matter play out the fact that energy and matter tell space and time how to curve and the curvature of space and time tells energy and matter how to move and how the overall curvature of SpaceTime indicates to us the origin and possible fate of the entire universe itself these Grand themes all stem from the elegant thing thinking of a brilliant physicist who accepted observational evidence from experiment about the nature of light thought deeply about the world around him learned the math necessary to describe the universe and in that elegant language spoke a volume about the cosmos that we are still reeling from today in this lecture we will learn the following things we’ll learn about the concept of temperature of a material body we’ll learn how to establish a scale and measure of temperature about the response of material bodies to changes in temperature and finally about heat energy as the underlying agent connected to changes in temperature there are many things that are left unsaid in the first two semesters of introductory physics we’re only able to cover a prescribed range of topics and that range can be described as follows motion Force the laws of motion relating force and acceleration to changes in state of motion energy momentum the conservation of energy and momentum non-conservative forces oscillatory motion and rotational motion that’s typically what we get covered in the first course in physics in the second course in physics we’re able to cover electric charge electric force electric Fields electric potential and electric currents and the combination of all those things into electric circuits and then we explore magnetic field and force and the basic behaviors of light such as geometric optics or interference and defraction now as a result of this in introductory physics there is essentially no time to discuss the laws of heat energy also known as thermodynamics but nonetheless thermodynamics is an essential Foundation of modern physics It ultimately was a branch of physics that helped to lead the way to Quantum Mechanics the theory of the very small and that is the next subject of this course so in this part of the course we will establish the second half of the foundations of modern physics the concept of temperature the concept of heat energy and some of the behaviors of heat energy we all have a fairly solid familiarity with the various Concepts associated with thermodynamics if you go outside on a day when it’s cold you feel like something is being pulled from your body on mass as if the world around you is hungry to take something away from you and keep it for itself and this feeling this sensation of the loss of something from our bodies where we have to trap it to keep it in is often what we call cold or the concept of a cold temperature of course the flip side of cold is hot there are environments there are situations where instead we feel like something is being put into our body and we want to get rid of it we might shed some clothing in order to help achieve this to help regulate our own sensation of temperature when the world around us is hotter than us we feel that P penetrating into our skin in a way that can be uncomfortable it causes us to sweat and so forth as a mechanism to try to maintain our own uh state of body temperature so cold and hot these ideas are familiar to us even if we cannot articulate the physical reasons why these situations exist now connected to these two things is also the concept of establishing a numerical measure of the degree of hotness or coldness of an environment so for example the average human being and this can vary by age and gender and a number of other factors is typically comfortable especially for intellectual work office work something like that in a temperature range between 70 to 75° fit now a human being experiencing an environment ment where the temperature is observed to be less than that number will often express a feeling of being cold chilly chilled needing to bundle up more to maintain their body warmth on the other hand a person who’s subjected to an environment above that range maybe 85° Fahrenheit instead of 74 Dees Fahrenheit will complain about sweating too much feeling too hot wanting to cool off in some way maybe by by drinking an iced beverage of some kind or maybe taking off a jacket if you’re in a work environment something like that we have a a concept of being able to measure the degree of heat or cold in the world around us including the heat of our own body taking our temperature to see if we have a fever is another concept that is pretty familiar in the human world now connected to these Sensations these experiences we have to come up with a series of critical issues and a plan in order for us to be able to quantitatively describe these scenarios of hot or cold we have a conception of hot and cold we have a conception of that we can measure these things somehow but we need to establish the basis for actually having that quantitative measure that quantitative description of these Concepts how do you know something is hot how do you know something is cold how do you measure that and how do you allow other people independently to establish the same scale of measure let us Begin by establishing that scale on which we can quantify those ideas like a room is too hot or a room is too cold let us then look at the origins of hot and cold and how the underlying concept is really tied to a fundamental concept called heat energy we will close with a relationship between heat energy of a body and its ability to radiate energy away but in this particular lecture we’re going to focus on temperature heat energy and the effects of heat energy not only on the temperature of a body but the structure of a material body let’s begin by establishing a measure of hot and cold now consider the world around you there are some phenomena in nature that appear to occur at very specific so-called thermal conditions that is to say if you could reproduce the environmental conditions under which a particular phenomenon occurs that phenomenon would occur repeatably reproducibly reliably so for example the freezing or boiling in of a body of water the only substance on Earth that can exist in solid liquid and gaseous States under Earth conditions is water it’s essential to Life as We Know It And because it’s able to coexist under a very narrow range of conditions as either a liquid a solid or a gas it makes an attractive phenomenon on which to establish a range of of behaviors that can be used to delineate a scale of temperature measure now that said of course there are materials other than water and they also change in response to temperature for example in the opening lecture video for this series I showed you the result of heating a Bim metallic strip now I’ll return to heating or cooling Metals later but we’ve observed already that two metals bonded into contact with each other will bend curve when exposed to a heat source and that’s [Music] because in this lecture we will learn the following things we’ll learn about the connection between temperature and the constituents of a material body we’ll learn about the precise nature and cause of heat energy and finally we’ll learn about the radiation of energy from a material body now matter is ultimately made from building blocks for example a liquid may be made of a large number of atoms or molecules the atomic theory would not really be accepted as a reliable description of nature until about 1905 but once one adapts the atomic theory as the correct description of material bodies one is then forced to conclude that the large scale macroscopic properties of a material object are somehow connected to the microscopic behaviors of the building blocks from which that material is constructed now the number of things that are used to construct a material body in the human world the max microscopic world is vast for example there’s the concept of the mole one mole is the number of atoms in a 12 G sample of carbon 12 now experimentally you can work it out and you’ll find that one mole’s worth of things anything at all grains of sand planets Stars atoms anything is given by a special number known as avagadro’s number and that number is 6 .02 * 10 23 of anything per unit mole one mole therefore is 6.02 * 10 23 things heat energy must have a connection to the behavior of the building blocks of matter after all if one is depositing a form of energy into a material body that energy must go somewhere and we must look to the constituents of the material body to figure out where energy might be going this helps answer the questions what is heat energy and where does it heat energy go exactly or where does it come from let’s look at an ideal gas as a laboratory for the connection of macroscopic Concepts such as the volume of a material or the temperature of a material and the pressure exerted by a material on its environment to microscopic IC Concepts like the position and velocity of an atom or molecule now we’re going to focus on ideal gases I’m going to start with a very simple simulation of an ideal gas this simulation is provided by the fet demonstration toolkit that’s available on the web and this is a simulator of an ideal gas system now to start I’m going to put one heavy particle of a gas just one atom or molecule of an ideal gas into the system where do the properties of gases like pressure and temperature come from well pressure is force per unit area and so the pressure exerted by an ideal gas on its container and in this case the container is represented by this box outlined here that pressure comes from the force of the Collision of the ideal gas particle with the walls of the container so for example we’ve injected one massive gas particle into the system and we see that it’s bouncing around the inside of the container it collides with the walls of the container and because this is an idealized system we treat it as having perfectly elastic collisions with the walls and the walls do not move and because of this this forces the momentum of the particle the component that’s perpendicular to the wall it collides with to reverse upon Collision so for instance the particle strikes the bottom wall and we see that its vertical component reverses it strikes the right wall and Its Right Moving component reverses to the left we see also that because momentum is conserved in this closed and isolated system that the total speed of the particle remains fixed even if its direction changes and that momentum Chang changes are conserved independently in every direction a collision with a wall to the left or to the right does not change the speed component that is vertical or parallel to that wall so the origin of the pressure of the gas is the force it exerts due to its momentum change on the walls of the container I could now instead inject more particles into the system so now let’s start by injecting 50 gas particles into this system we’ll give them a moment to spread out in the container and we see that while they all come in together as a clump because they didn’t all quite have the same velocity they start not only colliding with the walls of the container but with each other now an ideal gas will have elastic collisions with the walls of the container and with itself and we see that very quickly the gas particles have spread out fairly uniformly throughout the container and they continue to collide collisions will exchange momentum between colliding particles but on average we can see here that the particles are all moving with about the same speed some are moving a little faster some are moving a little slower but collisions level that out and we stare at this for a moment and see that these gas particles all appear to have some average amount of speed and a distribution of velocities that’s sort of spread around that average so slow moving particles can get struck and become fast moving particles fast moving particles can get struck and become slow moving particles but on average it seems like there’s a pretty typical consistent average speed on average we don’t see these particles getting much faster or much slower as a group while an ideal gas is truly an idealization there are many gases that are nearly ideal in nature for instance all of the noble gases es for example helium or argon they behave very much like ideal gases under many common conditions there are even many other substances that under a range of conditions can behave according to the idea of an ideal gas careful experimentation on systems that behave in this sort of Ideal manner have revealed that there is an empirical law that relates the macroscopic properties of a gas the number of moles of gas constituents given by the lowercase letter N the volume of the gas given by the capital letter V the temperature of the gas given by the capital letter T and the pressure exerted by that gas on its containing volume for instance the walls of the container that hold it and that’s denoted by the letter capital P this equation is known as the ideal gas law most students learn this in a chemistry course in either high school or college PV equals nrt the product of the pressure exerted by a gas and the volume of that gas is equal to the number of moles of that gas times a constant times the temperature of the gas now here this constant is denoted capital r it is known as the ideal gas constant and its value is 8.314 Jew per Kelvin per mole it’s named in honor of the French chemist enri Reno therefore the letter R but since a gas is made from small constituents albeit a very large number of them can we connect the microscopic properties of those constituents their positions in Space the changes in those positions in space with time can we connect those to this macroscopic statement about the aggregate behavior of the gas to connect the microscopic to the macroscopic let’s begin by doing what physicists and Chemists in the 1800s did and turned to classical physics after all Newton’s Laws of Motion were the only things that they knew to be reliable as describing nature so why wouldn’t you turn to the thing that had been working for a couple of hundred years already let’s begin with the concept of mass in this ideal gas let’s define the mle mass of a gas as capital M this is simply the mass for every mole of this gas it’s given by adding up avagadro’s number of individual constituent masses which will denote as Little M so if each atom or molecule that makes up an ideal gas each has an identical Mass Little M then the molar mass is that little M time avagadro’s number that gives us the mass per mole of this gas gas what about the volume of the gas well to keep things simple let’s consider a nice cubical space containing our ideal gas it has a fixed size it has sides all of length L and that means that the area of any side of the cubical space the Box in which we’re holding the gas is given by capital A equals the square of the length of any side and that also means that the volume is determined capital V by the cube of the length of any side now pressure is a bit more difficult pressure is the sum total of the force F total per unit area exerted by all gas constituents on the walls of the container at any moment in time an individual gas molecule will occasionally collide with a wall of the volume containing it that Collision will briefly exert a force that force on that area is the pressure now of course a gas is made from many constituents and so it’s the sum total of the average number of collisions per some unit of time that cause the pressure on the walls of a vessel how might we describe this using concepts of motion Newton’s laws and conservation laws all from classical physics well let’s begin by thinking about a single constituent each constituent has a velocity Vector at any moment in time with three components an X component a y component and a z component now since we’re considering an ideal gas we’re talking about elastic collisions between a constituent of mass m and for instance the wall of the container along the x-axis let’s only focus for now on the component of the motion of a gas molecule along the X AIS now during these collisions the wall doesn’t move and so its velocity before and after the Collision is zero and if you consider a single Collision along the x-axis between a gas molecule and the wall that it strikes and if you conserve kinetic energy and momentum as would be true of an elastic Collision then you find that the initial momentum of the gas molecule must be given by its mass time its original velocity in the X Direction and after the Collision conserving momentum and kinetic energy you’re forced to conclude that it has the same speed along the xaxis but it’s reversed the direction of its motion so the final momentum just after the collision with the wall will be negative M times its speed along the xais now a collision results in a change in momentum for the gas molecule and a change in momentum is what is known as an Impulse in introductory physics the impulse is just the difference between the final momentum and the initial momentum and in this case if you crunch the numbers you find out that if we knew the mass of a gas molecule or atom and we knew its velocity just before the Collision along the x axis that the impulse that results from this change in momentum is -2 M VX now if we knew the time over which the impulse occurs then we might compute the force that’s exerted by just this one constituent on the wall and we can do that by relating impulse time and force using Newton’s second law that the force is equal to the change in momentum divided by the change in time what is the time between collisions in one dimension well with a specific wall the time between collisions in one dimension is just the time between when the constituent strikes the wall for the first time bounces back horizontally across the Box strikes the opposite wall then bounces back along the x-axis to the First wall the one on whom we’re considering the force it’s that time the time between the collision with the wall striking the opposite wall and returning to the First wall the time between collisions will be given simply as twice the length of a single wall along the x- axis divided by the speed along the x-axis of that constituent 2 L over VX now the force of the gas constituent acting on the wall will be equal in magnitude but opposite in direction to the force that the wall exerts on the constituent the pressure is the force that the gas exerts on the container what we’ve computed is the force that the gas molecule has experienced by being acted on by the wall we can use Newton’s third law to relate what we have to what we want we want the force exerted on the wall by this constituent we have the force exerted on the constituent by the wall and they’re related in Newton’s third law by a minus sign and if we plug in the force that the constituent experiences because of the wall the minus signs cancel out and we’re just left with 2 * m * VX divided by the quantity 2L over VX and simplifying this we find that the force experienced by the wall due to this one Collision from this one gas molecule is mvx 2 divid L that is the mass of the gas molecule times its speed along the X Direction squared divided by the length of the wall along the X AIS but that’s just one gas molecule pressure is the sum of all such forces added up across all constituents in the ideal gas and then dividing by the area of the wall in question so what we really want is the total force exerted by all collisions by gas molecules on the wall in a given time and we want to divide that by the area of the wall which is just l^2 well the total force will be given by adding up the forces exerted by individual gas molecules with their individual velocities the component along the x-axis so for instance there’s there may be avagadro’s number worth of ideal gas constituents and so we have to look at each one in a Time window delta T during which these Collision should be considered that time window is given by 2L over V and we find that all we have to do is sum up M V1 x^2 over L Cub plus mv2 x^2 over L Cub plus mv3 x^2 over L Cub all the way up to the total number of molecules that make up this gas notice that every term in this sum has a common multiplicative factor of M the mass of the constituent divided by L cubed effectively the volume of the container so we can pull that out in front of the sum and then we just have to sum over this velocity squared of all of the gas molecules along the x-axis well the gas molecules are colliding with each other we looked at this in a simulation so they don’t all have the exact same horizontal speed at any given time but they do collide with each other and they do on average have the same speed over some unit of time so what we can do is we can approximate this sum by saying that we’re going to consider the fact that all of the gas molecules have on average the same horizontal component of velocity and that sum will just then be given by the total number of molecules capital M times the vx^ S average the average of the square of the X component of their velocities that number is one thing for all of the gas molecules even even if each of them has a slightly different horizontal component of speed because they’ve been colliding with each other and with the walls simplifying this one step further we can replace Big N the total number of gas molecules by avagadro’s number na times the number of moles of the gas little n that appeared in the ideal gas equation and that’s why we’re putting it in here so the final equation we get is that the pressure exerted by the gas on the wall is just on average given by the mass of each molecule or atom divided by the volume of this cubic container times the number of moles of the gas times avagadro’s number which tells you the number of things per mole times the average of the square of the X component of the Velocity well let’s see if we can relate that X component to the total speed of each gas molecule on average on average the X component of a constituent’s squared speed will simply be 1/3 of its total squared speed v^ s the speed of a single molecule will just be given using a variation of the Pythagorean theorem as the sum of the squares of the components vx^ 2 plus V y^2 plus vz^ 2 so on average we would expect after any number number of collisions that each of those components will be 1/3 of v^2 so if we plug that in we take our pressure equation which is just Rewritten here and we plug in the fact that vx2 average is really just 1/3 of the average of its total speed squared we finally arrive at a situation where we can begin to relate microscopic properties like the average speed squared of molecules and their masses to the large scale properties of the whole gas for instance multiplying this equation by the volume cancels out the v in the denominator of the microscopic equation we wind up with P * V is equal to the mass of each constituent times the number of moles of the constituents times avagadro’s number times the average speed squared divided 3 well we can simplify this further by remembering that we defined molar mass the mass per mole of the ideal gas and that’s just given by the mass of each constituent time avagadro’s number so that replaces M and na in the equation and we wind up with the molar mass times the number of moles times the average of the speed squared of a molecule divided by 3 well by the ideal gas law PV which is equal to this thing is also equal to nrt notice that the number of moles of gas appear on the right and left of this equation and cancel out and we can actually finally solve for the average speed of a single molecule in an ideal gas by rearranging this equation to isolate V average and when we do that we find out that this microscopic property of an individual gas molecule its average speed is given by a combination of the macroscopic properties of the gas the square root of three times the gas constant which is just a number times the temperature of the gas divided by the molar mass of that gas the microscopic has been connected to the macroscopic we see here that classical physics can give you some insights into how the individual constituents of a material have relationships with the macroscopic properties of that material that are easier to measure on the human scale we can take one final step and instead of looking at just the speed or average speed of an individual gas molecule we can consider the average kinetic energy of any single constituent of the gas system well that’s just going to be equal to 1/2 times the mass of a constituent times its average speed squared that’s the definition of the kinetic energy of a typical molecule in the gas now from the ideal gas relationship between average speed temperature molar mass and the gas constant we learn the following that the average kinetic energy of a single constituent in the gas which is given by 1 12 m v average squared can be instead related to the macroscopic properties of the gas2 * m time the quantity 3rt / the molar mass now this can be further simplified by replacing the molar mass instead with the mass per constituent times avagadro’s number which is also just a constant and we notice that the individual constituent masses vanish from this equation and we are left with the following that the average kinetic energy of a constituent of an ideal gas is given simply by a number three halves times another number the gas constant divided by avagadro’s number times a single variable the temperature of that gas now it turns out that R the gas constant divided by avagadro’s number is actually related to another fundamental constant of nature which is known as Bolton’s constant it’s written as a lowercase k with a subscript B and so in the end we find out that the average kinetic energy of a single constituent of a gas regard less of the masses of the constituents of that gas is simply given by three halves time the boltmon constant times the temperature of that gas this is a remarkable observation a fantastic relationship that something so tiny as the kinetic energy of a typical thing inside of a vast number of gas molecules is related to this singular macroscopic property temperature that we can can control easily in the macroscopic realm now boltzman’s constant is given here as 1381 * 10 -23 JW per Kelvin it’s a very tiny number which makes sense because the average kinetic energy of a constituent of a large number of gas molecules ought to be a very tiny number even for a standard temperature at room temperature for instance now when we measure the temperature of an ideal gas what this tells us is that we are actually measuring probing in a very direct way the average kinetic energy of its individual constituents and this tells us what heat energy is heat energy is determined by this thought process to be related to the average kinetic energy of constituents of a material body that is to say as one adds heat energy to a system this raises the average kinetic energy of the constituents adding heat Q raises T temperature and this proportionally results in an increase in the average kinetic energy where is the Heat Going the heat is going into the kinetic energy of the individual gas molecules if you want to remove heat from a system all you have to do is find a way to reduce the average kinetic energy of the constituent of that system this also allows us to finally understand that a system with no kinetic energy that is constituents that are holding perfectly still experiencing no collisions with the walls of their container or with each other because there’s no motion at all that is identified as being the lowest temperature that you can ever have zero average kinetic energy for your constituents is 0 Kelvin we finally have a physical understanding at the most basic microscopic levels of a large system as to what it means to achieve zero temperature zero temperature a state of zero heat energy is also a state of zero average kinetic energy for the constituents of that system so this raises an interesting question then how do you transfer heat energy either to or from a system well there are many ways to do this and I’m going to focus on three quite broad established mechanisms for transferring heat energy from a system because ultimately I only really want to focus on one of them so let’s consider cooling heating will just be the reverse of any of the things that I say here let’s begin with the mechanism of conduction conduction is when you place a second system perhaps at a lower temperature if we wish to cool the first system in physical contact with the first system think of two cubes of metal at different temperatures we want to cool one of those blocks of metal so we take another system that’s even cooler and we press them together so that their two faces of the material are physically touching each other at that interface at that cont space between the two materials collisions are going to begin occurring between the atoms or molecules of one system and the atoms or molecules of the other system this creates a an arena in which collisions occur transferring kinetic energy from one system on average to the other what you’ll find is that higher kinetic energy constituents are going to typically lose some kinetic energy to the slower moving constituents at the interface of the other system of course at the interface of the other system those constituents will then start having more collisions with the things inside the system and that’s how heat energy is transferred by conduction through a system it’s all collisions this decreases the temperature of the hot system and increases the temperature of the cold system until such time as the temperatures of the two systems reach a new equilibrium position T1 equal T2 this will will occur typically when the temperature of the hotter system is lowered down and the temperature of the cooler system is raised up and you finally reach a point where they both have the same temperature and they stop transferring heat energy they on average have the same kinetic energy for all their constituents no more transfer can occur then there is convection in convection you pass a fluid like a gas or a liquid across or around another system so if we want to take a system and cool it we might blow air over it or push water across it in some kind of current collisions at the boundary of your system between the constituents of your system and the constituents of the fluid will transfer kinetic energy on average to the fluid the fluid if it’s cooler will have um lower kinetic energy constituents and collisions will tend to favor increasing the kinetic energy of the cool systems uh constituents and this ultimately cools your target system system one by lowering the average kinetic energy of that system and finally there’s radiation radiation is a process by which constituents lose energy by giving it up in the form of radiation of light for instance you might be familiar with the fact that you can stretch your handout several centimet inches maybe even up to a few feet away from a a h hot cooking pan on the stove and even though you are not making physical contact with that and even though the air is very still in the room around you you feel something being transferred to your hand you would say that you can feel from a distance that the pan is hot well that’s because it’s radiating typically at the infrared and that infrared radiation which you can’t see with your eye but which you can feel with your skin uh will be absorbed by your skin radiation requires no physical contact between a system and the environment in fact if you took all the air out of the room and stuck your handout in that environment you would nonetheless feel heat being transferred to your hand by radiation electromagnetic radiation requires no medium to travel and so even evacuating the room of air will still lead to a cooling of the pan in this case by the radiation of infrared light now radiation has the effect of carrying kinetic energy away from a system and giving it to the environment large around it even without physical contact radiation is what I’m going to focus on for the rest of this lecture it’s an interesting phenomenon because it is an interface between mechanics and electromagnetism and you can already begin to see that since we got ourselves into trouble thinking about motion and the laws of electromagnetism and the laws of mechanics that a place like this heat energy and radiation is another similar interface of classical mechanical view of the universe with the electromagnetic laws of nature where inconsistencies may arise if you overly trust the mechanical laws of nature there’s a mathematical relationship that has been determined by experiment in the late 1800s and early 1900s between the energy that is emitted or absorbed by a heated material body and the temperature of that body this was determined empirically by ysep Stefan to be the following that the power radiated or absorbed by a body that that is to say the change in heat energy per unit change in time is given by the product of four numbers Sigma which is a constant of nature known as the Stefan boltzman constant whose value is 5.67 * 10-8 watts per meter squ per Kelvin to the 4th it’s not a bad number to remember because it’s got 5 6 7 8 in it I find that handy for remembering this number in a pinch now the stuff on boltzman constant is multiplied by another number which is this curly lowercase Greek Epsilon Epsilon is the emissivity of the surface of a body and it ranges between zero no emission and one perfect emission you can see that a body with zero emissivity will emit no power in the form of radiation because the right side of this equation will always be zero on the other hand a body with perfect emission will maximally emit radiation given by the product of the other numbers the Stefan boltzman constant the surface area a of the body and the temperature of the body raised to the fourth power note that all material bodies above 0 Kelvin radiate energy in the form of a electromagnetic radiation you and I sitting here right now at 98° F which is the typical human body temperature are radiating light away from our bodies we just can’t see it and we can play around and figure out what wavelength it is as an exercise in class A Perfect emitter with emissivity of one is also known as a black body it’s a very special kind of object it is a system that absorbs all incident radiation and it can subsequently re-emit its own radiation with perfect emissivity black bodies are a special laboratory for testing the interface of the laws of mechanics the movement of the constituents and the laws of motion that describe the allowed states of motion of that material at its smallest level and electromagnetic radiation the emission of light now before I show you an example of how classical physics when applied to the question of radiation got it wrong I want to Define for you a very useful concept and that is the power emitted per unit wavelength in a radiation situation this is known as the spectral Radiance now in a situation where an amount of energy say Delta Q is radiated Away by a body in some some period of time delta T it is actually fairly typical to ask the following question to really drill down into a question about the amount of energy within a certain range of wavelengths or frequencies of the emitted radiation in other words if I consider a range of the radiation with a minimum wavelength Lambda and a maximum wavelength that’s just a little bit higher than that Lambda plus Delta Lambda where Delta Lambda could be a very tiny amount how much energy per unit time is radiated by wavelengths in that range and asking this question is answered by a special kind of function known as the spectral Radiance now it’s often denoted by various letters I’m going to use the capital letter b and I’m going to make it a function of Lambda the wavelength explicitly to emphasize the fact that it is answering a question per unit wavelength this is the energy radiated per unit time per unit wavelength I could have also alternatively written B in terms of the frequency F because frequency and wavelength are related to the speed of light for electromagnetic radiation but I’m going to use b as a function of Lambda if you want to know the power radiated around a specific wavelength then you need to pick a small range around that wavelength and compute the product so for instance you might choose a specific Lambda and then because this is defined over a small range of Lambda to Lambda plus Delta Lambda you need to multiply the spectral Radiance which is a function of Lambda times the window around which you are trying to compute the amount of power radiated Delta Lambda and that will return the power emitted around that wavelength now that would be a sort of discret way of thinking about it if you have a well- defined continuous function a function of Lambda that varies continuously as Lambda representing this spectral Radiance B then you can just integrate you can use integral calculus in a Range to get the answer you desire so for example if I want to know how much power is emitted between two wavelengths Lambda 1 and Lambda 2 I can simply take the product of B and D Lambda and integrate that product from Lambda 1 to Lambda 2 and if B is a well-defined function I can do the integral it may not be pretty but I can get a function that answers the question and gives me the power radiated in that range of wavelengths now with that introduction in mind let’s take a look at a classical physics attempt to predict the amount of energy emitted per unit time about a given wavelength Lambda this was worked out in the early 1900s and answers the question how much power power is emitted in say uh the ultraviolet range around 240 nanm and some window around 240 nanometers how much power is emitted in the range of red light around say 740 nanometers in some window around that answering that question in little steps through the electromagnetic spectrum will give you a a picture of how power is distributed as a function of wavelength in the emitted r radiation now the classical version of this is known as the genes law and it’s from 1905 and so again you have to start from the spectral Radiance function the power per unit wavelength that is this quantity here in the Ry genes law 8 Pi * a the surface area of the object time C the speed of light times the boltson constant times the temperature of the object divided by Lambda to 4th and if you check the units of that particular fraction you’ll see that it is jewles per second per meter so per unit wavelength if you then want to know in a small window around the target wavelength Lambda how much power is emitted you need to multiply that by the size of the window and that will then answer the question about how much power is emitted around that wavelength in a window about the wavelength Lambda so for example this tells us that for say a spherical body that’s heated to uh a certain temperature T and that body has a certain surface area a the shorter the wavelength of the radiation you consider being emitted from the body the more and more power is radiated around that wavelength if true this would be a catastrophic feature of nature so for example consider a small sphere of metal or something like that you you make it out of a very good material and it’s got a surface area of just 1 M squared and it’s got an emissivity of one if you heat that to 6,000 Kelvin and just for reference a very modest small propane torch can easily heat something to 3,000 Kelvin you would emit about 10 to the 16 Watts that is Jews per second alone in dangerous ultraviolet radiation for instance with a wavelength of 250 NM that is easily lethal to a living organism to give you a point of reference you can buy easily on Amazon or at other online vendors a sanitizing wand A sanitizing wand emits 4 watts of radiation power in the form of ultraviolet specifically ultraviolet C which has a wavelength which kills bacteria now if it can kill bacteria it can do significant damage to other kinds of living cells including the cells of the human body body you should never expose your body to UVC if you can avoid it because it causes damage to DNA and this can lead to the formation of cancers 10 the 16th watts of UVC would be extremely dangerous if not lethal and all from a small heated sphere at 6,000 Kelvin well that seems ludicrous and it is ludicrous if you actually go and measure the amount of power emitted at a given wavelength it doesn’t shoot off to Infinity as Lambda goes to zero this is just not what is observed in reality and yet it is a byproduct of thinking of classical physics the marriage of Newton’s mechanics with electromagnetism let me show you a graph I don’t want you to worry too much about what the axes mean I’m going to describe them in an oversimplified manner the vertical axis tells you how much energy is emitted per unit time per unit area and per unit solid angle so at some chunk of uh angle space for a given frequency of radiation you’re considering so the frequencies are on the horizontal axis high frequency corresponds with short wavelength ultraviolet radiation would have a shorter wavelength x-rays would have a very short wavelength and so forth on the other hand long wavelengths are down here at low frequencies so infrared and red they tend to have very small frequencies and correspondingly very large wavelengths the blue curve which not only comports with reality but was predicted in a mathematical exercise by a physicist named Max plunk that one is what nature should look like and in fact is what nature does look like if you heat a black body to 5800 Kelvin and look at the so-called spectrum of emitted power for a given frequency the blue curve is what nature looks like this yellow dotted curve is the prediction of the Ry jees law and comes nowhere near reality it arguably maybe does an okay job for the very lowest frequencies the very longest wavelengths of radiation from a body maybe a human body would be accurately described by the Ry genes law but the sun on the other hand which has a temperature of about 5800 kin also behaves like a black body and is nowhere near described correctly by the Ry genes law now another physicist named Vil heline figured out in 1896 his own version of this prediction and that’s the pink curve and you’ll notice that ven’s law as it’s known does a pretty good job of describing the radiation at the highest frequencies but does an abysmal job of describing radiation at low frequencies Plank’s law however Nails it Max Plank’s law as he derived it in the early 1900s was the Cornerstone of the correct description of the radiation from heated matter so you can see here again a place where there’s a breakdown between classical thinking motivated by the things that we learn in introductory physics the things that are from The Familiar macroscopic World applied to the world of the very small in this case the individual constituents of a heated body of matter there’s a breakdown here and a breakdown is an opportunity to make sense of the correct laws of nature Max plunk figured it out even where V and Ry genes could not so to review in this lecture we have learned the following things we’ve learned about the connection between temperature and the constituents of a material body we’ve explored the precise nature and cause of heat energy the fact that heat energy is related to the average kinetic energy of the constituents of material like an ideal gas and that that is directly related to the temperature of the macroscopic body of that gas we’ve considered ways of transferring energy to and from objects and we’ve looked specifically at the emanation of electromagnetic energy in the form of light from a heated body we’ve looked at some of the laws that were either derived or determined to govern that kind of radiation of energy and we’ve seen that in places where classical physics mechanics Newton’s laws were combined with electromagnetism to predict the radiation from a

    heated body a special kind of body a black body is a total breakdown compared to reality in the next phase of the course we’re going to take this breakdown as a la launching point for a deeper understanding of nature we’re going to transition from the very fast to the very small and begin to explore the origins of quantum [Music] physics in this lecture we will learn the following things we will learn how the black body radiation Spectrum was finally understood we’ll learn about the possibility that’s implied by that solution that energy may come in discrete units we’ll learn about a phenomenon known as the photoelectric effect and we’ll learn how Albert Einstein resolved the puzzle of of the photoelectric effect in the last lecture we saw how the ra Gene power Spectrum prediction utterly failed to model nature correctly given a black body heated to a certain temperature T the rayy genes model predicted that more and more energy should be emitted in shorter and shorter wave lengths leading to some kind of natural catastrophe merely heating up a body to a few thousand de Kevin however matter heated to a temperature T simply does not radiate according to that prediction because if it did the effects would be catastrophic the shorter the wavelength and the more damaging the electromagnetic radiation the more of it would have been emitted from such a body as predicted by the Ry jees model it simply did not comport with reality this mismatch between reality and the prediction of classical physics has been called the ultraviolet catastrophe correspondingly the mismatch between reality and the prediction of ven’s model is known as the infrared catastrophy now historically this problem was not considered threatening or really so important that anyone truly panicked although at least one individual T did take this problem extremely seriously and that’s Max plunk you got to admit though ultraviolet catastrophe is a lovely and exciting name um it I should note that the term ultraviolet catastrophe actually doesn’t date to the exact period when the Ry jeans prediction or plun uh and his work were established but it actually appears to date to much later about 1911 and seems to have been coined by the physicist Paul Aron Fest now in the last lecture we saw that there was a model by a man named plunk Max plunk that did seem to have gotten the right answer so what was it that Max plunk did well he started from a mathematical model of a perfect black body a simple Model A a cavity fully enclosed on all sides except for a tiny hole in the cavity an ideal black body to remind you is one that absorbs all incident radiation on it and then it re-emits its own radiation with some Spectrum it’s got perfect emissivity so it maximally radiates given its other physical properties now once one hypothesizes that such a system exists one then has to apply the laws of physics to predict or describe that emitted radiation Spectrum the amount of energy emitted for instance per unit solid angle per unit time per unit wavelength and per unit area uh per unit many things but your bottom line is you’re attempting to predict how much of each wavelength interval of radiation is present in the emitted bulk of radiation now a cavity with a single small hole in it is actually a really good model for a perfect black body if you shoot radiation at the hole 100% of it incident on the hole will enter the cavity and be lost to the outside world that radiation is absorbed by the cavity now it then enters the cavity and it begins bouncing around inside the cavity striking the walls and therefore hitting the bits of matter that make up the walls of the cavity and fundamentally as we’ve seen in physics 2 matter is made from electric charges now as we also know as these electric charges get struck by radiation they’re going to begin to gain kinetic energy which will cause them to heat up the material surface of the cavity inside the cavity a hotter object emits radiation in a different way than a cooler object so again the question we want to boil this down to is what will that spectrum of emitted radiation due to the heating of the walls of the cavity from the incident radiation actually look like now we can boil the black body problem down to just a very simple collection of phenomena that we can conceptualize of using information from physics 2 uh this is a very simple model of an electromagnetic wave which would be what the radiation impinging on the surface of the cavity walls would look like it’s got an oscillating electric field and perpendicular to it it’s got an oscillating magnetic field and it’s traveling perpendicular to both of those fields this wave then strikes a charge in the wall of the cavity so for instance an electron the electron feels the electric and magnetic fields of the wave and it will respond to those by accelerating this is what we learned in physics 2 The Wave with its increasing and then decreasing electric field strength for example will cause an electron to accelerate more than less it will oscillate it will wave like a bit of matter in a rope that’s wiggled or in a chain That Shook or in a string that’s plucked the electron will oscillate so radiation enters the cavity with any number of possible frequencies or wavelengths that can compose that incident radiation and all of it is taken in by the cavity through the hole the electric charges that make up the matter in the walls of the cavity will either scatter they’ll be knocked off of their parent atoms for instance or maybe they’ll wiggle in response to the electromagnetic wave that strikes them and thus absorb some of the electromagnetic radiation as motion now absorbing an electromagnetic wave causes the charges to oscillate and an oscillating electric charge is a source of an electromagnetic wave so these newly oscillating electric charges can emit their own electromagnetic radiation this is the source of the emission spectrum from the black body so what will that reradiated energy look like when it escapes the cavity that radiation too will bounce around inside the cavity but some of it will make it out of the hole what will it look like and how much of each frequency is found in its power Spectrum well recall that the rayy genes model using a purely classical model of all of this system mechanics and electromagnetism via Maxwell’s equations deter that the Spectrum should look something like this that the energy emitted per unit time um taking into account the surface area and the whole uh viewing solid angle of the of the black body will basically go as the temperature of that body over the wavelength to the fourth of a particular wavelength of light that we’re considering as part of the outgoing Spectrum but as we can see as you decrease the wavelength that is increase the frequency of the radiation more and more and more power is emitted by the black body now a key assumption that lay underneath the building of the ra genes model was that all frequencies are possible for oscillating charges a charge stuck in an atom in the wall of this cavity model can oscillate at any frequency it likes all frequencies are possible and that led to the ra gen model let’s make a very simple model of a system where we can cause oscillations to occur in electric charges and then those oscillating electric charges in turn emit electromagnetic radiation radio waves or light that light then travels across a gap striking another electric charge and setting it into oscillatory motion to illustrate what I mean by this imagine we have the ability to wiggle an electric charge over here at a transmitter site and watch a sympathetic wiggle over here at a receiver site when an electromagnetic wave from the transmitter reaches the receiver to illustrate this let me start oscillating the electric charge on the left what you’re seeing here is the full electric field around that charge as it changes in time as the charge Moves In Space the changing electric field propagates out at the speed of light and causes an oscillatory pattern in space some places have strong electric Fields pointing in One Direction some places have weak electric Fields some places have electric fields that point in the opposite direction we can better see this by looking at the amplitude of the electromagnetic wave as a function of position away from the o oscillator and we see the rising and falling in time of the wave as it travels to where the receiver is the oscillator in this model is a charge that has been set in motion by radiation that was absorbed by the cavity walls the absorption of the radiation causes the charge to oscillate and the oscillating charge in turn emits its own electromagnetic radiation so we’re watching a charge that’s been set into oscillatory motion by external radiation emitting its own radiation here on the left and then causing another charge to oscillate over on the right that would in turn of course cause that secondary oscillation to generate its own radiation and you can see how the black body problem is a very complex interplay not only of mechanics but electromagnetism and getting the details of this right are essential to correctly predicting the radiation from a black body now the Fatal flaw that people like ra and genes made when constructing their prediction for the energy emitted per unit solid angle per unit time and per unit wavelength from say a black body was that they assumed that any oscillatory frequency was possible for the charges it seems a natural assumption electromagnetic waves originate on oscillating electric charges if I change the frequency and I can change it to anything I like in classical physics I expect a different kind of electromagnetic wave with its own frequency to be emitted and In classical physics I can pick any frequency I want anyone at all because in classical physics they’re all possible they’re all allowed and this was the Fatal flaw it turns out in the the Ry genes calculation of the black body spectrum they assumed that those oscillating charges in the walls of the cavity could emit any frequency of radiation they wanted as they sympathetically begin to oscillate having been struck by external radiation it turns out that this leads to the Ry Gene’s prediction of the power Spectrum which is utterly wrong the plun model on the other hand which arrived at the correct answer results in a power spectrum that looks like this it goes as 1 over Lambda to a power in this case Lambda to the 5th but there’s an overall multiplicative factor and that’s where the temperature dependence shows up it’s also where the where a wavelength dependence shows up as well and this extra piece has the effect of cutting off the power Spectrum at high frequencies in other words as you go to higher frequency you actually see there’s a turnover in the prediction of the model and it drops off to zero as you go to shorter and shorter wavelengths higher and higher frequencies you don’t emit more energy you wind up emitting less now what was the difference between the rayy Jee model and Plank’s effort to model the black body Spectrum well one key assumption was that plunk did not allow all frequencies to be possible for us oscillating charges and I’ll return to that assumption in a bit looking at some of the historical context of Plank’s own work to give you a better sense of what atoms and molecules actually do when they are struck by electromagnetic radiation let’s look at this simulation incorporating the modern understanding of the interaction of radiation and Mattern we have here a a water molecule two hydrogen’s bonded to one oxygen and we can shoot radiation at it let’s begin by shooting microwaves long wavelength electromagnetic waves somewhere between visible light and radio if we start shooting microwaves at the water molecule we see that many of the microwaves will pass through the water molecule but some of them will be absorbed and cause rotational motion of the molecule which then scatters the microwave this is in fact how a microwave oven Works microwaves at the right frequency will cause water molecules to rotate and collide with each other and kinetic energy is added to the system and as we know kinetic energy is related to the temperature of material if you add kinetic energy to the water molecules in a system you will heat it up let’s change the wavelength of the radiation to infrared we are now shooting much shorter wavelength light at the water molecule no longer are we able to make it rotate rather we are able to make it oscillate the hydrogen atoms that are bonded to the oxygen will occasionally be struck by an infrared Photon that then causes them to jiggle around a little bit before scattering off the photon if we shoot visible radiation which is even shorter wavelengths at our water molecule we see that it is effectively transparent to the visible light all the visible light all the visible light radiation is passing through the water molecule as if it’s not even there and that shouldn’t come as a surprise to us water is transparent to light so it makes sense that visible light should be able to make it through a body of water and we see that modeled here if we shorten the wavelength of the radiation even more to alter ultraviolet we see that this also tells us something about water that water doesn’t respond to this wavelength of radiation ultraviolet radiation passes through the water molecule essentially unscathed this kind of little simulation incorporates our modern understanding of electric charge chemicals bonding and the ways that energy can and cannot be absorbed and reiated by atoms and molecules we see that not all radiation causes a water molecule in this case to do anything only certain light frequencies or wavelengths have an effect on the charges of the water molecule and thus can cause them to vibrate oscillate or rotate in such a way that that might result in subsequent later reradiation of energy now as I showed you in the last lecture video this model accurately describes the shape of a black body Spectrum but it comes at one small cost Plank’s effort resulted in the need for a new physical constant which he labeled H and eventually came to be known as plunk constant it is related to the degree of the discretization of the oscillation of the charges in the cavity in other words not all frequencies of oscillation are allowed and H tells us something about the gap between allowed frequencies things in between in the gaps are not allowed this is known as the quantization of the oscillatory motion of charges in the cavity walls quantization coming from Quantum a Latin word for how much implying not an unlimited set of values that are possible for a system but rather a discreet well-defined and finite set of values that are are allowed for a system with no values in between the allowed ones now the reason that the Spectrum winds up cutting off at Short wavelengths or high frequency is that electromagnetic radiation as a consequence of Plank’s model requires a specific amount of energy to make a specific wavelength in other words if you want to make ultraviolet light you’ve got to put in a minimum amount of energy to do that if you want to make something with a shorter wavelength than ultraviolet light like xray Rays or gamma rays you have to put in even more energy and not all of those energies are possible inside the oscillating charges of the cavity walls so if you don’t have that energy you can’t make that wavelength and the Spectrum naturally Cuts itself off this implied also that the energy of the radiation is quantized and itself can come in units or packets now now this new constant Plank’s constant H ultimately had to be determined from experiment it wasn’t predicted by Plank’s model it was a parameter in the model that had to be determined and it has units of Jews time seconds which if you flip back to physics 1 and play around with those units a little bit you’ll realize that they correspond to units of angular momentum this actually has deep implications for the universe but we’re not going to get to them right now now its value was originally determined by Max plun by simply changing the value of H around in his calculations until at a specific temperature for a black body he had a value that yielded a shape for the black body spectrum that best described that particular heated black body now that’s how he did it and in fact by doing this by fitting the parameter to the data and determining the value of the parameter itself he came to within a few percent of the currently accepted value of plunk constant which is already a remarkable achievement but in science if you build a model by tuning it to existing experiment the true test of a model is where whether or not it correctly predicts new phenomena that have not yet been either explained or observed so plunk constant by itself being determined from the black body may just be tuned a mathematical model to the data to get the answer you wanted in the first place that’s the first step in describing nature but if you want to see whether or not you’ve learned something deep about nature you need to find the next thing that you can test by applying the same idea with the same constant and see if you get answers that are consistent with nature now the currently accepted value of Plank’s constant is 6. 626nightmarket time 10 -34 Jew seconds that is a number worth memorizing on par with the speed of light 2.98 * 10 8 m/s were the mass of the electron 9.11 * 10us 31 kg Plank’s constant is one of those fundamental numbers that when committed to memory can be busted out when you need it to do a quick calculation and can be very handy when doing things like engineering new systems like in electronics for instance now this constant is crucially important in the modern world I I can’t understate its value any more than I can understate the value of the speed of light its value is now the basis of the system international definition of the kilogram the definition of the kilogram used to be based on the size and mass of a platinum aridium bar that was kept under glass in France there are many flaws with that for instance if atoms of that bar flake off over time and you don’t notice it then over time your definition of the kilogram using that as a reference changes weights and measures are crucial to things like economies and standards and so forth and so you don’t want your definition of the kilogram drifting over time now so far as we know Plank’s constant is stable over vast periods of time certainly over many billions of years and so it was wise to redefine the kilogram using something that itself can be determined independently and as stable and it turned out that a particular way of measuring plunk constant lends itself to defining the kilogram and that change went into effect only in 2018 plunks constant also plays a fundamental in key role in all electronic devices certainly all modern micro Electronics those devices rely on the exact properties of semiconducting materials and semiconductors can be precisely engineered thanks to the quantization the discretization of radiation and matter and ultimately all of this stems in its scale size and control from the value of Plank’s constant now as I’ve hinted before Plank’s work had a consequence built into it that if true would radically change our view of radiation electromagnetic waves he realized in his paper on the subject that as part of the only way he could find to describe the black body Spectrum he was forced to assume that radiation had to come in quantized units whose sizes were controlled by the constant H and proportional by that constant to the frequency of the electromagnetic waves this equation relates the energy and the frequency of electromagnetic radiation E equals H Plank’s constant time F the frequency of the radiation and since fre fre quency and wavelength are related by the speed of light this also implies a relationship to the wavelength of that light let me give you some of the context of Max plunk and his work he concluded this effort in 1900 after many desperate years of working on the problem but he himself did not fully accept the implication of what his newly developed constant H implied and the consequences of his solution to the black body Spectrum problem basically his solution implied if correct that matter and energy can be quantized into discrete units and that units in between those are simply not realized in nature they’re forbidden by the system somehow by the parameters of the system now he assumed that this was all some kind of convenience math trick that he had played that it wasn’t really describing nature at a fundamental level and that someone else would come along really solve this problem using the correct description of Nature and one day explain why the trick worked if you look at some things that plank himself has said over the history of his own life from the year in which he published his black body Spectrum paper to decades later as he reflected on that period of his life you can gain some insights into his psychology as a scientist at the time and in the paper that he published in 1900 he States moreover it is necessary to interpret the total energy of a black body radiator not as a continuous infinitely divisible quantity but is a discrete quantity composed of an integral number of finite equal parts you can see here in sort of the tone and writing of his sentence that he finds something necessary to do but he doesn’t necessarily take away from that that it implies reality follows from this assumption the assumption that the total energy of a black body radiator is discretized and not continuous May merely be a mathematical assumption but nonetheless he found it necessary to make this assumption in order to interpret the data now many decades later in a letter that he wrote to RW wood he reflected back on this period and one famous quote from this letter is often repeated wherein he said the whole procedure was an act of Despair because a theoretical interpretation had to be found at any price no matter how high that may be and you get a real taste of his professional desperation where others had failed to describe the black body Spectrum plank was desperate to figure out what Avenue would lead to to the correct description he didn’t necessarily accept that the mathematical steps required to follow that Avenue implied anything about nature but it worked and he published it even if he didn’t fully embrace the implications of his own work now another famous quotation from Max plun whose Source I simply couldn’t track down but it is attributed to him by many other sources was that he was ready to sacrifice any of his previous convictions about physics in order to solve this problem now this last quote especially was motivated by another thing that plon had to do to solve this problem and that was to employ a statistical description of matter and radiation um many physicists found statistics distasteful because under the hood statistics tells you that you can’t know for sure the outcome of a particular system but you can know the probabilities of all possible outcomes even if you don’t know which one will be realized in the next experiment many physicists who believed that the Universe was deterministic that is that if you know exactly the initial conditions you can find the exact outcome of the system every time found the use of Statistics to describe nature distasteful distasteful doesn’t necessarily mean wrong and that’s why the hard work of the scientist is to use obser ations of nature to assess the assumptions that we have made in trying to describe and predict nature now as I said before the burden in science of a new idea Falls not on your ability to describe the things that came before but to explain the things that come after without changing any of the assumptions of the idea a truly successful Theory a theory that is not not only built on facts but predicts the existence of new ones is ultimately forged in the fire of experimental science married with mathematical effort this lands us on the subject of the photo electric effect now the photoelectric effect was known in the late 1800s but could not be described using what was known in the late 1800s it was observed by physicist Heinrich Herz now he was the first person to definitively demonstrate the existence of electromagnetic waves these had been a phenomenon predicted by Maxwell’s equations and in that same prediction captured the essence of light that light itself is an electromagnetic wave Herz realized that if you were going to test the prediction that electromagnetic waves are real independent of light you would need to demonstrate their existence by transmitting them from one place in a laboratory receiving them at another and showing that the wave induces an oscillating electric charge at the Target location so what he ultimately showed was that an oscillating charge at one place in a room a laboratory could induce an oscillating charge elsewhere in the room with no physical contact and this established the reality of electromagnetic waves Beyond light in fact you could think of this as the first radio transmission now he was also the first person to demonstrate an intrigu physical phenomenon the photoelectric effect light which is an electromagnetic wave at heart at least in the Maxwell view of nature um shown on a metal can liberate electrons from the metal so take a beam of light shine it on the surface of a metal look for an electric current and under the right conditions you will see an electric current develop in the metal now Maxwell’s equations predict that the intensity of a light beam an electromagnetic wave is proportional to the squared strength of its electric field that is e squared if e not is the base maximal electric electric field value of a particular wave now because of that prediction uh attempts were made to describe and predict and explain the observed features of the photoelectric effect so let me use an analogy combining mechanics and the laws of electrom magnetism Maxwell’s equations to attempt to predict the set of phenomena that you would expect to arise in the photoelectric effect think of the charges in a metal as a ball that’s stuck in a pond in a patch of lily pads or weeds what you want to do is Liberate the ball you would like to knock the ball out of the lily pads free it so that it floats over to the shore and you can get it cuz you don’t want to step in all of these weeds who knows what’s swimming around in this thing fine so you and your friends devise a sort of classical photo El electric effect experiment you get a bunch of empty buckets that you might have around to keep ice you know keep your beverages cool while you’re playing that day you empty out the buckets and you you carry them over to the shore of the the pond and uh one of you kneels down at the edge of the pond and starts using the bucket to push on the surface of the pond well this generates water waves so you’re pushing on the surface of the pond and the water waves are making the ball and the lily pads wiggle up and down but it’s not knocking the ball loose no no problem you’re at the limit of your strength but you’ve got lots of friends so your friends all also kneel down at the edge of the pond near you and they start pushing on the surface of the pond and you’re not very coordinated so these waves have different amplitudes at different times but eventually if you’re patient enough some waves will pass through the ball they’ll add up an amplitude constructively interfering and they’ll deliver enough energy to the ball to knock it out of the lily pads so the photoelectric effect in analogy to this ball stuck on a pond in a bunch of lily pads uh should be behaving as follows if you send in light waves even feeble light waves that don’t themselves have enough energy to liberate a charge from a metal if you send in enough of those light waves at the metal you will begin liberating charges the light wave amplitude should add up they go as the electric field squared of each wave and if you wait long enough you’ll start knocking electrons out of the metal that’s what people expected from the classical theory of mechanics and electromagnetism but what was actually observed in the close study of the photoelectric effect well what was observed was that the intensity of the light you shine on the metal has no effect on initiating the effect itself the photoelectric effect can’t be induced by simply cranking up and up and up and up the intensity of light if that light doesn’t already seem to have the ability to make a current flow in the metal we can simulate The observed photoelectric effect using this fet simulation that’s available on the web for example I can start by trying to shine long wavelength light onto a metal I’ve selected a copper plate which is located on the left side of the apparatus I have a representative light source at the top of the apparatus and as you can see I can control the intensity of the electromagnetic radiation or light that I can Shine On The Copper I’m going to go ahead and crank this red light source up to 100% of its intensity and as you can see there is no observed current in the graph on the right the graph shows on the y- AIS the electric current that’s observed in the system and on the horizontal axis the intensity of the light which is currently pegged at 100% even if you wait 1 minute 10 minutes 100 minutes you think you’re allowing the amplitude to build up and occasionally knock an elron out of the metal but you see nothing now instead if you change the wavelength or frequency of the light you maybe can see what happens to the effect do you make an electric current flow well if you start from a particular wavelength of light that doesn’t cause the photoelectric effect and then you change it gradually to a longer wavelength say start with red light and then change it to microwaves or radio you’ll also notice moving the intensity of the light up and down doesn’t cause the photo El electric effect to start but if you shorten the wavelength from the ineffective wavelength to something shorter higher frequency shorter wavelength at some point you’ll suddenly notice that electrons will will begin to flow through and off the metal you can induce the effect as you shorten the wavelength I’m going to begin to lower the wavelength of the light from Red at about 750 NMM down to Orange down to yellow and we still see that nothing is happening I’ve definitely switched to a shorter wavelength of electromagnetic radiation but we still see no current versus intensity on the graph I’m going to continue to shorten the wavelength of the light now we’re into the green we’re approaching light blue or blue now we’re going to the more richer blues and we’re heading toward Violet now I’m definitely down at the shortest visible wavelength range of light and yet the copper is doing nothing and I’m blasting it with 100% intensity but watch what happens when I push this simulation into into the ultraviolet very short wavelength radiation once I cross below a threshold wavelength or frequency for the radiation suddenly electrons begin to get shot off the copper by the light now over here you’ll notice that the current has gone up a little bit on the vertical axis I’m at 100% intensity and I’ve moved up about one tick mark on this axis now once you’ve set the photoelectric effect in motion you might hypothesize that if you crank down the intensity of the light to some sufficiently low level then the waves won’t be able to add up enough anymore and no more charge will flow even before the intensity gets to zero but what you find is that the electric current that you induce in the metal declines to zero as the intensity goes to zero and the electric curve current only goes to exactly zero when the intensity is also zero that is you switch the light off what I’m going to do now is I’m going to go ahead and lower the intensity of the light now you’ll notice that of course the current is decreasing as the intensity of the light decreases I’m still knocking electrons off but not as many and of course if I bring the intensity all the way down to zero then the photo El electric effect switches off there was no point in the intensity and current plot where the effect suddenly switched off before I got to zero intensity in fact if there’s even a little bit of intensity you’ll notice that electrons start boiling off of the copper not many but they come off some very fast and some very slow at a particular threshold wavelength and frequency the photo El electric effect simp simply begins raising and lowering the intensity of the light seems to have no effect on the maximum kinetic energy of an ejected electron even very weak intense light but with the correct wavelength or frequency will rapidly eject an electron occasionally with a high kinetic energy despite the fact that the intensity scales as the square of the electric field strength and shouldn’t more electric field produce more acceleration that’s what all of that stuff from kul’s law and physics 2 and Max equations says should be happening I can lower the intensity down even more down to just 1% of the source and yet nonetheless electrons will come shooting off of this thing with lots of kinetic energy it’s as if the kinetic energy of the ejected electrons from the copper have nothing to do with the intensity of the light but only to do with the wavelength or frequency of that light now I can bring the radiation up in intensity a little bit so that we can see a few more electrons boiling off the metal and what I’m going to do is I’m going to begin to increase the wavelength of the light just very gradually just a nudge at a time at some point we’re going to cross a threshold where the light simply does not have sufficient wavelength to induce the photoelectric effect and it seems that I’ve gotten to it at about 270 nanm now I can go ahead and crank up the intensity to 100% now that I’ve moved just past the wavelength threshold to induce the photo electric effect and yet again we see that intensity does not suddenly cause the photo El electric effect to switch on now you can see how frustrating this must have been for the physicists of the late 1800s this set of observational facts defied explanation using all the battl tested Notions of classical waves and the laws of electromagnetism does this sound familiar does this sound like the moment that led to special relativity because if it does you’re on the right Trail you’ve found a place where the theory of motion and the theory of electricity and magnetism which were largely developed on macroscopic things then CS a new microscopic phenomenon where it utterly fails to make accurate predictions and that smacks of opportunity so how was the photoelectric effect explained well it was our old friend Albert Einstein who cracked the photoelectric effect in one of his 1905 so-called miracle year Publications this was the year that catapulted him into at least physics academic Fame and allowed him to finally secure a faculty position after years of toil at the patent office in Burn Switzerland now to explain the phenomenon Einstein reached back to Plank’s 1900 paper on the black body Spectrum recall that a consequence of Plank’s solution to the problem desperate though the remedy may have been was that light has an energy that’s given not by the intensity of the electric field of the wave but rather by the frequency or wavelength of the light that is e is equal to H Plank’s constant time F the frequency of the light now since the speed of light is equal to the frequency times the wavelength one can substitute into this to get the corresponding relationship with wavelength shorten the wavelength increase the energy increase the frequency increase the energy those are the relationships between frequency and wavelength and the energy of a light packet a light quantum so Einstein Embraces the implication of Max Plank’s work that radiation can be quantized into discrete units and therefore a single unit of light is hypothesized to carry or cost to produce H * f for light of a certain frequency F so even if one unit of light of a certain frequency strikes an electron and therefore 4 strikes it with a certain amount of energy given by H * F The Liberation of the electron is immediately possible independent of the intensity of the light more light Quant striking more electrons per second means more electric current fewer light quanta striking fewer electrons per second means less current but if you have even one you will liberate a charge and that’s consistent with the observations of the photoelectric effect that once you make it happen it happens happens all the way down to even very low intensity until you switch off the light source so what are the equations describing the photoelectric effect that were worked up by Einstein in 1905 so he reasoned that it takes a certain minimum amount of energy to remove an electron from a metal a metal isn’t just going to give up its electrons without a fight I mean otherwise it would be really easy to uh just reach out and strip electrons off a metal but it takes energy so there’s some minimum amount of energy that’s required to liberate one charge from a metal and this is called the work function and it’s denoted by the lowercase Greek letter fi now if a Quantum of light with a given energy strikes the electron and has energy that exceeds the work function then it’s possible to transfer energy to the electron and remove it from the metal it can scatter the electron or even be fully absorbed by the electron the maximum amount of energy that can be transferred to the electron by such an interaction of matter and light is given by the following equation that the maximum energy that an electron can get when struck by a Quantum of light is given by the energy of the Quantum of light minus the work function it takes some energy to remove the electron if there’s extra energy left over after that it goes into the energy of the electron in motion and finally we arrive from Plank’s hypoth ois about the energy of these light quanta at the equation that the maximum energy you can transfer to an electron removing it from a metal is HF minus 5 now if HF is less than F the electron can’t receive sufficient energy to be removed from the metal HF must equal or exceed the work function in order to liberate an electron from a metal and metals of different kinds take different amounts of energy to remove charges from them now where does that energy go well it goes into the kinetic energy of the electron the electron will gain kinetic energy as a result of this interaction with a Quantum of light and so finally we arrive at the following equation that the energy we’re talking about here is really the maximum amount of kinetic energy that any given electron can receive in this Collision process and that’s going to be equal to Plank’s constant time the frequency of the light that struck the electron minus the work function of the metal the minimum energy required to remove the electron from the metal this ultimately leads to the birth of the concept of the photon and implies that light has both particle like and wav like aspects that need to be taken into account now the classical description of light from Maxwell’s equations imagines that light is an electromagnetic wave with an electric field that oscillates in time and space a magnetic field that oscillates perpendicular to the electric field in time and space and that the wave travels perpendicular to both the electric and magnetic fields each wave will have an energy per unit area given by this equation this is what I said before that the intensity of the radiation is proportional to the electric field squared this is all a description but Einstein’s special relativistic description of massless phenomena which light seems to be says that the energy of a massless phenomenon is equal to its momentum times the speed of light now recall that special relativity did not tell us where the momentum itself for light comes from but that thanks to Max plunk and Albert Einstein plunk quantizing radiation and the oscillations of matter in order to explain the black body spectrum and Einstein adopting the quantization of radiation in order in order to explain the photoelectric effect and doing so perfectly then leads to the following description of light interacting with matter that the energy of a light Quantum is equal to plunks constant times the frequency or plunks constant times the speed of light divided by the wavelength of the r radiation and we see that the energy of the light is related to the frequency and that the momentum is also related to the frequency or the wavelength the origin of the numerical value of a light quantum’s energy is wavelength and frequency those tune and control the energy and momentum of a light quantum now the wav likee aspects of light like defraction and interference oscillating charges making electromagnetic waves and electromagnetic waves then also sympathetically causing charges to oscillate these are all very wav likee things that had all been very well confirmed prior to the early 1900s but the black body problem and the photoelectric effect couldn’t be solved with those wav likee aspects you needed particle like aspects of light and these phenomena began to hint that those were needed light energy comes in units that energy is defined by frequency and wavelength and light is a massless phenomenon an electron is a massive particle likee thing that can travel through space light is a massless thing that travels through space and we see from the resolution of the black body problem and the photoelectric effect that light has these Quantum discreet behaviors in the same way that particulate matter has a Quantum or discreet nature now Einstein referred to these packets of light energy as light Quant and again that comes from the Latin Quantum meaning how much now in a letter in 1926 physical chemist Gilbert Lewis uh coined the more common term the one we use today Photon implying a Quantum of light from the Greek for light now in science it’s it’s not enough to describe a phenomenon it’s important that that description have testable consequences and that there is a test that could falsify the explanation and show that it’s wrong now if your explanation survives a test it lives another day and gets to make more predictions and over time if it keeps surviving it gets adopted as an accurate description of nature perhaps even as a law of nature you can can imagine that Einstein’s explanation was not readily accepted of course and much as plank had met his own work with serious scientific skepticism Einstein’s adoption of the quantum nature of radiation to explain the photoelectric effect with all of these interesting consequences was also met with serious scientific skepticism the American physicist Robert milikin who is one of the sort of few well-known American physicists in this early part of the 1900s and his famous especially to high school chemistry students for the oil drop experiment that established the fundamental unit of electric charge although that experiment is a whole fascinating story in and of itself um Milligan did not take the claims of Einstein’s explanation about the So-Cal you know the maximum kinetic energy of an ejected electron and so forth uh very seriously he wanted to test this claim to see if it was possible to refute Einstein’s explanation of the photoelectric effect now we’re going to do a reproduction of this famous experiment by milikin in our class uh but I’ll tease the conclusion of this and and and it’s the following that milikin in 1914 after careful experimentation confirmed Einstein’s description of the photoelectric effect all stemming from the quantum hypothesis of radiation in the end the photo El electric effect paper that appeared in 1905 during this amazingly productive year of work from Albert Einstein won the day and it’s no accident therefore that Einstein went on to receive a Nobel Prize in physics in 1921 interestingly it’s for this work that Einstein received the Nobel Prize in physics not for special relativity not for general relativity but for this Niche effect in experimental physics now Einstein had extended Plank’s work to an entirely separate space of experimental effort not the black body Spectrum where plank determined the value of his constant and while he made a satisfactory explanation of that Spectrum didn’t accept the implications of his own explanation Einstein embraced those implications and then predicted all the aspects of the photo electric effect not only correctly describing what was known of the phenomena but then leading to the experiments of milikin who confirmed that description as accurate fully in its mathematical formulation this set the stage for an entirely new other perspective on nature not the theory of space and time and the speed of light and gravity the theory of the very fast but the Quantum View of matter and radiation the correct theory of the very small so to review in this lecture we have learned the following things we’ve learned how the black body radiation Spectrum was finally understood and about the possibility implied by the this resolution that energy may come in discrete units we’ve also learned about the photoelectric effect and how Einstein resolved the puzzle of the photoelectric effect by embracing the conclusion of MOX plunk work on the black body radiation Spectrum applying them to the photoelectric effect to make predictions about that phenomenon which ultimately proved to be the correct description of Nature and that all of this has set the stage for a new view of radiation and matter [Music] in this lecture we will learn about the following things we will learn about the nature of a kind of radiation called x-rays we’ll learn a little bit about the production of X-rays and and finally we’ll look at the scattering of x-rays by matter and the implications for the nature of electromagnetic radiation now X-rays were discovered serendipitously in 1895 while Wilhelm renan was experimenting with what are then known as cathode rays and which we would now simply know as electrons he was using a device that would boil electrons off a metal using a very strong electric field and he observed some distance away from the apparatus that a special phosphorescent screen was glowing even though there should be no radiation from the experiment actually reaching the screen and so he became obsessed with trying to understand this phenomenon and after careful experimentation he decided that he had isolated a new kind of radiation that was here to for unknown and using the variable for an unknown quantity in math which is usually X he coined the term x-rays to describe these now one of the things that he observed during his experiments was that if he allowed the x-rays to pass through his hand it would cast a shadow on a screen behind the hand that showed only the bones of his hand and in fact this led to him attempting to make the first what we would Now call Medical x-ray in 1895 he used the hand of his then spouse anal VI and her hand famously is the first medical x-ray ever known to have been recorded in the history of science you can see here the dark areas that look very much like the bones of the hand uh the knuckles are up here she’s clearly wearing a ring or something around her finger here and the tips of the fingers are up here the thumb is off to the side uh in for Public Presentation uh ran made a much nicer version of this picture using a different hand and a different experimental apparatus but essentially this is the birth of Medical Imaging as we think of it now noninvasive Imaging using radiation or something else to see inside the body now we now know that xrays are a kind of electromagnetic radiation they’re a very short wavelength light you can’t see them with your eyes but if you have the right instrumentation which rgin did when he serendipitously discovered them uh you can induce a signal in something that can be seen with the eyes they have wavelengths that range at their smallest between 0.01 nanm all the way up to 10 nanm now as Ren discovered they easily penetrate common lowdensity materials think cardboard skin muscle most x-rays will pass through those undeflected unstopped now if you use more dense material between U and the source of x-rays then of course what he observed was that more of the x-rays are stopped so the light regions here are places where x-rays easily made it through the dark regions are places where many fewer x-rays penetrated through the hand in order to get to the Imaging device on the other side in this case photographic film so lead bone this is more dense than skin muscle paper cardboard and so it’s more likely to stop or scatter x-rays now you can imagine that these are insanely useful not just for practical applications but for all kinds of interesting studies of the natural world and they would themselves become a key object of study and ultimately would lead the way toward understanding more about the particle like aspects of light’s behaviors now let’s talk about Arthur Holly Compton and x-ray scattering experiments as I mentioned in the previous lecture in the late 1800s and early 1900s there were not many notable physicists from the United States now that’s as compared to the then European Powerhouse of both Education and Research that was long and well established across the Atlantic Ocean now one of the physicists who became very well known in the early 1900s was Ohio born Arthur Holly Compton interestingly his PhD thesis was in part on the reflection of x-rays after his PhD he received National research Council support and was then free to travel and do research abroad and he selected to conduct work at the then famous Cavendish laboratory in Cambridge England and he did this in 1919 now there he would experiment with very short wavelength flight including X-rays and gamma rays laying the groundwork for his eventual discovery of what is now known as the Compton effect he returned to the United states in the early 1920s and became faculty at Washington University in St Louis and it was there that he observed definitively and methodically now what we refer to as the Compton effect that x-ray Quant scattered by free electrons experience a lengthening of their wavelength after the scattery and that this lengthening is a strong function of the angle at which the light is scattered so how to explain this this is a very particle like picture of an x-ray striking a free electron causing the electron to scatter and itself being scattered now in classical electromagnetism a wave would come in it would start an electron oscillating the electron would oscillate sympathetically but this shouldn’t result in a change in the wavelength of the radiation the like waves on the surface of a pond it’ll make things on on the surface start to bounce up and down but the wave itself doesn’t change wavelength uh when it when it scatters through these things on the other hand uh Compton could only explain this phenomenon by analyzing this scattering process from a more particle like Viewpoint where the X-ray quanta have energy and momentum before and after the collision with the free electron and that because the energy and momentum of the quantum is changed the wavelength is changed and he came up with a precise mathematical formula to relate all of these changes so we can hypothesize as comped did based on Einstein’s 1905 photoelectric effect work which was itself based on Max Plank’s black body Spectrum work that the X-ray incident on the electron which I labeled here I for the purposes of the coming notation uh before scattering carries a total momentum that’s given by uh E equals PC for the incident moment momentum and energy this can be related through plunk relationship to the frequency of the the radiation so H plon constant time fi the initial frequency of the radiation and if we want to get wavelength into this to consider shifts in the wavelength of the scattered light then we can convert this into HC over Lambda I where Lambda I is the initial wavelength now this allows us to write the momentum of the incident x-ray as the initial momentum is plunks constant divided by the wavelength so e is most neatly equal to H * F but P the momentum of the quantum is most neatly related to the wavelength by H divided by the wavelength Lambda the scattering process then occurs and the final scattered light Quantum carries a different momentum P final equal H over Lambda final now the initial electron State we could take as being at rest and and so it has no velocity in the initial state but the final state it involves an electron that’s now been scattered at some angle fi that we’ll write down later now it has a total speed ufal and thus it has a total momentum now I’m going to be careful here I’m not going to assume that this is necessarily a slow moving electron and in fact in reality in the Compton scattering experiments these electrons come out with a whopping great amount of momentum put them very close to the speed of light so close that it is obviously safe to use the relativistic definition of momentum that is the gamma Factor that’s a function of the speed of the ejected electron times the mass of the electron times its speed now to analyze this as a scattering process involving the Collision of a xray with a stationary electron leading to a moving electron and a scattered uh light Quantum we need to only conserve total energy and then momentum in the X and Y Direction so X is here clearly labeled as the horizontal Direction positive to the right Y is the vertical Direction and it would be positive upward vertically so let’s go ahead and do this let’s start from conservation of momentum in the horizontal or X Direction This is a closed and isolated system so the initial momentum of the system that includes the X-ray and the stationary electron must be equal to the final momentum of the system that now involves the scattered light Quantum and the moving electron so if we substitute in with the equations for the initial and final momentum of the light Quantum and we put in the component of the velocity of the electron along the horizontal axis we wind up with this equation which removing the zero because the electron in the initial state is not moving simplifies to this equation here now let me comment on a few things first of all the initial momentum of the X-ray is entirely along the xaxis but only a part of its final momentum lies along the x-axis and so that’s given by HC over Lambda F its total final momentum times the cosine of the scattering angle Theta now there’s another angle in the problem it’s the angle between the horizontal and the electron that get scattered and that’s denoted fi and because of the picture up here we’re only considering the horizontal component of the electron’s momentum and that’s given by gamma m u cosine 5 so so far nothing exciting going on it’s just breaking down the kinematics of the X-ray and the final State light Quantum and the scattered electron all along the x-axis and that’s about as far as we can go right now without knowing things like five the scattering angle of the electron we need more equations and so we’re going to turn to conserving momentum in y so let’s go go ahead and write down the vertical conservation of momentum for the same problem I’ll proceed through this relatively quickly again the initial total momentum in the y direction must be equal to the final total momentum in the y direction there is no initial momentum in the y direction the X-ray is moving entirely along the x axis the unscattered electron has no velocity so the initial state is all zeros and the final state has two pieces the positive vertical component of the scattered light Quantum and the negative vertical component of the scattered electron and so we can consolidate the zeros on the left hand side and we wind up with this equation here now we have SS instead of cosiness for the two scattering angles in the problem now we could use this to solve for fi uh or at least s of fi where it’s already looking a bit nasty we can already see that this is going to be a bit of a lift in algebra let’s see if the conservation of total energy in the system holds any comfort for us in attempting to get at a singular equation that relates the initial wavelength the final wavelength and the scattering angle of the light Quantum well we’re going to conserve total energy total initial energy must be equal to Total final energy we can plug in the total energy of the initial x-ray HC over Lambda I now remember the total energy of the unmoving electron is not zero this is special relativity mass energy is internal energy and is therefore Just Energy so we have to put in the rest mass energy of the electron the final light Quantum has an energy HC over Lambda F and the final scattered electron has a total energy given by gamma time the mass of the electron time c^ 2 this involves both kinetic and internal mass energy now we can then uh just rewrite this equation without the conservation of energy stuff on the left and we arrive at this equation here relating initial and final energies nothing’s really simplified so there’s not a lot of comfort here it’s going to be an algebraic lift but these are the pieces that Compton would have worked with and in fact did work with in order to try to understand his scattering experiments from his experiment he would have known three things the incident x-ray wavelength Lambda I the scattered light wavelength Lambda F and the angle at which light is scattered Theta so the question is of course can we use algebra possibly pages of it in order to relate these things using this hypothesis of a particle like scattering process between light quanta and an electron and could we then make a prediction for the relationship between these three things well the answer is yes and I’m going to leave the lengthy algebra to the viewer or reader of all of this stuff here but basically we’re going to eventually find by working through all of this what Compton found and that is that the predicted relationship between the final and initial wavelength and the scattering angle is given by this very Nic looking equation here in fact what we find out is that from Compton’s analysis of this process it suggests that the difference in wavelengths after and before the scatter will depend only on the scattering angle of the light and some constants of nature H the mass of the electron and the speed of light Compton ultimately confirmed that this was a correct description of of these experiments by doing his own experiments and testing this idea now there are some implications from the Compton effect which is described in this formula an undeflected x-ray that is an x-ray that goes straight through the system with an angle theta equals 0 will experience no shift in wavelength the cosine of 0 is 1 1 minus 1 is zero there is no difference between the initial and final wavelength of the X-ray that doesn’t lead to a big surprise but more interesting perhaps is that if you have a completely deflected x-ray one whose scattering angle is 180° or Pi radians that is a so-called back scatter comes straight back at the source of the x-rays it will experience the maximal possible shift in wavelength and that corresponds from a energy perspective to the largest achievable kinetic energy for the electron that’s the most kinetic energy a scattered electron is ever going to get is when you have a perfect back scatter of the uh the x-rays as a result of losing energy to the electron and coming out at this 180° scattering Angle now Compton in the course of doing his experiments did observe scattered light at Angles other than those expected from Simply scattering off the electrons and from this he determined that some of the x-rays were scattering not off of Just electrons in the atoms but entire atoms themselves that is you could rework the algebra that would lead to the compon scattering formula not by putting in the mass of a scattered electron but by putting in the mass of an entire scattered uh Atomic nucleus or atom and if you do that you’ll find that scattering at the same angle leads to a much smaller wavelength shift because the mass of an atom is much bigger than the mass of an electron and that causes the wavelength shift to get much much smaller at the same angle but nonetheless you will see scattered light with an Al an entirely different set of wavelengths albeit at a lower rate at that same angle when sometimes the x-rays scatter off of whole atoms and not just electrons what this also implied was that for light with wavelengths or frequencies at the level of xrays which does span a large space of of of wavelength ranges scattering of the light behaves more like scattering particles off of other particles like bouncing tennis balls off of bowling balls or something like that rather than waves off of particles where waves would cause sympathetic oscillations in the particles but wouldn’t change the wavelength of the original wave so this flies in the face again of the purely wave hypothesis of light and it seems that under these conditions a much better and more accurate description of the way that light behaves is as if it behaves as a large collection of quanta more than as a collection of waves now this all has deeper implications that stem from the Compton effect let’s take everything from the last few lectures together into one coherent picture all together the black body problem the photo electric effect and Compton scattering Point toward a complex set of aspects of light Behavior light isn’t just a wave and it’s not just a unit or discrete thing like a particle under some conditions light behaves exactly according to classical maxweld equation Theory waves scattering off of or otherwise interacting with and causing oscillations in matter that behavior was well established by the late 1800s electromagnetic waves really can behave like waves but under different conditions suddenly one can observe that light behaves according more accurately according to a particle description a Quantum description that light is discretized in some way not continuous like a wave so in that case it’s better described as a collection of quanta the photon so many photons all acting together and that can be thought of as particles interacting with the particles that themselves compose matter electrons whole atoms and so forth so what ultimately resulted from all of this was that there are particle like aspects of light’s behavior that tend to correspond more often to when the wavelength of the light was very short that is very high frequency whereas the wav likee aspects of light’s Behavior seemed to manifest or correspond more when the wavelength is very long that is the radiation has very low frequency somewhere in that space of wavelengths and frequencies between very long and very short there’s a transition between these sets of behavior wav like and particle like but what defines short and long that’s a very arbitrary distinction something that’s hot to one person may actually be kind of chilly to another think about uh the way that offices are heated or air conditioned uh some people find the temperature in a typical office setting perfectly fine and acceptable some people have to put a blanket over themselves to stay warm because they view it as chilly okay but what how do we Define short and long to understand when the wave and when the particle like phenomena are applicable when it turns out that the answer has to do with the dimension D or size or scale of the system with which the light interacts so the longness or shortness of of wavelength or the highness or lowness of frequency is when compared to the size of the system with which the light interacts if the wavelength of the light is much much greater than the dimensions of the system think long wavelengths that are far in excess of the size of uh of atoms for instance then it turns out that wav likee Behavior rules the atom experiences light like a wave if the wavelength is much much smaller than the dimensions of the system then the system experiences light more like being crashed into by a particle where all the momentum and energy is transferred at once particle like Behavior rules now in the middle as the wavelength becomes comparable to the size of the system things get very complicated and you have to be extremely careful and have an accurate theory in order to actually predict what’s going to happen in that case so there are some extreme cases the wavelength is much smaller than the dimensions of the system the wavelength is much longer than the dimensions of the system those are easy to handle when absolutely particle-like Behavior or absolutely wavelike Behavior manifests in the middle things get dicey and in order to describe systems which have comparable sizes to the wavelength of light for instance you need the Right theory we don’t quite have it yet now let’s talk about the sizes of things as a primer for what’s to come in our thinking about the interactions of light and matter to probe the scales of things with sizes larger than a virus and we can see from this chart over here that a virus has a size scale that’s roughly 100 nanometers okay you you can find that it’s sufficient to use visible light bacteria have uh sizes of about one micron 1,000 nanometers in size red blood cells 10,000 nanometers or about 10 microns in size hair is about 100 microns or 100,000 nanometers in size ants are 10 the 6 nanometers baseballs are about 10 to the e8th nmet we’re in the realm of the macroscopic macroscopic here meaning larger than the wavelength of visible light so this helps us to understand a little bit about why it is is that we didn’t get ourselves into trouble with large scale descriptions of motion and radiation all Newton’s laws and Maxwell’s equations when we were dealing with things that had sizes that were much um smaller than the wavelengths of light that we were using to interact with them looking at a bacterium or a red blood cell or a hair follicle with a microscope is straightforward because the wavelength of visible light is much smaller than all of those things and so it simply scatters off of them and we can resolve the sizes of those structures quite easily when you have a wavelength that’s smaller than the structure you’re looking at you can resolve the features of that structure but to probe viruses and DNA or hemoglobin or macro molecules like glucose for instance you need x-rays you need to get down to sizes that are about the level of 1 to 5 to 10 nanometers or so uh in those cases you’re going to need something like x-rays if you want to resolve the structure of glucose hemoglobin DNA which are obviously essential to understanding modern biological functioning so x-rays are your friend when you want to probe structures that are smaller than bacteria x-rays will allow you to see if you have the right instrumentation to reveal them to the eye these sorts of distance scales but if you want to probe atoms and molecules you need to really push your x-rays you need to go down to the shortest x-ray lengths about 0.01 nanometer or so the scale of atoms is at the level of .1 nanm or 10us 10 m so x-rays can be comparable to or smaller than but not by much the size of an atom and so the particle like aspects of light begin to emerge naturally at this scale it’s no surprise that the behavior of x-rays which is an electromagnetic radiation became very particle likee when we started looking at them interacting with atomic systems like the atoms in metal and the electrons in those atoms those things turn out to have scales that are roughly comparable in size uh or a little bit bigger than the kinds of x-rays that were scattering off of them and so that’s when we got our solves into trouble when it came to the theory of light and how it’s supposed to behave when it interacts with matter it’s it’s when the size of the light got to be smaller than the size of the thing that we were smashing the light into and suddenly we needed a slightly different description of light in order to understand all that now just to tease things if you wanted to probe the nucleus of an atom there we’re talking about sizes at the level of 10 -5 M or or one femtometer and for that x-rays are just too big you’re not going to resolve things of the size of a nucleus of an atom using x-rays instead you need

    something with a wavelength that’s really short like gamma ray radiation or even other things that as we’ll learn turn out to have even smaller wavelengths than gamma rays let’s revisit the phenomenon of the interference of light we’ve looked at this in class in the context of the Michaelson Morley experiment what we saw in an in-class demonstration was that light that is forced to pass through a very narrow opening will defract you’ll get a pattern on a screen some distance away from the slit that the light passes through that shows light and dark spots the bright spots are where the waves have constructively interfered and their amplitudes have added up the dark spots are where the waves have been out of phase with each each other and destructively interfere black areas are places where the waves completely cancel each other out this computer simulation imagine sending in light waves toward a barrier that has two slits in it the light can defract through either slit the resulting wavefronts that come out on the far side of the barrier then interfere with each other and if we put a screen up here on the right side we could imagine Imaging this and seeing bright spots and dark spots and bright spots and dark spots and then fainter bright spots and so forth the pattern can be controlled by changing the geometry of this setup so for instance if I increase the separation between the slits and wait a few moments for the light pattern to catch up we’ll notice that as we take more data with the screen on the right the number of bright fringes has increased we now see that what was once faint on the outside is much brighter but nonetheless we have bright dark bright dark bright dark and so forth this is a wave Behavior how can we reconcile the particle nature of light and the wave nature of light in a phenomenon like this two slit light defraction instead of imagining waves of light coming into a system with two slits in the barrier let’s instead set up a situation where we can fire say photons that correspond to green light at a barrier with two slits in it one at a time if we do this we imagine sending in one Photon that Photon has to go through either the barrier on the left or the barrier on the right we don’t know which barrier it’s going to go through but we can look at the screen on the other side to see where it lands and slowly one Photon at a time as we look at the observing screen on the far side we see green dots the green dots indicate where the photon that we fired ultimately wound up on the screen one Photon at a time we’re building up an image on the far side of the screen now this is rather tedious we’d like to see if a pattern emerges in all of this so I’m going to speed this up and then when sufficient information has been received by the viewing screen on the other side I’ll comment on the pattern sufficient time has passed that we can begin to comment on the pattern we observe on the detector screen there are places where photons have clearly clumped after passing through the two slit process there are places where we find few or no photons on the detecting screen for instance those darker regions flank the bright region in the middle this is akin to the interference pattern that we saw when we were thinking about light as a wave traveling through this system and interfering with itself now single Photon by single Photon we’re building up a similar intensity pattern on the screen on the far side there are bright bands dark bands bright bands dark bands and so forth the same alternating pattern of high intensity and low intensity that we saw from the wave Behavior indeed it seems that the wave behavior is recovered in the limit of a large number of photons passing through the system this reconciles the wave and particle Behavior aspects of light in a single experiment and in fact this famous Young’s tolit experiment is one of the many ways that one can reconcile and understand these dual aspects of the existence of the phenomenon we call light in fact what seems to be true for the single Photon experiment is that we’re unable to predict with certainty where any single Photon will wind up striking the screen on the other side but the probability that a photon will strike in the middle is much higher than the prob ability that it will strike just to the right or just to the left of center and from that we can begin to build an understanding that the probability of where a photon goes on the screen seems to be somehow related to the intensity the amplitude squared of the light wave description of nature now in this lecture let’s review what we have learned we’ve learned about the nature of X-rays and we’ve seen a little bit about how to produce them by experimenting with cathode rays electron smashing into a Target we’ve then looked at the scattering of x-rays by matter and following Arthur Compton’s explanation of his scattering experiments have come to understand something about the nature of radiation with very short wavelengths from this we’ve seen the implications not only for the nature of electromagnetic radiation as having both particle like and weight like aspects under different conditions but something about the conditions themselves that trigger these different aspects of the behavior to be observed when the wavelength of the radiation is much smaller than the scale of the thing that it’s scattering off of then we see the particle like aspects of light’s Behavior emerge when the wavelength is much greater than the size of the thing off of which the radiation is scattering then we see the wavelike aspects of the r radiation emerge and in between there’s a transition a place where we lack a theory so far to actually understand how to calculate these are the foundations for what will happen next as we depart the comfortable world of radiation with its wavelike behavior and now its newly understood particle-like behavior and turn our eye from radiation to matter [Music] itself in this lecture we will learn the following things we’ll learn about the structure of the atom as it was known in the late 1800s and very early 1900s we’ll learn about how matter itself can have way aspects to Its Behavior we’ll learn about Lou de bry’s experimentally verified conjectures about the wave properties of matter and we’ll learn about how to conduct experiments that reveal the wave aspects of Matter’s Behavior to review let’s take a look at the things that describe the wave aspects and the particle aspects of electromagnetic radiation recall that the wave description of light was really the first and most formally developed part of the description of Its Behavior set and these are included in Maxwell’s equations now they describe a spatially and temporally distributed phenomenon you can boil the wave equations in Maxwell’s equations down to this set of equations describing space and time variations in electric and magnetic fields and these variations propagate at the speed of light in the material under configuration for Simplicity we can assume empty space or simply the vacuum and in that case the Solutions in empty space are the famous electromagnetic wave Solutions an oscillating electric and magnetic disturbance that travels at the speed of light perpendicular to the variations in electric and magnetic components and then of course there’s an energy per unit area of an electromagnetic wave in empty space there’s no one place where the energy is concentrated there’s more in some place and less than others and one can think about the energy density or the energy per unit area of a traveling electromagnetic wave phenomenon now the particle description of Light which emerged from evidence based on the black body radiation spectrum and the photoelectric effect these are descriptions of something that has definite energy and definite momentum at a definite location in space and time that’s what a particle is it’s a localized phenomena at a very specific place in SpaceTime whereas a wave is a spread out and distributed phenomenon that isn’t only in just one place in SpaceTime now from Albert Einstein’s work on the photoelectric effect which built upon Max plunk work with the black body Spectrum we have sort of a combined description of the particle like aspects of light’s Behavior set so for instance in special relativity we have massless phenomena whose energy and momentum are related by the speed of light eal P * C but unfortunately in special relativity we couldn’t Glimpse where the energy or the momentum of light came from we got nonsense answers from Pure special relativity however Max Plank’s work with the black body Spectrum revealed another relationship that the energy of a Quantum of light a photon is related to the frequency of the light e equal HF and the constant of proportionality is Plank’s famous constant we can plug in the relationship between wavelength and frequency for a light wave C equals F Lambda and we can get a relationship for instance between energy and frequency energy and wavelength and more interestingly between momentum and wavelength which we get by combining the energy and momentum relationships between special relativity and the black body Spectrum so we find that for a Quantum of light a localized packet like unit of light energy is related to The Wave properties of that same phenomenon by eal HF and momentum is related to The Wave Properties by P = H over Lambda let’s Model A Long wavelength interaction and a short wavelength interaction using a beaker sitting in a tank of water this identical Beaker is going to be smashed into by a wave under two different conditions on the left a long wavelength disturbance that’s much larger than the size of the beaker and on the right a short wavelength disturbance whose size is comparable to that of the beaker this will illustrate the difference between wav like phenomena and particle like phenomena the motion on the left as the wave develops in the tank will be more gradual with the beaker gently falling and then Rising whereas on the right when the short wave crashes into the beaker it’s almost as if it’s been struck by something small and fast moving look how violent the Collision on the right is compared to the one on the left we might argue the beaker on on the right has been struck by something more particle like whereas the beaker on the left which is bobbing around has been struck by something more wav likee this helps to illustrate why a wavelength phenomena short compared to the Target will exhibit particle like Behavior whereas a wavelength long compared to the Target will exhibit wav like Behavior but what if what’s so special about electromagnetic radiation why does electromagnetic radiation get to have all the fun of having particle likee and wav likee aspects to Its Behavior set what if matter electrons protons neutrons whole atoms also could exhibit wav likee behaviors they had been experienced primarily as particulate objects definite things with locations in space and time but maybe it was just because nobody had observed the wave aspects of the behavior up to a certain point such as in the early 1900s there were hints of course that something funny was going on with matter at the scale of the atomic size so for instance it had been long known since certainly the work of Anders enstrom in the in the early part of the 1800s that the emission Spectra of elements like Hydro gas or helium gas that they emitted only certain colors white light comes in a full rainbow and if you stare at White Light closely enough for instance from the Sun you’ll you’ll see that there are missing lines of color in its Spectrum but they’re hard to detect if you take a pure gas and you excite it so that it emits light it’s much easier to see that it’s only allowed to emit certain wavelength certain colors of light here I show you the hydrogen emission spectrum revealed using a optical disc a DVD or a CD ROM the many scattering surfaces for light on the surface of the disc will spread any light that strikes it out like a prism into a rainbow but we see an incomplete rainbow from hydrogen there’s a bright red line there’s a bright blue green or cyan line there’s a blue line and then there’s a fainter purple line which you can actually see part of down here reflected in the disc now this is a more classical way of laying out the emission spectrum of a gas like hydrogen’s flattening It All Out On A Plane here’s that red line the blue green line it’s very hard to see the faint dark blue line and then there’s a violet or purple line down here these are the long wavelength emissions these are the short wavelength emissions but there are big gaps in between these things and no two elements have the same spectral fingerprints excite helium excite Neon excite argon and you’ll get a very different pattern of colored lines out of each of those why why are atoms only allowed to emit certain kinds of light that was a source of curiosity in the 1800s that could not be resolved whole mathematical patterns were observed out of the relationships in wavelength or frequency between these colored lines but nobody could make sense of why these relationships existed and where they came from now to understand what’s going on with matter at size scales like that of the atom it’s very valuable to dig back a little bit into the history of the discovery and description of the atom as a real phenomenon in nature now of course thousands of years ago philosophers and mathematicians and perhaps what would now be considered Proto scientists and engineers thought deeply about matter and they argued endlessly about whether it was continuously distributed made of only a finite number of substances or Atomic in nature that is coming in small units that could be built up into the structures we experience in nature but that was a lot of argument without a lot of evidence and our modern understanding of how to understand the natural world and the scientific method reflects the reality that speculation is fine but the final Arbiter is observation of Nature and the testing of your claims the discovery of atoms as a real feature of nature or at least a potentially real feature of nature goes back to the early 1800s when chemist John Dalton discerned that not only elements have weight and that the weights are specific to each element in proportion to for instance the weight of hydrogen um but also that when you react one element with another element you’ll only get products from the reaction that completely use up the reactants if you have the right proportions of reactants for instance you might try reacting two things one to one but have an incomplete reaction react them in a ratio of 2: one and you completely eliminate all of the original reactants that went into the process that was something that Dalton characterized and it was a strong hint that the elements come in units and that those units have rules of comp combination that only allows certain proportions of them to completely react and disappear into other final products now it wouldn’t be until 1897 although speculation had preceded in the decades before this work that Joseph John or JJ Thompson would reveal the first component of what would come to be known as atams atams themselves were not completely firmly established as the correct description of nature in 1897 but Thompson found found out by experimenting on a kind of radiation known as cathode rays in his day that they actually possess of mass but they possess of a unit of mass that is about a thousand times less than that of hydrogen now this would imply that either there’s a lighter element than hydrogen or perhaps one had ripped something out of hydrogen and isolated it in the first place to be studied he observes that these cathode rays with their very tiny masses also possess of electric charge and can be made to for instance accelerate in electric Fields or bend in magnetic fields in 1905 based on the idea that this electron which composes the the cathode rays which is the identity of the COS the cathode rays is a piece of what we call atoms he proposed a model of the atom it imagine a central large positive charge with negative charges embedded in it and this was known as The Plum Pudding model because it look very much like a a British dessert known as a plum pudding where you have a whole bunch of raisins or other fruits embedded in sort of a uniform distribution of dough which is cooked up into a dessert so imagine that the raisins are the negative charges and the positive charges the dough and the negative charges are spread throughout the dough this was Thompson’s model of the atom now that may sound ludicrous and cartoonish but the beauty of science is that that’s a conjecture that can be tested for instance you might imagine trying to do experiments that verify whether or not the negative charge and the positive charge are uniformly spread out in something of the volume of an atom that’s an experiment that couldn’t necessarily be conducted at that moment in 1905 but it was certainly possible shortly thereafter now cherry-picking my way through the story I want to focus for a moment on Marie and Pierre c um now they among many other people came to understand that unstable elements or radioactive elements that emit radiation when they Decay away um results in a a new kind of radiation that had not yet up till that point been understood now they experimented on these radiations and it was finally Ernest Rutherford who classified them into the modern way that we usually talk about emission of radiation from unstable Atomic nuclei and those three classes of radiation are alpha beta and gamma for the first three letters of the Greek alphabet Alpha radiation would eventually be revealed to be a a whole hydrogen nucleus entirely ejected from a very heavy nucleus of a very heavy atom so this would be two protons and two neutrons bound together in a very stable little unit uh and it can be spat out of an unstable nucleus that spontaneously radioactively decays now Alpha radi ration is highly electrically charged it has plus two units of the elementary charge because of its two protons and that means that it can’t penetrate very far into material but it can get into material and it can dump a lot of energy along the way um as I mentioned Ernest Rutherford came up with this classification scheme he was another physicist who is considered to be one of the greatest experimentalists if not of his day perhaps even of all time uh working in conjunction with the physicist Hans Giger and Ernest Marsden he scattered Alpha radiation off metallic targets and he found out by looking at the scattering process that the plum pudding model of JJ Thompson did not describe what happened when you scattered uh alpha particles off of atomic nuclei the Thompson model would have postulated that because all the charges very spatially spread out the probability of striking any of the positive charge or any of the negative charge is extremely small and so for the most part you’d expect to find your Alpha radiation traveling through the atom lightly scattered but mostly coming out on the other side of the target but when Weatherford asked Giger and Marsden to look at what’s called back scattered alpha particles that is look for alpha particles that strike the metallic Target and then reflect almost exactly back at the original emitter of the alpha radiation they were surprised to find out that there are a significant number of alpha particles that bounce back off of the matal target as if they’re striking a huge Target of positive charge concentrated somewhere in the center of every atom and this in fact was a picture that Rutherford used to build his own model of the atom modifying JJ Thompson’s model and concentrating all the positive charge in each atom at the center of the atom this forms the first sort of planetary model of the atom as electrons orbiting a central tightly packed nucleus with a huge positive charge of course depending on the element in question but it was this picture that adequately described the back scattering process with its higher rate than expected from the Thompson model uh observed by Rutherford Giger and Marson this is now known as the Rutherford model of the atom and it would be further modified as more experiments were conducted on this system now how do we know the sizes of atoms well skipping ahead a little bit in the story of the atom you can look at the scattering of of x-rays for instance we looked at Compton scattering in a previous lecture uh but imagine scattering x-rays with slightly longer wavelengths than we would have been talking about when talking about Compton scattering here the X-ray is it turns out comparable in size to the atoms off of which it’s scattering you know with wavelengths of about .1 nanometer or so um smashing these x-rays into crystalline solids like table salt sodium chloride it was observed that specific patterns will appear in the scattered x-rays so for instance this image on the right is the very first x-ray defract made by Max von Lao Paul nipping and Walter Friedrich in 1912 now not long after Rutherford’s experiments revealed that the atom was composed of electrons with a tightly packed po positively charged nucleus now um Von Lao nipping and Friedrich noticed that there were bright spots where the x-rays tended to accumulate and dark regions where no scattered x-rays tended to be observed and this interestingly enough looked like a interference pattern that you would expect from light interfering and scattering in different ways off of a Target so using these interference patterns um and especially through the work of William Henry Bragg and William Lawrence Bragg the only Father and Son team to ever won the Nobel Prize in physics um they were able to explain the scattering of the the x-rays as being off of small objects albeit with comparable size to the x-rays in question and separations in space that were similarly comparably sized so William Henry Bragg and Lawrence Bragg did their own scattering experiments and Laurence brag in particular developed a model of the scattering process of scattering x-rays off of Regular layers of atoms in a crystall and solid that beautifully explained these patterns of light and dark that were observed at first by Von La nipping and Friedrich in 1912 and this actually led to the ability to determine the approximate size of atoms using these x-ray defraction patterns let’s take a look at the model that Lawrence Bragg developed because it will help us to understand how we can detect wave properties in general going forward let’s begin by modeling a crystal as a series of regularly arranged atoms layered in Planes we’ll come back to the separation between the planes later but they could be represented by some distance D which will appear later in this example let’s then imagine that we draw an incoming x-ray that scatters off of one particular particular atom in a plane at the top of the system now from the place where this Ray has been emitted the X-ray will strike an atom and Scatter off of it this will have a certain path length associated with it the default length that this x-ray had to travel during the scattering process we can imagine then that this x-ray came from the plane of emission shown here which makes a 90° angle to the original x-ray a second Ray emitted from very close by from the plane of emission which which also makes an angle of 90° with respect to that surface strikes another atom nearby missing the first one but hitting one in the layer below it that Ray also scatters and is detected at another point where the first x-ray is also detected photographic film or a camera or some system like that now because the second Ray did not strike the same atom as the first x-ray there’s going to be an extra bit of distance that the second x-ray has to travel before coming back to the plane where the first x-ray is also detected so we can imagine considering what that extra length is by drawing another line parallel to the line of emission the plane of emission that represents the extra distance that the X-ray would have to go that’s highlighted here in red this is the extra length that the X-ray the second one scattering off the second atom has to travel before it returns to the same location where the first x-ray also strikes a detection system on each side of the scatter off the second atom we have an extra length capital L that the X-ray had to travel and we can start doing some geometry to figure out how one relates that extra length L to the displacement D between atoms and the planes of the crystal notice that the angle between the black lines which are parallel to the plane of emission and the red lines here must also be 90° this is some geometry that you yourself could work through to verify but that Ray will always remain perpendicular to the plane of emission now the scattered x-rays will make an angle Theta with respect to the surface of the Crystal and if one works through the trigonometry and the geometry of the problem you’ll find that there is one interior angle inside the little Tri triangle whose hypotenuse is D and who each have a side of length L and and the similar angle is indicated here now we can relate the length L this is half the extra length the Ray has to travel to the distance D and the angle Theta of scattering by simply noting that in this triangle the S of theta is equal to l the opposite side divided by D the hypotenuse of the right triangle now let’s think about what’s going to happen if these two waves one scattered off of one atom on the surface of the Crystal and one scattered off of an atom in the next layer of the crystal meet at the same place on the detection screen at the same time one of these x-rays the first one for instance is a wave and it’s going to have crests and troughs just like any other electromagnetic wave now its partner x-ray that arrives at the same time will interfere constructively or destructively depending on the alignment of the second Ray with the first one let’s imagine we want to figure out what the condition is for completely constructive interference that is where the peaks of X-ray 1 line up with the peaks of X-ray 2 and the condition for that is that they be shifted relative to each other by exactly an integer number of wavelengths this is the condition for constructive interference the waves can be shifted in Distance by some distance 2 L with respect to each other but the condition is that that distance 2 L has to be an integer multiple of the x-rays wavelengths after scattering so n * Lambda so that is an integer number n * the wavelength of the X-ray Lambda meets the condition for constructive interference when n is an exact integer multiple of Lambda as I said the condition for constructive interference is that n * Lambda is some distance D and that’s the extra distance that the second x-ray has to travel and from our picture that’s twice L now we can relate this extra length L to the angle of scatter of the x-rays Theta using the trigonometric relationship derived earlier and that relationship was just that sin Theta the sign of the scattering angle equals L the side opposite that angle divided by D the hypotenuse of the triangle this allows us to solve for l in terms of D and sin Theta L is equal to D * sin Theta now plugging that into our Construction interference condition we find the following that if the second x-ray is shifted by an integer number of wavelengths with respect to the first n * Lambda then this will simply equal to 2D sin Theta constrained by the scattering requirements in the system for constructive interference and this condition this mathematical condition in order to obtain constructive interference is known as the brag condition as derived by Lawrence Bragg originally in thinking about this this x-ray scattering process so all one has to do is look at Angles where you see bright spots in the interference pattern and this will tell you given the wavelength of the x-rays what is the space separation of the planes of atoms in the crystal now in the specific case of the sodium chloride x-ray scattering that I hinted at earlier if you take regular crystals of sodium chloride and expose them to a beam of x-rays you can look to see where in scattering angle uh relative to the incident beam the bright spots and dark spots appear so for instance we have here an x-ray spectrometer the vertical axis is the number of x-rays per second that are detected and the horizontal axis is the angle with respect to the incident beam of x-rays now Theta here is the SC shattering angle with respect to say the surface of the material but this can be related via 2 Theta back to the original angle to the beam you’ll notice that there are in fact places where there are buildups of intensity of scattered x-rays so for instance just before 30° around 28° or so and just around 32° and then there’s another clump of Peaks over here uh there’s a clump just around 60 or so uh and so forth and then there’s another Clump over here there’s a very low bump and then a larger bump and you’ll notice that these bumps come with different intensities well what’s going on here is that a a copper emitter is being used to generate the x-rays and because of the properties of copper it generates two kinds of x-rays in the beam the so-called copper K Alpha line and the copper K beta line the K Alpha line has a wavelength of about .15 nanm and the the K beta line has a wavelength of about .14 nomer so they’re not exactly the same wavelength and that explains why the first bright Fringe in the X-ray has two peaks one from each of the K Alpha and K beta lines the second bright spot in the X-ray scatter has two peaks again one from the alpha and one from the beta line and so forth now if you take the Lawrence brag scattering approach and you relate the locations and angle space of bright spots constructive interference locations back to the size of the scattering uh distance between scatterers in the crystal lattice you can actually estimate the separation of the atoms or molecules that make up the crystal lattice and you find out that this comes in at about 28 nanm regardless of which of these x-ray lines you consider so we find out that the spacing of the scatterers inside a sodium chloride crystal is about the same scale as the X-ray wavelengths it’s only about a factor of two or so larger than the X-ray wavelengths that’s easy then for us to see the wave nature of the scattered x-rays emerge because they are a little bit bigger than but comparable in size to the things off of which they’re scattering uh it’s no wonder we don’t see strong Compton scattering here the particle nature of the x-rays is not in effect the wave nature of the x-rays because they’re large compared to the size of the things they’re scatter in off of is in effect uh but this is nice because it tells us roughly the scale of the size of the scattering objects and that comes in at about a fraction of a nanometer so this roughly tells us that the size of atoms or Atomic distance scales is at that level of about a fraction of an anomer now this tells us something about the sizes of atoms atoms come in at sizes around 10 Theus 10 m or so this unit is not in the system international but it’s known as the anstrom in honor of Anders anstrom the enstrom is about 10us 10 m and that roughly corresponds to the size of say a hydrogen atom or an atom that’s slightly larger than that now going back to atomic emission Spectra that is you know heating or ionizing a gas an elemental gas like hydrogen or helium or neon or something like that we get these patterns of light that come out you know it’s it’s as if only certain energies are per Ed for the electrons in an atom why would that be well in your mind you might start modeling the electron in orbit around the central nucleus of its parent atom as a string on a guitar a string on a guitar is confined at two ends uh it’s bolted down at two ends and tensioned and once you set the tension of a guitar string all the primary and secondary frequencies of its vibration are fixed and that’s how you you can tune the tension of a guitar string and get a specific note a note consists of a specific fundamental frequency and then a whole bunch of other frequencies layered on top of it with regular intervals and what determines the frequency is the length of the string and the tension of the string and that’s that basically says how many of each kind of standing wave with a certain wavelength can actually be found on a guitar string so perhaps like guitar strings confined at two ends electrons are wav likee and find themselves confined in a specific volume with only specific frequencies allowed that would certainly help explain why these patterns of light are so specific to each atom so we might draw in our mind a model of the atom as an electron confined to a volume like a spherical volume with a radius that’s about the size of an atom. one nanometer or so maybe it’s there that these wavelike properties of electrons which you could couldn’t really notice at larger scales clearly emerge and maybe that’s why Atomic Spectra have the properties that they have with these regularly spaced and in fact mathematically related colored lines this certainly would be consistent with observations of other phenomena like the black body cavity emitter where only certain vibrational frequencies of the walls of the cavity appeared to be allowed and that constrained the radiation that the cavity could emit so this isn’t totally alien the black body spectrum and atomic emission Spectra may be two aspects of the same behavior trying to tell us something about matter so if matter can be wavelike as well as particle like what is it that determines the wave properties of matter remember for light we had Maxwell’s equations they were built up from the careful study of the electric and magnetic forces and Fields and imer merged as wave equations that when solved in empty space told us that light was an electromagnetic wave an oscillatory phenomena with wav likee characteristics we have no wave equation for matter there is no first principles thing that we’ve experienced up through the end of the 1800s that tells us oh well of course there’s a wave equation for matter too so we don’t have a starting point for the wave properties of matter assuming they’re even real at all so in his 192 4 PhD thesis French physicist Louis de BR postulated postulated in the same way that Einstein postulated that the speed of light was the same for all observers that matter also has wave properties and not only that drawing from Plank’s relationship between energy and frequency for light and the relationship between momentum and wavelength that results from special relativity de Bry asserted the hypothesis that the very same facts would be true true for matter if it had wavelike properties so the energy of a piece of matter would be related to the frequency of the matter Wave by eal HF that’s a conjecture that the momentum of a piece of matter would be related to the wavelength of the corresponding matter Wave by H divided by the wavelength that’s a conjecture so how would one prove this recall Einstein made the conjecture based on the Michael morle experiment that the speed of light was the same for all observers regardless of the state of motion of the source of the light or the Observer of the light relative motion did not change the speed of light that could be tested by conducting experiments looking at the constancy of the speed of light with respect to motion now that conjecture along with the other postulate of Relativity had other predictive consequences for this description of space and time and those consequences were verified think about time dilation and the lifetime of the muon so how would one prove de bry’s conjecture well brag scattering offers the possibility to test this hypothesis we could for instance compute the matter wave properties of electrons and then we might try to find a system off of which we might scatter them and see if we can see the wave properties of electrons revealed by the scattering process all we have to do is find a scattering system whose size scale is slightly smaller than or roughly comparable to whatever the corresponding matter wavelengths of an electron would be so just as x-rays scattered from crystals allows the wave nature of x-rays to reveal to us the structure of the crystals once we know the structure of crystals themselves regular Arrangements of atoms we can then look at electron scattering and see see if it reveals any wave properties of electrons for instance interference well this is precisely what was done so consider the electron with its mass of 9.11 * 10- 31 kg now imagine accelerating it up to some momentum now we’re going to be fully relativistic here we’re going to use the correct definition of momentum because we might have to accelerate electrons to extremely high speeds to achieve the kinds of properties the wave properties we would need in order to see if those wave properties exist so we’re going to use the fully relativistic momentum equation the gamma factor of the electron times its mass times its velocity which we can set by accelerating the electron Now by De br’s postulates the momentum of an electron accelerated up to some speed U is going to be related to its matter wavelength by H over Lambda e so what momentum would we need to accelerate an electron 2 to probe the scale of a crystal whose spacing is going to be somewhere around the level of .1 nanm or so well we would ideally want to achieve an acceleration that gets our wav length down to something comparable to that about .1 NM now notice that momentum according to De bry’s postulate is inversely proportional to wavelength so if we want to get the wavelength down to something the size of 0.1 nanm we’ve got to get the momentum up high to some Target value now if you crunch the numbers on this this will require an electron momentum of about 7 * 10 -4 kg m/s that doesn’t really tell us much so for instance um if we used an accelerating electric potential difference a voltage to get our electrons up to this momentum what voltage would be needed to achieve that for an electron now I’m going to leave the math to you if you would like to play around with this but you need to make sure that you’re careful and use special relativity to answer these questions remember the relationship between um energy and the gamma Factor total energy and the gamma factor for an electron that’s written down here and remember also from special relativity that that can be related to the momentum and the rest Mass of the electron okay through this equation and keep in mind also the special relativistic definition of kinetic energy you’re going to need to combine all of these things to get the answer to the question what voltage would be needed to achieve this for an electron but it turns out that this corresponds this momentum corresponds to a gamma Factor that’s actually quite modest it’s only 1.03 that’s only a small fraction of the speed of light and that shouldn’t be hard to achieve for something as low mass as the electron that corresponds to a kinetic energy of about 2 * 10 -17 Jew um and if you remember your conversion of electron volts an electron volt is roughly 10 the19 Jew or so this isn’t many electron volts worth of of kinetic energy and so if you crunch the numbers and you relate the kinetic energy to the accelerating potential that would be required to achieve that for an electron with its one unit of Elementary charge you very quickly find out that this only requires about 150 volts that is no problem at all certainly in the days when this experiment was done uh and this experiment was done in 1927 achieving 150 volt electric potential difference for electrons was quite a trivial activity in that day so that scattering experiment was famously done by by two physicists Lester germer shown on the the right hand side of the photo and Clinton Davidson shown on the left and this is in fact um a a piece of the equipment of their scattering experiment with the electron emitter and the nickel Crystal that they used as a Target in 1927 uh to do the scattering and then they looked at the pattern of scattered electrons to see if any wave nature effects emerged and what’s the most obvious wave nature effects well if you see an interference pattern in the scattered locations of the electrons that is if you see places where there are intense locations where electrons scatter to and other dark regions where they don’t scatter to then you would have some evidence for the wave nature of electrons matter wave properties could in fact be real so just as an x-ray scattering if you scan over the scattering angle of the electrons from the Crystal and if wave properties manifest then constructive and destructive wave interference should occur at different angles for a fixed wavelength and thus a fixed momentum all right so this is an analogy to the X-ray scattering uh process of course that we looked at earlier with the brag scattering um so you could what you could do of course is you could uh set your voltage to accelerate the electrons to something specific to achieve a specific momentum for the incoming beam and then you could look at different angles of scattering relative to the beam to see if you see intense regions and less intense regions of scattering um in that case the brag scattering formula just applies uh if you want to see the nth bright Fringe of constructive interference the first the second the third and so forth then all you have to do knowing the the wavelength of the thing you’re scattering is look at a specific angle knowing the the size of the crystal the space in between the scatterers and the Crystal B and then the wavelength would simply be determined using de bry’s hypothesis using the momentum of the electron but actually instead of scanning over scattering angle it in fact when you can control very easily the momentum of the electrons then it’s actually easier to Simply vary the moment momentum of the Electron Beam and observe at a fixed angle Theta so don’t move around where you’re looking just observe at a fixed angle Theta and scan through voltage which changes the momentum of the beam and thus changes the degree of the wave properties of the beam as a function of voltage and as you scan over the voltage sometimes you’ll make the electrons have just the right wavelength to interfere totally constructively when they scatter and sometimes as you keep tuning the voltage around you’ll make them interfere spere totally destructively with each other and you’ll see no scattered electrons at that same angle Theta and this is what Davidson and germer did and here’s what they saw so this is the intensity of scattered electrons versus the square root of the voltage of their instrument and what you notice is that uh there is a place of course where there’s a bright intensity Peak and then it falls off to a minimum and then there’s another bright intensity Peak at a different voltage and then then it falls off to a minimum and so forth you see that there are these uh increases in electron intensity at a certain voltage and then you crank the voltage up a little bit more and the intensity decreases down to a minimum you keep cranking it it goes up to a maximum again we are seeing exactly what would have been predicted from brag scattering combined with the matter wave hypothesis this did not have to be this way but it turns out that matter also has wav likee properties that can be revealed under the right condition I conditions just to really drive this home in two Dimensions now scanning over scattering angle rather than fixing the scattering angle and scanning over electron momentum this is what an electron diffractogram looks like you see this pattern of bright and dark spots separated by gaps here we can very clearly see that electrons will intensely build up in the scattering process in some places and and not at all in other places with big gaps in between both vertically and horizontally there are very clearly bright spots and dark spots just like a laser beam that interferes with itself through passing through two slits for instance only waves can interfere with each other in this manner and in this case it’s because the Crystal and solids like nickel for instance off of which the electrons are scattered have structures that can accommodate an easily tuned electron momentum that yields a wavelength comparable to the size of the scattering uh system or a little bit larger and that’s easy to do with even modestly accelerated electrons on a metal Target so here’s what scattering and interference tell us about the true nature of both matter and electromagnetic radiation electromagnetic radiation already has a wave equation that describes its wave nature it comes from Maxwell’s equations so again we come back to this question well if matter can be revealed through experiment and observation to have wave properties under certain conditions then where’s the wave equation where’s the equivalent of the thing that comes from Maxwell’s equations that describes the wave properties of electrons protons neutrons whole atoms Etc where is it what is it you know electromagnetic fields and light propagating through empty space these are the solutions to Maxwell’s equations if we had an equivalent matter wave equation what will the solutions to the matter wave equation look like and these are all excellent questions and these are the questions that after these kinds of experiments had been done physicists really began to struggle with in the 1920s and into the 1930s now we’re going to get to the answer to this question very soon but we have some hints for ourselves already the solutions to the matter wave equation whatever they are whatever specific form they take for a very specific system an electron scattering off of a nickel Crystal an electron confined in a hydrogen atom whatever the solutions to the matter wave equation are going to be they’re going to be probabilistic in nature and we can already see see this revealed in the scattering intensity patterns from experiments like brag scattering the Davis and germer experiment and so forth the intensity of the scattering pattern seems to have everything to do with the probability of finding a particle at a certain location in space and time after the scattering process has occurred and that probability is controlled in some way by the original wave nature of the thing that experienced in this case the scattering phenomenon probability whatever our wave equation describes it’s going to be probabilistic in nature waves are a spread out spatial and temporal phenomenon there’s no one place where a wave is and where it is not there are many places where a wave can be and probability and the wave equation whatever it is are going to play a fundamental and deep role with one another in describing matter and radiation so let’s review in this lecture we have learned the following things we’ve learned about the structure of the atom as it was known in the late 1800s and very early 1900s cherry-picking our way through just a few scenes in the great story of the atom we’ve learned about how matter itself can have wave aspects to Its Behavior first hinted at although no one really understood this at the time by the nature of atomic Spectra and the black body Spectrum now it was Lou de Bry who conjectured that the same wave and momentum and energy descriptions that could be discerned from the black body spectrum and special relativity equally applied to matter like electrons that was a conjecture and that was experimentally verified using scattering experiments of matter off of other matter the the target had size scales that were comparable to the matter wavelength we were trying to assess and in fact tuning the beam of electrons to the right momentum to get the desired wavelength we we actually see that the wave properties manifest in the scattering experiment if electrons did not have wav likee aspects to their behavior we would not have seen the defrags that can be discerned from scattering electrons off of crystal in targets so that has also taught us how to conduct experiments both with light and with matter to reveal the wave aspects of Matter’s behavior and Compton scattering offers us a glimpse of how to reveal the particle aspects of the behavior of radiation and matter all we have to do is get the wavelength of the phenomenon to be much smaller than the size scale of the thing we’re shooting it at and the particle nature should manifest again these these ideas are going to play key roles going forward in everything we’re going to do with matter and [Music] radiation in this lecture we will learn the following things we’ll take a look at mechanical and electromagnetic wave equations to inspire our thinking about matter waves we’ll learn how to infer the nature of the wave equation for matter from an exercise involving the conservation of energy we’ll look at the meaning of the Waves described by the matter wave equation the so-called Schrodinger wave equation and finally we’ll look at the limits of absolute knowledge that are imposed by the wave nature of matter let’s take a peek at waves beginning with classical mechanics an introductory physics class would have taught you about oscillatory phenomena and a wave is just another kind of oscillatory phenomenon that can be described by time and space dependent functions so in introductory physics we learn that a Time varying oscillation along one dimension for instance a mass on the end of a spring that’s bouncing back and forth on a friction of the surface or up and down in a gravitational f field can be described as simple harmonic in nature and this allows us to write a mathematical function involving for instance the cosign of frequency and time and an offset from the amplitude being maximal at zero this is a typical equation you might see in introductory physics to describe an oscillatory phenomenon now here Omega is a special kind of frequency it’s known as the angular frequency and it’s given in terms of the period of oscillation which is a more familiar concept the period of oscillation often denoted by capital t is simply the time required for one cycle of the phenomenon to conclude the angular frequency is 2 pi divided the period and this essentially means that it’s 2 pi times the frequency of oscillation of the phenomenon angular frequency is the rate of angular displacement if we were to model the repetitive Behavior as going around a circle completing one cycle of the circle 2 pi radians as completing one cycle of the phenomenon now for all considerations here let’s set the phase angle the degree by which we would need to offset the cosine function to get the amplitude to match the initial conditions of our oscillator let’s set that phase angle to zero let’s set F to zero to simplify this equation if you then extend the phenomenon to two dimensions and imagine a long string for instance made from a bunch of tiny little masses each tiny little Mass bound to its neighbor as if by a little spring and we pluck the string that is we displace part of the string vertically then let it go and it bounces up and down and up and down the vibration of a string now we have a distortion in y That’s traveling along X in time and the solution to that problem looks something like this that the displacement in Y at any position X and at time T is is given by some initial y times the cosine of a spatial part K * X I’ll come back to K in a moment minus a temporal part Omega T which we’re already familiar with from the equation up here on a simple one-dimensional o oscillatory phenomenon now what is K well K in this context is known as the wave number and it’s defined by 2 pi the number of radians in a circle divided by the wavelength of the phenomenon so you can think of this as describing the number of cycles per unit distance in the phenomenon whereas the angular frequency is the number of cycles per unit time but these functions answer some question and if they’re the answers to a question what is the question well they are all solutions to a wave equation that is an equation that describes how changes in space relate to changes in time now the one-dimensional mechanical wave equation at least the one that tells you about vertical displacements and how they uh vary as a function of horizontal position and time is simply given by the second derivative with respect to time of the vertical amplitude Y and that’s equal to the uh constant squared times the second derivative with respect to space of the vertical displacement Y and Y of course is a function of x and t so if you try applying this wave equation to the solution on the previous slide you’ll see the following first of all the left hand side is the second derivative with respect to time of the vertical displacement y plugging in our function for the vertical displacement we would get uh this equation now the second derivative with respect to time of our description of the vertical displacement versus X and time taking one of the derivatives of this function results in us having to do the derivative twice first of the cosine function and then of the argument of the cosine function well the derivative of cosine is going to be the negative s function and the derivative of the argument is going to return a negative Omega a negative angular frequency multiplier and so we’ll be left with this Omega time the original amplitude y Sub 0 time a sign function of the original argument we have to take the time derivative of this one more time if we do that we want up with an additional factor of negative Omega out in front of the original cosine function and so at the end of this we wind up with an equation that’s just Nega Omega 2times the original function y of x and t now let’s handle the right hand side of the wave equation this is a constant term squared times the second derivative with respect to X position of the displacement y we plug in our function for y again now taking the first of the two spatial derivatives that we have here we wind up with a function that looks like this so the first derivative of cosine returns negative s and the derivative of the argument multiplied by that gives us a factor of K and so we wind up with this and we have to take the spatial derivative of this one more time and at the end of this we wind up with an equation that’s negative the constant squar time the wave number squar time the original function y of x and t Now setting these two things equal to each other as would be required by the wave equation we find out that the function y of x and t drops out of both sides of the equation leaving us with this simple relationship between the angular frequency squared the constant squared and the wave number squared and if we take the square root of all of this we see that we have Cal Omega over K and this is one of the velocities that’s present in mechanical waves the speed of the mechanical wave is given by the ratio of the angular frequency and the wave number now this is a very quick tool T of a solution to the wave equation and how you can see that it does solve the wave equation and how when you plug it in it returns a relationship between frequency squared speed squared and wave number which is related to wavelength squared a dedicated waves course would spend a lot more time on this motivating the derivation of the wave equation itself from a simple model of a vibrating string or something like that motivating how one sets up and solves that equation and then showing you what relationships emerge from Solutions under different conditions here I am merely trying to motivate some thought process about wave equations and the resulting relationships that can be derived from the application of those wave equations to their Solutions so sticking with mechanical waves for a moment let’s think a little bit about the energy that’s contained in that wave so again our model here is a mechanical Distortion of a physical medium and that Medium that I have in mind here might be a string made from many little bits of mass all hooked together as if by little Springs each with a spring constant and so forth so if we Model A String that way we can think about the string as having a total mass capital M and a total length capital L and the little bits of mass it’s made from are all equal in size and uniformly distributed along the length L and so this string has a uniform linear mass density given by the Greek letter mu which is mass / length Big M / by big l no matter what chunk of the string we look at every chunk will have the same mu because it’s a uniform distribution of mass and so we can always relate mu to the mass in that chunk and the length of that chunk if we then vibrate the string such that a given part of it at some time T in location X will have a small mass m and that mass will have a vertical velocity VY that velocity will uh oscillate transverse to its length just as the displacement oscillates transverse to its length that tiny little chunk of the string will have a length DX a differential of X and a little mass m that can be related to the length DX by the linear mass density so the Little M divided by the little DX would be mu because it’s a uniform distribution of mass so that means that m is equal to Mu DX in every place we see M we can replace it with this product and vice versa the kinetic energy of that little chunk in a moment of its motion uh as the string vibrates will be defined by its mass and its velocity at a given moment in time T So taking the classical definition of kinetic energy we’re thinking about a mechanical wave here so let’s think classically for a second we have the little bit of kinetic energy possessed of by that little bit of mass is going to be 1/2 time its mass which is Mu DX time its transverse velocity squared vy^ 2 well VY is just the Der I ative of the displacement in the Y Direction with respect to time Dy DT and we’re going to square that so we wind up if you plug in that derivative uh and and do that as we did on the previous uh page we wind up with this equation for the little bit of kinetic energy possessed of by that little bit of mass that makes up the string so this is the term for the little bit of kinetic energy possessed of by that little chunk of mass now because it’s hooked to its neighbors by springy things each mass is linked to the next by you could imagine a little spring with a spring constant um the potential energy stored in that same chunk of mass will depend on the elasticity of the string that is the stiffness of the little springs that you could imagine hold one chunk of Mass to the next so thinking of the string this way as concocted of a whole bunch of little masses M connected to their neighbors by little Springs with spring Constance Kappa then as an introductory Mechanics for oscillatory phenomena masses on a spring you can relate the angular frequency squared to the ratio of the spring constant and the mass that is to say the spring constant is related to the mass times the angular frequency squared for an oscillating Mass on the end of a spring with spring constant Kappa so the little bit of potential energy that’s stored at that location at X in that little mass m is just 1/2 time the spring constant times the displacement from equilibrium squared well that’s just going to be little chunk of potential energy held by that little bit of mass 1 12 * Kappa * the displacement squared we go ahead and substitute for Kappa with M Omega s and we can substitute for M with mu DX and then finally we can put in our equation for the displacement and that now involves the cosine and that whole thing is squared so we have kinetic energy we have potential energy let’s look at the total energy possessed of by this little bit of mass m so that little bit of mass m will have total energy de composed of kinetic and potential added together at any moment in time we have the expressions for those two things DK and du and you’ll notice that if you pull out all the multiplicative factors um you’ll be left with the same coefficients multiplying a s term squared and the same coefficients multiplying a cosine term squared so you wind up being able to pull all those multip itive factors 12 and Y and Omega squ and mu and DX out in front of a sum of a sin squ and a cosine squar and there’s a trigonometric identity that comes into play sin squ plus cosine squ is 1 and so the S and cosine functions vanish from the total energy of this little chunk and all that defines its little bit of energy that it possesses at any moment in time is that the total energy of that little chunk is constant it may be divided differently between kinetic and potential but the total energy of that chunk of mass that makes up that string that’s vibrating is constant and it’s given by this number here and again it depends on angular frequency linear mass density the length of the element the initial displacement of of any element of the string and so forth this is just the energy stored in this little piece of mass m at a location X in space and T in time now note that the total energy depends on the square of the angular frequency the presence of the Omega squ multiplier tells us something about the number of time derivatives or the product of the number of time derivatives that had been present in the original equation for total energy remember we had to square the time derivative of the wave function that solves this mechanical wave equation and that yielded an Omega squar term in all of this so you can see that there are shades of the number of derivatives uh left over as sort of vestigial elements of the energy equation for this little bit of mass m so here are the key takeaways from this look at mechanical waves the mechanical wave equation relates the second derivative with respect to space and the second derivative with respect to time you would derive its form in a dedicated class on waves but we don’t have time for that here nonetheless I want you to take away the big lessons from this now the solutions to the wave equation when acted

    upon by the derivatives in the wave equation yield squares of the angular frequency Omega and the wave number K recall we had an equation relating Omega squ k^ 2 and the speed of the wave squared the energy equation for the wave or a part of the wave is sensitive to the number of time or space derivatives in the underlying equation and these manifest as multipliers like Omega squar and one might think about the presence of the squares of these quantities like Omega or k as indicative of the underlying wave equation that you needed to have solved in order to get these Solutions in the first place now let’s take a quick look at waves in electromagnetism this is the next classical wave equation that was discovered in the history of physics and it’s derived from Maxwell’s equations for electric and magnetic fields so the wave equation that results describes the propagation of oscillating electric and magnetic fields in in empty space for instance although it’s not limited to only empty space and that wave equation can be written as follows that the speed of light in empty space squared times a spatial derivative squared minus a Time derivative squared all of this acting on an electric field Vector is equal to zero well again notice that like the mechanical wave equation we’ve got second derivatives in space and second derivatives in time all acting on a solution e Vector whose form we don’t necessarily know beforehand but if we solve the equation we find out that the solutions to the oscillating electric field components look very similar to the mechanical waves in that they have a vector amplitude instead of a scalar amplitude a cosine function a spatial piece and a temporal piece of the argument of the cosine function now I should note that yes there is an identical equation for magnetic fields that can be derived from Maxwell’s equations but you can think of it as a bit redundant it describes the action of the oscillating magnetic field but can be related through the mathematics of the solution to the electric field and so if you can remember the electric field wave equation which I’m not asking you to do but will come in handy in a dedicated course on electromagnetism later um you can very quickly work out what the form of the magnetic field wave equation is and relate the electric field to the magnetic field although they are in independent directions of one another the field strengths are not independent of each other um this is an interesting problem in that electromagnetic waves are two component waves they have an oscillating electric component and an oscillating magnetic component they have two kinds of information that are stored in the wave and this is a theme that we’ll return to later when we look at matter waves now applying the wave equation written here to the solution written here similarly yields quadratic multipliers of k^ squ uh and Omega squar so for instance we find that the speed of light squared will be equal to the angular frequency squared divided by the wave number squared and this latter relationship turns out to be a direct consequence of the massless nature of light that we learn from Plank’s relationship for energy and momentum and wavelength and so forth and the special relativistic relationship between energy and momentum for massless particles uh this allows its speed to be related directly to its frequency and wavelength with no other multip multiplicative or additive factors involved so let’s revisit that the relationship between frequency and wavelength through a wave can be directly related to the energy present in the radiation Quantum the photon the photon is the particle like aspect of light’s Behavior and the electromagnetic wave is the full wave description of light’s Behavior so for example from above and from our previous look at electromagnetic waves we know that the speed of light can be related to the wavelength and the frequency of light as follows wavelength Lambda time frequency F now to get angular quantities shoehorned into this equation like Omega and K what we can do is we can multiply Lambda F by a clever number one so I’m going to multiply by 2 pi ided 2 pi if I group The 2 pi in the numerator with f i get Omega the angular frequency 2 pi F if I group The 2 pi in the denominator with Lambda I get k k K is just 2 pi over Lambda so I wind up with the equation that the speed of light is Omega / K the angular frequency divided by the wave number now recalling that Plank’s relationship for the energy and frequency of light related by plunk constant is E equals HF we can play that same game and shoehorn angular quantities in here by multiplying HF by a clever number one 2 Pi / 2 pi grouping the 2 pi in the numerator with f the frequency in the numerator and taking H and dividing it by the remaining 2 pi in the denominator we wind up with this compact equation for the energy as related to the angular frequency H bar is to denote H over 2 pi and it is known as the reduced plunks constant remember that plunk constant has a value of 6. 626 * 10- 34 so all you do is take that that divide by 2 pi 2 pi is approximately 6 and so you wind up with a number that’s about 1 * 10- 34 um it’s very convenient for all of these angular wave Concepts to carry H bar around rather than H and 2 pi so it’s very nice to Define this reduced plx constant notationally as an H with a line through the vertical part of the H now the momentum of the quantum is given by P = H over Lambda and again if you insert a clever one in that you’ll find that this is equal to H bar * K the wave number what’s nice about this is it kind of puts e and P for an electromagnetic wave on equal footing e is HR Omega p is h k these are a little easier to remember than the H over Lambda and H * F thing at least I find them more convenient once you feel more comfortable with angular frequency and wave number as angular concepts of oscillatory motion recalling that the speed of light for our electromagnetic waves is given by Omega over K that is the angular frequency divided by the wave number if we substitute in for those two quantities with their energy and momentum Expressions we recover the special relativistic relationship for light as a massless phenomenon E equals P * the speed of light so we’ve exactly recovered the Einstein energy relation for light a massless phenomenon so let’s take an overview of the wave equation In classical mechanics and electromagnetism they involve second derivatives of both space and time so you see both uh d^2 dx^ 2 and d^2 dt^ 2 in these wave equations now we could have inferred that from the results of the wave equations as their application results in squares of time and spatial frequencies so for instance Omega squares and K squar appear in equations that result from the application of the wave equations the energy equations tell us the proportionality of frequency to wavelength as well as other useful information like that and all of this kind of leaves you wondering if we have a a mechanical wave equation and an electromagnetic wave equation where’s the matter wave equation where is the evidence for that from The History of Science up to the early 1900s that’s the problem since its presence could not be inferred directly from previous measurements in the same way that Maxwell’s equations were inferred from kul’s law gauss’s law and other things like that ampers law and ultimately when composed together in the form of Maxwell’s equations led to the wave equation for light and in the same sense that considering a string is a bunch of masses Bound by Springs that are uh you know uh tugged up and down and then caused to vibrate by being stretched will lead you to the wave equation where where is the exercise that leads to the wave equation for matter and that didn’t really exist up through the 1920s the early 1920s so the question I would put to you is is it possible given other equations that we could infer from what we know its form so if we know things about particles and waves like wavelength and frequency and energy and momentum and we know the relationships between those things can we figure out the wave equation using all the information we have from Atomic Spectra the black body radiation Spectrum the photoelectric effect compon scattering and all that other stuff can we figure that out so one can Glimpse the hints of the underlying but unseen matter wave equation um sort of like seeing a shadow cast on a wall by a complex object that’s out of your line of sight but whose Shadow is projected onto a wall giving you hints about the real shape of the thing you can’t see and we can get that glimpse of the shadow of the matter wave equation by considering the conservation of energy for a particle that’s acted upon by an external Force now such a particle in classical physics would have of course a kinetic and a potential energy with specific forms depending on the the force involved in the problem now conservation of energy would require no matter what that the total energy of that particle is going to be the sum of its kinetic and potential pieces now sticking to classical physics for a moment because employing special relativity to derive the rule of matter waves involves a whole skill set of mathematics that really can’t be expected of you at this stage of a University career we’re going to stick with purely classical low velocity matter uh even small matter like electrons we’re going to have to consider moving at relatively low velocities not very close to the speed of light now obviously that doesn’t cover the full domain of phenomena of small particles like that but it’ll get us going and it will allow us to solve a great number of of problems that are actually within our grasp once we figure out the matter wave equation so using classical physics we can write the kinetic energy as 12 mv^2 and we’re going to leave the potential energy unspecified I’m not going to worry about what the force is that’s acting on this let’s just say it has a potential energy U for now and leave it at that now we have relationships for matter waves between total energy and frequency and total momentum and wavelength but we don’t have moment mum in this equation so let’s get momentum into this equation and the way we do that is we multiply yet again by a clever number one so if we insert a number one in the kinetic energy equation that is just m / M then we get an m^2 V ^2 in the numerator and MV is just momentum so we wind up with momentum classical momentum squared in the numerator divided by twice the mass of the particle plus its potential energy U so we have our kinetic term now expressed in terms of momentum and we still have our potential energy term and they’re summed together to get the total energy e so let us now inject de br’s postulates into this equation that is eal HF which is equal to H bar Omega and P equal H over Lambda which is just equal to H bar k again employing all these nice angular quantities and if we do this we now obtain the shadow cast on the wall by the matter wave equation and that is H Omega equal h^ 2 k^2 2 m + U do you see it the single power of Omega on the left side indicates to us that a shadow is cast here by a single time derivative that’s acted on some solution to the underlying wave equation we don’t see the solution and we don’t see the wave equation but we see the result of applying those two things and that is a single power of Omega on the left side the k^ squ on the right hand side implies that there’s a second derivative with respect to space in the wave equation acting on the solutions to that equation whatever they may be so we have a single time derivative and a second space derivative that result in k^2 and Omega so let’s go ahead and take that equation with our hypotheses our hunches about what the underlying wave equations form might look like and let’s try inserting those hunches into this equation above assuming that an appropriate derivative has acted on an unknown solution to yield a single Omega or a k s so if we do that if we take our hypothesis about the number of derivatives acting on some unknown function that solves the wave equation yielding this relationship we wind up with the following equation on the left H Bar times the first time derivative of an unknown solution which I’m denoting with the Greek letter SII and it’s a function of space and time we’re only considering Motion in one Dimension right now on the right hand side we have h^ 2 m time the second derivative with respect to space of that same function s of x and t and of course we have the potential energy of the the matter wave still tacked on to the right hand side over here and I’ll return to that a little bit more later so this looks promising it has all the Hallmarks of a wave equation but it’s different from Maxwell’s equations or a mechanical wave equation in one key way it has a second derivative in space but only a single or first derivative in time this will have implications for the kinds of functions that can solve such equations and the solutions to this as I’ve said are denoted by the capital Greek letter S as a function of x and t in one dimension so let’s explore solutions to this equation and as we do this we’ll find that we are missing at least one key piece of the underlying equation we’ve guessed at the form of the object casting the shadow on the wall and we may have guessed incorrectly so let’s begin by guessing the form of the solutions to our equation and then plug them in and see if we recover our energy conservation statement so to simplify matters let’s consider for now free particles that is particles free from external forces uh and that is most simply expressed by setting U to zero the particle has no potential energy associated with it we’re only considering motion at a constant velocity so that’s a fixed kinetic energy which then relates to the total energy of the particle okay well to solve our wave equation we need a kind of function that when acted on by a derivative transmutes into another version of itself so for instance in the old wave equations the mechanical wave equation the electromagnetic wave equation we had second derivatives acting on the solutions signs and cosiness were great for that because after two derivatives they return to their original selves so that’s what we did for traditional waves we used signs and cosines so let’s take a guess let’s guess that s of x and t is just one of our mechanical wave Solutions a cosine of KX minus Omega T so we’re not doing anything original here we’re just taking mechanical waves getting inspired by them and blindly applying that idea here so let’s write down our wave equation that we’ve guessed H bar time the first derivative of our solution with respect to time h 2 m * the second space derivative of our solution let’s go ahead and plug our guess at the solution in all right so we plugged our function in on both sides now and then go ahead and work out the derivatives and you should find the following conclusion that we wind up with a positive sign function on the left now multiplied by Omega we get a minus sign from the derivative of the cosine but we also get a minus sign from the derivative of its argument with respect to time so this winds up being net positive on the left side but over here the two derivatives of the cosine uh yield an overall minus sign so we have a positive s function and a negative cosine function we we can’t cancel out the signs and cosiness on either side it doesn’t recover our original energy conservation expression it doesn’t work to solve the equation the left side and right sides don’t give us what we would have expected based on where we had derived this from which was the conservation of energy and if you try just a sign function it will similarly fail so what if instead we combine s and cosine functions what if we add together SS and cosiness because when you take the first derivative of something that’s a s plus a cosine you’ll wind up with a sign and a cosine in the result and similarly with the second derivative maybe a superposition an addition of s and cosine will do the trick all right let’s go ahead and try that now I’m trying the barest simplest superposition I’m assuming that they have the same multiplicative coefficient out in front a whatever that is and otherwise it’s a cosine of the same argument and a sign of the same argument all added together when you’re playing around with solving equations whose Solutions are not known to you a priori that is with prior knowledge beforehand guesses like this will get you through the process and you should always try to start with the simplest guess and work your way up in complexity so for instance it may be these coefficients aren’t supposed to be the same but don’t start by assuming that try assuming them and then work your way up to a more General set of solutions as you get more comfortable with solving the problem so we write down our guess at the wave equation again we plug in our new choice of the solution we work through the derivatives and we’ll get the following equations now what I want to do is I want to reshuffle the term order on the left side I want to get the cosine first and then the S second so I’m just going to move these terms around without changing anything about the equation and this is the final form of the equation I get I get get a a negative cosine and a positive sign and I get a negative cosine and a negative sign over here I can’t cancel these functions out I can’t recover the energy conservation expression we started from it’s a lot closer than we were with just cosine but it’s still no good we’ve got problems with the plus and minus signs and all of this it’s a mess and what’s the real problem we keep running into here well we keep generating minus signs from the single derivative of only the cosine on the left side the derivative of the sign gives us something positive but the derivative of the cosine gives us negative s function and that’s what’s really hurting us here our goal at this point is if we’re going to figure this out we’ve got to find a way to get rid of this minus sign we keep picking up and at this point it helps to remember that there are other kinds of numbers than real numbers in the world so everything I have done up till now is predicated on the assumption that these Solutions and perhaps even the wave equation itself can only be composed from real valued numbers you know like 1.1 or 2.3 or Pi or 75.6 those are all numbers that can manifest in the real world if somebody says look I’m you know I’m going to give you -76 it means that they’re going to take $76 away from you right that has real consequences negative numbers are are real things in the world around us but there are other kinds of numbers that don’t have physical meaning in the world around us and it’s important to remember that and they fall under at least one class of these numbers is a category known as imaginary numbers and in particular it’s helpful at this point to remember some of the behavior set of the archetypal imaginary number I which effectively serves the role of being the number one in the imaginary Ary number set so let’s pause for a moment and revisit imaginary numbers which presumably you have seen in some context prior to this course let’s recall that the imaginary number I is a special kind of number one but with no physical interpretation um so I is defined by the question what squared equals -1 and the answer to that is I and I’s value would be the square root of ne1 which is nonsensical if somebody told you you know give me I dollars you would not know what to do with that because you don’t know how to calculate the square root of a negative number and then turn that into a real dollar value that you then hand that person now this number can exist in a mathematical Universe where it has plenty of self-consistent rules that don’t violate any of the axioms of mathematics that you’re uh that you’re playing with and in fact I doesn’t violate any ma axioms of mathematics at all so it’s perfectly mathematically tenable even if it’s not physically realizable around us in the world its existence mathematically has consequences though so for example uh you know going back to the question that leads to I you can take I and multiply it by itself that’s just i^ 2 if we plug in for i^ 2 we have the sare < TK of -1 * the < TK of1 and by definition that has to yield ne1 and so i^2 = 1 I is the answer to that question you know the square of what number gives me1 but you’ll notice that I S has the ability to add a stray minus sign where none would have been present before and I is the number one in the imaginary number world it is the unit on which you can build all other numbers uh integers for instance in imaginary space so the presence of extraneous minus signs when trying to solve equations using functions as we’ve been doing with our guest at the matter wave equation could actually be an indicator of something that we are trying to use real valued numbers and solutions only but maybe the problem we’re trying to solve is too complicated to only admit real numbers that it requires the ability to store additional information that real numbers alone cannot accommodate those can be accommodated by complex numbers these are numbers that contain both real and imaginary components so for instance the complex number Z is made from two real numbers X and Y but Y is multiplied by the number I and so I Y is imaginary X is real this is a combination of a real and an imaginary part this is what is known as a complex number and it stores twice as much information as a single real number because it’s got this extra component over here here and if this is reminding you a lot of vectors like a vector Z being equal to a component along the x axis X and the Y component y that’s good because a lot of the basic ideas of vectors translate into complex numbers and give us some confidence about how we can get useful information out of complex numbers so let’s explore complex solutions to this equation so let’s start by trying a guess at a complex solution it’s got a real valued part A cosine and it’s got an imaginary part a i sin so all I’ve done is I’ve added the number I to the sign part of my solution so again here’s my guess at the wave equation I plug in my solution I take my derivatives and I get the following results now again I’ve got s and cosine out of order on the left side so if I shuffle them around to get cosine first and sin 2 and try to map that onto the right I see that I still have a problem I’ve got negative I cosine and negative cosine here I’ve got a sin and I’ve got negative a I sign over here I can’t just naively cancel these things out that doesn’t really work um so I have a problem I I still can’t get this to work out if I were to try to move an i or a negative I from the left side to the right side I still wind up with a stray eye where I don’t want one and and that isn’t going to work for this problem so it’s ridiculously close to working out we’re so temptingly close to solving this problem right now but something is still missing some salt is missing from our soup here that we used to try to mimic the the the recipe for the matter wave equation so let’s see if we can get one more opportunity to think about our assumptions by moving some minus signs around to see if we can find a clue that will resolve this little puzzle puzzle so I’m starting with the last equation from the previous slide here I haven’t done anything to it yet but what I’m going to do is I’m going to pull the minus signs on the right hand side out as an overall multiplier and then multiply both sides by -1 so I’m basically moving the minus sign here to the left side of the equation so this is the net effect of doing it notice all the terms over here on the right are now positive uh the minus sign that would have been on the right is now moved to the left and overall multiplies these terms and if I go ahead and do that if I multiply it through I see that I now have I cosine here and negative s here and I have cosine here and I sign here and that has a kind of weird Rhythm to it and that’s the Clue the difference between the function on the left hand side and the function on the right hand side boils down simp simply to a missing factor of I on the left if we had originally guessed that the wave equation was I times a Time derivative of the wave function all of this would work out we would actually get the left hand side being equal to the right hand side so if we had just done this at the beginning if we traded our original guess at the left hand side of the wave equation which was all predicated on our bias for real numbers H times the first time derivative if we traded that for negative itimes the first time derivative then we’d solve our problem and let me show you that so to wit let’s revisit our guess at the form of the wave equation I’m leaving the solution completely intact that’s still the same as what I had a moment ago and what I’ve done is I’ve changed the left hand side of our guess at the wave equation to be negative I times the first time derivative of s I’ve done nothing to the right hand side plug in the solutions do the derivatives play the game again where we Shuffle the terms on the left hand side to get cosine first and sin 2 move the minus sign on the right over to the left okay and distribute that into these terms now multiply the Nega I into the function the negative I multiplied by this I gives me * -1 which is 1 positive cosine the I multipli the S function cancels out the minus sign here and leaves me with positive I * s cosine I cosine I the left and the right sides level out and the function is now the same on both sides and it can be cancelled out mutually on both sides of the equation so Bazinga as a famous TV character might say at a moment of Revelation we’ve done it we’ve cracked the underlying form of the matter wave equation let’s prove that by seeing if it returns to us the energy conservation relation that we started from we’ve only shown here that we have a wave equation that when acting on our guess at the solution to a free particle wave Returns the solution with a bunch of multiplying coefficients and so we can cancel a function out of both sides but we don’t know that we’ve recovered the conservation of energy equation we started from so let’s see if this all works out let’s review what we did starting from the total energy of a free particle which is H Omega = h^2 k^2 2m we constructed a wave equation that had the right time and space derivatives in it to return these factors of Omega and k^2 we played with oscillatory Solutions and we learned that only complex functions will satisfy an equation like this and from this we inferred a missing imag inary number from our original guess at the equation we should have had a negative I lurking on the guess of the left side of our equation the whole time in order for this thing to have viable solutions for the free particle that work out the solution guesses that we made for free particles are of this form they’re complex they have a real part and an imaginary part and plugging them into the wave equation yields this relationship H Omega equal h^2 k^2 2m which was precisely the energy conservation equation for a free particle that we began with so it’s entirely a self-consistent exercise and that should be deeply mathematically satisfying even if you’re not completely comfortable with the process by which we arrived at that but I promise you that this is not the first time assuming this is the first time you’ve ever seen this kind of strategy done for solving an unknown equation with unknown Solutions this is not the first time you’ll bust out this trick in your life if you ever have to solve hard problems so this trick is actually useful even if it feels a little clunky at first and I I hope it conveys to you the sort of incredible exercise that must have been required originally to derive this wave equation in the 1920s we’re doing this with the benefit of a century of hindsight but our predecessors did not have the benefit of this much hindsight so while I am able to look at resources and come up with ways of explaining the way of equations form to you at this level of a Physics course the people that were involved in trying to write down the matter wave equation in the 1920s did not have the benefit of this much hindsight and so they were struggling with immense difficulty in a different mathematical landscape than we are in now so putting back the potential energy piece of this this is the full one-dimensional what is known as schinger wave equation we have another function that we can tack on to the right hand side here V of x and t which is known as the potential term that function acting on the wave function sigh uh would return the potential energy of the matter wave in this case you and this is one of the most important equations in history where I’ve added back the potential energy term to complete the equation for a particle in one dimension the one-dimensional shring or wave equation is one of the most revolutionary insights into the universe in the history of our species named after Ain Schrodinger the first person in 1926 to fully determine the form of the matter wave equation now he was doing this using a whole different set of mathematical algebraic and calculus constraints that is this is not how he derived this equation but this is sufficient at this level to motivate where an equation like this might come from he was using mathematical guidelines to infer the nature of the equation that we simply don’t have the mathematical foundation for at this stage of a Physics course to do now you might look at this equation and think well this is horrible I hate time derivatives I hate space derivatives and there’s two space derivatives and one time derivatives and these functions are awful and there’s imaginary numbers and okay that might seem daunting to you and and perhaps it is but there’s actually a much more difficult part to this equation and that is that the hard part of the shring or wave equation is the solving of the equation to specific situations the real challenge of this equation is not this equation itself although it doesn’t look very pleasant I know but rather in the finding of solutions the size of x and t to this equation given different potentials V and so forth now we’ve effectively solved the free particle case and we’ll explore the solutions to that for the rest of this lecture uh but if a potential energy term is present so in other words if there are forces in a problem and you can’t ignore them and you have to include them and those result in potential energy or changes in potential energy for a particle in a problem you have to completely rework this equation and find the correct solutions that satisfy the equation equation with the correct potential term added on and that is much more difficult and that is effectively what we’ll spend the rest of this course learning how to do for different situations that map on to the physical world now I should say that solving this equation for different situations is what has allowed us to understand semiconductors it is part of what allowed the revolution in microelectronics to happen in the first place solving this equation for Atomic system leads to an understanding of where chemistry comes from and specifically doing this on a grand scale is the heart of physical chemistry solving this equation for information systems is what results in Quantum Computing and Quantum information which is a hot subject these days and is one of the many technological frontiers of our species I cannot understate how important the shringer wave equation is for all the foundations of the world we live in today but also all the potential for the great discoveries of tomorrow [Music] in this lecture we will learn the following things we’ll learn about a classical model of the atom synthesizing two semesters of introductory physics with some of the concepts that we’ve been exploring in this course we’ll learn about how to impose specifically the matterwave hypothesis on on the atom and see if we can make some useful predictions about atoms using this structure the so-called bore model of the atom let’s briefly revisit the key observational features of the hydrogen atom at a macroscopic level atoms in general when excited by an ionizing electric potential emit not a continuous rainbow of colors but rather a discrete set of color colors the so-called atomic emission spectrum shown here is the atomic emission spectrum for the hydrogen atom it has a characteristically bright red line which can be seen in the image over here on the right it’s got a blue green or cyan line which can also be clearly seen in the image over here on the right it’s got a darker blue line and a violet line and those are a little bit harder to see you can more easily see the dark blue and the Violet line in this part of the image on the right now while the atomic emission spectrum can be readily revealed by ionizing a gas in a sealed tube and then passing the light through a system that will spread it out revealing the rainbow of colors that makes up any kind of light The Mystery of the atomic spectrum goes deep each atom has a characteristic Spectrum it’s Unique to each atom that we know of in nature hydrogen is different from Helium helium is different from lithium each each of them has this pattern that’s their own we do not understand the origin of this using classical Notions of energy and momentum and matter but at last we are ready to confront this last mystery left over from the 1800s using not only what was developed in the first two semesters of physics but what we’ve been learning in this course now let’s be more numerical about the hydrogen emission spectrum we have a red line a blue green or cyan line a dark blue line and a violet line these have Associated wavelengths for the photons that carry each of these colors of light to our eye the red line for instance has a wavelength of 656 nanm the cyan or blue green line has a wavelength of 486 nanm the dark blue line has a wavelength of 434 NM and the Violet line has a wavelength of 4 110 nanm for something we’ll do later in this lecture it’s worth noting these numbers down on a piece of paper go ahead and pause the video write these four numbers down noting the colors that go with each of them and let’s save that information for a little bit later now it was Johan Balmer who worked out the mathematical relationship between these lines in 1885 these are the lines of light from the atomic emission spectrum of hydrogen that are visible to the uned human eye there are of course ultraviolet and infrared radiations from ionized hydrogen we won’t talk about those here but they’re represented in other Spectra the Balmer spectrum is the one that spans the range of light wavelengths that are visible to the human eye now balber noted that the wavelength of each of these lines is given by a simple formula a constant btimes this ratio an integer n^ 2 / by that same integer n^ 2us 2^ 2 or 4 plugging in N = 3 4 5 6 Etc Balmer was able to show that there’s a clear mathematical relationship between these colored lines here the constant B was determined to be 364.5 nanometers and all one has to do to calculate the Balmer spectrum is know this number and this formula and use the integers greater than two and you can reproduce the wavelengths present in this picture but why why is there a clear mathematical relationship between these colored lines emitted from hydrogen and why are there similar mathematical relationships between the colored lines emitted from other atoms when ionized these are deep questions Mysteries left over from the 1800s that physics in its day could not explain now let’s recall the earlier models of the atom that we explored in a previous lecture Joseph John or JJ Thompson after discovering that cathode rays were really just electrons uh and had masses that were far smaller than the lightest known element at the time hydrogen constructed his Thompson model of the atom imagining that the electrons with their negative electric charges were embedded in a larger swath of positive charge spread out in space and if one were to fire particles through such atoms they would mostly miss the electrons which are very tiny and pass almost cleanly and undeflected through this diffuse positive electric charge Ernest Rutherford and his colleagues however revealed by scattering alpha particles off of thin metal foils that in fact sometimes the alpha particles would bounce almost directly back at the apparatus that had fired them at the metal in the first place and this implied that there was some kind of densely packed small core of positive charge at the heart of each atom surrounded by orbiting electrons as if planets going around a sun this model helped explain why while many of the alpha particles would pass through the thin foil relatively undeflected occasionally one of them would suffer a collision with his densely packed positively charged nucleus of the atom and suffer an immensely disruptive Collision some of which could send the the alpha particles coming almost straight back at the source from which they had been emitted now all of this was happening in the very late 1800s with Thompson’s work and the very early 1900s with Rutherford’s work and as you’ll see as we learned more about the atom as people thought more deeply about the implication of MOX plunk adoption of the quantization of energy to explain the black body problem and Albert Einstein’s adoption of that same notion to explain the photoelectric effect models of the atom changed rapidly in response to these ideas and this then led to the ability to conduct calculations making new predictions about the behavior of atoms but also new tests of how atoms should behave themselves this was a dynamic period in physics transitioning from the classical era of the previous three centur CES into now the modern era that we would still be living in the after effects of today now let’s consider the Rutherford atom but let’s simplify the calculations and only think about an electron going around a single proton so a hydrogen like atom but only in two Dimensions the electron is bound to the proton in a circular orbit by the kulum force in the same way that the Earth would be bound to the Sun by the gravitational force in our solar system so the electron would be orbiting the proton the proton would be the central Force emitter in this problem the electron would be responding to that Force the electric force the Kum force in this case so that force would be given here by this formula the Kum force exerted on the electron by the proton would be given by a constant 1 over 4 Pi * Epsilon KN and I’ll come back to that in a bit basically this is the permitivity of free space Epsilon KN it has something to do with how electric Fields can propagate through empty space uh the product of the charges of the electron which is negative the fundamental electric charge and the proton which is positive the fundamental electric charge and that divided by the distance squared between the electron and the proton now this R would be the orbital radius of this circular orbit now I want to note here that this unit Vector R hat carries all the the directional information of this Force Now by convention R hat points from the source of the force the proton to the recipient of the force the electron it is the sign of the electron’s negative electric charge that ultimately flips the direction of that vector and has the force pointing back toward the proton that is making it an attractive Force now according to Newton’s Second Law digging back to our first semester introductory physics the sum of all four on the electron will be simply summarized by its mass times its net acceleration well what acceleration is this electron experiencing as it orbits the proton the answer is a centripetal acceleration this ultimately is a center seeking Force which results in a center seeking acceleration changing constantly the direction of the electron’s Velocity Vector so that means that the the acceleration the net acceleration of this electron has a well- defined form it’s given by v^2 / R in magnitude and its direction Center seeking will point to the center of the circular motion which again is in the direction of negative R hat that is from the electron to the proton whereas our hat is defined as being from the proton to the electron so we have all the pieces we need to build a Rutherford model of the atom in two Dimensions using these ideas of a centrally compact positive charged nucleus and orbiting electrons so let’s go ahead and do that we can set the sum of the forces which is just the kolum force equal to the masstimes the centripetal acceleration now note that there is a negative R hat on the left side negative R hat negative R hat on the right side negative R hat drops out of this entire equation and we’re left with just this equation 1 4 Pi Epsilon KN * e^2 R is equal to the mass of the the electron times the velocity of the electron all squared now I’m leaving it in this funny form because this almost instantly lets us write down the classical kinetic energy of this electron that is 12 m v ^2 if I just multiply this equation by 1/2 I immediately get the kinetic energy of this electron going in orbit around the central proton 12 * 1 4i Epsilon * e^2 R now I’m going to leave this equation unsimplified I’m going to leave this one/ half sitting out in front to ease the next step and that is Computing the total energy of this electron you can go ahead and multiply this out if you want to but uh it’s convenient for what’s going to happen next to just leave it out that way to remind us that uh doing math with this thing is going to be relatively straightforward so let me rewrite the kinetic energy of the electron here at the top of the slide now the total energy is the sum of its potential and kinetic energies for that electron at a given orbital radius r that is to say the total energy of the electron is just its kinetic energy plus its potential energy well the only Force present is the kolon force and so that means it has an electric potential energy and that electric potential energy for the electron UE will be given by its charge negative e times the electric potential of the proton V with a subscript P well the electric potential of the proton is just going to be given by 1 4 Pi Epsilon KN time the charge of the proton divided by the distance between them so we wind up with this equation for the electric potential energy of the electron 1 4 Pi Epsilon * e^2 R and that allows us to then write the total energy e as follows it’s just our kinetic energy plus the potential energy which is a negative number and you see why I left the 1/2 here it was convenient because um uh I have 1/2 times a common multiplicative thing here and just subtracting off that common multiplicative thing here and so ultimately at the end of the day I get a negative number for the total energy of the electron and that’s okay it it just means that the uh potential energy of this particular electron outmatches its kinetic energy for this particular orbital radius R so this is the final expression that I get for the total classical energy of a 2d Rutherford atom that is just considering the electron going around a stationary proton in the center let’s write that equation down we’re going to pull it up later we’re going to need it again for something that happens later in this lecture but this is about as far as I’m going to go with this for the time being now how do we get modern Concepts like the fact that the electron is actually a wave and not a point-like particle into this thing well we see the problem already with a Rutherford model it’s not going to explain the hydrogen emission spectrum right because in the Rutherford model any orbital radius R is allowed you can put any R in there and you’ll get an e out now because any total energy is allowed for the electron this cannot explain the discrete energy spectrum of electrons in a hydrogen atom we already have a sense that quantization of some sort must be present in the atom right this is the atomic emission spectrum it Bears the Fingerprints of constrained system with only specifically allowed energies determined by those constraints now the de broi postulates enter to provide the crucial missing thing the clue that will help us to understand this whole problem this is the key step that ultimately leads to quantization so for example we know that the momentum of an electron matter wave is given by plunk constant divided by the wavelength of that matter wave we can can relate the particle like properties momentum to the wav like properties wavelength using the de BR postulate now classically and because we’re doing things at low velocity we’ll revisit this as a consequence of these choices later but we’re going to start off thinking classically we can write the momentum in terms of its speed and that’s just going to be the mass of the electron times its velocity or in this case the magnitude of its velocity its speed thinking purely classically so the question we want to answer is is every wavelength of the electron possible for our electron if it is then every momentum is allowed in the atom and we’re right back where we started again if every mum momentum is allowed every speed is allowed and if every speed is allowed every radius is allowed this has consequences in a system where there are relationships between things like speed and orbital radius speed and momentum and momentum and wavelength but maybe that’s the flaw maybe the problem here is that not every wavelength through a matter wave for our electron orbiting this Central proton is actually allowed we had a discussion in class about the Schrodinger wave equation and in that discussion I drew the real part of free particle wave functions or other kinds of wave functions up on the board and I invited you the class to discuss whether or not those wave functions made physical sense I mean it’s possible to write things down mathematically that don’t make sense physically and let’s revisit that discussion I will review the Sal points here as I go through some examples and let’s apply that discussion to the electron in orbit around the proton and see what conclusions we might draw now let’s start by thinking about a circular orbit a circular orbit is quite simply one that after one period repeats again and that means if we’re thinking about the electron as a wave at a moment in time so Frozen in space at a moment in time remember it’s not only going to be a one place its wave function is spread out over space and the space over which it’s spread out is the circumference of its orbit that’s its one dimension that it’s traveling along in this problem so whatever the wave function of this electron is it had better at least obey the basic mathematical principle that when it gets back to itself at the starting point of its orbital circumference it starts all over again from exactly where it began so let’s imagine the real part of the electron matter wave and we’re just going to make one up and it might describe the electron traveling along such a circumference of an orbit of radius r at a specific time zero so we’re going to freeze this wave function which it’s of course itself is not physical but we’re going to imagine it being frozen in time at a given moment and it might look something like this um if we pick a zero point on the orbital circumference which I’ve marked here in one dimension it might be that its wave is at a local maximum the real part of its wave function might be at a local maximum there then as we move along the circumference the wave function declines and then goes negative and then it comes back and it goes to zero declining to zero again and then it goes positive now at one circumference that is at 2 pi r we see that the function I’ve chosen here nicely comes back to where it started this seems to behave itself in the sense that the wave is one continuous wave that repeats nicely in space because again we froze in time it’s possible that this wave might be waving in time but we froze in time and so in space this thing had better meet itself when it gets back to its starting point and I’ll I’ll explain physically why that needs to be in a moment so this one seems to be a reasonable candidate wave function for describing our electron in orbit around a central proton it nicely repeats itself when it reaches 2 pi r that is z the Zero Point again on its circumference uh Its Behavior is very smooth and continuous at the boundary where the orbit then repeats where that is 2 pi which Cycles back to zero again on a circle now let’s take a look at a wave function that’s also plausible mathematically but has some undesirable physical properties so what about this matter wave for the same electron at the same radius R we Frozen it in time it’s a perfectly reasonable wave function right it’s got one wavelength here it looks wavy is this a good physical wave function for describing the electron well if we look at 2i R we see that where the wave function ends up when it gets back to its beginning again is not where it started now that doesn’t per se have any mathematical negative consequences I mean this is a perfectly allowed function I can write it down uh it has something called a jump discontinuity when it gets back to its start it jumps from this value right before 2 pi r back to its starting value at 2 pi r but but this has physical consequences so because it has this jump discontinuity at 2 pi and does not cycle back to where it started uh the jump discontinuity results in a first derivative at that point that is infinite that is the derivative of this wave function with respect to space DDX at that point 0 or 2 pi r has an infinite slope now because the first derivative with respect to space in the shring or wave equation plays the role of the thing that tells you the momentum of the particle the jump discontinuity means that we have an infinite momentum point in the wave function and a place of infinite momentum is physically forbidden it just doesn’t make any physical sense if this were the case the universe would have ended long ago if things like this were possible because there’d be a particle which would contain more energy than every other particle in the universe and that would have all kinds of terrible physical consequences so this kind of wave function is physically forbidden it may be mathematically allowed but it violates physical Notions of naturalness in the world around us because the jump discontinuity has a physical consequence that is infinite momentum what about this wave function what about this matter wave for the same electron again at the same radius R does this look to you like a good wave function go ahead and pause the video and stare at it for a moment if you drew the conclusion that yeah it’s a pretty good wave function you’re you’re on the right track I mean it’s got twice the number of wavelengths in 2i r that the first one did but it comes back to where it started at 2 pi r um in fact it differs exactly by a factor of two in wavelength from the first one and in fact all waves that satisfy the relationship that their wavelength is an integer multiple of the shortest continuous and complete wave you can write down the so-called fundamental if you will all harmonics of the fundamental wave of this electron will satisfy this condition that there’s no infinite momentum Point anywhere along the physical space the circumference where it would occupy in space and in fact none in between those integer multiples will work they’ll all have the same problem that the previous example had there will be a jump discontinuity when you get to 2i R this results in a place of infinite momentum it’s unphysical so whatever the wave function that describes the electron and orbit around the proton it must satisfy this condition in order to have physical meaning an integer number n times some fundamental wavelength Lambda is going to be equal to 2 pi r the only lambdas that will work will be those that satisfy this constraint that is 2 pi r / n equals Lambda now if we utilize the de BR postulate relating momentum and wavelength then we wind up with NH over P substituting in for Lambda equal 2i R and classically remembering that P is equal to MV this puts a constraint between the radius and the speed and the integer multiple in question here NH over MV = 2i R now you’ll notice that I can move the 2 pi over to the left side and then I’ll have H over 2 pi and that allows us then to substitute with the reduced plunks constant H bar reme remember that H bar is just h / 2 pi you get this a lot when you start switching to the angular quantities angular frequency and wave number and things like that the H bar is very convenient in those contexts so let’s go ahead and just absorb the 2 pi into the definition of har bar and we’ll arrive at the following equation for the speed of the electron and its relation to the radius of the orbit that is uh M * V * r equal NH that can be Rewritten to solve for the speed of the electron ve which is NH over Mr and in preparation for relating this back to energy Concepts I’m going to go ahead and square ve so I get ve^ squar which is just this thing on the right hand side here so from the matter wave hypothesis I have a relationship between V an integer multiple of the fundamental wavelength and the radius of the orbit that determined that wavel length the first place now V and R also appear in things like kinetic energy so you can already see that we have a new constraint to throw into energy equations that will lead us to perhaps some final understanding of why it is that the atomic spectrum is discretized now before we do that I want to talk a little bit about Neil’s B’s actual postulate it’s worth noting that the way that bore attack the this problem was to postulate that there was a quantization of angular momentum in the atom that is the electron was quantized in its orbit around the proton uh now he did this in 1913 and this was about 11 years prior to De br’s work which was in 1924 so bore asserted having I guess seen that quantization worked in other problems to explain things that had previously gone unexplained he asserted that since h and H bar the reduced plunks constant have units of angular momentum that is jewles time seconds it might be in an atom that the angular momentum L is a multiple an integer multiple of H bar that those would be the only kinds of angular momenta that would be allowed in an orbital system like a 2d Rutherford atom so the angular momentum of the electron can only come in multiples of H and for a circular orbit we can relate the uh constru straint and H bar directly to the angular momentum of a particle going in a circle and that’s just P * R which classically is MVR and so this leads to the equation MVR equal nhr from B’s assertion now later on De broi would explain the reason why this works and that’s based on what we just saw on the previous slide that is if there’s a constraint on the the structure of the electron wave function requiring that the uh radius the circumference of the orbit be related to integer multiples of a fundamental wavelength of the electron then if you go back to the previous slides so go back in the lecture video you’ll see that this exact same condition resulted from the matter wave consideration so this points to the fact that uh in 1913 when bour made this assertion this is quite a bold assertion really born out of the success of the ideas of quantization in the previous a decade or so uh based on Plank’s work and Einstein’s work and so forth um this was bore being very intellectually bold and it paid off because as you’ll see this model works extremely well so finally we can take the kinetic energy equation and we can eliminate the speed of the electron in that equation in favor of the quantization condition from the matter wave hypothesis so that is here’s our kinetic energy for for the electron and orbit around a proton that can be related to the generic kinetic energy 12 mv^ 2 but we have an expression for v^2 from the quantization condition from the matter wave hypothesis and that was determined earlier to be this so if we substitute that into the equation we find out that the kinetic energy from kul’s law is equal to this kinetic energy expression taking into account the quantization of the wave function of the electron that only certain wave shapes will be allowed for a given orbital radius R and some algebra will finally lead you to this expression for the allowed radi of an atom it’s actually quite remarkable the allowed orbital radius in this 2D model is simply given by the product of an integer 1 2 3 4 Etc squared that’s n times a product of a bunch of fundamental constants of nature notice there are no variables left you have the numbers four and Pi you have the constant of nature the permitivity of free space Epsilon KN whose value is given here 8.85 * 10-2 fads per meter you have the fundamental constant Plank’s constant the reduced version squared and a reminder that H bar is 1.05 * 10us 34 Jew seconds you have the mass of the electron 9.11 * 10- 31 kg and the fundamental Electric charge 1.62 * 10 -19 Kum the only thing that can vary in here is n and n is fixed to be an integer 1 2 3 4 5 Etc so the radius of the orbit in our hydrogen like atom is simply given by an integer squared times a number so what is that number well if we stick in n equals 1 and solve we arrive at what is known as the bore radius it’s the smallest orbit allowed in the hydrogen atom because of the imposition of the matter wave hypothesis or B’s angular momentum quantization condition which turn out to be equivalent the bore radius is just this thing here and if you calculate it out it’s about half an anstrom 5.3 * 10- 11 M which is to say that the smallest a hydrogen atom can ever be with its one electron and its one proton is is about one enstrom across and as as we saw from earlier discussions in the lectures in this course an anstrom is roughly the size scale of an atom and that’s no accident it’s imposed by the matter wave nature of the electron so we find in this model of the hydrogen atom based on a classical definition of kinetic energy and momentum but with matter wave quantization imposed we suddenly find that only a fundamental orbit and its harmonics are multiples of that orbit are allowed and this begins to look a lot more like the atom that gives rise to a quantized atomic spectrum but the question is can we see that Spectrum arise from this model well to answer this question let’s again consider the total classical energy of an electron orbiting a proton at radius R but again impose the condition that n * Lambda the matter wavelength of the electron equals 2i r that led to our writing of the bore radius and then all the allowed radi of our hydrogen atom n^2 * the bore radius so we have the total classical energy of this electron I’m just repeating the expression we wrote earlier and then I’m plugging in with the expression for r n^2 a and putting in the full definition of of a here the bore radius so you wind up if you play around with it a little bit getting an equation that looks like this you get a negative a bunch of numbers and constants * 1 n^ 2 that factor in front of 1/ n^2 is – 2.19 * 1018 Jew if you go ahead and punch in all the numbers here and calculate it in electron volts this is a much more familiar number this is the famous -3.6 electron volts that is the energy of the electron in the hydrogen atom in its lowest orbit and of course this also turns out to be the energy required to fully ionize an electron out of its parent hydrogen atom if you want to free that electron completely get it away from its proton from hydrogen and put it out at Infinity you have to put in 13.6 electron volts to liberate it so the energy of an allowed orbit of integer n corresponding to radius R = n^2 * the bore radius is given by this simple equation that the energy of that orbit that specific orbit is -3.6 EV * 1 / n^2 where n is 1 2 3 4 Etc any integer this is a remarkable fact just by imposing the matter wave hypothesis on this and requiring that the wave functions be well behaved when thinking about the wave function spread over the circumference of a circular orbit we’ve immediately arrived at a quantization condition for our model of the hydrogen atom but how good is this model in order to understand how the bore model of the atom will give us the kinds of quantized energy Spectra that would result in specific specific wavelengths of light being emitted by an excited atom let’s step back and take a look at the bore model for a moment schematically the picture on the left illustrates the classical drawing of what the bore model of the atom would look like it’s very similar to the picture that I sketched earlier with a single electron orbiting a single proton here the smallest gray Circle corresponds to the Nal 1 orbit the the smallest orbit that an electron can have around the proton at the center of the hydrogen atom and that corresponds to the bore radius the Nal 2 orbit is a multiple of 2^ 2ar or 4 * the bore radius and similarly the Nal 3 orbit is going to be a multiple of 3^ s or 9 * the bore radius electrons can only orbit at these allowed radi going in a circle around the central single proton that means that if an electron is struck by for instance electromagnetic radiation a photon it could be caused to jump into a larger orbit if the electron possesses of sufficient energy to give the energy to the electron needed to transition from one orbit to the next so for instance we might imagine that the electron started in the N equals 1 orbit of the hydrogen atom was struck by a photon of sufficient energy and was able to transition to the Nal 2 orbit of the atom maybe this resulted in a complete loss a total absorption of the photon that struck it or maybe the photon was scattered losing energy and changing its wavelength gaining uh wavelength in the process becoming longer in wavelength now the image here shows the opposite of that process an electron starts in for instance the Nal 3 orbit and then spontaneously falls down into the Nal 2 2 orbit but because conservation of energy has to hold the energy difference between the Nal 3 orbit and the nals 2 orbit must go

    someplace and in this case it would result in the emission of a photon so because the atom conserves energy in order to go to a wider orbit it must absorb the energy from someplace a photon with the right frequency and wavelength can do that to drop down to a lower orbit that is one characterized by a smaller integer n it must release energy and emitting a photon of a specific wavelength and frequency will do that too so let’s consider a transition that releases a photon emits a photon in the process from an orbit that’s marked by an integer n greater than M the orbit into which it falls so n is some integer m is some integer and N is greater than M the change in energy Del Delta e is going to be given by the final energy the energy of the state marked by the number M minus the energy of the initial State the orbit marked by the integer n well if we plug in the formula for the energy of any specific orbit in a hydrogen atom that’s going to give us an overall multiplicative factor of -3.6 electron volts and that’s going to be multiplied by the difference between two fractions 1 m^ 2us 1 / n^ 2 so for example for the transition from the Nal 2 orbit to the Nal 1 orbit or n = 2 m = 1 we find that Delta e is E1 minus E2 and that’s going to be given by 10.2 electron volts go ahead and work that out yourself for practice but you should find that that energy is 10.2 electron volts but but this energy must go somewhere and so this lost energy from the electron would go into the creation of a photon that then is emitted during the process and that Photon will have an energy given by H bar Omega the product of the reduced plunks constant and its angular frequency so let’s think about the photon wavelengths from electron transitions in hydrogen using this energy conservation idea and combined with the relationship between the frequency wavelength and energy of a photon we can then calculate the expected wavelengths of photons emitted from an ionized bore atom so an atom where for instance an electron starts out at infinity and comes down into one of the the low orbits or maybe starts just above and drops down to a slightly lower orbit now recall that the Balmer series The visible wavelengths of light emitted in the atomic emission spectrum of hydrogen involved an empirical relationship between wavelengths of emitted light from hydrogen given by the following formula where the integer n ranges between 3 4 5 6 and up and this integer here is fixed at two well this looks a lot like the kind of relationship you might derive from the bore model of the atom in the transition between uh say Nal 3 and n = 2 state so just to see if we’re at all matching reality let’s tabulate the energy of photons and the corresponding wavelengths of the photons that would result from transitions from the 3 to two State the 4 to 2 State the 5 to2 State and so forth and if you do that you find the following remarkable things that the wavelength of the photon emitted when the electron goes from the Nal 3 orbit to to the Nal 2 orbit is 656 NM and if that sounds familiar it should sound exactly like the red line in the Balmer series which has this wavelength if the electron instead started in the Nal 4 orbit and dropped to the Nal 2 orbit that results in a photon of wavelength 484 nanm which is blue green and is weirdly close to the blue green line in the Balmer series similarly 5 to2 results in a 432 nanm wavelength Photon that’s blue and 6 to2 results in a 409 nanm Photon that’s a violet and these are in fact to good accuracy the Balmer series lines now they differ a little bit from the numbers before and I’ll comment on that in a moment but overall the pattern is very well explained by the quantization of orbits in the atom due to the matter wave nature of the electron and thus the resulting quantization of angular momentum all Neil’s B’s conjecture in 1913 this is a remarkable fact the fact that just using a classical model of the atom combined with matter wave nature of the electron one can immediately reproduce a pattern in the world around you in this case the Balmer series of atomic emission spectrum lines this is incredible now that said if it is wise to revisit our model and compare that to what we might actually expect from a more realistic model of atoms after all atoms are not two-dimensional things they’re three-dimensional things at the very minimum and we haven’t included an extra dimension in our model we’ve only made a very good approximation to what we would expect real atoms to need to be more accurately described by but you have to admit it’s a pretty good model for what we were trying to accomplish it almost exactly repr produces the Balmer Spectrum which no previous model could do so the so-called bore Rutherford model of the atom which is what we constructed here has a few assumptions built into it the one of them is obvious it’s two-dimensional we said that outright a little bit less obvious although I hinted at it throughout this discussion is that we’ve modeled this atom as if the electron is free to move but the proton or substituting the proton with a whole nucleus with Z protons instead so 2 3 4 5 6 7 8 protons whatever you like we have the proton pinned and unmoving at the center of the atom but you know think about planets orbiting stars or planets orbiting other planets things of comparable size and mass orbiting each other one isn’t fixed while the other one goes around it rather they co-orbit a common Center and that Center is the center of mass of the system so a more accurate model would take into account the fact that the proton can also wobble in response to being tugged on by the electron via the kolon force now we’ve also obtained this model by combining a very classical picture of a planetary atom with very classical Notions of momentum kinetic energy and so forth with the matter wave idea that’s how we stitched quantum physics into this through the matter wave idea a more realistic model of the atom of course would be fully threedimensional from the start it would allow for the motion of both the electron and the proton and in fact if one puts that into this model one much more accurately captures the balm or Spectrum wavelengths they’re a little off from what’s predicted in this model but they’re almost exactly predicted by using a model where the proton can also wobble a little bit in it as it’s orbited by the electron and of course in reality we wouldn’t start from a fully classical picture we would try to exact L solve Schrodinger’s wave equation in three dimensions using as our potential acting on the wave function the kulum potential written here in full three-dimensional Glory so there’s a r Vector hidden in here that has X and Y and Z in it the truth is we are simply not ready at this stage to commit to more realism in describing the atom this was already a bit of an exhaustive exercise at the level of say coming out of introdu physics but I promise you that through the rest of this course we are going to build up a toolkit that would allow you to attack this problem in a later semester starting from the principles outlined in this course so let’s review in this lecture we have learned the following things we’ve learned about a way to develop a classical model of the atom from classical energy and force considerations we’ve done so in two dimensions we’ve then imposed quantum physics on this by sticking the matter wave hypothesis into the atom via the electron thinking about what wavelengths would be allowed for an orbit of a given radius R and then imposing that condition on the energy conservation that is derived from the classical model built in the first step this has allowed us to make predictions about the behavior of a hydrogen like atom in this model and we found that it matches remarkably well with OB observational evidence this is certainly a far more accurate description of nature than anything that has come before and this is the bore model of the atom which is a building block to a much larger picture of quantum physics the physics of the smallest things in the universe [Music] so let’s see what we have learned already from this most basic scenario a particle moving at constant speed free from external forces there is a wave equation which means that the solutions will not have definite localization okay these are waves they describe a phenomenon that’s not specifiable to any one location in space and that means they’re spread out and as a result of that and because they describe something that is oscillating we’re led to questions like what is it that’s oscillating what are the implications for measuring things like position or momentum of particles when fundamentally they’re waves and they’re not localizable to any one definite location in space at any one time uh I may not be able to know everything that I thought I could know about particles from matter waves which are really what matter is um electromagnetic waves are very iations in the strength of electric and magnetic fields and mechanical waves are variations in say the density of a medium or the displacement of a medium what’s oscillating in a matter wave we’ll come back to that now to better understand these Solutions we need to confront the mathematics of these complex functions a bit more closely and I don’t want you to be daunted by the presence of either complex numbers or complex functions they’re basically just a representation of information that becomes necessary when a problem has too much information to describe be described by only one class of number say real numbers it’s okay it just means that matter waves contain more information than real numbers alone can capture and there’s nothing scary about that all we have to do is become more comfortable with the language of complex numbers and how to get real values out of them because after all real numbers are the only things that are realized in the physical world it may be true that phenomena can be described by complex numbers and complex functions but when we make measurements of the natural world we don’t get the answer I back from it we get numbers like 5 or – 52 or 73.7 71 back from measurements those are all real valued and so regardless of the fact that the wave equation may be complex and its Solutions may be complex somehow we’ve got to get real numbers and only real numbers out of these things and to really understand that we need to take a look at complex numbers and a little bit of the algebra related to complex numbers but basically complex numbers just double the available amount of information you can store in a single number that’s all they do so our working solution to the free particle wave equation is of this form we’ve seen it a bunch of times now and it looks weirdly similar to that representative complex number Z I showed you earlier X+ i y it’s a complex structure with a real part which we could represent by X and an imaginary part which we could represent by y it looks very similar to a a simple complex number but as I’ve said observations of the natural world are conventionally Des described by real numbers not imaginary ones now that’s okay I mean we’ve already kind of hinted at the fact that complex numbers look a lot like vectors and we’re used to dealing with vectors with an X component and a y component and from those we’re comfortable summarizing the information content of a vector using the concept of its length or its magnitude a single real number you know for instance you might have a three-dimensional velocity with a VX a VY and a VZ component and that’s all very complicated but you’re very comfortable going look the speed of the particle is V where V is the square root of vx^ 2 + V y^2 + vz^ 2 a single valued real number that summarizes the overall thrust of the Velocity Vector so that’s not scary at all that’s something you’ve been doing since beginning introductory physics the question here is how does one get a single real number out of a complex one how do we get the measurable out of the complex function or number well you might just try you know your old friend the the square right Square the complex number and see if that gives you a real value but unfortunately it gives you a complex polinomial you wind up with a real number x^2 and a real number negative y^2 but a complex piece 2 ixy that’s the sum of the Cross terms of this square if only we could get rid of that cross term we’d be home free we would actually recover something that looks weirdly like the Pythagorean theorem with an X squ and A Y squ term this is almost a hypotenuse squared but it’s not real valued so this may be the h hypotenuse in some space but it’s not the hypotenuse in the real number space of measurement okay so that won’t work now instead to get a real number you need to do something like this and this is part of what defines the algebra of complex numbers you’re going to take Z and you’re going to multiply it by a special version of itself known as Zar uh this is just x + i y the original complex number times x minus i y and you’ll notice that when you do that distributed multiplication out and add all the terms together you wind up with x^2 plus y^2 and no cross terms well this looks weirdly like the Pythagorean theorem you’ve got an X component squared you’ve got a y component squared and this is somehow related to a sort of square of the complex number although this funny thing Zar is required so while that yields something more consistent with for instance your experience with the Pythagorean theorem about the length of a vector um but it does it with a complex number with real and imaginary components what is this thing Zar that we’ve employed to get away with this and the answer is that Zar is what is known as the complex conjugate of the complex number all you do to take the complex conjugate of a complex number is take all numbers I inside the number and replace them with negative I that’s it you’re going to send I to minus I you’re going to flip the sign of all the I’s and that is all Zar represents now to keep this kind of consistent with our instincts about vectors and lengths and magnitudes and things like that we have a shorthand notation for Z * Z Star Z time its own complex conjugate to indicate that it is the square of the real length the thing we would really measure as a consequence in nature if we described a problem using complex numbers and that is denoted by the magnitude or absolute value bars of Z all squared so the magnitude of z^ squ is defined as ZZ star so if you see this notation absolute value or magnitude of z^ squared in complex space that denotes the product of Z with its complex conjugate Z star that’s how you get the real valued length of a complex number or a complex function now another interesting thing about the free particle Solutions is that uh one can simp simplify the notation that we’ve been using to carry around these free particle Solutions and and that is the language of signs and cosiness and exponential functions so for instance um it’s really clunky to have to keep writing out these signs and cosiness in our free particle wave function solution to the wave equation it would be nice if we could compactify this notation somehow and Mathematics does offer us a more compact representation of the same information and will also give us some practice with imaginary numbers like like I um ultimately we will be able to summarize the free particle Solutions as a single exponential function rather than a sum of ss and cosiness to get there let’s consider a tailor expansion of the S function sin of X so the S of X tailor expanded into a series of terms becomes x – x Cub over 3 factorial + x 5 over 5 factorial Etc similarly the cosine function can be tailor expanded into the following 1 – x^2 2 factorial + x 4 4 factorial Etc notice that the S involves only the odd powers of X and the cosine involves only the even powers of X so x to the 0 is 1 x^2 X4 and so forth and the sums all have alternating pluses and minuses that are used to combine the terms together now recall that the tailor expansion of the exponential function e to the X looks like the following if you tailor expand e to the X you wind up with 1 + x + x^2 / 2 factorial Etc so if you stare at these three things for a second you’re dangerously close to being able to find some combination of s and cosine that when added together yields e to X but it’s not going to be real valued because the sign and cosine expansions have alternating plus and minus signs in front of their terms whereas the e to the X expansion is all sums and so we see a problem here we would like to use e to the X to represent some combination of s and cosine of x but we can’t do that because we have these stray minus signs on alternating terms that complicate our ability to use only real numbers to do this trick to make sign and cosine combine to get e to X well again leaving that expansion of e to the X up here let’s go back and revisit a little bit the use of the imaginary number I and the implications it might have for combining sign and cosine so note that while the expansion of e to the X involves the sum of a bunch of power of X and the S and cosine expansions have alternating sums and subtractions we might use this rule that when you see stray minus signs that they might be indicative of products of the imaginary number I we can crack the puzzle so let’s think creatively for a moment and let’s recall that i^2 = -1 and that allows us to then rewrite terms like x^2 which appears in the expansion of the cosine function as i^2 x^2 or in other words IX all sared so it’s as if we replaced the argument of the cosine function with uh I times the argument that we started with now in the S expansion we have odd numbered powers of of of X like X cubed for instance and uh that could be Rewritten as i^2 X Cub but that’s not very satisfying we have different powers of I and X in this but let’s keep in mind that if we have a term that looks instead like Nega IX cubed that can be Rewritten and you can practice this for yourself as I cubed x cubed which is just IX all cubed so with those things in mind let’s recall our free particle Solutions are of the form a * the cosine of an argument X plus I * the S of an argument X well if we stare at that for a second and we plug in the tailor expansions of cosine and S we would get this that we have a times for instance just keeping the first two terms in the tailor expans expansion 1 – x^2 / 2 factorial and we’re going to add to that I a time this expansion of sign keeping only the first two terms x – x Cub over 3 factorial now if we distribute the imaginary number I into the parentheses on the right hand side of this uh sum we can start employing the identities and relationships that I wrote up here so for instance iix cubed is just iix all cubed and x^2 is just IX all SAR so for instance I wind up with terms like this I have IX here which is fine we can leave that alone I have IX cubed and that can be replaced with positive IX all cubed and that’s done here now for the cosine I have 1 – x^2 / 2 factorial well x^2 can be replaced with IX all SAR and you’ll notice what’s happening we’re eating up the minus signs in algebra involving the number I so we wind up with a positive sum of these terms 1 + IX plus IX all 2 over 2 factorial plus IX all cubed over 3 factorial Etc if we were keeping more terms in the tailor expansion this thing here can simply be Rewritten as a * e to the iix the argument of the cosine and S was X but combining Li this way with a multiplicative I in front of the sign term we get to rewrite that sum as a e to the I * X the original argument of the S and cosine function so we’ve traded a real valued function for a complex function but it’s a much more compact notation than what we had before and this allows us to rewrite the free particle Solutions in this more compact form as a * e to the I * the quantity KX – Omega T and this is a little bit easier iier to carry around on a piece of paper than the sums of ss and cosin with the imaginary number I in only one of the two terms now what is the magnitude of our free particle solution and let’s keep in mind that we don’t know if the constant out in front of the function a is real or complex so let’s try to calculate the magnitude squared of the wave function of the free particle uh let’s do that so we’re trying to calculate the absolute value of s^ squ and remember in a complex space of functions or numbers that’s defined as s times its complex conjugate s star well what is that well s is just a * e to the I KX minus Omega T the complex conjugative s would involve changing I to negative I everywhere we see it but we don’t know if there’s an I hiding inside of the prefactor a that multiplies the exponential function so to be very careful about this in case the a is also a complex number we’re going to replace a with a star and I with negative I up here and that’s about as far as we can go with this if we now group terms together in the multiplication we have a * a star we have e i KX – Omega T and grouping the exponents together we have then I * KX Omega T these exponents completely cancel each other out to zero and we’re left with a term that’s just e to the 0 e to the 0 is 1 so this then simplifies to a * a star or just the magnitude of a s so the measure of the wave particle function for a free particle is just a real number the magnitude of a^ s but what is it that we’ve just evaluated what is this function that solves the wave equation and what is the meaning of its length these are the the questions that really racked people’s brains in the 1920s and 1930s this was a real intellectual struggle in confronting the wave nature of matter so one is forced to interpret these functions and their meaning there is no easy answer from first principles in nature about what the wave function is because it’s a complex function you don’t actually have any physical meaning to its real and imaginary Parts it’s only the magnitude of the wave function squared that has any physical meaning and so you have to lay an interpretation down as to what you think the underlying wave function is and what is waving it’s not energy because energy is a real thing it’s something else and I have to tell you that in the history of physics and you may have seen this in popular videos on quantum mechanics which often are Rife with misunderstandings of the underlying math and subject material it’s this contest of intellectual ideas that has caused the most most hand rubbing and consternation and some of the most bitter disagreements and strong opinions in the history of Science and it’s all been over a function whose Direct Value has no physical meaning because it’s based in part on imaginary numbers which themselves have no physical interpretation it’s only the real valued magnitude of the complex function or the complex numbers that have any physical meaning it’s not those numbers themselves it’s only the measure of their overall information content that has meaning in the physical world now the most practical interpretation one which has also been met with the most experimental success since Irvin Schrodinger first published his wave equation is that of a probabilistic meaning to the square of the wave function that is to say this thing the magnitude of s squared uh this amplitude squared of the wave function is interpreted as representing a probability per unit distance per unit time in one dimension in two Dimensions it’s per unit area and in three its per unit volume now to obtain raw probabilities one has to specify the exact conditions under which the free particle has been prepared for instance where was it starting from exactly and what was its momentum and things like that and then you can answer questions such as given that this is a matter wave and it’s not localized once it’s released to any one place in space what’s the probability of finding this particle between say 1 cm and 2 cm from the point of origin or what’s the probability of finding the particle a distance of 3 cm from the point of origin 1 second after it starts its Journey these are questions you can try to answer in the framework of the matter wave equation the Shing or wave equation and all the math that goes along with it we don’t have that framework available we’re going to develop that framework going forward and try to get answers to questions like this all right so that’s our goal we’re going to conclude clude our discussion of the implications of the wave nature of matter in this lecture and later lectures we’ll begin to think about specific problem statements and then how we use the shringer wave equation to attack those problem statements and interrogate the solutions to get answers that can be measured in a laboratory experiment the wave function itself is not directly accessible but its amplitude squared in different situations has physical consequences for measurement now that said because we’re mathematical beings that can imagine things that are not physically realizable in the world around us we can use some math and computer aids to try to visualize the wave function of our matter particle that’s free from external forces but to do this we have to concoct a space of the imaginary value of the wave function and the real value value of the wave function now these are not physical axes in space they don’t have physical extent remember that this is an oscillating probability probability itself is not physical but the probabilities of outcomes are physical and so it’s you have to be very careful to separate your visualization of the wave function from physical meaning which is only derivable from the square of the wave function the complex conjugate times the original wave function nonetheless because we are mathematical beings and we can think abstractly let’s attempt to visualize what the wave function of a free particle would look like without specifying how it was prepared uh in that case then it’s the solution that we’ve written down already and we can imagine thinking about the uh amplitude of the wave function along its imaginary axis and along its real axis so along its imaginary axis it’s a sign function whose amplitude starts out at zero goes to a maximum plunges to a minimum and returns to zero after one cycle and along the real valued axis of the wave function it starts off at maximum amplitude eventually goes through zero to a minimum back through zero to a maximum after one cycle of the matter wave and note that the maximum of the matter wave in the real value part of the wave function is achieved at the same location as the zero point of the imaginary part of the wave function which is what you would expect from a cosine and a s function combined together now of course if we construct this in 3D space with our imaginary axis our real axis and then the spatial location and physical space of the particle we wind up with a helical structure a helical surface that winds through imaginary and real space uh keeping in mind that we’re talking about the imaginary and real components of the wave function but at all points in space as we’ve seen the amplitude squared of this is a constant valued number that doesn’t depend on space and time and and so whatever this wave function is doing varying in its real and imaginary Parts in physical space it represents a constant probability density everywhere in space in time so there’s nothing waving in physical space in IM in the space of the wave function you have oscillation and that oscillation is related to the probability of finding the particle at that point in space at that moment in time but in physical space all you have is the magnitude squar of the wave function that’s the only physical thing that manifests in the measurable world now to close out this lecture let’s take a look at what it means to try to measure both the position and the momentum of a matter wave representation of a particle so here’s a real valued part of the wave function of a matter Wave It’s the cosine it starts at one goes to negative 1 returns to one after one cycle and you see I’ve got two two wavelengths represented in this picture I’ve ignored the complex part but it’s also waving at the same time we’ve just looked we’re looking now just at the physical position of the particle versus the value of the real component of the wave function the imaginary component of this wave also has an important role in what happens with the physical reality of the particle but it’s not shown here I just want to concentrate your energy now on thinking about what it means to measure momentum and position for a wave or at least a particle described as a wave now measuring the position of a free particle boils down to determining where it is along the x-axis so for instance I might do that by zooming in more and more on this wave and saying okay I’m localizing the particle more and more and more by spotting the little chunk of its wave function in the real valued component located at that point in space but measuring the momentum of the same particle boils down to a different observation measuring the momentum of the particle is related ated to determining the second derivative of this wave with respect to space that is determining the curvature of this wave that’s what the second derivative with respect to space tells you it tells you about the spatial curvature of the real part or the imaginary part of the wave function and it’s that curvature the degree to which the wave bends to move toward the next part of its cycle that determines momentum now it’s very easy to determine the momentum in this picture we clearly have two wavelengths we could sit down and easily determine from the information on this page uh what the wavelength of this wave is all right but we might be a little less certain about where it is because there’s a couple of cycles of its real valued part of its wave function here so maybe it’s here or maybe it’s here or maybe it’s here all right so knowing the momentum really well might preclude knowing the position really well but what if we really localize this particle to one specific place place in in position space all right so what we want to do is try to locate the particle more and more precisely by zooming in on the wave function to really localize the phenomenon to one narrow region of space and this is equivalent to identifying where it is in a range X and X Plus Delta X and then sending Delta X more and more toward zero to zoom way way way way in on a narrow slice of the wave all right but as we’ll see it’s going to become hard harder and harder to establish the curvature of the wave as we do this and thus the momentum of the wave is going to slip from our grasp now to help you with this exercise what I want you to do is really stare at the wave in this region right here where I’m indicating with the with the mouse cursor okay so really stare at the wave here right now you can clearly see that there’s well-defined curvature you could easily and readily determine the wavelength of this phenomenon how about now can you easily determine the wavelength to this phenomenon I’ve zoomed in localizing more in space where I want to see where the particle is but in doing so I’ve traded a lot of the curvature away in order to do that it’s it’s getting harder to determine the wavelength of this wave but you could still maybe do it you’ve got a peak over here and you can see how it’s declining there’s lots of curvature to determine the the momentum of this wave but how about now I’ve zoomed in even more stare at that are you confident you could determine the curv of that wave and you may be remembering the old wave but as you continue to stare can you determine the curvature of the wave well I messed with you a little bit while you were staring at the wave while I was daring you to think about the curvature of that line I did one more change to the wave I’m still zoomed way in on it but I changed the wavelength by 10% did you notice did you notice that the wavelength changed from the previous zoom in to the zoom in you’re looking at now an astute Observer might have noticed while they were staring at it that the grid behind here uh changed when I did that and that corresponds to a change in where I was zoomed in on the wave but the starting value and the ending value of the wave in this picture didn’t change the heights of the Waves where they enter the picture and exit the picture were concocted identically giving you the impression that you were confident that the wavelength was the same as the wave from before but it’s not I changed the wavelength by 10% but presented you with a similar Zoom in region and this is meant to confuse you on purpose to show you that the more you close in on the wave function the harder and harder and harder it’s going to be to determine the curvature of the wave is this line straight is it bending gently how much is it bending you don’t have infinite resolution available to you in the universe you’re going to hit a limit at some point and it’s going to get extremely hard to determine if this is a straight line or not a straight line and if it’s not a straight line you’re going to struggle with determining exactly what its radius of curvature is and that struggle is reflected in a loss of control over your knowledge of the momentum of the particle knowing the position too well comes at the cost of knowing the momentum so let me repeat that statement one more time when you’re dealing with matter waves knowing the position very well comes at the cost of knowing the momentum with any Precision knowing the momentum very well comes at the cost of knowing the position with any Precision that I reflected in my ear statement about being zoomed out looking at many cycles of the wave you’re very confident when you’re zoomed out that you know the wavelength of this phenomenon but because there are many places where the particle is likely to be and less likely to be represented by the changing amplitude of the wave in real space you’re getting kind of confused about where it might actually be is it more at one of the Maxima or more at the other maximum or more at the third maximum or the fourth maximum or the fifth maximum gaining confidence in momentum comes comes at the cost of confidence and precision and it was the physicist verer Heisenberg who worked out the mathematics of this particular issue in 1927 now the real way to do this of course is to take the wave equation and to work through the forier transform which tells you something about the information content of the wave in position and frequency or momentum space that’s a little above the ability of a course like this to work out although you are welcome to look into it on your own if you’re comfortable with uh integrals and derivatives at a high level at least at the level of say Cal 2 and Cal 3 um Heisenberg codified the relationship between the certainty or uncertainty of our knowledge in momentum and the uncertainty of our knowledge in position in what is known as the Heisenberg uncertainty principle and it’s a very definitive statement albeit an inequality it says that the uncertainty in the knowledge of momentum Delta P times the uncertainty in the knowledge of position Delta X must always be greater than or equal to H the reduced plunks constant divided two why is it that we don’t worry about knowing how fast our car is moving while also knowing its position on the road we don’t freak out about that like if we’re going to stare at the speedometer for a moment we’re suddenly going to look up and realize we’re in New York City whereas we were in Dallas at the beginning of our glance down at the speedometer that doesn’t happen in the real world you don’t increase your confidence in your current velocity and thus your current momentum and then suddenly look up and realize you’re on the Moon I mean this is essentially what we’re talking about here with tiny matter waves right is that once you become very confident you know where the particle is you suddenly lose all confidence about its momentum and vice versa well it’s no wonder we didn’t notice it h bar over2 is a number that is approximately 10us 35 Jew seconds that’s an insanely small number it’s no wonder we didn’t notice this before and that it would only manifest at the scale of things tiny like atoms or electrons or the nucleus of the atom or things like that but this statement holds for matter waves no matter what situation you’re in you cannot know the position and the momentum at the same time with infinite precision and you can see that if you did try to know one of them with infinite Precision that is Delta X exactly equal to zero so you want to know exactly the position of a matter wave so you specify an experiment that lets you get infinite Precision no uncertainty on the position you completely lose control of the momentum the uncertainty on the momentum blows up to Infinity in order to hold this as a constant that’s the only way to satisfy this inequality is if Delta P blows up to Infinity as Delta X goes to zero this is a limit imposed by the wave nature of matter it’s unavoidable you cannot know this pair of variables X and P with any simultaneously perfect Precision now now of course the why of this is buried deeply in things like the fora transform and in the algebra of matrices that is collections of numbers in multiple Dimensions which is another form of language that can be used to derive quantum mechanics which is where we are essentially at now that’s above the pay grade of this particular class but I just want to say that because you are going to encounter quantum mechanics again in a dedicated higher level course than this one and I want you to understand that I’m having to wave my hands quite a bit at this level in order to motivate this nonetheless you will have a second crack at this where you’ll begin to see the wise of all of this where is this coming from why H over two uh why this particular product of momentum and position are there other products of things that similarly in pairs are uncertain when you know one you don’t know the other and and vice versa these are excellent questions and I don’t expect you to be satisfied with this right now but this is where we can get in a course at this level level after two semesters of introductory physics so let’s review what we have learned in this lecture we’ve learned about mechanical and electromagnetic wave equations and from that we’ve learned how to infer the nature of the wave equation for matter and this has given us some ability to get at the meaning of the Waves described by the matter wave equation albeit by interpreting what’s going on based on our experience with the natural world the wave equation involves complex numbers and the solution to the wave equations involve complex functions we have to get real numbers out of this thing if we want to map it onto the real world and the only way to do this is for instance to calculate the amplitude squar of the wave function in doing that however we lose any ability to understand or map the physicality of the wave function itself onto the real world it’s only the amplitude of the wave function that has implications for the real world so the wave function describes oscillating probabilities and it’s the the amplitude squared that tells us the probability per unit distance per unit time for something to be true in the shringer wave equation describing a matter wave involving either no forces or some forces but the wave nature of matter ultimately imposes a limit of absolute knowledge on our ability to understand the world around us what we learn from exploring the wave part of the wave function of the matter waves is that there’s a limit to our knowledge if we know the position of this wave very well we lose control over its momentum if we get control over its momentum at a high degree we lose our confidence in information about the position of the particle any longer these pair of variables are related to each other in their uncertainty by the Heisenberg uncertainty principle and fundamentally this imposes a limit of absolute knowledge on what we can know about a system of particles at any given moment in time by making measurements these are the foundations of quantum mechanics that we will build on going forward and we will spend the rest of this course essentially applying quantum mechanics and special relativity to problems involving the very small things in the universe like atoms and individual subatomic particles to make predictions about the natural world and understand phenomena like atoms and the behavior of particles trapped in systems like you would find for instance in semiconductors these are all basic applications that are at our fingertips now that we have a foundational equation that we can solve in order to understand the outcomes of these particular [Music] situations in this lecture we will learn the following things we will learn about the postulates of quantum mechanics the inviable tenets that are the foundations of this branch of physics we’ll also learn about some guidelines that you can employ for wave functions so that you can learn to solve the Schrodinger wave equation we’ll learn about classical analoges of quantum systems that we might want to model building on what we know already about a classical system but employing that in the Schrodinger wave equation and finally we’ll learn about a specific archetypal model a Quantum model of a bound particle known as the particle in a box or infinite Square well model and we will solve it using the Schrodinger wave equation let me remind you first about the one-dimensional Schrodinger wave equation which I will represent using a Shand going forward swe much easier to carry that around than shringer wave equation the schinger wave equation has a Time dependent statement on the left and on the right it has a spatially dependent statement about the wave function and finally it has a portion here that describes the action of an external force on the particle or system represented by the wave function the above is the one-dimensional schinger wave equation and generally speaking it allows for solutions that vary in space and time and it also allows for forces represented by the underlying potential that gives rise to the force that varies in Space in Time this is very complex so to utilize this equation we will need to do the following first we will represent physical situations with a model and what that usually boils down to because the time piece on the left and the space piece in the middle are essentially fixed by the form of the equation is varying the form of the potential V this describes how the system constrains particles described by the wave function now this effort may involve simplifying assumptions in the aid of creating a simple model of the force or forces that can act on the particle and these choices these simplifying assumptions have consequences that I’ll talk about later we will Define the base rules of quantum mechanics what are the inviolable tenants of problem solving in quantum mechanics that if untrue mean the fundamental dissolution of quantum mechanics we’ll also Define some guidelines for how to write down wave functions that will work to solve the shring or wave equation for instance in a specific situation now these guidelines may be violable depending on how you approximate physical situations but don’t represent of fundamental failure of quantum mechanics if violated in other words poor assumptions on the part of the problem solver the physicist are not to be held against the fundamental framework of quantum mechanics so what are the inviable tenets of quantum mechanics well these are known as the postulates of quantum mechanics and I’m going to warn you at the beginning that I am glossing over some of the Elegance of these postulates in favor of a bit more wordiness because we don’t have the mathematical foundations quite yet in order to take advantage of the more elegant and direct way of stating these postulates so what are the postulates of quantum mechanics well the first one is that at each specific time the state of a system that is for instance a particle or collection of particles can be entirely represented by a space of functions that are related to the wave function s now while sigh depends on a finite number of things like spatial position along the horizontal space axis and time the space of functions that can be related to the wave function and can fully represent the possible state of a system can be infinite in dimension now for our purposes we will concentrate just on the wave function rather than on this larger notion of a space of functions that can describe a system a more advanced course will concentrate rather on that space of functions which has all kinds of properties and rules associated with it it’s called a Hilbert space and it’s named after mathematical physicist David Hilbert the second postulate is that every observable quantity of a system for instance a measurement of momentum or energy will be represented mathematically by the action of an operator on the state of the system now I’ll elaborate more on on this a little bit later but think back to how I waved my hands and derived the shring or wave equation for example the total energy is measured in that equation by a Time derivative acting on the wave function and as you’ll see other actions of other derivatives effectively represent operators that measure quantities of the system these would be the outcomes of doing experiments and finally the only possible results of a measurement of an observable are related to characteristic numbers known as igen values of those

    By Amjad Izhar
    Contact: amjad.izhar@gmail.com
    https://amjadizhar.blog