This is one of the webpages of Libarid A. Maljian at the Department of Physics at CSLA at NJIT.
New Jersey Institute of Technology
College of Science and Liberal Arts
Department of Physics
Introductory Astronomy and Cosmology, Section 101
Fall 2024
First Examination lecture notes
Introduction to the Universe: Our Location in Space and Time
A star system is a star with
several planets orbiting that star and moons orbiting those planets. The name of our home star system is the Solar
System. There is only one star in our
Solar System: the Sun. We live on planet
Earth, the third planet orbiting the Sun.
A galaxy is a collection of billions of star systems held together by
gravity. The name of our home galaxy is
the Milky Way Galaxy. There
are roughly one hundred billion star systems that together compose the Milky
Way Galaxy, and the Solar System is just one of these one hundred
billion star systems within the Milky Way Galaxy. Galactic groups contain only a few dozen
galaxies, while galactic clusters contain between several hundred and several
thousand galaxies. Our Milky Way Galaxy
is not a member of a galactic cluster; our Milky Way Galaxy is a member of a
galactic group. The name of our home
galactic group is the Local Galactic Group (or just the Local Group for
short). The Local Galactic Group is
composed of a few dozen galaxies, although most of them are small
galaxies. There are only three major galaxies
in the Local Galactic Group: our Milky Way Galaxy, the Andromeda Galaxy, and
the Triangulum Galaxy. Galactic
superclusters are enormous organizations composed of between tens of thousands
and hundreds of thousands of galaxies.
The name of our home galactic supercluster is the Laniakea
Galactic Supercluster, and our Local Group is a small galactic group on the
outskirts of the Laniakea Galactic Supercluster. Cosmic filaments are colossal organizations
composed of several galactic superclusters, and hence cosmic filaments contain
between a few hundred thousand and a few million galaxies. The name of our home cosmic filament is the
Perseus-Pisces-Sculptor-Hercules Cosmic Filament, and our Laniakea
Galactic Supercluster is one of several galactic superclusters within the
Perseus-Pisces-Sculptor-Hercules Cosmic Filament. The observable universe contains hundreds of
thousands, perhaps millions, of cosmic filaments. Since each cosmic filament contains between a
few hundred thousand and a few million galaxies, the entire observable universe
contains roughly one hundred billion galaxies. Assuming that each galaxy contains on average
one hundred billion star systems just like our Milky Way Galaxy, then there are roughly ten sextillion star systems in the
observable universe. (Please refer to
the following multiplication table, where each number is one thousand times the
previous number: one, one thousand, one million, one billion, one trillion, one
quadrillion, one quintillion, one sextillion, one septillion, one octillion,
one nonillion, one decillion. Caution:
this multiplication table is only correct in American English. These same words are used
for different numbers in British English.)
We can summarize our location in the universe with our cosmic
address. Whenever anyone asks for our
mailing address, we provide a list of larger and larger organizations wherein
we reside. After our
name comes a house number, then a street/avenue/road/boulevard (which is a
collection of houses), then a municipality (which is a collection of
streets/avenues/roads/boulevards), then a county (which is a collection of
municipalities), then a state/province (which is a collection of counties), and
then a country (which is a collection of states/provinces). If we were to continue, we
would then provide a continent (which is a collection of countries), then a
planet (which is a collection of continents), then a star system (which is a
collection of planets orbiting a star), then a galaxy (which is a collection of
star systems), then a galactic group or a galactic cluster (which is a
collection of galaxies), then a galactic supercluster (which is a collection of
galactic groups and galactic clusters), then a cosmic filament (which is a
collection of galactic superclusters), and finally a universe (which is a
collection of cosmic filaments).
Every person we have ever met or ever will meet and every person we have
ever heard of or will ever hear of has the same cosmic address starting with
planet Earth followed by the Solar System, the Milky Way Galaxy, the Local
Galactic Group, the Laniakea Galactic Supercluster,
the Perseus-Pisces-Sculptor-Hercules Cosmic Filament, and the observable
universe.
One light-year is the
distance that light travels in a time of one year. We must never forget that a light-year is a
length of distance, not a duration of time.
This is easy to forget, since a year is
a duration of time. Nevertheless, a
light-year is not a duration of time; a light-year is a length of
distance. How far is one
light-year? Light travels roughly three
hundred thousand kilometers every second through the vacuum of outer
space. This is extraordinarily fast by
human standards. Since three hundred
thousand kilometers is the distance light travels in one second, we multiply
this by sixty to find how far light travels in one minute, since there are sixty
seconds in one minute. We multiply this
result by another sixty to obtain how far light travels in one hour, since
there are sixty minutes in one hour. We
multiply this result by twenty-four to find how far light travels in one day,
since there are twenty-four hours in one day.
Finally, we multiply this result by 365.25 to obtain how far light
travels in one year, since there are 365.25 days in one year. The final result of
this calculation is that one light-year is roughly 9.5 trillion kilometers. This is close enough to ten trillion
kilometers that throughout this course we will assume that one light-year is
roughly ten trillion kilometers as a satisfactory approximation. Our universe is so enormous that it actually
takes years for light to travel from one star system to a neighboring star
system. It is for this reason that
astronomers consider telescopes to be time machines. When we observe a star that is one hundred
light-years distant for example, we see that star as it appeared one hundred years
ago (one hundred years in the past), since it took that long for its light to
travel from that star to us. The only
way to know how that star actually appears at this moment is to wait another
one hundred years for that light to travel to us. We see a star one thousand light-years
distant as it appeared one thousand years ago (one thousand years in the past),
since it took that long for its light to travel from that star to us. The only way to know how that star actually
appears at this moment is to wait another one thousand years for that light to
travel to us. Not only are telescopes
time machines, but the human eye is itself also a time machine. If we look at the Sun (which we must never do
since that would cause permanent blindness), we see the Sun as it appeared
roughly eight minutes ago (roughly eight minutes in the past), since it takes
that long for light to travel from the Sun to the Earth. The only way to know how our Sun actually
appears at this moment is to wait another eight minutes for that light to
travel to us. If we look at the Moon, we
see it as it appeared roughly one second ago (roughly one second in the past),
since it takes that long for light to travel from the Moon to the Earth. The only way to know how our Moon actually appears
at this moment is to wait another second for that light to travel to us. When we look at the tables and chairs around
us or even these words we are reading, we see those tables and chairs and these
words as they appeared a few nanoseconds ago (a few nanoseconds in the past),
since it takes that long for light to travel from the tables and chairs and
even these words to our eyes. Obviously,
these time delays are so tiny in everyday life that we do not notice them at
all, but they are real nevertheless. The universe is so enormous
that it takes a few years for light to travel from one star system to a
neighboring star system, it often takes millions of years for light to travel
from one galaxy to a neighboring galaxy, and it takes billions of years for
light to travel from one side of the observable universe to the other side of
the observable universe. For
example, our Milky Way Galaxy has a diameter of roughly one hundred thousand
light-years, and the Andromeda Galaxy is more than two million light-years from
our Milky Way Galaxy.
The universe is roughly
fourteen billion years old. This is
unimaginably old by human standards. The
human mind can comprehend seconds, minutes, hours, days, weeks, months, and
years of time. Ten years is called a
decade. Ten decades (which is one
hundred years) is called a century, which is roughly how long most humans
live. Ten centuries (which is one
hundred decades or one thousand years) is called a
millennium. Ten millennia is almost
twice as long as all of recorded human history, but it
is still minuscule compared to the age of the universe. One hundred millennia is almost how long our
species Homo sapiens has existed, and
one thousand millennia (which is one million years) is almost how long our
genus Homo has existed. Ten million years is almost how long our
family of hominids has existed. One
hundred million years ago, there were no hominids; dinosaurs roamed the Earth. One billion years is approaching the age of
our planet Earth, but this is still not near the age of the universe. Ten billion years is roughly the age of our
Milky Way Galaxy, and this is finally approaching the fourteen-billion-year age
of the entire universe. We can gain a
greater appreciation for the age of the universe through the cosmic
calendar. In the cosmic calendar, we
pretend that the entire history of the universe fits into one calendar
year. In other words, when we use the
cosmic calendar we pretend that the creation of the universe occurred at the
very beginning of the calendar year on January 01st at
midnight, and we pretend that the present day is at the very end of the
calendar year on December 31st immediately before midnight. If the creation of the universe occurred on
January 01st at midnight, we must wait until September
before the Earth forms! We must then
wait another month (October) before the most primitive unicellular
microorganisms appear on Earth! We must
wait another month (November) before multicellular but still microscopic
organisms appear on Earth! We must wait
another month (mid-December) before macroscopic but still invertebrate animals
appear on Earth! The most primitive
vertebrate animals (fishes) appear on roughly December 20th,
amphibians appear on roughly December 22nd, and reptiles appear on roughly
December 24th. The age of reptiles,
commonly known as the age of dinosaurs, lasts from roughly December 25th to roughly December 30th, when the age of mammals
begins. Hominids do not appear until
December 31st at roughly 05:30 p.m., and Homo sapiens do not appear until
December 31st at roughly 11:52 p.m.!
Finally, all of recorded human history lasts for roughly fifteen
seconds, beginning on December 31st at roughly
11:59:45 p.m.! Compared with the
recorded history of our species and even compared with the entire history of
our species (unrecorded and recorded), the universe is unimaginably old.
Introduction to the Sky
Every point on the surface of
the Earth can be labeled with the Geographic
Coordinate System, a pair of coordinates (two numbers) called latitude and
longitude. Lines of latitude are
parallel to one another. The line of
latitude at 0° is commonly called the equator, but we must call this line of
latitude the Terrestrial Equator in this course, for reasons we will make clear
shortly. The Terrestrial Equator divides
the surface of the Earth into two hemispheres: the northern hemisphere and the
southern hemisphere. Latitudes in the
northern hemisphere are measured in degrees north, and
latitudes in the southern hemisphere are measured in degrees south. The Earth’s rotational axis (or axis of
rotation) pierces the Earth at two points.
One of these points is at 90°N latitude and is
commonly called the north pole, but we must call this line of latitude the
North Terrestrial Pole in this course, for reasons we will make clear
shortly. The other point pierced by the
Earth’s rotational axis (or axis of rotation) is at 90°S
latitude which is commonly called the south pole, but we must call this line of
latitude the South Terrestrial Pole in this course, for reasons we will make
clear shortly. Lines of longitude are
not parallel to one another; they all begin together at the North Terrestrial
Pole, they spread apart from one another until they are furthest apart from one
another at the Terrestrial Equator, and they all converge back together at the
South Terrestrial Pole. Lines of
longitude are measured from one hundred and eighty
degrees west to one hundred and eighty degrees east. Every point on the surface of the Earth has a
unique latitude and longitude with only two exceptions. Although the North Terrestrial Pole has the
unique latitude 90°N, the North Terrestrial Pole has
an undefined longitude. Although the
South Terrestrial Pole has the unique latitude 90°S,
the South Terrestrial Pole has an undefined longitude. These are the only two points that suffer
from this tragedy; every other point on the surface of the Earth has a unique
latitude and a unique longitude.
The Celestial Sphere is a
giant imaginary sphere that does not exist physically but is
nevertheless an essential concept in observational astronomy. For the purposes of observational astronomy,
we assume all astronomical objects in the sky (such as stars and galaxies) are
located on this Celestial Sphere, and we assume the Earth is at the center of
this Celestial Sphere. From wherever we
happen to be located on the Earth, the sky we see above us is only half of the
Celestial Sphere, since the other half of the Celestial Sphere is below us
where we see the ground instead. The
horizon is a giant imaginary circle that separates the ground from the
sky. Above the horizon, we see one-half
of the Celestial Sphere; below the horizon, we see the ground that prevents us
from seeing the other half of the Celestial Sphere.
All astronomical objects
(such as stars and galaxies) can be labeled with the
Horizon Coordinate System, a pair of coordinates (two numbers) called altitude
and azimuth. Altitude is the angle above
or below the horizon. Lines of altitude
are parallel to one another. The line of
altitude at 0° is the horizon, which divides the Celestial Sphere into two
hemispheres: the positive-altitude hemisphere and the negative-altitude
hemisphere. Altitudes in the
positive-altitude hemisphere are measured in positive
degrees, and altitudes in the negative-altitude hemisphere are measured in
negative degrees. Astronomical objects
below the horizon have negative altitudes, which means we cannot see them since
the ground is in the way. Astronomical
objects above the horizon have positive altitudes which means we can see them
in the sky, unless it is raining or cloudy!
The point on the sky directly on top of us is +90°
altitude and is called the zenith; the point directly opposite the zenith is −90°
altitude and is called the nadir. In
colloquial English, the zenith of anything is its climax or apex or pinnacle,
such as the apex of a civilization or the pinnacle of a person’s career. Also in colloquial English, the nadir of
anything is its rock bottom, such as the rock bottom of a classroom’s
performance. Lines of azimuth are not
parallel to one another; they all begin together at the zenith, they spread
apart from one another until they are furthest apart from one another at the
horizon, and they all converge back together at the nadir. Lines of azimuth are
measured from 0° to 360°, starting at 0° for directly north. The azimuth is 45° for directly northeast,
90° for directly east, 135° for directly southeast, 180° for directly south,
225° for directly southwest, 270° for directly west, 315° for directly
northwest, and 360° for directly north (which is actually back to 0° for
directly north). We will call the point
on the horizon directly north the North Point, the point on the horizon
directly east we will call the East Point, the point on the horizon directly
south we will call the South Point, and the point on
the horizon directly west we will call the West Point. Every astronomical object on the Celestial
Sphere has a unique altitude and azimuth
with only two exceptions. Although the
zenith has the unique altitude +90°, the zenith has an
undefined azimuth. Although the nadir
has the unique altitude −90°, the nadir has an undefined azimuth. These are the only two points that suffer
from this tragedy; every other point on the Celestial Sphere has a unique
altitude and a unique azimuth.
Although the Horizon
Coordinate System is commonly used by the general public, the Horizon
Coordinate System is unsatisfactory for observational astronomy for a couple of
reasons. Firstly, the Horizon
Coordinates (the altitude and the azimuth) of a particular astronomical object
depends upon where we happen to be located on the Earth. An astronomical object may have a positive
altitude from where we happen to be located on the Earth for example, but that
same astronomical object will have a negative altitude from someone else’s
different location on the Earth. The
azimuth will also be different. The problem
with the Horizon Coordinate System is even worse than this however. Even if we remain fixed at one particular
location on the Earth, the Horizon Coordinates (the altitude and the azimuth)
of astronomical objects will still not remain fixed because the Earth is
continuously rotating from west to east.
This continuous rotation causes astronomical objects to appear to cross
the eastern horizon (rising) and then later cross the western horizon
(setting). The Celestial Meridian is a
giant imaginary circle that is perpendicular to the horizon. The Celestial Meridian begins at the North
Point, runs through the zenith, runs through the South Point, runs though the
nadir, and ends back at the North Point.
As the Earth rotates from west to east, astronomical objects appear to
cross the eastern horizon (rising), and they appear to have a higher and higher
altitude until they reach their highest altitude when they cross the Celestial
Meridian (culminating). As the Earth
continues to rotate, astronomical objects then appear to have a lower and lower
altitude until they cross the western horizon (setting), and they eventually
rise crossing the eastern horizon again.
In summary, altitudes and azimuths both continuously change as a result of the Earth’s continuous rotation. This daily apparent motion is most obvious
for the Sun. The moment when the Sun
crosses the eastern horizon is called sunrise, the moment when the Sun crosses
the Celestial Meridian above the horizon is called noon, the moment when the
Sun crosses the western horizon is called sunset, and the moment when the Sun
crosses the Celestial Meridian again but below the horizon is called
midnight. The first half of the day before the Sun crosses the Celestial
Meridian at noon is called “before meridian” or “ante meridiem” which is
abbreviated “a.m.,” and the second half of the day after the Sun has crossed the Celestial Meridian at noon is called
“afternoon” or “after meridian” or “post meridiem” which is abbreviated “p.m.”
Since the Horizon Coordinates
of astronomical objects continuously change as the Earth rotates, we need another
pair of coordinates that remains fixed even though the Earth is rotating. The most important such coordinate system is
the Equatorial Coordinate System, a pair of coordinates (two numbers) called
declination and right ascension. Lines
of declination are projections of lines of latitude onto the Celestial
Sphere. Since lines of latitude are
parallel to one another, lines of declination are also parallel to one
another. The projection of the
Terrestrial Equator at 0° latitude onto the Celestial Sphere is 0°
declination. This is
called the Celestial Equator. We
now realize why we must never simply use the word equator in this course. There are two equators! The Earth’s equator is 0° latitude and is called the Terrestrial Equator, while the Celestial
Sphere’s equator (the sky’s equator) is 0° declination and is called the
Celestial Equator. The Celestial Equator
divides the Celestial Sphere into two hemispheres: the positive-declination
hemisphere and the negative-declination hemisphere. Declinations in the positive-declination
hemisphere are measured in positive degrees, while
declinations in the negative-declination hemisphere are measured in negative
degrees. The Earth’s rotational axis (or
axis of rotation) pierces the Celestial Sphere at two points. One of these points is at +90°
declination and is called the North Celestial Pole. We now realize why we must never simply use
the term north pole in this course.
There are two north poles! The
Earth’s north pole is 90°N
latitude and is called the North Terrestrial Pole, while the Celestial Sphere’s
north pole (the sky’s north pole) is +90° declination and is called the North
Celestial Pole. The other point on the
Celestial Sphere pierced by the Earth’s rotational axis (or axis of rotation)
is at −90° declination and is called the South
Celestial Pole. We now realize why we
must never simply use the term south pole in this course. There are two south poles! The Earth’s south pole
is 90°S latitude and is called the South Terrestrial
Pole, while the Celestial Sphere’s south pole (the sky’s south pole) is −90°
declination and is called the South Celestial Pole. Lines of right ascension on the Celestial
Sphere are analogous to lines of longitude on Earth. Just as lines of longitude are not parallel
to one another, lines of right ascension are also not parallel to one
another. Lines of right ascension all
begin together at the North Celestial Pole, they spread apart from one another
until they are furthest apart from one another at the Celestial Equator, and
they all converge back together at the South Celestial Pole. Right ascension is measured in
right-ascension hour-angles, running from 00h to 24h. We will clearly define the line of right
ascension at 00h shortly. Each right-ascension hour-angle
is actually 15° of right ascension, since 360° divided by 24h
equals 15° of right ascension per 1h of
right ascension. Every point on the
Celestial Sphere has a unique declination and right ascension with only two
exceptions. Although the North Celestial
Pole has the unique declination +90°, the North
Celestial Pole has an undefined right ascension. Although the South Celestial Pole has the
unique declination −90°, the South Celestial Pole has an undefined right
ascension. These are the only two points
that suffer from this tragedy; every other point on the Celestial Sphere has a
unique declination and a unique right ascension.
Notice that any spherical
coordinate system we impose upon a sphere always results in two points on the
sphere with undefined coordinates. These
are called the coordinate singularities of the
coordinate system we have imposed. For
example, the North Terrestrial Pole and South Terrestrial Pole are the
coordinate singularities of the Geographic Coordinate System, since the North
Terrestrial Pole and the South Terrestrial Pole have undefined longitudes. As another example, the zenith and the nadir
are the coordinate singularities of the Horizon Coordinate System, since the
zenith and the nadir have undefined azimuths.
As yet another example, the North Celestial
Pole and the South Celestial Pole are the coordinate singularities of the
Equatorial Coordinate System, since the North Celestial Pole and the South
Celestial Pole have undefined right ascensions.
As the Earth rotates, the
Horizon Coordinates of astronomical objects (the altitude and the azimuth)
continuously change, but the Equatorial Coordinates (the declination and the
right ascension) of all astronomical objects remain fixed. Actually, this is not
exactly correct. As we will
discuss later in the course, the Earth’s rotational axis (or axis of rotation)
is slowly precessing and nutating, and these motions
cause changes in Equatorial Coordinates.
In addition, stars and galaxies are also physically moving through the
universe; these physical motions cause additional changes in Equatorial
Coordinates. Nevertheless, all these
changes are sufficiently small that our naked eye does not notice them. For the purposes of this discussion, we will
assume that the Equatorial Coordinates of astronomical objects remain fixed as
a satisfactory naked-eye approximation.
Suppose there happens to be
an astronomical object (such as a star or galaxy) at either the North Celestial
Pole or the South Celestial Pole. Such
an object will actually have fixed Horizon Coordinates even though the Earth is
continuously rotating. It just so
happens that there is a star almost exactly at the North Celestial Pole. Therefore, this star appears to remain fixed as everything else in the sky appears to circle around
it. This star has several names: α
(alpha) Ursae Minoris,
Polaris, the Pole Star, or the North Star.
There is no star almost exactly at the South Celestial Pole, but if
there were such a star, then we would name it the South Star. The North Star always has an azimuth of 0°
since it is directly north, and the South Star (if there were one) always has
an azimuth of 180° since it is directly south.
The altitude of the North Star is always equal to our latitude on Earth,
while the altitude of the South Star (if there were one) is always equal to the
opposite of our latitude on Earth. For
example, if we lived at 40°N latitude, then the
Horizon Coordinates of the North Star would be +40° altitude and 0° azimuth,
while the Horizon Coordinates of the South Star (if there were one) would be −40°
altitude and 180° azimuth. As another
example, if we lived at 60°S latitude, then the
Horizon Coordinates of the North Star would be −60° altitude and 0°
azimuth, while the Horizon Coordinates of the South Star (if there were one)
would be +60° altitude and 180° azimuth.
If we happen to live in the northern hemisphere, all astronomical
objects (such as stars and galaxies) close enough to the North Star at the North
Celestial Pole would never set; they would just appear to circle around the
North Celestial Pole as the Earth rotates.
Also from the northern hemisphere, all
astronomical objects (such as stars and galaxies) close enough to the South
Star at the South Celestial Pole would never rise; they would just circle
around the South Celestial Pole as the Earth rotates. All of these astronomical objects are called circumpolar, since they appear to circle around
the celestial poles. The Latin root circum- means around in words such as circumference, circumscribe,
or circumvent for example; hence, the word circumpolar literally means around
the pole. If we happen to live in the
southern hemisphere, all astronomical objects (such as stars and galaxies)
close enough to the South Star at the South Celestial Pole would never set;
they would just appear to circle around the South Celestial Pole as the Earth
rotates. Also
from the southern hemisphere, all astronomical objects (such as stars and
galaxies) close enough to the North Star at the North Celestial Pole would
never rise; they would just circle around the North Celestial Pole as the Earth
rotates. If we happen to live at the
North Terrestrial Pole, our latitude would be 90°N,
but this means that the North Celestial Pole is at +90°
altitude which means it is at the zenith.
Also from the North Terrestrial Pole, the South Celestial Pole is at −90°
altitude which means it is at the nadir. As the Earth rotates, the entire sky would
appear to be circumpolar, with half of the sky never setting and the other half
of the sky never rising. This stands to
reason. At the North
Terrestrial Pole, there is no east for anything to rise from, nor is there west
for anything to set to. At the
North Terrestrial Pole, all directions are south! If we happen to live at the South Terrestrial
Pole, our latitude would be 90°S, but this means that
the North Celestial Pole is at −90° altitude which means it is at the
nadir. Also from the South Terrestrial
Pole, the South Celestial Pole is at +90° altitude which
means it is at the zenith. As the
Earth rotates, the entire sky would appear to be circumpolar, with half of the
sky never setting and the other half of the sky never rising. This stands to reason. At the South Terrestrial
Pole, there is no east for anything to rise from, nor is there west for
anything to set to. At the South
Terrestrial Pole, all directions are north!
Also, note that the half of the sky that would never set at the North
Terrestrial Pole would be the half of the sky that would never rise at the South
Terrestrial Pole. Conversely, the half
of the sky that would never rise at the North Terrestrial Pole would be the
half of the sky that would never set at the South Terrestrial Pole. If we happen to live at the Terrestrial
Equator, our latitude would be 0°, but this means that the North Celestial Pole
is at 0° altitude, which means it is at the North Point on the horizon. Also from the Terrestrial Equator, the South
Celestial Pole is at 0° altitude, which means it is at
the South Point on the horizon. As the
Earth rotates, nothing in the entire sky would appear to be circumpolar; all
astronomical objects appear to rise and set.
Also, the Terrestrial Equator is the only
location on the Earth where the three giant circles on the Celestial Sphere we
have discussed (the horizon, the Celestial Meridian, and the Celestial Equator)
are all perpendicular to one another.
There is a common
misconception that the North Star is the brightest star in the sky. This is false. In fact, the North Star is among the dimmest
stars in the sky; the North Star is so dim that it is barely visible to our
naked eye. The importance of the North
Star is its location in the sky, not its brightness. As we discussed, the North Star is almost
exactly at the North Celestial Pole.
Therefore, the North Star appears to remain fixed
as everything else in the sky appears to circle around it. Since the North Star remains fixed, the
cardinal directions (north, south, east, and west) are determined by first
finding the North Star. If we stand
facing the direction of the North Star, then we are facing north. To our left would be west, to our right would
be east, and south would be behind us.
This is the only way to determine the cardinal directions. It is a common misconception that we use a
magnetic compass to determine the cardinal directions. This is false. As we will discuss later in the course, a
magnetic compass points toward magnetic north, which is different from cardinal
north (true north, toward the North Terrestrial Pole). Admittedly, the direction of magnetic north
is somewhat close to the direction of cardinal north (true north, toward the
North Terrestrial Pole). Nevertheless,
we strictly emphasize the difference between magnetic north and cardinal north
in this course. Again, we do not use a
magnetic compass to determine the cardinal directions (north, south, east, and
west). The only way to determine the
cardinal directions (north, south, east, and west) is by finding the North
Star, which is difficult since it is one of the dimmest stars in the sky,
barely visible with the naked eye.
For thousands of years,
humans have looked up into the sky and observed that the stars appear to remain
fixed relative to one another while continuously rising and setting. Many cultures formed pictures from groups of
stars in the sky and named them constellations.
However, the modern definition of a constellation is a region of the
Celestial Sphere defined by a boundary.
This modern definition of a constellation is analogous to the definition
of a country, which is a region of a continent on Earth defined by a boundary
(the country’s borders). Just as the
Earth is divided into roughly two hundred countries each defined by its own
boundary (its own borders), the entire Celestial Sphere is divided into
eighty-eight modern constellations, each defined by its own boundary. Any astronomical object (whether it is a
star, a galaxy, a planet, or even the Sun or the Moon) is
considered to be within a certain constellation if it is within the
boundary that defines that constellation.
An asterism is a group of stars in the sky that is not one of the
eighty-eight strictly defined modern constellations.
There are several prominent
circumpolar constellations. The
constellation Ursa Major (the big bear) includes
seven bright stars that form the Big Dipper asterism. Two of the stars in the Big Dipper asterism can be used to find the North Star, also known as α
(alpha) Ursae Minoris or
Polaris or the Pole Star. The North Star
is within the constellation Ursa Minor (the little
bear), which includes the Little Dipper asterism. Three other stars in the Big Dipper asterism can be used to find Arcturus, the brightest star in the
constellation Boötes (the shepherd). The constellation Cassiopeia (the queen of Aethiopia) has five bright stars shaped like the letter
W. There are several prominent summer
constellations. The constellation Cygnus
(the swan) includes the Northern Cross asterism. The brightest star in the constellation
Cygnus (the swan) is Deneb. The
brightest star in the constellation Lyra (the harp) is Vega. The brightest star in the constellation
Aquila (the eagle) is Altair. The Summer
Triangle asterism is formed by connecting Vega, Deneb,
and Altair. There are several prominent
winter constellations. The constellation
Orion (the hunter) includes seven bright stars, with Betelgeuse and Rigel among
them. The sword of Orion is actually the
Orion Nebula, which we will discuss later in the course. The brightest star in the constellation Canis Major (the big dog) is Sirius, the dog
star. The brightest star in the
constellation Canis Minor (the little dog) is
Procyon. The Winter Triangle asterism is formed by connecting Betelgeuse, Sirius, and
Procyon. The brightest star in the
constellation Auriga (the charioteer) is Capella. The two brightest stars in the constellation
Perseus (the hero) are Mirfak and Algol.
For thousands of years,
humans have looked up into the sky and observed that the Sun appears to wander
around the Celestial Sphere. The giant
circle that the Sun appears to wander around is called
the ecliptic, and it takes the Sun one year to complete one journey around the
ecliptic. The constellations along the
ecliptic are called the zodiac constellations. If we arbitrarily decide to begin with the
constellation Aquarius (the water bearer), next comes the constellation Pisces
(the fish) followed by the constellation Aries (the ram). Next comes the constellation Taurus (the
bull). The brightest star in the
constellation Taurus (the bull) is Aldebaran, and the
Pleiades is a star cluster within the constellation Taurus (the bull). Next comes the constellation Gemini (the
twins) with the two bright stars Pollux and Castor. Next comes the constellation Cancer (the
crab) followed by the constellation Leo (the lion). The brightest star in the constellation Leo
(the lion) is Regulus. Next comes the constellation Virgo (the
virgin). The brightest star in the
constellation Virgo (the virgin) is Spica.
Next comes the constellation Libra (the scales) followed by the
constellation Scorpius (the scorpion).
The brightest star in the constellation Scorpius (the scorpion) is
Antares. Next comes the constellation
Ophiuchus (the serpent bearer) followed by the constellation Sagittarius (the
centaur archer) followed by the constellation Capricornus (the sea goat)
followed by the constellation Aquarius (the water bearer), which is where we
arbitrarily decided to begin our journey around the ecliptic. Again, it takes the Sun one year to complete
one journey around the ecliptic. As the
Sun journeys around the ecliptic, it appears to wander from within one zodiac
constellation to another, spending approximately one month within each of these
zodiac constellations.
The ecliptic intersects the
Celestial Equator at two points called the equinoxes: the vernal equinox (or
the spring equinox) is where the Sun appears to be on the ecliptic on roughly
March 21st every year, and the autumn equinox is where
the Sun appears to be on the ecliptic on roughly September 21st every
year. Both equinoxes are 0° declination,
since they are on the Celestial Equator.
Astronomers have agreed to define 00h
right ascension as the line of right ascension that intersects the vernal
equinox (spring equinox). The autumn
equinox is therefore 12h right ascension
since it is on the opposite side of the Celestial Sphere from the vernal
equinox (spring equinox). The furthest
angle from the Celestial Equator the Sun ever wanders along the ecliptic is
roughly 23˝°. This occurs at two points
halfway between the equinoxes called the solstices. The summer solstice is where the Sun appears
to be on the ecliptic on roughly June 21st every year,
and the winter solstice is where the Sun appears to be on the ecliptic on
roughly December 21st every year. The
summer solstice is 06h right ascension
since it is halfway from the vernal equinox (spring equinox) at 00h to the autumn equinox at 12h. The winter solstice is 18h
right ascension since it is halfway from the autumn equinox at 12h to the vernal equinox at 24h
(which is the same as 00h since 24h is all the way around the Celestial Sphere
back to 00h). The summer solstice is where the Sun has a
positive maximum declination of +23˝°, and the winter
solstice is where the Sun has a negative maximum declination of −23˝°. In summary, the Equatorial
Coordinates of the vernal equinox (spring equinox) where the Sun appears on the
ecliptic on roughly March 21st every year is 0° declination and 00h right ascension, the Equatorial Coordinates
of the summer solstice where the Sun appears on the ecliptic on roughly June
21st every year is +23˝° declination and 06h
right ascension, the Equatorial Coordinates of the autumn equinox where the Sun
appears on the ecliptic on roughly September 21st every year is 0° declination
and 12h right ascension, and the
Equatorial Coordinates of the winter solstice where the Sun appears on the
ecliptic on roughly December 21st every year is −23˝° declination and 18h right ascension. When the Sun is at the vernal equinox (spring
equinox) with 0° declination, it appears to have an increasing declination as
it journeys from the winter solstice in the negative-declination hemisphere of
the Celestial Sphere toward the summer solstice in the positive-declination hemisphere
of the Celestial Sphere. When the Sun is
at the autumn equinox with 0° declination, it appears to have a decreasing
declination as it journeys from the summer solstice in the positive-declination
hemisphere of the Celestial Sphere toward the winter solstice in the
negative-declination hemisphere of the Celestial Sphere.
The Wave Theory of Light
A wave is a propagating
(traveling) disturbance. Therefore, a
wave requires a medium through which to propagate, since we cannot have a
propagating disturbance if there is nothing to disturb! A transverse wave is a wave where the
direction of the disturbance is perpendicular to the direction of propagation,
while a longitudinal wave is a wave where the direction of the disturbance is
parallel and antiparallel to the direction of propagation. Light is a real-life example of a transverse
wave, while sound is a real-life example of a longitudinal wave. A wave can have a component of its
disturbance perpendicular to the direction of propagation and another component
parallel and antiparallel to the direction of propagation. In other words, a wave can be both transverse
and longitudinal. Water waves that we
see in the ocean are a real-life example of a wave that is both transverse and
longitudinal. The maximum magnitude of a
wave’s disturbance is called the wave amplitude, and
these amplitudes occur at what are called crests (maximum positive disturbance)
and troughs (maximum negative disturbance).
The wave height is the amount of disturbance between a crest and a
trough. In other words, the wave height
is double the wave amplitude (or the wave amplitude is half of the wave
height). The distance from one crest to
the next crest (which is also the distance from one trough to the next trough) is called the wavelength of the wave and is always given the
symbol λ (the lowercase Greek letter lambda). Caution: the word wavelength is misleading,
since it may lead us to conclude that the wavelength is the length of the
entire wave, which it is not. The
wavelength of a wave is the length of only one cycle of the wave. The frequency of a wave is the number of
crests passing a point every second as well as the number of troughs passing a
point every second. We may also
interpret the frequency of a wave as how many cycles or
oscillations or vibrations the wave executes every second. In other words, the frequency of a wave is
how frequently the wave is vibrating
or oscillating, which is why it is called the frequency! A high-frequency wave is oscillating
(vibrating) many cycles every second, while a low-frequency wave is oscillating
(vibrating) a small number of cycles every second. We will use the symbol f for frequency, and its units are cycles per second or vibrations
per second or oscillations per second.
This unit is called a hertz with the
abbreviation Hz, named for the German physicist Heinrich Hertz. Again, one hertz (Hz) is one cycle per second
or one vibration per second or one oscillation per second. A kilohertz is one thousand hertz or one
thousand cycles per second, since the metric prefix kilo- always means
thousand. For example, one kilometer is
one thousand meters, and one kilogram is one thousand grams. A megahertz is one million hertz or one
million cycles per second, since the metric prefix mega- always means
million. On the amplitude-modulation
radio band (AM radio), the radio-station numbers are kilohertz, while on the
frequency-modulation radio band (FM radio), the radio-station numbers are
megahertz.
The speed of a wave is a
function of the properties of the medium through which the wave
propagates. For example, the speed of
sound is some speed through gases such as air, a faster speed through liquids,
and an even faster speed through solids.
The speed of sound through air is not even fixed; the speed of sound
through air actually changes as the temperature of the air changes. As another example, the speed of light is
some speed through gases such as air, a slower speed through liquids such as
water, and an even slower speed through solids such as glass. The speed of any wave with wavelength λ
and frequency f is determined by the
equation v = f λ, where v is the
speed (the velocity) of propagation of the wave. If we solve this equation for the frequency,
we deduce that f = v / λ. Therefore, frequency and wavelength are
inversely proportional to each other.
Waves with higher frequencies have shorter wavelengths, while waves with
lower frequencies have longer wavelengths.
In addition to the properties of the medium of propagation, the speed of
a wave is almost always also a function of the wave’s
own wavelength or frequency thus resulting in dispersion, as we will discuss
shortly.
The amplitude of any wave
determines its energy. In particular,
the energy of a wave is directly proportional to the square of its
amplitude. Therefore, a wave with a
larger amplitude has more energy, while a wave with a smaller amplitude has
less energy. For example, the amplitude
of a sound wave determines its loudness.
A sound wave with a larger amplitude is more loud,
while a sound wave with a smaller amplitude is less loud (more quiet). As another example, the amplitude of a light
wave determines its brightness. A light
wave with a larger amplitude is more bright, while a
light wave with a smaller amplitude is less bright (more dim). The frequency of a wave is difficult to
interpret physically, and so we will interpret the frequency of a wave on a
case-by-case basis. For example, the
frequency of a sound wave is its pitch, meaning that a sound wave with a higher
frequency has a higher pitch while a sound wave with a lower frequency has a
lower pitch. As another example, the
frequency of a visible light wave is its color.
In particular, a visible light wave with a high frequency is blue or
violet, a visible light wave with a low frequency is red or orange, and a
visible light wave with an intermediate frequency is yellow or green. In order starting from the lowest frequency
(which is also the longest wavelength), the colors of visible light are red,
orange, yellow, green, blue, indigo, and violet at the highest frequency (which
is also the shortest wavelength). This
is why the colors of the rainbow are in this order; a rainbow reveals the
correct sequence of colors as determined by either the frequency or the
wavelength. We can memorize this
sequence of colors with a mnemonic.
Using the letters r for red, o for orange, y for yellow, g for green, b
for blue, i for indigo, and v for violet, we
construct a fanciful name of an imaginary person: Roy G. Biv.
Whereas the frequency and the
wavelength of a wave are constrained to one another through the equation v = f
λ, no universal equation constrains the amplitude with the frequency. Therefore, a wave can have a large amplitude
and a high frequency, a wave can have a large amplitude and a low frequency, a
wave can have a small amplitude and a high frequency, and a wave can have a
small amplitude and a low frequency. In
other words, all of these combinations are physically possible. For example, a sound wave
with a large amplitude and a high frequency is a loud high-pitch sound, a sound
wave with a large amplitude and a low frequency is a loud low-pitch sound, a
sound wave with a small amplitude and a high frequency is a quiet high-pitch
sound, and a sound wave with a small amplitude and a low frequency is a quiet
low-pitch sound. As another example, a visible light wave with a large amplitude and
a high frequency is bright blue, a visible light wave with a large amplitude
and a low frequency is bright red, a visible light wave with a small amplitude
and a high frequency is dim blue, and a visible light wave with a small
amplitude and a low frequency is dim red.
A wave that is a propagating
(traveling) disturbance through a material medium (a medium composed of atoms) is called a mechanical wave.
Sound waves and water waves are real-life examples of mechanical
waves. We will call a wave that is a
propagating disturbance through an abstract field medium a field wave. Light waves and gravitational waves are
real-life examples of field waves. Light
waves are propagating disturbances through the electromagnetic field created by
charges, and gravitational waves are propagating disturbances through the
gravitational field created by masses.
Since light waves are propagating disturbances through the
electromagnetic field, then light is actually an electromagnetic wave. The Electromagnetic Spectrum is a list of all
the different types of electromagnetic waves in order as determined by either
the frequency or the wavelength.
Starting with the lowest frequencies (which are also the longest
wavelengths), we have radio waves, microwaves, infrared, visible light (the
only type of electromagnetic wave our eyes can see), ultraviolet, X-rays, and
gamma rays at the highest frequencies (which are also the shortest
wavelengths). All of these are
electromagnetic waves. Therefore, all of
them may be regarded as different forms of light. They all propagate at the same speed of light
through the vacuum of outer space for example.
We now realize that whenever we use the word light in colloquial
English, we probably mean to use the term visible light, since this is the only
type of light that our eyes can actually see.
The visible light band of the Electromagnetic Spectrum is actually quite
narrow. Nevertheless, the visible light
band of the Electromagnetic Spectrum can be subdivided. In order, the subcategories of the visible
light band of the Electromagnetic Spectrum starting at the lowest frequency
(which is also the longest wavelength) are red, orange, yellow, green, blue,
indigo, and violet at the highest frequency (which is also the shortest
wavelength). We now realize why
electromagnetic waves with slightly lower frequencies (or with slightly longer
wavelengths) than visible light are called infrared,
since their frequencies (or wavelengths) are just beyond red visible
light. In other words, infrared light is
more red than red! We also realize why
electromagnetic waves with slightly higher frequencies (or with slightly
shorter wavelengths) than visible light are called
ultraviolet, since their frequencies (or wavelengths) are just beyond violet
visible light. In other words,
ultraviolet light is more purple than purple!
A wave detector detects a
frequency shift if the source of the wave is moving or if the
detector of the wave is moving or if both are moving. This is called the
Doppler-Fizeau effect or the Doppler-Fizeau shift, named for the Austrian physicist and
mathematician Christian Doppler and the French physicist Hippolyte
Fizeau, both of whom mathematically derived the
existence of this effect. According to
the Doppler-Fizeau effect, a wave detector detects a
higher-frequency shift if there is advancing motion (the source moves toward
the detector, the detector moves toward the source, or both). Conversely, a wave detector detects a
lower-frequency shift if there is receding motion (the source moves away from
the detector, the detector moves away from the source, or both). For example, the human ear is a detector of
sound waves. Since the frequency of a
sound wave is its pitch, our ears hear higher pitches as a
police/ambulance/firetruck siren for example moves towards us, and our ears
hear lower pitches as a police/ambulance/firetruck siren for example moves away
from us. Although the human eye is a
detector of light waves, the Doppler-Fizeau effect
for light is too small to be noticed by the human
eye. This is because the magnitude of
the Doppler-Fizeau shift for any wave is a function
of the speeds of the source and the detector as compared with the propagation
speed of the wave itself. The vacuum
speed of light is nearly one million times faster than the speed of sound through
air. Since the vacuum speed of light is
so extraordinarily fast, most objects in the universe move slowly as compared
with this incredibly fast speed, resulting in tiny Doppler-Fizeau
shifts for light. The speed of sound
through air is nearly one million times slower than the vacuum speed of light;
thus, police/ambulance/firetruck sirens for example can move at appreciable
fractions of the speed of sound, resulting in Doppler-Fizeau
shifts for sound that the human ear actually notices. Although the Doppler-Fizeau
effect for light is too small for the human eye to notice, astronomers can
experimentally measure tiny Doppler-Fizeau shifts of
the light from astronomical objects such as stars and galaxies. Our instruments detect higher-frequency
shifts (which are also shorter-wavelength shifts) of light from stars and
galaxies that move towards us.
Astronomers use the word blueshift for
higher-frequency shifts (or shorter-wavelength shifts) of any type of
electromagnetic wave. Conversely, our
instruments detect lower-frequency shifts (which are also longer-wavelength
shifts) of light from stars and galaxies that move away from us. Astronomers use the word redshift for
lower-frequency shifts (or longer wavelength shifts) of any type of
electromagnetic wave. By measuring these
blueshifts and redshifts, astronomers can determine
not only the direction of motion of stars and galaxies (whether toward us or
away from us) but in addition their speed of motion (whether fast or
slow). We can even determine the rotational
speed of a star using the Doppler-Fizeau effect, as
we will discuss shortly.
Consider any wave propagating
in a certain medium that encounters a second medium. This wave is called
the incident wave. At the boundary
between the two media, a part of the wave will bounce back into the first
medium while the rest of the wave will be transmitted
into the second medium. The wave that
bounces back into the first medium is called the
reflected wave, while the wave that is transmitted into the second medium is
called the refracted wave. We will make
clear the meanings of the words reflection and refraction shortly. Any line perpendicular to the boundary
between the two media is called the normal to the
boundary, since the word normal in physics and engineering means
perpendicular. The angle between the
incident wave and the normal is called the angle of
incidence with the symbol θ1. The angle between the reflected wave and the
normal is called the angle of reflection with the
symbol θ3. The angle between the refracted wave and the
normal is called the angle of refraction with the
symbol θ2. The Law of Reflection states that θ1 = θ3
in all cases. In other words, the angle
of incidence is equal to the angle of reflection in all cases for all
waves. Reflection is the bouncing of a
part of a wave off of another medium with no change in
angle with respect to the normal. The
Law of Refraction states sin(θ1)/v1 =
sin(θ2)/v2, where v1 is
the speed of the wave in the first medium and v2 is the speed of the
refracted wave in the second medium. Refraction
is the bending of a wave due to a change in speed of the wave. According to the Law of Refraction, the angle
of refraction θ2 is smaller than the
angle of incidence θ1 if the
transmitted speed v2
is slower than the incident speed v1, while the angle of refraction θ2 is larger than the angle of incidence θ1 if the transmitted speed v2 is
faster than the incident speed v1. More
plainly, a wave is refracted (bent) toward
the normal if the transmitted wave propagates slower than the incident wave, while
a wave is refracted (bent) away from
the normal if the transmitted wave propagates faster than the incident wave.
When two or more waves of the
same nature (the same type) occupy the same space at the same time, they add
together to become a combined wave. This
is called interference. Caution: sometimes addition is indeed an
addition, but sometimes addition is actually a subtraction. For example, positive five added with
positive three yields positive eight, and negative five added with negative three
yields negative eight. However, positive
five added with negative three yields positive two. Notice that two is actually the difference
between five and three. Again, whereas
sometimes addition is indeed an addition, sometimes addition is actually a
subtraction. When two waves of the same
nature (the same type) occupy the same space at the same time, crest and crest
may meet, trough and trough may meet, or crest and trough may meet. The crests of a wave are locations of maximum
positive disturbance, and the troughs of a wave are locations of maximum
negative disturbance. When crest and
crest meet, the waves interfere with one another to become a combined wave with
an even greater positive disturbance, since adding positive numbers together
yields even greater positive numbers.
When trough and trough meet, the waves interfere with one another to
become a combined wave with an even greater negative disturbance, since adding
negative numbers together yields even greater negative numbers (in magnitude). Either of these scenarios is
called constructive interference, since the combined wave has a larger
amplitude than the individual waves that interfered to form the combined
wave. When crest and trough meet, the
waves interfere with one another to become a combined wave with a smaller
disturbance, since adding a positive number with a negative number actually
results in subtraction, such as adding positive five with negative three to
yield positive two. This scenario is called destructive interference, since the combined wave
has a smaller amplitude than some of the individual waves that interfered to
form the combined wave. Any type of
interference between these two extremes is called
mixed interference.
Consider many waves with many
different wavelengths (or frequencies) that interfere with one another to form
a combined wave. The speed of the
individual waves is called the phase speed, while the
speed of the combined wave is called the group speed. Suppose all the
individual waves move at the same speed.
So, these individual waves will all move together. The resulting combined wave will then move at
the same speed as the individual waves.
In other words, the group speed and the phase are equal to each
other. Now suppose instead that the
individual waves all move at different speeds.
The combined wave will then spread as faster-moving waves pull out ahead
of slower-moving waves that lag behind.
This is called dispersion. The verb to disperse means to spread
out. The combined wave spreads because
the individual waves all move at different speeds. The combined wave will also move at a
different speed. In other words,
dispersion is the spreading of a wave because the phase speed and the group
speed are different from each other.
When the individual waves all move at the same speed, they will all move
together, resulting in the combined wave also moving at that same speed. The combined wave does not spread, since all the individual waves move together. This is called no
dispersion. A rainbow results from the
dispersion of light within water or glass.
White light is a combination of individual light waves with different
wavelengths or frequencies (different colors).
When white light propagates within water or glass, the different colors
all propagate at different speeds. This
also causes these colors to refract (to bend) at different angles
which separates the individual colors from each other, thus forming a
rainbow.
Diffraction is the bending of
a wave, but this is a different bending than the refraction of a wave. Refraction is the bending of a wave due to a
change in speed of the wave, but diffraction is the bending of a wave without
involving a change in speed of the wave.
Diffraction is the bending of a wave around obstacles. However, if the wavelength of a wave is small
compared with the size of the obstacles, this diffractive bending will be
negligible, and the wave will seem to propagate in straight lines. This is why light from a flashlight or even
more so from a laser pointer appears to travel a straight path. The wavelength of visible light waves is so
small compared with the sizes of everyday obstacles around us that the
diffraction of visible light is not noticeable.
However, the wavelength of sound waves is not small compared with the
sizes of everyday obstacles around us, such as hallways and doorways. Thus, the diffraction of sound is quite
severe. This is why we can hear someone
speaking who is nevertheless standing around a corner. The wavelength of sound is not small compared
to the size of the hallway; therefore, the sound diffracts around the corner
instead of traveling a simple straight path.
As we discussed, a transverse
wave is a wave where the direction of the disturbance is perpendicular to the
direction of propagation. The
polarization of a transverse wave is the orientation and the motion of its
perpendicular disturbance. If the
disturbance is a vertical vibration, a horizontal vibration, or a diagonal
vibration, we say that the wave is linearly polarized. If the disturbance traces out the shape of an
ellipse, we say that the wave is elliptically polarized. A special case of an elliptically polarized
wave is a circularly polarized wave, since a circular shape is a special case
of an elliptical shape. We may interpret
an elliptically polarized wave as the interference (the combination) of two
appropriately linearly polarized waves.
Conversely, we may interpret a linearly polarized wave as the
interference (the combination) of two appropriately elliptically polarized
waves. Since light is a transverse wave,
light waves can be linearly polarized or elliptically polarized. A random combination of a large number of
light waves each with its own particular polarization results in unpolarized light.
In particular, light from the Sun (sunlight) is unpolarized
light.
Quantum Physics: Photons, Atoms, and Spectra
According to the wave theory
of light, radio, microwaves, infrared, visible light, ultraviolet, X-rays, and
gamma rays are electromagnetic waves.
However, according to the quantum theory of light, radio, microwaves,
infrared, visible light, ultraviolet, X-rays, and gamma rays are composed of
particles called photons. In other
words, a photon is a particle of the electromagnetic field. The energy of a photon is
determined by the equation Ephoton = h f, where h is the Planck constant, one of the
fundamental physical constants of the universe.
The Planck constant is named for the German
physicist Max Planck, one of the grandfathers of Quantum Mechanics, the correct
theory of molecules, atoms, and subatomic particles. According to this equation Ephoton
= h f, the energy of a photon is
directly proportional to the frequency.
Therefore, higher frequencies of light are composed of higher-energy
photons, and lower frequencies of light are composed of lower-energy
photons. Therefore, the Electromagnetic
Spectrum starting with the lowest photon energy is radio, microwaves, infrared,
visible light, ultraviolet, X-rays, and gamma rays at the highest photon energy. Notice that ultraviolet photons have greater
energy than visible light photons; this is why ultraviolet causes suntans and
sunburns. Also
notice that X-ray photons have even greater energy, so much so that they
penetrate most substances. This is why X-rays
are used to take X-rays! The subcategories of the visible light band
of the Electromagnetic Spectrum starting at the lowest photon energy are red,
orange, yellow, green, blue, indigo, and violet at the highest photon energy.
Since every blue photon in
the universe has more energy than every red photon in the universe, if we had
blue light composed of more photons and red light composed of fewer photons,
the blue light would certainly be brighter than the red light. If we had red light and blue light each
composed of the same number of photons, the blue light would still be brighter
than the red light, since each blue photon composing the blue light has more
energy than each red photon composing the red light. How could red light ever be equally as bright
as blue light? In this case, the red
light must be composed of significantly more photons than the blue light to
ensure that the total energy of the red light were equal to the total energy of
the blue light, even though each red photon actually has less energy than each
blue photon. Furthermore, if every blue
photon in the universe has more energy than every red photon in the universe,
could red light ever be even brighter than blue light? In this case, the bright red light must be
composed of many many more photons than the dim blue light to ensure that the
total energy of the red light is greater than the total energy of the blue
light even though each red photon actually has less energy than each blue
photon. In summary, the total energy of
light is equal to the number of photons multiplied by the energy of each
photon. This reveals a connection
between the wave theory of light and the quantum theory of light. According to the wave theory of light, the
total energy of light is directly proportional to the square of its
amplitude. According to the quantum
theory of light, the total energy of light is directly proportional to the
product of the number of photons and the frequency (since the energy of each
photon is directly proportional to the frequency through the equation Ephoton
= h f). We conclude the following connection between
the wave theory of light and the quantum theory of light: the square of the
amplitude of a light wave must be directly proportional to the product of the
number of photons that composes the light and the frequency of the light.
All materials in the universe
(such as solids, liquids, and gases) are composed of atoms. Atoms are composed of even smaller
particles. The center of the atom is called the nucleus, since the center of anything is often
called its nucleus. For example, the
center of a biological cell is called the cellular
nucleus, and the center of an entire galaxy is called the galactic
nucleus. The center of an atom is more properly called the atomic nucleus, but we will
often simply refer to it as the nucleus.
Surrounding the atomic nucleus are electrons. The atomic nucleus is positively charged, and
electrons are negatively charged. Like
charges repel, and unlike charges attract.
In other words, positive and positive repel, negative and negative
repel, and positive and negative attract.
It is the attraction between the positive nucleus and
the surrounding negative electrons that holds the atom together. The atomic nucleus is composed of even smaller
particles: protons and neutrons. Protons
are positively charged. In fact, it is
because of the protons that the entire atomic nucleus has a positive charge. Neutrons have zero electrical charge. In other words, neutrons are neutral. This is why they are called
neutrons!
The number of protons in the
nucleus is the single most important number of the atom. The number of protons in the nucleus is so
important that it is called the atomic number. The atomic
number, which is always the number of protons in the nucleus, is so important
that an atom is named solely based on its atomic
number. For example, every atom in the universe with twelve
protons in its nucleus is considered to be a magnesium
atom. As another example, every atom in the universe with seven
protons in its nucleus is considered to be a nitrogen
atom. We are not saying that the number
of neutrons is irrelevant, nor are we saying that the number of electrons is
irrelevant. The neutrons and the
electrons are quite important. We are saying
that the atomic number is always the number of protons, and the name of an atom
is based only upon its atomic number, the number of
protons.
Consider an atom where the
number of electrons balances the number of protons. Since protons are positive and electrons are
negative, the atom is neutral overall.
Now suppose we add extra electrons to the atom. Since electrons are negative, the atom will no
longer be neutral overall; the atom will now be negative overall. Suppose instead that we removed electrons
from the atom in the first place. Now
the atom will be positive overall. A
charged atom is called an ion. Therefore, changing the
number of electrons results in ions. For
example, consider the sodium atom with the symbol Na. The atomic number of sodium is eleven,
meaning that every sodium atom in the universe has eleven protons. We will make this clear with a subscript
before the atom’s symbol as follows: 11Na. If the sodium atom were neutral, it would
have eleven electrons as well, but suppose we add three more electrons. Since electrons are negative, we now have an
ion with a charge of negative three. We
write the charge as a superscript after the symbol of the atom as follows: 11Na3–. Even though
the charge is read negative three, the superscript is
written in the strange way 3–. As another example, consider the aluminum
atom with the symbol Al. The atomic
number of aluminum is thirteen, meaning that every aluminum atom in the
universe has thirteen protons. We make
this clear with a subscript before the atom’s symbol as follows: 13Al. If
the aluminum atom were neutral, it would have thirteen electrons as well, but
suppose we remove two of its electrons.
We now have an ion with a charge of positive two. We write the charge as a superscript after
the symbol of the atom as follows: 13Al2+. Even though the charge is
read positive two, the superscript is written in the strange way
2+. A positive ion is
called a cation, and a negative ion is called an anion. In other words, an anion (a negative ion) has
extra electrons, while a cation (a positive ion) is deficient (has lost)
electrons.
If we change the number of
neutrons, we do not get ions, since
neutrons are neutral. So, adding or
removing neutrons does not change the charge at all. If we change the number of neutrons, what we
are changing is the mass of the atom.
The atomic mass of an atom is
the number of protons plus the number of neutrons. We do not include the electrons when
calculating the mass of the atom because an electron is almost two thousand
times less massive than a proton or a neutron.
Thus, electrons contribute a minuscule amount to the mass of an
atom. A proton and a neutron have
roughly equal amounts of mass, which is why we count them equally. When we change the number of neutrons, we are
changing the atomic mass of the atom.
Two atoms with the same atomic number but different atomic mass are called isotopes. Therefore, changing the number of neutrons
results in isotopes. For example,
consider the carbon atom with the symbol C.
The atomic number of carbon is six, meaning that every carbon atom in
the universe has six protons. We make
this clear with a subscript before the symbol of the atom as follows: 6C.
However, carbon has three isotopes: carbon-twelve, carbon-thirteen, and
carbon-fourteen. An isotope is named based on its atomic mass. Thus, the numbers twelve, thirteen, and
fourteen are the atomic masses of these isotopes of carbon. We make this clear with a superscript before
the symbol of the atom as follows: for carbon-twelve, for carbon-thirteen, and for carbon-fourteen. Notice that carbon always has six protons,
but the carbon-fourteen isotope has eight neutrons, since six plus eight equals
fourteen. The carbon-thirteen isotope
has seven neutrons, since six plus seven equals thirteen. The carbon-twelve isotope has six neutrons, since six
plus six equals twelve. As another example, consider the oxygen atom with the symbol
O. The atomic number of oxygen is eight,
meaning that every oxygen atom in the universe has eight protons. We make this clear with a subscript before
the symbol of the atom as follows: 8O. However, oxygen has three isotopes:
oxygen-sixteen, oxygen-seventeen, and oxygen-eighteen. An isotope is named
based on its atomic mass. Thus, the
numbers sixteen, seventeen, and eighteen are the atomic masses of these
isotopes of oxygen. We make this clear
with a superscript before the symbol of the atom as follows: for oxygen-sixteen, for oxygen-seventeen, and for oxygen-eighteen. Notice that oxygen always has eight protons,
but the oxygen-eighteen isotope has ten neutrons, since eight plus ten equals
eighteen. The oxygen-seventeen isotope
has nine neutrons, since eight plus nine equals seventeen. The oxygen-sixteen isotope has eight neutrons, since
eight plus eight equals sixteen.
Let
us apply everything we have discussed about atoms to the following
examples. Consider the neon atom with
the symbol Ne. Now suppose we write 2–. This neon atom has ten protons, eleven
neutrons, twelve electrons, an atomic number of ten, an atomic mass of
twenty-one, and a charge of negative two.
As another example, consider the boron atom with the symbol B. (There are borons
in this class!) Now suppose we write 3+. This boron atom has five protons, four
neutrons, two electrons, an atomic number of five, an
atomic mass of nine, and a charge of positive three.
The two most important atoms
in this course are hydrogen and helium, since most of the atoms in the universe
are hydrogen atoms, and helium atoms are the second most abundant atom in the
universe. The symbol for the hydrogen
atom is H. The atomic number of hydrogen
is one, meaning that every hydrogen atom in the universe has one proton in its
nucleus. We make this clear with a
subscript before the symbol of the atom as follows: 1H. However, hydrogen has three isotopes:
hydrogen-one which is written ,
hydrogen-two which is written ,
and hydrogen-three which is written . Hydrogen is so important that these three
isotopes have additional names besides hydrogen-one, hydrogen-two, and
hydrogen-three. Hydrogen-one is also called protium.
It is also called ordinary hydrogen, since most
of the hydrogen atoms in the universe are this isotope. Hydrogen-two is also called
deuterium. It is also
called heavy hydrogen, since it is twice as massive as ordinary
hydrogen. (When an oxygen atom
chemically bonds to two ordinary hydrogen atoms, the result is a molecule of
ordinary water. When an oxygen atom
chemically bonds to two heavy hydrogen atoms, the result is a molecule of heavy
water.) Hydrogen-three is also called tritium.
The names protium, deuterium, and tritium are derived from the names of the nuclei of these
atoms. The atomic number of hydrogen is
one, meaning that every hydrogen atom in the universe has one proton in its
nucleus. This means that the
hydrogen-one isotope (or protium or ordinary
hydrogen) has no neutrons in its nucleus, since one plus
zero equals one. In other words, its
nucleus is a single proton all by itself.
This is the simplest nucleus in the universe. Since the nucleus is a proton, when we put an
electron around it to build the entire atom, we name the entire atom protium, since its nucleus is a proton. The hydrogen-two isotope (or deuterium or
heavy hydrogen) has one neutron in its nucleus, since one plus one equals
two. In other words, its nucleus is a
proton and a neutron stuck to each other.
A proton and a neutron stuck to each other is called
a deuteron. Since the nucleus is a
deuteron, when we put an electron around it to build the entire atom, we name
the entire atom deuterium, since its nucleus is a deuteron. The hydrogen-three isotope (or tritium) has
two neutrons in its nucleus, since one plus two equals three. In other words, its nucleus is a proton and
two neutrons all stuck to one another. A
proton and two neutrons all stuck to one another is called a triton. Since the nucleus is a triton, when we put an
electron around it to build the entire atom, we name the entire atom tritium,
since its nucleus is a triton. The
helium atom with the symbol He has an atomic number of two, meaning that every
helium atom in the universe has two protons in its nucleus. We make this clear with a subscript before
the symbol of the atom as follows: 2He. Most of the helium atoms in the universe are
the helium-four isotope which is written . Helium-four is also called
ordinary helium, since most of the helium atoms in the universe are this isotope. The nucleus of helium-four is composed of two
protons and two neutrons, since two plus two equals four. In other words, the nucleus of helium-four is
two protons and two neutrons all stuck to one another. Two protons and two neutrons all stuck to one
another is called an alpha particle. To
summarize, the nucleus of the protium atom is a
proton, the nucleus of the deuterium atom is a deuteron, the nucleus of the
tritium atom is a triton, and the nucleus of the ordinary helium atom is an
alpha particle.
Electrons do not orbit an atomic nucleus like planets orbit the Sun.
In fact, the electrons do not orbit at all; they exist in abstract
quantum-mechanical states that we will not discuss deeply in this course. We simply state that there are definite
energy levels within an atom. Some
levels are at lower energies, and other levels are at higher energies. If an electron wishes to change its energy
from a lower level to a higher level, it must absorb a photon, a particle of
light. However, not any photon will
accomplish this transition. The energy
of the photon absorbed must be exactly equal to the difference in energy
between the two levels. If an electron
wishes to change its energy from a higher level to a lower level, it must emit
(spit out) a photon, but not any photon will accomplish this transition. The energy of the photon emitted must be
exactly equal to the difference in energy between the two levels. Therefore, an atom can only absorb or emit
photons of very specific energies (or very specific frequencies or very
specific wavelengths). The list of all
the allowed photon energies (or frequencies or wavelengths) an atom is permitted to absorb is called the absorption spectrum of
the atom, and the list of all the allowed photon energies (or frequencies or
wavelengths) an atom is permitted to emit is called the emission spectrum of
the atom. The study and measurement of
spectra is called spectroscopy. Since different atoms have different energy
levels, every atom has its own unique spectrum, different from the spectra of
all other atoms. Therefore, the spectrum
of an atom is rather like its fingerprint, enabling us to
uniquely identify an atom. A
spectacular application of spectroscopy was the discovery of the Sun’s
composition, the discovery of which atoms compose the Sun. In the early nineteenth century (early 1800s), the Bavarian physicist Joseph von Fraunhofer discovered missing wavelengths in the Sun’s
light. These absorption lines within
sunlight were later called Fraunhofer
lines in his honor. By measuring the
wavelengths of these absorption lines and consulting a table of absorption
spectra, we can determine which atoms absorbed these missing wavelengths and
thus determine the composition of the Sun.
We discover that the Sun is composed of all the atoms across the entire
the Periodic Table of Elements, but not in equal amounts. Only two atoms account for close to one
hundred percent of the Sun’s mass; all the other atoms on the Periodic Table of
Elements account for only a tiny fraction (tiny percentage) of the Sun’s
mass. What are these two elements that
account for close to one hundred percent of the Sun’s mass? We discover from the Fraunhofer
lines in sunlight that hydrogen atoms account for roughly three-quarters
(roughly seventy-five percent) of the Sun’s mass. What about the remaining roughly one-quarter
(roughly twenty-five percent) of the Sun’s mass? During the nineteenth century (the 1800s), the wavelengths of the remaining absorption lines
were not found in any atom’s tabulated absorption
spectrum! Evidently, roughly one-quarter
(roughly twenty-five percent) of the Sun’s mass is composed of a new atom never
before discovered! This newly-discovered atom was called helium, named for Helios
the personification of the Sun in ancient Greek mythology. In the early twentieth century (early 1900s), helium was discovered on Earth as the product of
certain nuclear reactions, and today helium is in common use, in blimps and in
party balloons for example.
Nevertheless, helium was first discovered from
its absorption lines in the Sun’s light!
What is temperature? What do we mean when we say something is
hot? What do we mean when we say
something is cold? The temperature of an
object is a measure of the average energy of the atoms that compose that
object. In this course, we may assume
that the average energy of atoms corresponds to their average speed. In other words, the atoms of a hotter object are
moving relatively faster, whereas the atoms of a cooler object are moving relatively
slower. There are two scales of
temperature in common use: degrees fahrenheit and
degrees celsius.
However, neither degrees fahrenheit nor
degrees celsius are acceptable units of temperature,
since zero is in the wrong place in both of these scales. What do we mean by saying zero is in the
wrong place? If the temperature of an
object is a measure of the average speed of its atoms, then the coldest
possible temperature of our universe is the temperature at which all the atoms
of an object completely stop moving.
After all, there is no slower speed than not moving at all! The temperature at which all atoms completely
stop moving is commonly called the absolute zero of
temperature. However, this absolute zero
of temperature is not zero degrees fahrenheit nor is
it zero degrees celsius. Atoms are still moving at zero degrees fahrenheit, and atoms are still moving at zero degrees celsius. There are
negative temperatures on both of these scales (commonly called temperatures
below zero) where the atoms move slower still.
The absolute zero of temperature when all atoms completely stop moving
is exactly negative 273.15 degrees celsius or exactly
negative 459.67 degrees fahrenheit. A correct unit of temperature must assign the
number zero to the absolute zero of temperature. The simplest way to correct degrees celsius is to add 273.15 to all degrees celsius. Since absolute zero is negative 273.15
degrees celsius, then adding 273.15 would yield zero,
and all other temperatures would become positive. The simplest way to correct degrees fahrenheit is to add 459.67 to all degrees fahrenheit. Since
absolute zero is negative 459.67 degrees fahrenheit,
then adding 459.67 would yield zero, and all other temperatures would become
positive. When we correct the celsius scale by adding 273.15 to
all degrees celsius, the resulting correct units of
temperature are called kelvins. When we
correct the fahrenheit scale
by adding 459.67 to all degrees fahrenheit, the
resulting correct units of temperature are called rankines. To summarize, absolute zero temperature is
negative 273.15 degrees celsius or negative 459.67
degrees fahrenheit using
incorrect temperature scales, but absolute zero temperature is zero kelvins or
zero rankines using correct units of temperature. We will use kelvins throughout this
course. Although kelvins are a correct
unit of temperature, it is somewhat difficult growing accustomed to kelvins. For example, most humans consider 280 kelvins
to be uncomfortably cold, most humans consider 300
kelvins to be a comfortable room temperature, and most humans consider 320
kelvins to be uncomfortably hot.
The Third Law of
Thermodynamics states that it is impossible to cool an object to absolute zero
temperature in a finite number of processes.
It follows that every object in the universe has a temperature that is
warmer than absolute zero. Therefore,
every object in the universe has its atoms moving at some average speed. Since atoms are composed of protons,
neutrons, and electrons and since protons and electrons are charged, every
object in the universe radiates electromagnetic waves from its moving
atoms. The neutrons also contribute to
these electromagnetic waves. Although
neutrons are neutral, they still have some electromagnetic properties. The amount of energy radiated from a hot,
dense object often follows the blackbody spectrum, which is a continuous
spectrum with its primary radiation within a band of the Electromagnetic
Spectrum determined by the temperature of the object. In particular, hotter temperatures correspond
to higher photon energies (which are also at higher frequencies and shorter
wavelengths), while cooler temperatures correspond to lower photon energies
(which are also at lower frequencies and longer wavelengths). In other words, a hot, dense object’s primary
radiation is displaced as its temperature
changes. This is
called the Wien displacement law, named for the German physicist Wilhelm
Wien who discovered this relationship between a hot, dense object’s primary
radiation and the object’s temperature.
More precisely, the Wien displacement law states that the wavelength of
a hot, dense object’s primary radiation is inversely proportional to its
temperature, assuming we measure temperature with correct units such as kelvins
or rankines.
At extremely cold temperatures (close to absolute zero), objects radiate
primarily in the microwave band of the Electromagnetic Spectrum. At a few hundred kelvins (such as room
temperatures), objects radiate primarily in the infrared band of the
Electromagnetic Spectrum. At roughly one
or two thousand kelvins, objects radiate primarily red visible light. At roughly three or four thousand kelvins,
objects radiate primarily orange visible light.
At roughly five or six thousand kelvins, objects radiate primarily
yellow visible light. At roughly ten
thousand kelvins, objects radiate primarily blue visible light. At hundreds of thousands of kelvins, objects
radiate primarily in the ultraviolet band of the Electromagnetic Spectrum. At a couple million kelvins, objects radiate
primarily in the X-ray band of the Electromagnetic Spectrum. At tens of millions of kelvins, objects
radiate primarily in the gamma-ray band of the Electromagnetic Spectrum. Notice how hotter temperatures displace the
primary radiation to higher and higher photon energies (which are also higher
and higher frequencies and shorter and shorter wavelengths), while cooler
temperatures displace the primary radiation to lower and lower photon energies
(which are also lower and lower frequencies and longer and longer
wavelengths). We can demonstrate this
Wien displacement law by heating metal. A metal that is sufficiently hot radiates red visible
light. As the metal is
made even hotter, it radiates orange visible light. If the metal is made
hotter still, it radiates yellow visible light.
We can also demonstrate this Wien displacement law with a flame on a
stovetop. At the lowest setting, the
flame is red. At a slightly higher
setting, the flame is orange. At an even
higher setting, the flame is yellow, and the hottest part of the flame is
blue. The Sun is a yellow star, and from
that yellow color we can correctly estimate that the
surface temperature of the Sun is roughly six thousand kelvins. Stars throughout the universe that are red in
color are cooler than our Sun, stars that are blue in color are hotter than our
Sun, and stars that are yellow in color are roughly the same temperature as our
Sun. We must emphasize that we are
referring to the color that an object radiates because it is hot enough to be
radiating that color. Many objects have
various different colors even though they are all at room temperature, such as
red roses, yellow paint, green grass, and blue jeans. These objects are not radiating these colors;
these objects are reflecting these colors while absorbing all other
colors. We must be careful to make a
distinction between the color of an object simply
because it is reflecting that color versus the color of an object because it is
actually hot enough to be radiating that color.
A red pen is at room temperature, while a piece of charcoal glowing red
is at roughly one or two thousand kelvins of temperature!
We may summarize our
discussion of spectroscopy with the three Kirchhoff laws of spectroscopy, named
for the German physicist Gustav Kirchhoff.
The first Kirchhoff law of spectroscopy states that a dense object (such
as a solid, a liquid, or even a sufficiently dense gas) will radiate (or emit)
light at all wavelengths (or frequencies or photon energies). This is a continuous spectrum. The second Kirchhoff law of spectroscopy
states that a tenuous (low density) gas that is
sufficiently hot will radiate (or emit) light at particular wavelengths (or
frequencies or photon energies) that are unique to the particular gas. This is an emission spectrum. The third Kirchhoff law of spectroscopy
states that a tenuous (low density) gas that is sufficiently cool that is
subjected to a continuous spectrum will remove light at particular wavelengths
(or frequencies or photon energies) that are unique to the particular gas. This is an absorption spectrum. Generally, continuous spectra enable us to
determine the temperature of an object, whereas emission spectra and absorption
spectra enable us to determine the composition of an object.
Observational Astronomy
Since light is a wave, light
must obey the Law of Reflection θ1 = θ3 and the Law of Refraction sin(θ1)/v1 =
sin(θ2)/v2. A device that reflects light is called a mirror. A
device that refracts light is called a lens. Since most metals reflect light very well, we
can construct a mirror by coating a piece of glass with a metal (often
aluminum) and polishing the metal. Any
piece of glass may be regarded as a lens, since light will refract (bend) as it
is transmitted from the air into the glass and will refract (bend) again as it
is transmitted from within the glass back into the air. In the following discussion of mirrors and
lenses, we will assume that we are in the paraxial approximation. In this approximation, all light rays
incident upon a mirror or lens must be near the symmetry axis of the mirror or
lens. We can guarantee that we are in
the paraxial approximation by constructing the mirror or lens with a large
radius of curvature.
A curved mirror with its
center of curvature and its focal point facing toward the light incident upon
it is called a concave mirror. In this case, we say that the focal point is
in front of the mirror. A curved mirror
with its center of curvature and its focal point facing away from the light
incident upon it is called a convex mirror. In this case, we say that the focal point is
behind the mirror. In the paraxial
approximation, light incident upon a concave mirror will reflect and converge
at the focal point that is in front of the mirror. For this reason, a concave mirror is also called a converging mirror. Also in the paraxial approximation, light
incident upon a convex mirror will reflect and diverge away from the focal
point that is behind the mirror. For
this reason, a convex mirror is also called a
diverging mirror. A curved lens that is
thicker in its middle than it is at its edge is called
a convex lens. A curved lens that is
thinner in its middle than it is at its edge is called
a concave lens. In the paraxial
approximation, light incident upon a convex lens will refract and converge at
the focal point that is on the opposite side of the lens as the incident rays. For this reason, a convex lens is also called a converging lens. Also in the paraxial approximation, light
incident upon a concave lens will refract and diverge away from the focal point
that is on the same side of the lens as the incident rays. For this reason, a concave lens is also called a diverging lens. Notice that mirrors and lenses are completely
opposite in character. A concave mirror
is converging, but a concave lens is diverging.
A convex mirror is diverging, but a convex lens is converging. Even the geometry of convergence or
divergence (as the case may be) is opposite in character. In particular, light rays converge to a focus
on the same side as the incident rays for a converging mirror, but light rays
converge to a focus on the opposite side as the incident rays for a converging
lens. Also, light
rays diverge away from a focus on the opposite side as the incident rays for a
diverging mirror, but light rays diverge away from a focus on the same side as
the incident rays for a diverging lens.
A telescope is a device that
collects light from a large distant object.
We must never confuse a telescope with a microscope, which is a device
that collects light from a small nearby object.
When we define a telescope as a device that collects light from a large
distant object, we mean any type of light whatsoever. In other words, a telescope is a device that
collects from a large distant object electromagnetic waves (composed of
photons) from any band whatsoever across the Electromagnetic Spectrum. A telescope that collects radio waves is called a radio telescope.
A telescope that collects infrared is called an
infrared telescope. A telescope that
collects ultraviolet is called an ultraviolet
telescope. A telescope that collects
X-rays is called an X-ray telescope. A telescope that collects gamma rays is called a gamma-ray telescope. Caution: a telescope that collects visible
light is called an optical
telescope. We now realize that whenever
we use the word telescope in colloquial English, we probably mean to use the
term optical telescope, since there are other types of telescopes that collect
other forms of light that our eyes cannot see.
Whereas the optical telescope was invented more than four hundred years
ago, it has only been in recent decades that other types of telescopes have
been built thus giving astronomers a more complete understanding of the
universe by collecting light from across the entire Electromagnetic Spectrum
from astronomical objects such as stars and galaxies.
Optical telescopes are often divided into two categories: refracting telescopes
(or just refractors for short) and reflecting telescopes (or just reflectors
for short). Refracting telescopes use
lenses as their primary optical instruments, while reflecting telescopes use
mirrors as their primary optical instruments.
Refracting telescopes and reflecting telescopes each have their own
particular advantages and disadvantages, but astronomers have concluded in
recent decades that the advantages of reflectors far outweigh their
disadvantages and that the disadvantages of refractors far outweigh their
advantages. For example, dispersion
causes different colors of light to refract through a lens by different angles,
causing the final image to appear blurred with color. This is called
chromatic aberration. Refracting telescopes
suffer from chromatic aberration, since they use lenses as their primary
optical instruments. However, reflecting
telescopes do not suffer from chromatic aberration, since they use mirrors as
their primary optical instruments.
Mirrors reflect light, and the Law of Reflection states that the angle
of incidence is equal to the angle of reflection in all cases regardless of
color. As another example of an
advantage of reflectors and disadvantage of refractors, any small imperfection
within the glass composing a lens may render the entire refracting telescope
useless, since light must refract through the glass that composes the
lens. However, any imperfection within
the glass supporting a mirror is irrelevant to the operation of a reflecting
telescope, since light will not refract through the glass at all but will
instead reflect off of the metal (often aluminum) that
is coated on the surface of the glass.
Certainly, any imperfections within the metal coating will affect the
operation of the reflecting telescope, but such imperfections are much easier
to correct than imperfections within the glass of a lens. Since astronomers have concluded in recent
decades that reflectors are superior to refractors, all of the large optical
telescopes built in recent decades have been and continue to be reflecting
telescopes. Nevertheless, the first
optical telescopes ever built were small refractors. To build a primitive refracting telescope,
all that is required is two lenses with different focal lengths. The lens with the smaller focal length is placed closer to the eye; this is called the ocular lens,
commonly known as the eyepiece. The lens
with the larger focal length is placed further from the eye; this is called the
objective lens, since the light from the object we are observing will first
pass through that lens before passing through the ocular lens (the eyepiece)
and then finally into our eye. The two
lenses (objective and ocular) must be aligned with
each other so that they share the same symmetry axis. The distance between the two lenses must be
the sum of the two focal lengths of the lenses, and the magnification of the
resulting image when looking through the telescope is equal to the ratio of the
two focal lengths of the lenses. For
example, suppose we wish to build a small refracting telescope from two lenses,
one with a three-inch focal length and the other with a twelve-inch focal
length. The focal length of the ocular
lens is three inches, and the focal length of the objective lens is twelve inches. The distance between these two lenses must be
fifteen inches, since the sum of three and twelve is fifteen. (The word sum means addition. In this example, twelve plus three equals
fifteen.) Finally, everything observed
through this telescope will be magnified four times,
appearing to be four times larger or four times closer, since the ratio of
twelve to three is four. (The word ratio
means division. In this example, twelve
divided by three equals four.) If we
looked through this telescope the wrong way around, everything would appear to
be four times smaller or four times further, since the magnification would now
be one-fourth instead of four.
It is commonly known that the image of an astronomical object is magnified by a telescope. More precisely, the resolving power of a
telescope is the maximum extent to which the image formed by the telescope can
separate (or resolve) two distant objects.
For any telescope, diffraction (the bending of a wave around obstacles)
limits the resolving power of the telescope.
Hence, the resolving power of a telescope is directly
related to the size of the telescope’s primary optical instrument,
whether mirror or lens as the case may be.
A larger telescope (larger mirror or lens) will result in greater
resolving power (commonly known as higher magnification),
while a smaller telescope (smaller mirror or lens) will result in poorer
resolving power (commonly known as lower magnification). A telescope that achieves its maximum
resolving power as constrained by diffraction is said
to be at its own diffraction limit.
Any telescope on planet Earth
is called a ground-based telescope, while any
telescope in outer space (usually orbiting the Earth) is called a space
telescope. The Earth’s atmosphere is the
single most important factor that causes ground-based optical telescopes to
operate far below their maximum resolving power, far below their diffraction
limit. This is for several reasons. Firstly, light pollution is light from human
activities (such as city lights and highway lights) that adds brightness to the
night sky thus preventing astronomers from observing dim stars and
galaxies. Secondly, even in the absence
of light pollution, the Earth’s atmosphere continuously refracts incoming light
from outer space. This is why stars
appear to twinkle; the atmosphere’s continuous refraction of light is so severe
that even our naked eyes observe stars appearing to twinkle as a result! Note however that the planets in our Solar System
do not appear to twinkle in the sky. The
planets in our Solar System are hundreds of thousands of times closer to us
than the nearest stars besides our Sun, causing these planets to appear to have
a perceptible (noticeable) size in the sky.
Although this perceptible (noticeable) size is still small and although
the Earth’s atmosphere still continuously refracts the incoming light from
these planets, the blurring is nevertheless smaller than their perceptible
(noticeable) size in the sky, thus eliminating the twinkling effect, causing
these planets to appear steady (non-twinkling) in the sky. An extreme example of this lack of twinkling
is our Moon. The Moon is by far the
closest astronomical object in the entire universe. The Moon is so close to us that its size in
the sky is unmistakable. Although the
Earth’s atmosphere still continuously refracts the incoming light from our
Moon, the blurring is much smaller than its unmistakable size in the sky. Thus, the image of the Moon in the sky
appears to remain steady. Astronomers
use the word seeing to describe whether atmospheric conditions happen to be
favorable or unfavorable for astronomical observing. If atmospheric conditions happen to be
particularly steady thus minimizing blurring, astronomers say that the seeing
is good or excellent. If atmospheric
conditions happen to cause severe blurring, astronomers say that the seeing is
poor or bad. Furthermore, the Earth’s
atmosphere is opaque to certain wavelengths of light. In particular, the Earth’s atmosphere is
opaque to X-rays, as we will discuss later in the course. As a result, a ground-based X-ray telescope
would not collect any X-rays from stars or galaxies at all! All X-ray telescopes must therefore be space
telescopes. Similar difficulties arise
with infrared telescopes and ultraviolet telescopes, since certain gases within
the Earth’s atmosphere absorb far ultraviolet, while
other gases within the Earth’s atmosphere absorb infrared, as we will discuss
later in the course. For all of these
reasons, the National Aeronautics and Space Administration (NASA) has launched
several space telescopes, each one covering a different band of the
Electromagnetic Spectrum. These
telescopes are called the NASA Great Space
Observatories, since astronomers have gained a more complete understanding of
the universe using these space telescopes.
The Hubble telescope is the great optical space telescope (placed in
Earth orbit in 1990 and is still in operation).
The Compton observatory was the great gamma-ray space telescope (in
operation from 1991 to 2000). The
Compton observatory was replaced by the Swift
observatory (placed in Earth orbit in 2004 and is still in operation) and the
Fermi telescope (placed in Earth orbit in 2008 and is still in operation). The Chandra observatory is the great X-ray
space telescope (placed in Earth orbit in 1999 as is still in operation). The Spitzer telescope was the great infrared
space telescope (in operation from 2003 to 2020). The Spitzer telescope was
replaced by the Webb telescope (placed in solar orbit in 2022 and is still in
operation). The Cosmic Background
Explorer was the great microwave space telescope (in operation from 1989 to
1993). The Cosmic
Background Explorer was replaced by the Wilkinson telescope (in operation from
2001 to 2010). The Galaxy
Evolution Explorer was the great ultraviolet space telescope (in operation from
2003 to 2013). Although these space
telescopes have given astronomers a more complete understanding of the
universe, it is nevertheless expensive and dangerous to launch and to service
space telescopes. Therefore, astronomers
continue to build and use ground-based telescopes. There are many ground-based optical
telescopes much larger than the Hubble telescope for example. In order to minimize the negative effects of
the Earth’s atmosphere, ground-based astronomical observatories should always be built at astronomically optimal
locations. Firstly, ground-based
astronomical observatories should be built far from
cities to minimize light pollution and thus maximize the darkness of the night
sky. Secondly, ground-based astronomical
observatories should be built at high elevations, such
as on mountaintops. By operating
significantly above sea level, light from outer space will propagate through
less of the Earth’s atmosphere, thus reducing its negative effects. Thirdly, ground-based astronomical
observatories should be built in arid (dry) locations,
such as in deserts, to further reduce the negative effects of the Earth’s
atmosphere. In summary, ground-based
astronomical observatories should be built at locations that are dark (far from
cities) and high (on mountaintops) and dry (in
deserts).
In recent decades,
extraordinary technologies have been developed
enabling ground-based telescopes to match the quality of observations of space
telescopes. For example, instead of
using photographic film, photoelectric detectors together with computers permit
astronomers to digitally subtract the negative effects
of the Earth’s atmosphere and even to digitally correct for any imperfections
in the optical devices composing the telescope.
In particular, the charged-coupled device (CCD) is a small photoelectric
detector that can detect light (photons) with extraordinary efficiency and
sensitivity. The manufacture of these
charged-coupled devices (CCDs) eventually became so inexpensive that they are now integrated into every mobile telephone as a digital
camera, although these digital cameras have nowhere nearly the same sensitivity
as the charged-coupled devices (CCDs) used with telescopes. Adaptive optics is another extraordinary
technology that corrects for the continuous refraction of light caused by the
Earth’s atmosphere. This technology uses
deformable mirrors that continuously adjust their shapes based upon how light has been blurred by the Earth’s atmosphere. Astronomers use laser beams to continuously
probe the motion of the air to determine how the shape of the deformable mirror
is to be adjusted, thus forming images that have
eliminated the blurring of the atmosphere.
Astronomers use telescopes to
make a wide variety of measurements to study the universe. Astrometry is the precise measurement of the
position and the motion of astronomical objects as they appear on the Celestial
Sphere (as they appear in the sky). As
we discussed, this reveals not only the physical motion of stars and galaxies
through the universe but also the slow precession and nutation of the Earth’s
own axis of rotation. Photometry is the
measurement of the apparent brightness of astronomical objects (the brightness
as they appear in the sky). For
thousands of years before the invention of the telescope, photometric
measurements were made with the human eye, which can
modestly discriminate between somewhat brighter stars and somewhat dimmer
stars. Even after the telescope was invented, the human eye together with telescopes
continued to make photometric measurements for a few centuries. Photographic film together with telescopes
began to be used for photometric measurements roughly one century ago, and in
recent decades charged-coupled devices (CCDs) together
with telescopes have made very precise photometric measurements of
extraordinarily dim stars and galaxies.
Spectroscopy is the measurement of spectra, the measurement of wavelengths
(or frequencies or photon energies) within the light from astronomical
objects. As we discussed, the
measurement of either emission spectra or absorption spectra can reveal the
composition of an astronomical object, and the measurement of the primary
wavelength of a continuous spectrum can reveal the temperature of an
astronomical object. Spectroscopy also
reveals the motion of an astronomical object.
In particular, shifts in the wavelengths (or the
frequencies or the photon energies) of spectral lines (whether absorption or
emission) within the light of an astronomical object reveals not only the
direction of motion of the astronomical object (whether toward us or away from
us) but in addition its speed of motion (whether fast or slow), as determined
by the Doppler-Fizeau effect. Spectroscopy can even reveal the rotational
speed of a star. As a star rotates, one
side of it moves toward us, while the other side of the star moves away from
us. Hence, the light from one side of
the star will be blueshifted, while the light from
the other side of the star will be redshifted, again as determined by the
Doppler-Fizeau effect. As a result, any particular spectral line
(whether absorption or emission) will actually be broadened over a range of
wavelengths (or frequencies or photon energies). If the star is rotating faster, the blueshift and the redshift from either side of the star
will be more severe, resulting in more widely broadened spectral lines. If the star is rotating slower, the blueshift and the redshift from either side of the star
will be less severe, resulting in more narrowly broadened spectral lines. Therefore, the width of broadened spectral
lines (either absorption or emission) reveals the rotational speed of the star (whether
fast or slow). Polarimetry is the
measurement of the polarization of the light from astronomical objects. Interferometry is the use of more than one
telescope to measure the interference (constructive, destructive, or mixed)
among the light collected by the telescopes from the same astronomical
object. Interferometry increases the
resolving power of each telescope tremendously.
In principle, interferometry permits two telescopes to
together have the same resolving power as a single telescope with a size
equal to the distance between the two individual telescopes. For example, two telescopes on opposite sides
of planet Earth used together as a single interferometer would
in principle have the same resolving power as a single telescope the
size of planet Earth! In recent decades,
many astronomical observatories use not one telescope but a pair of telescopes
working together as a single interferometer.
However, the total amount of light collected by the two telescopes would
still be determined by the size of the two individual telescopes combined, not
the effective size of the entire interferometer. For this reason, ideally an interferometer
should be not just two telescopes but many telescopes working together to form
a single interferometer to increase the light gathering power of the entire
interferometer. A radio interferometer
for example often consists of dozens of radio telescopes all working together
as a single interferometer.
History of Astronomy from Ancient Astronomy to the Beginnings of Modern Astronomy
For thousands of years before
the invention of the telescope, humans looked up into the sky and tracked the
motion of the Sun, the motion of the stars, and the motion of a band of milk
wrapped around the entire sky they called the milky way. For thousands of years, humans watched the
Sun (during the daytime) rise in the eastern horizon and set in the western
horizon, and they watched the stars and the milky way
(during the nighttime) also rise in the eastern horizon and set in the western
horizon. Therefore, humans concluded
that the Earth is at the center of the universe, and everything else in the
universe moves around the Earth. The
first person to question whether or not this is actually the case was
Aristarchus of Samos, an ancient Greek astronomer who lived twenty-three
centuries ago. Using geometry, he
attempted to calculate the sizes of the Sun and the Moon relative to the size
of the Earth. He calculated that the
Moon is smaller than the Earth, and he calculated that the Sun is larger than
the Earth. Today, we know that
Aristarchus’s numerical results were significantly incorrect, since he made
some false assumptions in his calculations.
Nevertheless, we know today that the Moon is indeed smaller than the
Earth, and we know today that the Sun is indeed larger than the Earth. In other words, Aristarchus’s results may not
have been quantitatively correct, but
they were at least qualitatively
correct. Aristarchus then reasoned that
it made no sense for the Sun to move around the Earth if it was larger than the
Earth; he argued that it made more sense for the Earth
to move around the Sun. Aristarchus’s
Greek contemporaries persuaded him that the Earth could not be moving, since we
would then see stars appear to suffer parallax if the Earth were moving. Parallax is the apparent motion of an object
when in actuality the observer is moving.
Since the ancient Greeks did not see stars suffer parallax, they
continued to believe that the Earth is not moving and that everything else in
the universe, including the Sun, moves around the Earth. Today, we of course know that Aristarchus was
correct; the Earth does move around the Sun, but his Greek contemporaries were
also correct: stars must appear to suffer parallax due to the Earth’s motion
around the Sun. No one at the time
realized how incredibly distant stars truly are. Stars are so distant that our naked eyes
cannot observe the tiny parallax they appear to suffer. (The further distant a star, the smaller the
parallax.) These tiny parallax angles were finally measured during the Modern Ages thanks to the
invention of the telescope, but ancient humans did not have the ability to
measure these tiny angles. Since the
naked eye cannot see the parallax of stars, ancient humans continued to believe
that the Earth is at the center of the universe, and that everything else in
the universe moves around the Earth.
For thousands of years,
humans looked up into the sky and observed that stars appeared to remain fixed
relative to one another while continuously rising and setting. However, ancient humans also noticed seven
objects that do not remain fixed relative to the stars or even to each
another. These seven objects appeared to
wander around the sky. The Greek word
for wanderer transliterates to the English word planet. The seven planets (wanderers) of ancient
astronomy are the Sun, the Moon, Mercury, Venus, Mars, Jupiter, and
Saturn. Today, we know that the Sun and
the Moon are not truly planets, but the meaning of the word planet in ancient
astronomy was wanderer, and the Sun and the Moon do indeed appear to wander
around the sky. Also
note that ancient humans did not understand that the Earth is itself a
planet. The reason for this is obvious:
we look up to see the planets, but we look down to see the Earth! We will use the term ancient planets for
these seven objects so that we will not confuse them with the modern and
correct meaning of the word planet. For
thousands of years, humans observed only these seven ancient planets: the Sun,
the Moon, Mercury, Venus, Mars, Jupiter, and Saturn. For this reason, the number seven has
attained tremendous importance in many different cultures to the present
day. For example, seven is considered to be a lucky number, even to the present day. In addition, we continue to use to the
present day a calendar with seven days in a week, and each of these days is named for one of the ancient planets. This is most obvious for Sunday
which means day of the Sun, Monday which means day of the Moon, and
Saturday which means day of Saturn.
Thursday means day of Thor, who was the northern European mythological
analogue to Jupiter in ancient Roman mythology.
Friday means day of Frea, who was the northern
European mythological analogue to Venus in ancient Roman mythology. Tuesday means day of Týr,
who was the northern European mythological analogue to Mars in ancient Roman
mythology. Wednesday means day of Odin,
who was the northern European mythological analogue to Mercury in ancient Roman
mythology. Today we know that other
objects appear to wander around the sky, such as Uranus, Neptune, Pluto, Eris,
Ceres, and Vesta for example, but these objects are
too dim to be seen with the naked eye; they were not
discovered until the Modern Ages after the telescope was invented. Actually, it is possible to see Uranus with
the naked eye under ideal conditions. If
ancient humans had discovered Uranus, then there would have been eight ancient
planets instead of seven. As a result,
eight would have been considered a lucky number
instead of seven. Furthermore, we would
be using a calendar with eight days in a week instead of seven, and the eighth
day of the week would most certainly be called Uranusday!
For thousands of years,
humans noticed that Mercury, Venus, Mars, Jupiter, and Saturn at times appear
to slow down and stop and then turn around and move retrograde (backwards from
their usual motion) until slowing down and stopping again before continuing
with their original motion. Ancient
humans also noticed that the Sun and the Moon never move retrograde. The Greco-Roman astronomer Claudius Ptolemy
who lived nineteen centuries ago formulated a model of the universe to explain
these motions. Ptolemy’s model of the
universe was a geocentric model, meaning that it placed the Earth at the center
of the universe, as all humans believed at the time. Anyone who believes that the Earth is the
center of the universe is called a geocentrist. The words geocentric and geocentrist
are derived from the Greek root geo- for Gaia, the
personification of the Earth in ancient Greek mythology. Many other words are
derived from the same Greek root, such as geology, geologist, and
geography for example. According to
Ptolemy’s geocentric model, the Earth is at the center of the universe, and the
Moon moves along a simple circle around the Earth, since the Moon never appears
to move retrograde. Next comes Mercury
and Venus in this model. In order to
explain their occasional retrograde motions, Ptolemy claimed that they must be
moving around small circles (called epicycles) while at the same time moving
around large circles (called deferents) around the
Earth. Next comes the Sun, which
according to Ptolemy moves along a simple circle around the Earth, since the
Sun never appears to move retrograde.
Next comes Mars, Jupiter, and Saturn, which Ptolemy claimed must be
moving around small epicycles while at the same time moving around large deferents around the Earth to explain their occasional
retrograde motions. Finally, Ptolemy
claimed that the stars and the milky way were very far
from the Earth and fixed relative to one another. Although we know today that Ptolemy’s
geocentric model of the universe is not correct, this model nevertheless
predicted the motions of the ancient planets with fair reliability. Therefore, humans believed quite strongly in
Ptolemy’s geocentric model of the universe.
For the rest of the history of the Western Roman Empire, humans accepted
Ptolemy’s geocentric model of the universe.
Even after the Western Roman Empire crumbled, Europeans during the Middle Ages continued to believe in Ptolemy’s geocentric
model of the universe. Truth be told,
Middle-Age Europeans believed in Ptolemy’s geocentric model not
only because of its fair reliability but also fearing punishment from
the Catholic Church which had adopted this model as part of its doctrine.
Many historians agree that
the Modern Ages of human history begins roughly five centuries ago due to the
dramatic political, economic, social, artistic, religious, and scientific
changes that occurred. Artistically, the
geniuses of the Age of Renaissance began to create magnificent paintings,
sculptures, architecture, music, and literature. Politically, several long-term wars finally
concluded, including the Hundred Year War, the Wars of the Roses, the
Byzantine-Ottoman Wars, and the Granada War.
The adventurers of the Age of Exploration discovered and explored the
American continents, leading directly to major European powers expanding their
empires into the American continents.
The global trade that emerged led directly to the establishment of
modern banks. Societally, the invention
of the printing press resulted in the proliferation of books and hence
dramatically increased literacy.
Religiously, the leaders of the Protestant Reformation questioned the
doctrines and the authority of the Catholic Church. Mathematics and the sciences, astronomy in
particular, played an important role in these revolutionary changes in human
history.
Modern astronomy begins with
the work of the Polish astronomer Nicolaus Copernicus who lived five centuries
ago. Copernicus formulated a simpler
model of the universe than Ptolemy’s geocentric model. Copernicus’s model was a heliocentric model,
meaning that it placed the Sun at the center of the universe. Anyone who believes that the Sun is at the
center of the universe is called a heliocentrist. The words heliocentric and heliocentrist are derived from the
Greek root helio- for Helios, the personification of
the Sun in ancient Greek mythology. Many
other words are derived from the same Greek root, such
as helium, helioseismology, helioseismologist, and
helium for example. According to
Copernicus, Mercury and Venus move around the Sun along simple circles. Next comes the Earth, which Copernicus
claimed was a planet that also moves around the Sun along a simple circle. Next comes Mars, Jupiter, and Saturn, which
Copernicus claimed also move around the Sun along simple circles. If Mercury, Venus, Earth, Mars, Jupiter, and
Saturn all move around the Sun along simple circles, then how did the
heliocentric Copernicus model explain occasional retrograde motions? Copernicus claimed that whenever the Earth
moved passed another planet, it would appear as if the planet moved backwards
relative to the Earth when in fact the planet’s motion did not actually
change. Copernicus’s heliocentric model
of the universe is certainly simpler than Ptolemy’s geocentric model of the
universe. Nevertheless, Copernicus’s
heliocentric model did not predict the motions of the planets in the sky any
more reliably than Ptolemy’s geocentric model.
Therefore, Europeans continued to believe that the Earth is at the
center of the universe. Again truth be told, Europeans continued to believe in
Ptolemy’s geocentric model not only because of its fair reliability but also
fearing punishment from the Catholic Church.
Copernicus himself waited until he was dying from natural causes before
publishing his heliocentric model.
For thousands of years before
the invention of the telescope, humans built ancient observatories that used
large objects to point into the sky to track the motions of stars and ancient
planets. We will use the term ancient
observatory so as not to cause confusion with modern observatories, which use
telescopes. Examples of ancient
observatories include Stonehenge in England and the pyramids in Egypt and
Mexico. The Danish astronomer Tycho Brahe who lived more than four centuries ago built
such an observatory and spent decades of his life tracking the motions of the
ancient planets. He collected so much
data that he hired the German mathematician Johannes Kepler to analyze the
data. Tycho
Brahe died shortly thereafter, and Kepler then proceeded to use Brahe’s
measurements to attempt to prove with certainty that Copernicus was correct,
that the Earth together with the other planets do indeed move around the
Sun. Kepler did not succeed until he
abandoned the assumption that the planets move along simple circles. After abandoning this assumption, Kepler used
Brahe’s measurements to show with superb accuracy that the planets do indeed
move around the Sun. Moreover, he
formulated what we today call Kepler’s three laws of planetary motion: the Law
of Ellipses, the Law of Equal Areas, and the Law of Periods.
According to Kepler’s first
law, the Law of Ellipses, the planets (including the Earth) move around the Sun
along orbits that are ellipses. An
ellipse is an elongated circle with a major axis that is longer than and
perpendicular to its minor axis. Every
ellipse has two focal points (or foci) on its major axis; each focus is the
same distance from the center of the ellipse.
Half of the entire major axis of any ellipse is called its semi-major
axis, always denoted a; half of the
entire minor axis of any ellipse is called its semi-minor axis, always denoted b.
The Latin root semi- means half.
For example, a semicircle is half of a full circle, and a semiformal
event is halfway between a formal event and an informal event. Not only is the orbit of a planet around the
Sun an ellipse, but the Sun is not at the center of the ellipse; the Sun is at
one of the foci of the ellipse. (There
is nothing at the other focus.) Since
the Sun is at one of the foci of the elliptical orbit, there is only one point
on the elliptical orbit where the planet is closest to the Sun, called the
perihelion of the planet’s orbit. Also, there is only one point on the elliptical orbit where
the planet is furthest from the Sun, called the aphelion of the planet’s
orbit. The distance from the Sun to a
planet’s perihelion is called the perihelion distance
and is denoted rperihelion;
the distance from the Sun to a planet’s aphelion is called the aphelion
distance and is denoted raphelion. Notice that the sum of the perihelion
distance and the aphelion distance is equal to the entire major axis of the
orbit, which is twice the semi-major axis of the orbit. In other words, rperihelion + raphelion
= 2a. The time it takes a planet to move one
complete orbit around the Sun is called the orbital
period of the planet, denoted P. Notice that the orbital period of any planet
is the time it takes that planet to move from its perihelion all the way around
its orbit, returning to its perihelion.
The orbital period is also the time it takes the planet to move from its
aphelion all the way around its orbit, returning to its aphelion. According to Kepler’s second law, the Law of
Equal Areas, planets sweep out equal areas in equal times. It follows from this that a planet moves
faster while closer to the Sun (and fastest in fact at its perihelion) and
moves slower while further from the Sun (and slowest in fact at its
aphelion). According to Kepler’s third
law, the Law of Periods, the square of the orbital periods of all the planets
(including the Earth) around the Sun are all directly proportional to the cube
of the semi-major axes of the orbits of all the planets (including the Earth)
around the Sun. The orbital period of
the Earth around the Sun is one Earth-year (1 yr). The semi-major axis of the Earth’s orbit around
the Sun is called one astronomical unit (1 au), which
we know today is roughly equal to 150 million kilometers. Assuming we agree to measure the orbital
parameters of all planets around the Sun in terms of the Earth’s orbital parameters,
we may write Kepler’s third law as P2 = a3, where P
must be measured in Earth-years and a
must be measured in astronomical units.
We may apply Kepler’s laws to perform simple orbit calculations. For example, consider a hypothetical planet
orbiting the Sun that is six astronomical units from the Sun at its perihelion
and twelve astronomical units from the Sun at its aphelion. In other words, we are given that rperihelion
= 6 au and raphelion = 12 au. Thus, the entire major axis of the orbit is
eighteen astronomical units, since six plus twelve equals eighteen. Therefore, the semi-major axis of the orbit
is nine astronomical units, since half of eighteen equals nine. In other words, a = 9 au.
Since Kepler’s third law states P2 = a3, we have now discovered that P2 = 729,
since the cube of nine equals seven hundred and twenty-nine. Finally, we conclude that the orbital period
of this hypothetical planet around the Sun is twenty-seven Earth-years, since
the square-root of seven hundred and twenty-nine
equals twenty-seven. In other words, P = 27 yr for
this hypothetical planet. Although
Kepler deduced these three laws from Brahe’s measurements, he could not explain
why any of these laws are true. Although
Kepler was a brilliant mathematician, he was nevertheless an empiricist, not a
physicist. An empiricist makes
observations and deduces mathematical relationships that describe those
observations but has no answers for deeper questions about those mathematical
relationships. In particular, an
empiricist cannot explain why those mathematical relationships are true. A physicist on the other hand seeks and
discovers answers to these deeper questions.
In particular, a physicist seeks and discovers reasons why certain
mathematical relationships that describe the universe are true. Since Kepler was an empiricist but not a
physicist, he could not explain why his own laws of planetary motion are
true. It would be Isaac Newton who would
offer explanations for not just these three laws but for far more than Kepler or anyone else could ever dream any single person
could ever accomplish. All these
thousands of years of astronomical history are leading up to the person who
would become the first true physicist, Isaac Newton.
The Italian astronomer
Galileo Galilei read about a new invention: the (optical) telescope. He built his own telescope after reading
about this new invention, and in the year 1609 he
became the first person to ever make telescopic observations of the sky. His discoveries were breathtaking. Galileo Galilei discovered mountains and
craters on the Moon. Galileo Galilei
discovered sunspots on the Sun. (Never ever observe the Sun through a telescope. Never ever observe
the Sun through binoculars. Never ever observe the Sun even with the naked eye. Solar observations done incorrectly causes
permanent blindness.) Galileo Galilei
discovered four moons orbiting around Jupiter that were later
named the Galilean moons in his honor.
(Today we know that Jupiter has roughly one hundred moons. Only four of them are large enough to be
visible through a primitive telescope.)
Galileo Galilei discovered rings around Saturn. (His telescope was too primitive to see that
they are actually rings. He speculated
that they were moons around Saturn.)
Galileo Galilei discovered phases of Venus analogous to the phases of
the Moon, including full, half, crescent (less than half), and gibbous (more
than half). Galileo Galilei discovered
that the milky way is not in fact milk; the milky way is innumerable stars
sufficiently crowded together in the sky that with the naked eye all of their
light blends together so as to appear to be milk. Only a telescope can produce enough
magnification to reveal all of these magnificent discoveries. The phases of Venus can only be correctly explained
if Venus moves around the Sun, not the Earth.
Moreover, the discovery of four moons orbiting around Jupiter revealed
that Jupiter is the center of its own mini-universe, further proving that the
Earth is not the center of everything.
For all of these discoveries and for his formulation of the scientific
method, we will regard Galileo Galilei as one of the grandfathers of
logical/mathematical/scientific reasoning.
Other grandfathers of logical/mathematical/scientific reasoning include
the British philosopher Francis Bacon, the French mathematician René Descartes,
and the British philosopher John Locke.
The Newtonian Model of the Universe
The British mathematician and
physicist Isaac Newton was among the most brilliant persons who have ever lived. Isaac Newton discovered calculus (advanced
mathematics) and became the first physicist when he discovered three universal
laws of motion and the law of universal gravitation. Physics is the search for, the study of, and
the application of the mathematical equations that describe the universe; these
mathematical equations are called the laws of
physics. Isaac Newton accomplished all
of these achievements between the 1670s and the 1680s and published them in his textbook Philosophić Naturalis
Principia Mathematica (Mathematical
Principles of Natural Philosophy) or the Principia for short. In this
textbook, Isaac Newton presented what is today called
the Newtonian model of the universe. For
all of these reasons, we will regard Isaac Newton as the true father of
logical/mathematical/scientific reasoning.
We begin our discussion of the Newtonian model of the universe with
Newton’s three universal laws of motion: the Law of Inertia, the Law of
Acceleration, and the Law of Action-Reaction.
A force is a push or a
pull. For most of human history, humans
believed that force causes motion.
Sadly, most humans even today still believe that force causes motion,
but this misunderstanding is forgivable.
Often in our daily experience, objects seem to move only while we push
or pull them, and objects seem to stop moving when we stop pushing or pulling
them. We are tempted to conclude that
force does indeed cause motion; nevertheless, this conclusion is false. Consider the following experiment. If we give an object a brief push or pull and
observe carefully, we notice that the object continues to move even after our
brief push or pull has ended; the object continues moving even when we are no
longer pushing or pulling it! Of course,
objects often slow down and eventually stop moving when we have stopped pushing
or pulling them, but if we observe carefully we discover that
it is other forces that cause the object to slow down and eventually stop
moving. Friction is often the
force that is responsible for slowing down and eventually stopping objects from
moving. If we give a brief push or pull
on an object that happens to be on a smooth surface, we notice that the object
continues to move for a longer distance and a longer time after our brief push
or pull has ended, since there is less friction from the smooth surface to slow
down the object. Bowling lanes are so
smooth that a bowling ball travels the entire length of the bowling lane while
slowing down only a small amount; in fact, the bowling ball only stops moving
because it strikes the pins or the back wall at the very end of the bowling
lane. Notice that the bowling ball
continues moving along the entire bowling lane even though nothing is pushing
or pulling it in the direction that it moves.
We are compelled to conclude that if there were zero forces (no pushing
or pulling), an object would continue to move without slowing down at all, and
certainly without speeding up and certainly without changing the direction that
it moves. We conclude that force does
not cause motion, since we have carefully observed that objects actually
continue to move in the absence of force.
This is Newton’s first law of motion, the Law of Inertia. Force does not cause motion, since an object
moves in a straight line at a constant speed when there is zero force pushing
or pulling the object. Certainly, an
object at rest (not moving) will remain at rest without forces pushing or
pulling the object, but an object in motion will remain in motion along a
straight line at a constant speed without forces pushing or pulling the object.
If force does not cause
motion, this begs the following question: what does force do? What does pushing or pulling an object
accomplish? This
question is answered by Newton’s second law of motion, the Law of Acceleration. According to this law, force (pushing or
pulling) causes changes in the motion
of an object. This change may be
speeding up the object, slowing down the object, changing the direction the
object moves, or combinations of these changes.
Physicists use the word acceleration
for the rate at which an object’s motion changes, but we must always remember
that this could be any change in motion whatsoever. An object that is speeding up is said to be accelerating,
but an object that is slowing down is also said to be accelerating. (In colloquial
English, we would use the word decelerating instead.) Moreover, an object that is neither speeding
up nor slowing down but only changing the direction that it moves is also said to be accelerating. Newton’s second law of motion is written
mathematically as ,
where is the net force (the sum of all the forces)
acting on the object, is the acceleration (the rate of any change in
motion whatsoever) of the object, and m
is the mass of the object. Notice that
the net force is directly proportional to the acceleration, meaning that
stronger forces cause greater accelerations and weaker forces cause smaller
accelerations. If we solve this equation
for the acceleration, we conclude ,
and we see that the acceleration is inversely proportional to the mass of the
object. In other words, larger-mass
objects suffer smaller accelerations from a force (a push or a pull), while
smaller-mass objects suffer larger accelerations from
a force (a push or a pull). This stands
to reason. For example, a baseball
struck with a baseball bat will suffer a large acceleration since the baseball
has a small mass, but a car struck with the same baseball bat will suffer a
small acceleration since the car has a large mass.
According to Newton’s third
law of motion, the Law of Action-Reaction, if one object exerts a force on
another object, then the second object must also exert a force on the first
object that is equal in magnitude but opposite in direction. In some circumstances, this law is
intuitive. For example, if we lean
against a wall, we are obviously exerting a force (pushing) the wall. Therefore, the wall must also exert a force
on us, and this is the force that prevents us from falling
over as we lean against the wall.
However, there are other circumstances when Newton’s third law of motion
is counterintuitive. For example, if a
truck strikes a pedestrian, the pedestrian will be killed
with body parts everywhere while the truck barely suffers a scratch. Did the truck exert a greater force on the
pedestrian or did the pedestrian exert a greater force on the truck? According to Newton’s third law of motion,
the Law of Action-Reaction, the answer is neither. The force that the truck exerted on the
pedestrian was equal (in magnitude) to the force that the pedestrian exerted on
the truck, but how can this be the case if the pedestrian was killed with body
parts everywhere while the truck barely suffered a scratch? The resolution of this paradox is as
follows. The pedestrian has a very small
mass (as compared with the truck, which has a much larger mass) which caused
the pedestrian to suffer a very large acceleration (as compared with the truck,
which suffers a very small acceleration).
When a pedestrian is killed by a moving car or truck, they
were killed not only by the force from the car or truck; the pedestrian
was also killed by their own small mass, resulting in a large acceleration.
The Newtonian model of the
universe is Newton’s three universal laws of motion together with Newton’s law
of universal gravitation. According to
Newton’s law of universal gravitation, everything in the universe attracts
everything else in the universe. This is
difficult to believe. We do not observe tables and chairs and humans and mobile telephones
attracting each other; we only observe planets, moons, and stars causing
attractions. (Actually, we do observe humans and mobile telephones
attracting each other quite strongly, but this strong attraction is not
gravitational!) Nevertheless, Newton’s
law of universal gravitation is correct: everything in the universe attracts
everything else in the universe. We do
not notice tables and chairs and humans and mobile
telephones attracting each another gravitationally because gravity is by far by
far by far by far the weakest force in the entire universe. Gravity is so weak that we never notice
gravitational attractions among tables and chairs and humans and mobile
telephones. We only notice gravitational
attractions from gargantuan objects, such as planets, moons, and stars. Even in this case, gravity is noticeably
weak. Every time we walk up a staircase,
we are effortlessly defying planet Earth’s gravitational attraction! Newton’s law of universal gravitation can be written in mathematical form. Consider any two objects whatsoever, one with
mass m1
and the other with mass m2,
and suppose the distance between these two objects is r. The gravitational force
(attraction) between these two objects is directly proportional to the product
of their masses and is inversely proportional to the square of the distance
between them. Mathematically, Newton’s
law of universal gravitation is written ,
where Fgravitational
is the gravitational force (attraction) between the two objects. Also, the symbol G is called Newton’s gravitational
constant of the universe, which is an example of a fundamental God-given
constant of the universe. Within the
mathematical equations that describe the universe (the laws of physics), there
are certain fixed constants (fixed numbers).
These fixed constants (fixed numbers) are called the fundamental
God-given constants of the universe.
Most physicists agree that the three most fundamental God-given
constants of the universe are the vacuum speed of light (always written with
the symbol c), the Planck constant
(always written with the symbol h),
and Newton’s gravitational constant of the universe (always written with the
symbol G). Each and every one
of the fundamental God-given constants of the universe has an absolutely fixed
value, having that same value everywhere in the universe and everywhen in the universe.
(The word everywhen means at all times in the
past, present, and future.) Newton’s gravitational
constant of the universe is roughly equal to 6.67×10–11
assuming that we agree to measure all masses in kilograms, all distances in
meters, and all times in seconds. This
number 6.67×10–11 is incredibly small: 10–3 is one
thousandth, 10–6 is one millionth, 10–9 is one
billionth, and 10–11 is even smaller than one billionth. We now realize why gravity is by far by far
by far by far the weakest force in the entire universe. Whenever we calculate a gravitational force
(attraction), we must multiply by Newton’s gravitational constant of the
universe G, which is so incredibly
small that the resulting gravitational attraction is incredibly weak. This is why we never notice gravitational
attractions among tables and chairs and humans and mobile telephones. The only hope we have of ever feeling
gravitational attractions is from gargantuan objects, such as planets, moons,
and stars, and even in this case gravity is noticeably
weak.
According to Newton’s law of
universal gravitation, the gravitational force (attraction) between any two
objects in the universe is directly proportional to the product of their
masses. For example, if we double both
masses, the gravitational force (attraction) strengthens by a factor of four,
since the product of two and two is four.
(The word product means multiplication.)
As another example, if we triple both masses, the gravitational force
(attraction) strengthens by a factor of nine, since the product of three and
three is nine. As yet another example,
if we double one mass and triple the other mass, the gravitational force
(attraction) strengthens by a factor of six, since the product of two and three
is six. If we double only one of the
masses, the gravitational force (attraction) strengthens by a factor of two, since
the product of two and one is two. If we
triple only one of the masses, the gravitational force (attraction) strengthens
by a factor of three, since the product of three and one is three. Also according to Newton’s law of universal
gravitation, the gravitational force (attraction) between any two objects in
the universe is inversely proportional to the square of the distance between
them. This means that increasing the distance between two
objects weakens the gravitational
force (attraction) between them, while decreasing
the distance between two objects strengthens
the gravitational force (attraction) between them. This stands to reason; we expect the
attraction between objects to be stronger when they are closer together, and we
expect the attraction between objects to be weaker when they are further
apart. For example, if we triple the
distance between two objects, the gravitational force (attraction) weakens by a
factor of nine, since three squared equals nine. As another example, if we quadruple the
distance between two objects, the gravitational force (attraction) weakens by a
factor of sixteen, since four squared equals sixteen. As yet another
example, if we double the distance between two objects, the gravitational force
(attraction) weakens by a factor of four, since two squared equals four. If we cut the distance between two objects
down to one-third, the gravitational force (attraction) strengthens by a factor
of nine. If we cut the distance between
two objects down to one-fourth, the gravitational force (attraction)
strengthens by a factor of sixteen. If
we cut the distance between two objects down to one-half, the gravitational
force (attraction) strengthens by a factor of four. We can put all of this together with the
following amusing example: if we double one mass, octuple
the other mass, and quadruple the distance, the gravitational force
(attraction) does not strengthen or weaken; it remains the same strength!
Isaac Newton combined his
three universal laws of motion with his law of universal gravitation, and using
calculus (which he also discovered) he proceeded to mathematically explain
(what was believed at the time to be) everything that had ever been observed in
the universe. This model of the universe
is called the Newtonian model of the universe, and
physicists regard it as the first mathematically correct description of the
universe. We now discuss some of the
greatest achievements of the Newtonian model of the universe. Firstly, Newton explained why Kepler’s three
laws of planetary motion are true, but he went even beyond this. Newton generalized Kepler’s three laws of
planetary motion, showing mathematically that Kepler’s own formulation of his
own planetary laws was not precisely correct.
According to Kepler’s first law
as Kepler formulated it, the orbits of the planets around the Sun are
ellipses. According to
Newton, it is not just planets orbiting the Sun that should have elliptical
orbits. If gravitation is indeed
universal, then the orbit of anything around anything else (such as a moon
orbiting a planet) should also be an ellipse, but Newton went even beyond
this. He proved mathematically that the
orbit could be a circle, an ellipse, a parabola, or a hyperbola. The total energy of the two attracting
objects determines the shape of the orbit.
If the total energy of the two attracting objects is sufficiently large,
then the two objects will not remain bound to each other; they will escape from
each other’s gravitational attraction.
The orbit in this case will be a parabola or a hyperbola. Hence, parabolic orbits and hyperbolic orbits
are called unbound
orbits. However, if the total energy
of the two attracting objects is not this large, then the two objects will
remain bound to each other; they cannot escape from each other’s gravitational
attraction. The orbit in this case will
be a circle or an ellipse. Hence,
circular orbits and elliptical orbits are called bound orbits. Circles, ellipses, parabolae,
and hyperbolae are all conic sections, the intersection of a cone and a
plane. Conic sections are
classified using a variable called the eccentricity. A circle is a conic section with an
eccentricity equal to zero, an ellipse is a conic section with an eccentricity
anywhere between zero and one, a parabola is a conic section with an
eccentricity equal to one, and a hyperbola is a conic section with an
eccentricity anywhere greater than one.
In summary, whereas Kepler’s formulation of his own first law states
that the orbit of a planet around the Sun is an ellipse, Newton’s formulation
of Kepler’s first law states that the orbit of anything around anything else is
a conic section.
According to Kepler’s second
law as Kepler formulated it, planets sweep out equal areas in equal times,
since a planet moves faster while closer to the Sun (fastest at its perihelion)
and moves slower while further from the Sun (slowest at its aphelion). According to Newton, it is
not just planets orbiting the Sun that should sweep out equal areas in equal
times. If gravitation is indeed
universal, then anything orbiting anything else (such as a moon orbiting a
planet) should also sweep out equal areas in equal times, but Newton went even
beyond this. He proved mathematically
that equal areas are swept in equal times because of
the Conservation of Angular Momentum, another law of physics that he
discovered. Ice skaters spin faster when
they pull their arms in, and ice skaters spin slower when they pull their arms
out. This ensures that the angular
momentum of the ice skater remains conserved (remains constant). Rather like ice skaters who
spin faster when they pull their arms in, planets orbiting a star (or moons
orbiting a planet) speed up as they move closer to their attractor, and rather
like ice skaters who spin slower when they pull their arms out, planets
orbiting a star (or moons orbiting a planet) slow down as they move further
from their attractor. This
continuously changing speed keeps the angular momentum conserved, and Newton
proved mathematically that this causes equal areas to be
swept in equal times. In summary,
whereas Kepler’s formulation of his own second law states that planets sweep
out equal areas in equal times, Newton’s formulation of Kepler’s second law
states that the angular momentum of anything orbiting anything else must remain
conserved.
According to Kepler’s third
law as Kepler formulated it, the square of the orbital periods of all the
planets around the Sun are directly proportional to the cube of the semi-major
axes of the orbits of all the planets around the Sun. According to Newton, it is not just planets
orbiting the Sun where this direct proportionality should be true. If gravitation is indeed universal, then the
square of the orbital periods should always be directly proportional to the
cube of the semi-major axes of the orbits.
This direct proportionality should also be true for moons orbiting a
planet for example, but Newton went even beyond this. He proved mathematically
that this direct proportionality actually states , where P is
the orbital period, π is roughly
equal to 3.141 592 653 589 793 238 462 643 383 279 502 884 197 169 399 375 105 820 974 944 592, G is
Newton’s gravitational constant of the universe, M is the total mass, and a
is the semi-major axis of the orbit. With Newton’s formulation of Kepler’s third
law, the units of a need not be
astronomical units, and the units of P
need not be Earth-years. All that is
required is for the units of P, a, M,
and G to all be consistent with one
another. For example, we may use G = 6.67×10–11 if we
agree to measure P, a, and M in seconds, meters, and kilograms, respectively. Using this equation, astronomers have
calculated the mass of the Sun from the orbital parameters of anything orbiting
the Sun, such as planets, asteroids, and comets. Using this equation, astronomers have
calculated the mass of the Earth from the orbital parameters of anything
orbiting the Earth, such as the Moon and artificial satellites. Using this equation, astronomers have
calculated the mass of Jupiter from the orbital parameters of the moons
orbiting Jupiter. In fact, it is not an
exaggeration to say that the only way astronomers can accurately calculate the
mass of any object in the universe (such as a star, planet, or moon) is to use
this equation. In
summary, whereas Kepler’s formulation of his own third law states that P2 = a3 for
the planets around the Sun where P
must be measured in Earth-years and a
must be measured in astronomical units, Newton’s formulation of Kepler’s third
law states that for anything in
orbit around anything else, where P, a, M,
and G may be measured in any units
that are consistent with one another.
For most of human history,
humans believed that heavier objects fall faster than lighter objects. Sadly, most humans even today still believe
that heavier objects fall faster than lighter objects. In actuality, everything, no matter how heavy
or how light, falls toward the Earth with the same acceleration near the
surface of the Earth: 9.8 meters per second per second downward. Caution: this is only the case when all
non-gravitational forces such as air resistance can be
ignored as compared with the gravitational force. We can demonstrate this by dropping a heavy
object and a light object at the same time from the same height, such as a
textbook and a pencil. Both will hit the
ground at the same time even though the book is hundreds of times heavier than
the pencil! Although Galileo Galilei
first demonstrated that this is true, it was Isaac Newton who
explained mathematically why this is
true. Actually, Newton went even beyond
this; he proved mathematically that everything, no matter how light or how
heavy, falls toward any planet, moon, or star with the same acceleration,
ignoring all non-gravitational forces such as air resistance as usual. Isaac Newton discovered the
following equation for this acceleration due to gravity for any planet, moon,
or star in the universe: g = GM / R2, where g is the acceleration due to gravity
near the surface of the planet, moon, or star, G is Newton’s gravitational constant of the universe, M is the mass of the planet, moon, or
star, and R is the radius of the
planet, moon, or star. In other
words, the acceleration due to gravity near the surface of any planet, moon, or
star in the universe is G multiplied
by the mass of the planet, moon, or star and divided by the square of the
radius of the planet, moon, or star. For
example, if we multiply G by the mass
of the Earth and divide by the square of the radius of the Earth, the answer is
9.8 meters per second per second! As
another example, if we multiply G by
the mass of the Earth’s Moon and divide by the square of the radius of the
Earth’s Moon, the answer is 1.6 meters per second per second, which is only
one-sixth of the acceleration due to gravity near the surface of the
Earth. The Earth’s Moon has virtually no
atmosphere and therefore virtually no air resistance. Consequently, Newton’s equation was tested on the Earth’s Moon in a dramatic way. Fifty years ago, one of the astronauts on the
Moon dropped a hammer and a feather, and both fell with the same acceleration;
both hit the Moon’s ground at the same time!
As yet another example, if we multiply G by the mass of Mars and divide by the
square of the radius of Mars, the answer is 3.7 meters per second per second,
which is only one-third of the acceleration due to gravity near the surface of
the Earth.
Perhaps the most brilliant of
Newton’s achievements was the explanation of the tides. Sometimes the ocean is at flood tide (high
tide); sometimes the ocean is at ebb tide (low tide). Why do the tides occur? Newton proved mathematically that any object
exerts different gravitational forces (attractions) across
any other object due to the varying distances between different parts of the
two objects from each other.
Parts of the two objects that are closer to each other feel stronger
attractions, while parts of the two objects that are further from each other
feel weaker attractions. The differences in the gravitational forces
(attractions) across an object are called tidal forces, because they cause the
tides in the ocean. The Moon and the Sun
each exert roughly equal tidal forces on the Earth’s oceans causing them to
bulge, resulting in flood tides and ebb tides every day. When the Earth, the Moon, and the Sun happen
to form a nearly straight line (this occurs during New Moon or Full Moon), the
lunar tidal force and the solar tidal force reinforce each other. If we interpret the tides as waves, then
during this straight-line configuration, the lunar tidal crests and the solar
tidal crests meet each other, while the lunar tidal troughs and the solar tidal
troughs also meet each other. This is
constructive interference, resulting in severely high flood tides and severely
low ebb tides. These are
called the spring tides. When the
Earth, the Moon, and the Sun happen to form a nearly right angle (this occurs
during First Quarter Moon or Third Quarter Moon), the lunar tidal force and the
solar tidal force counteract each other.
If we interpret the tides as waves, then during this right-angle
configuration, the lunar tidal crests and the solar tidal troughs meet each
other, while the lunar tidal troughs and the solar tidal crests meet each
other. This is destructive interference,
resulting in modest flood tides and modest ebb tides. These are called the
neap tides. The tidal range is the wave
height of the tides, the difference between the flood tide and the ebb
tide. The spring tides have the largest
tidal range, since the flood tide is very high and the ebb tide is very low,
resulting in a large difference between them.
The neap tides have the smallest tidal range, since the flood tide is
modestly high and the ebb tide is modestly low, resulting in a small difference
between them. The Moon’s orbital period
around the Earth is roughly one month.
In fact, the word month is derived from the
word moon. If we take the word month,
remove the suffix -th, and
insert an extra letter o, we obtain the word moon! One month is roughly four weeks. If today is New Moon, we will have spring
tides with the largest tidal range (severely high flood tides and severely low
ebb tides). Roughly
one week later will be First Quarter Moon, and we will have neap tides with the
smallest tidal range (modest flood tides and modest ebb tides). Roughly one week
later will be Full Moon, and we will have spring tides again with the largest
tidal range (severely high flood tides and severely low ebb tides). Roughly one week
later will be Third Quarter Moon, and we will have neap tides again with the
smallest tidal range (modest flood tides and modest ebb tides). Roughly one week
later, we have returned to New Moon, roughly four weeks since the previous New
Moon. For thousands of years, humans
already noticed that there is a correlation between the changing appearance of
the Moon in the sky and the changing tidal range in the ocean, but it was Isaac
Newton who explained mathematically why this is the case. The lunar tidal force and the solar tidal
force not only cause the Earth’s oceans to bulge, but they also cause the shape
of the solid Earth itself to bulge. The
shape of the solid Earth itself suffers flood tides and ebb tides every day. When the solid Earth itself suffers a flood
tide, we are slightly further from the center of the Earth. Later when the solid Earth itself suffers an
ebb tide, we are slightly closer to the center of the Earth. Each and every day
of our lives, we move up and down roughly one meter, even while we believe
ourselves to be remaining still!
Although many Europeans were
convinced that the Newtonian model of the universe is correct, many other
Europeans were not convinced. This was
partly because Isaac Newton had made many enemies during his lifetime by his own
personal arrogance. As a result, the
enemies of Newton tried desperately to disprove the Newtonian model of the
universe. When the Halley comet passed
near the Earth during Newton’s lifetime, its orbital parameters were measured, and Newton’s equations were used to calculate
that it had an orbital period around the Sun of roughly seventy-four
years. After seventy-four years, the
Halley comet did not return as scheduled, and the enemies of Newton rejoiced
since they believed that this disproved the Newtonian model of the
universe. However, the original
calculation only included the gravitational attraction of the Sun. What about the gravitational attractions of
the planets? Physicists and
mathematicians recalculated the orbit of the Halley comet including the
gravitational attractions of the planets in addition to the gravitational
attraction of the Sun. This more correct
calculation revealed that the orbital period of the Halley comet was not
seventy-four years; it was in fact seventy-six years. In other words, the original calculation was
two years in error. Europeans waited two
more years, and the Halley comet returned!
Isaac Newton, who had died a few decades earlier, was
already considered a genius by many Europeans, but the return of the Halley
comet as correctly predicted by Newton’s equations convinced not only his
admirers but his enemies as well that he may have been the most brilliant
person who ever lived. The British poet
Alexander Pope wrote the following poem in honor of Isaac Newton. “Nature and Nature’s laws lay hid in
night. God said, ‘Let Newton be!’ and
all was light.” Isaac Newton’s
achievements were the culmination of the Age of Reason of seventeenth-century
Europe (the 1600s), which led directly to the Age of
Enlightenment of eighteenth-century Europe (the 1700s). During the Age of Enlightenment, scholars in
many different disciplines began to approach their subjects with mathematical
logic and scientific reasoning. For
example, mathematicians began to insist on rigorous proofs before any
mathematical statement would be regarded as a true
theorem. As another example, political
philosophers debated different systems of government in a reasoned and logical
fashion. As yet another example,
religious scholars used logic and reason when studying the Bible. Isaac Newton is not only one of the greatest
figures of scientific and mathematical history for his model of the universe,
but he is also one of the greatest figures of world history for his exceptional
role in the Age of Reason and for inspiring the Age of Enlightenment. The most authoritative biography of Isaac
Newton is Never at Rest: A Biography of
Isaac Newton, written by Richard S. Westfall. No one would dare question the Newtonian
model of the universe until two hundred years after Isaac Newton died. In the early twentieth century (early 1900s), Albert Einstein dared to question the Newtonian
model of the universe. The British poet
J. C. Squire wrote the following poem in honor of Albert Einstein as a sequel
to Alexander Pope’s poem. “It did not
last: the devil howling ‘Ho! Let Einstein be,’ restored the status quo.” We will study the Einsteinian model of the
universe later in this course.
Libarid A. Maljian homepage at the Department of Physics at CSLA at NJIT
Libarid A. Maljian profile at the Department of Physics at CSLA at NJIT
Department of Physics at CSLA at NJIT
College of Science and Liberal Arts at NJIT
New Jersey Institute of Technology
This webpage was most recently modified on Wednesday, the twenty-fifth day of September, anno Domini MMXXIV, at 03:45 ante meridiem EDT.