This is one of the webpages of Libarid A. Maljian at the Department of Physics at CSLA at NJIT.

 

 

 

New Jersey Institute of Technology

College of Science and Liberal Arts

Department of Physics

Introductory Astronomy and Cosmology, Section 101

Phys 202-101

Fall 2024

First Examination lecture notes

 

 

 

Introduction to the Universe: Our Location in Space and Time

 

A star system is a star with several planets orbiting that star and moons orbiting those planets.  The name of our home star system is the Solar System.  There is only one star in our Solar System: the Sun.  We live on planet Earth, the third planet orbiting the Sun.  A galaxy is a collection of billions of star systems held together by gravity.  The name of our home galaxy is the Milky Way Galaxy.  There are roughly one hundred billion star systems that together compose the Milky Way Galaxy, and the Solar System is just one of these one hundred billion star systems within the Milky Way Galaxy.  Galactic groups contain only a few dozen galaxies, while galactic clusters contain between several hundred and several thousand galaxies.  Our Milky Way Galaxy is not a member of a galactic cluster; our Milky Way Galaxy is a member of a galactic group.  The name of our home galactic group is the Local Galactic Group (or just the Local Group for short).  The Local Galactic Group is composed of a few dozen galaxies, although most of them are small galaxies.  There are only three major galaxies in the Local Galactic Group: our Milky Way Galaxy, the Andromeda Galaxy, and the Triangulum Galaxy.  Galactic superclusters are enormous organizations composed of between tens of thousands and hundreds of thousands of galaxies.  The name of our home galactic supercluster is the Laniakea Galactic Supercluster, and our Local Group is a small galactic group on the outskirts of the Laniakea Galactic Supercluster.  Cosmic filaments are colossal organizations composed of several galactic superclusters, and hence cosmic filaments contain between a few hundred thousand and a few million galaxies.  The name of our home cosmic filament is the Perseus-Pisces-Sculptor-Hercules Cosmic Filament, and our Laniakea Galactic Supercluster is one of several galactic superclusters within the Perseus-Pisces-Sculptor-Hercules Cosmic Filament.  The observable universe contains hundreds of thousands, perhaps millions, of cosmic filaments.  Since each cosmic filament contains between a few hundred thousand and a few million galaxies, the entire observable universe contains roughly one hundred billion galaxies.  Assuming that each galaxy contains on average one hundred billion star systems just like our Milky Way Galaxy, then there are roughly ten sextillion star systems in the observable universe.  (Please refer to the following multiplication table, where each number is one thousand times the previous number: one, one thousand, one million, one billion, one trillion, one quadrillion, one quintillion, one sextillion, one septillion, one octillion, one nonillion, one decillion.  Caution: this multiplication table is only correct in American English.  These same words are used for different numbers in British English.)  We can summarize our location in the universe with our cosmic address.  Whenever anyone asks for our mailing address, we provide a list of larger and larger organizations wherein we reside.  After our name comes a house number, then a street/avenue/road/boulevard (which is a collection of houses), then a municipality (which is a collection of streets/avenues/roads/boulevards), then a county (which is a collection of municipalities), then a state/province (which is a collection of counties), and then a country (which is a collection of states/provinces).  If we were to continue, we would then provide a continent (which is a collection of countries), then a planet (which is a collection of continents), then a star system (which is a collection of planets orbiting a star), then a galaxy (which is a collection of star systems), then a galactic group or a galactic cluster (which is a collection of galaxies), then a galactic supercluster (which is a collection of galactic groups and galactic clusters), then a cosmic filament (which is a collection of galactic superclusters), and finally a universe (which is a collection of cosmic filaments).  Every person we have ever met or ever will meet and every person we have ever heard of or will ever hear of has the same cosmic address starting with planet Earth followed by the Solar System, the Milky Way Galaxy, the Local Galactic Group, the Laniakea Galactic Supercluster, the Perseus-Pisces-Sculptor-Hercules Cosmic Filament, and the observable universe.

 

One light-year is the distance that light travels in a time of one year.  We must never forget that a light-year is a length of distance, not a duration of time.  This is easy to forget, since a year is a duration of time.  Nevertheless, a light-year is not a duration of time; a light-year is a length of distance.  How far is one light-year?  Light travels roughly three hundred thousand kilometers every second through the vacuum of outer space.  This is extraordinarily fast by human standards.  Since three hundred thousand kilometers is the distance light travels in one second, we multiply this by sixty to find how far light travels in one minute, since there are sixty seconds in one minute.  We multiply this result by another sixty to obtain how far light travels in one hour, since there are sixty minutes in one hour.  We multiply this result by twenty-four to find how far light travels in one day, since there are twenty-four hours in one day.  Finally, we multiply this result by 365.25 to obtain how far light travels in one year, since there are 365.25 days in one year.  The final result of this calculation is that one light-year is roughly 9.5 trillion kilometers.  This is close enough to ten trillion kilometers that throughout this course we will assume that one light-year is roughly ten trillion kilometers as a satisfactory approximation.  Our universe is so enormous that it actually takes years for light to travel from one star system to a neighboring star system.  It is for this reason that astronomers consider telescopes to be time machines.  When we observe a star that is one hundred light-years distant for example, we see that star as it appeared one hundred years ago (one hundred years in the past), since it took that long for its light to travel from that star to us.  The only way to know how that star actually appears at this moment is to wait another one hundred years for that light to travel to us.  We see a star one thousand light-years distant as it appeared one thousand years ago (one thousand years in the past), since it took that long for its light to travel from that star to us.  The only way to know how that star actually appears at this moment is to wait another one thousand years for that light to travel to us.  Not only are telescopes time machines, but the human eye is itself also a time machine.  If we look at the Sun (which we must never do since that would cause permanent blindness), we see the Sun as it appeared roughly eight minutes ago (roughly eight minutes in the past), since it takes that long for light to travel from the Sun to the Earth.  The only way to know how our Sun actually appears at this moment is to wait another eight minutes for that light to travel to us.  If we look at the Moon, we see it as it appeared roughly one second ago (roughly one second in the past), since it takes that long for light to travel from the Moon to the Earth.  The only way to know how our Moon actually appears at this moment is to wait another second for that light to travel to us.  When we look at the tables and chairs around us or even these words we are reading, we see those tables and chairs and these words as they appeared a few nanoseconds ago (a few nanoseconds in the past), since it takes that long for light to travel from the tables and chairs and even these words to our eyes.  Obviously, these time delays are so tiny in everyday life that we do not notice them at all, but they are real nevertheless.  The universe is so enormous that it takes a few years for light to travel from one star system to a neighboring star system, it often takes millions of years for light to travel from one galaxy to a neighboring galaxy, and it takes billions of years for light to travel from one side of the observable universe to the other side of the observable universe.  For example, our Milky Way Galaxy has a diameter of roughly one hundred thousand light-years, and the Andromeda Galaxy is more than two million light-years from our Milky Way Galaxy.

 

The universe is roughly fourteen billion years old.  This is unimaginably old by human standards.  The human mind can comprehend seconds, minutes, hours, days, weeks, months, and years of time.  Ten years is called a decade.  Ten decades (which is one hundred years) is called a century, which is roughly how long most humans live.  Ten centuries (which is one hundred decades or one thousand years) is called a millennium.  Ten millennia is almost twice as long as all of recorded human history, but it is still minuscule compared to the age of the universe.  One hundred millennia is almost how long our species Homo sapiens has existed, and one thousand millennia (which is one million years) is almost how long our genus Homo has existed.  Ten million years is almost how long our family of hominids has existed.  One hundred million years ago, there were no hominids; dinosaurs roamed the Earth.  One billion years is approaching the age of our planet Earth, but this is still not near the age of the universe.  Ten billion years is roughly the age of our Milky Way Galaxy, and this is finally approaching the fourteen-billion-year age of the entire universe.  We can gain a greater appreciation for the age of the universe through the cosmic calendar.  In the cosmic calendar, we pretend that the entire history of the universe fits into one calendar year.  In other words, when we use the cosmic calendar we pretend that the creation of the universe occurred at the very beginning of the calendar year on January 01st at midnight, and we pretend that the present day is at the very end of the calendar year on December 31st immediately before midnight.  If the creation of the universe occurred on January 01st at midnight, we must wait until September before the Earth forms!  We must then wait another month (October) before the most primitive unicellular microorganisms appear on Earth!  We must wait another month (November) before multicellular but still microscopic organisms appear on Earth!  We must wait another month (mid-December) before macroscopic but still invertebrate animals appear on Earth!  The most primitive vertebrate animals (fishes) appear on roughly December 20th, amphibians appear on roughly December 22nd, and reptiles appear on roughly December 24th.  The age of reptiles, commonly known as the age of dinosaurs, lasts from roughly December 25th to roughly December 30th, when the age of mammals begins.  Hominids do not appear until December 31st at roughly 05:30 p.m., and Homo sapiens do not appear until December 31st at roughly 11:52 p.m.!  Finally, all of recorded human history lasts for roughly fifteen seconds, beginning on December 31st at roughly 11:59:45 p.m.!  Compared with the recorded history of our species and even compared with the entire history of our species (unrecorded and recorded), the universe is unimaginably old.

 

 

Introduction to the Sky

 

Every point on the surface of the Earth can be labeled with the Geographic Coordinate System, a pair of coordinates (two numbers) called latitude and longitude.  Lines of latitude are parallel to one another.  The line of latitude at 0° is commonly called the equator, but we must call this line of latitude the Terrestrial Equator in this course, for reasons we will make clear shortly.  The Terrestrial Equator divides the surface of the Earth into two hemispheres: the northern hemisphere and the southern hemisphere.  Latitudes in the northern hemisphere are measured in degrees north, and latitudes in the southern hemisphere are measured in degrees south.  The Earth’s rotational axis (or axis of rotation) pierces the Earth at two points.  One of these points is at 90°N latitude and is commonly called the north pole, but we must call this line of latitude the North Terrestrial Pole in this course, for reasons we will make clear shortly.  The other point pierced by the Earth’s rotational axis (or axis of rotation) is at 90°S latitude which is commonly called the south pole, but we must call this line of latitude the South Terrestrial Pole in this course, for reasons we will make clear shortly.  Lines of longitude are not parallel to one another; they all begin together at the North Terrestrial Pole, they spread apart from one another until they are furthest apart from one another at the Terrestrial Equator, and they all converge back together at the South Terrestrial Pole.  Lines of longitude are measured from one hundred and eighty degrees west to one hundred and eighty degrees east.  Every point on the surface of the Earth has a unique latitude and longitude with only two exceptions.  Although the North Terrestrial Pole has the unique latitude 90°N, the North Terrestrial Pole has an undefined longitude.  Although the South Terrestrial Pole has the unique latitude 90°S, the South Terrestrial Pole has an undefined longitude.  These are the only two points that suffer from this tragedy; every other point on the surface of the Earth has a unique latitude and a unique longitude.

 

The Celestial Sphere is a giant imaginary sphere that does not exist physically but is nevertheless an essential concept in observational astronomy.  For the purposes of observational astronomy, we assume all astronomical objects in the sky (such as stars and galaxies) are located on this Celestial Sphere, and we assume the Earth is at the center of this Celestial Sphere.  From wherever we happen to be located on the Earth, the sky we see above us is only half of the Celestial Sphere, since the other half of the Celestial Sphere is below us where we see the ground instead.  The horizon is a giant imaginary circle that separates the ground from the sky.  Above the horizon, we see one-half of the Celestial Sphere; below the horizon, we see the ground that prevents us from seeing the other half of the Celestial Sphere.

 

All astronomical objects (such as stars and galaxies) can be labeled with the Horizon Coordinate System, a pair of coordinates (two numbers) called altitude and azimuth.  Altitude is the angle above or below the horizon.  Lines of altitude are parallel to one another.  The line of altitude at 0° is the horizon, which divides the Celestial Sphere into two hemispheres: the positive-altitude hemisphere and the negative-altitude hemisphere.  Altitudes in the positive-altitude hemisphere are measured in positive degrees, and altitudes in the negative-altitude hemisphere are measured in negative degrees.  Astronomical objects below the horizon have negative altitudes, which means we cannot see them since the ground is in the way.  Astronomical objects above the horizon have positive altitudes which means we can see them in the sky, unless it is raining or cloudy!  The point on the sky directly on top of us is +90° altitude and is called the zenith; the point directly opposite the zenith is −90° altitude and is called the nadir.  In colloquial English, the zenith of anything is its climax or apex or pinnacle, such as the apex of a civilization or the pinnacle of a person’s career.  Also in colloquial English, the nadir of anything is its rock bottom, such as the rock bottom of a classroom’s performance.  Lines of azimuth are not parallel to one another; they all begin together at the zenith, they spread apart from one another until they are furthest apart from one another at the horizon, and they all converge back together at the nadir.  Lines of azimuth are measured from 0° to 360°, starting at 0° for directly north.  The azimuth is 45° for directly northeast, 90° for directly east, 135° for directly southeast, 180° for directly south, 225° for directly southwest, 270° for directly west, 315° for directly northwest, and 360° for directly north (which is actually back to 0° for directly north).  We will call the point on the horizon directly north the North Point, the point on the horizon directly east we will call the East Point, the point on the horizon directly south we will call the South Point, and the point on the horizon directly west we will call the West Point.  Every astronomical object on the Celestial Sphere has a unique altitude and azimuth with only two exceptions.  Although the zenith has the unique altitude +90°, the zenith has an undefined azimuth.  Although the nadir has the unique altitude −90°, the nadir has an undefined azimuth.  These are the only two points that suffer from this tragedy; every other point on the Celestial Sphere has a unique altitude and a unique azimuth.

 

Although the Horizon Coordinate System is commonly used by the general public, the Horizon Coordinate System is unsatisfactory for observational astronomy for a couple of reasons.  Firstly, the Horizon Coordinates (the altitude and the azimuth) of a particular astronomical object depends upon where we happen to be located on the Earth.  An astronomical object may have a positive altitude from where we happen to be located on the Earth for example, but that same astronomical object will have a negative altitude from someone else’s different location on the Earth.  The azimuth will also be different.  The problem with the Horizon Coordinate System is even worse than this however.  Even if we remain fixed at one particular location on the Earth, the Horizon Coordinates (the altitude and the azimuth) of astronomical objects will still not remain fixed because the Earth is continuously rotating from west to east.  This continuous rotation causes astronomical objects to appear to cross the eastern horizon (rising) and then later cross the western horizon (setting).  The Celestial Meridian is a giant imaginary circle that is perpendicular to the horizon.  The Celestial Meridian begins at the North Point, runs through the zenith, runs through the South Point, runs though the nadir, and ends back at the North Point.  As the Earth rotates from west to east, astronomical objects appear to cross the eastern horizon (rising), and they appear to have a higher and higher altitude until they reach their highest altitude when they cross the Celestial Meridian (culminating).  As the Earth continues to rotate, astronomical objects then appear to have a lower and lower altitude until they cross the western horizon (setting), and they eventually rise crossing the eastern horizon again.  In summary, altitudes and azimuths both continuously change as a result of the Earth’s continuous rotation.  This daily apparent motion is most obvious for the Sun.  The moment when the Sun crosses the eastern horizon is called sunrise, the moment when the Sun crosses the Celestial Meridian above the horizon is called noon, the moment when the Sun crosses the western horizon is called sunset, and the moment when the Sun crosses the Celestial Meridian again but below the horizon is called midnight.  The first half of the day before the Sun crosses the Celestial Meridian at noon is called “before meridian” or “ante meridiem” which is abbreviated “a.m.,” and the second half of the day after the Sun has crossed the Celestial Meridian at noon is called “afternoon” or “after meridian” or “post meridiem” which is abbreviated “p.m.”

 

Since the Horizon Coordinates of astronomical objects continuously change as the Earth rotates, we need another pair of coordinates that remains fixed even though the Earth is rotating.  The most important such coordinate system is the Equatorial Coordinate System, a pair of coordinates (two numbers) called declination and right ascension.  Lines of declination are projections of lines of latitude onto the Celestial Sphere.  Since lines of latitude are parallel to one another, lines of declination are also parallel to one another.  The projection of the Terrestrial Equator at 0° latitude onto the Celestial Sphere is 0° declination.  This is called the Celestial Equator.  We now realize why we must never simply use the word equator in this course.  There are two equators!  The Earth’s equator is 0° latitude and is called the Terrestrial Equator, while the Celestial Sphere’s equator (the sky’s equator) is 0° declination and is called the Celestial Equator.  The Celestial Equator divides the Celestial Sphere into two hemispheres: the positive-declination hemisphere and the negative-declination hemisphere.  Declinations in the positive-declination hemisphere are measured in positive degrees, while declinations in the negative-declination hemisphere are measured in negative degrees.  The Earth’s rotational axis (or axis of rotation) pierces the Celestial Sphere at two points.  One of these points is at +90° declination and is called the North Celestial Pole.  We now realize why we must never simply use the term north pole in this course.  There are two north poles!  The Earth’s north pole is 90°N latitude and is called the North Terrestrial Pole, while the Celestial Sphere’s north pole (the sky’s north pole) is +90° declination and is called the North Celestial Pole.  The other point on the Celestial Sphere pierced by the Earth’s rotational axis (or axis of rotation) is at −90° declination and is called the South Celestial Pole.  We now realize why we must never simply use the term south pole in this course.  There are two south poles!  The Earth’s south pole is 90°S latitude and is called the South Terrestrial Pole, while the Celestial Sphere’s south pole (the sky’s south pole) is −90° declination and is called the South Celestial Pole.  Lines of right ascension on the Celestial Sphere are analogous to lines of longitude on Earth.  Just as lines of longitude are not parallel to one another, lines of right ascension are also not parallel to one another.  Lines of right ascension all begin together at the North Celestial Pole, they spread apart from one another until they are furthest apart from one another at the Celestial Equator, and they all converge back together at the South Celestial Pole.  Right ascension is measured in right-ascension hour-angles, running from 00h to 24h.  We will clearly define the line of right ascension at 00h shortly.  Each right-ascension hour-angle is actually 15° of right ascension, since 360° divided by 24h equals 15° of right ascension per 1h of right ascension.  Every point on the Celestial Sphere has a unique declination and right ascension with only two exceptions.  Although the North Celestial Pole has the unique declination +90°, the North Celestial Pole has an undefined right ascension.  Although the South Celestial Pole has the unique declination −90°, the South Celestial Pole has an undefined right ascension.  These are the only two points that suffer from this tragedy; every other point on the Celestial Sphere has a unique declination and a unique right ascension.

 

Notice that any spherical coordinate system we impose upon a sphere always results in two points on the sphere with undefined coordinates.  These are called the coordinate singularities of the coordinate system we have imposed.  For example, the North Terrestrial Pole and South Terrestrial Pole are the coordinate singularities of the Geographic Coordinate System, since the North Terrestrial Pole and the South Terrestrial Pole have undefined longitudes.  As another example, the zenith and the nadir are the coordinate singularities of the Horizon Coordinate System, since the zenith and the nadir have undefined azimuths.  As yet another example, the North Celestial Pole and the South Celestial Pole are the coordinate singularities of the Equatorial Coordinate System, since the North Celestial Pole and the South Celestial Pole have undefined right ascensions.

 

As the Earth rotates, the Horizon Coordinates of astronomical objects (the altitude and the azimuth) continuously change, but the Equatorial Coordinates (the declination and the right ascension) of all astronomical objects remain fixed.  Actually, this is not exactly correct.  As we will discuss later in the course, the Earth’s rotational axis (or axis of rotation) is slowly precessing and nutating, and these motions cause changes in Equatorial Coordinates.  In addition, stars and galaxies are also physically moving through the universe; these physical motions cause additional changes in Equatorial Coordinates.  Nevertheless, all these changes are sufficiently small that our naked eye does not notice them.  For the purposes of this discussion, we will assume that the Equatorial Coordinates of astronomical objects remain fixed as a satisfactory naked-eye approximation.

 

Suppose there happens to be an astronomical object (such as a star or galaxy) at either the North Celestial Pole or the South Celestial Pole.  Such an object will actually have fixed Horizon Coordinates even though the Earth is continuously rotating.  It just so happens that there is a star almost exactly at the North Celestial Pole.  Therefore, this star appears to remain fixed as everything else in the sky appears to circle around it.  This star has several names: α (alpha) Ursae Minoris, Polaris, the Pole Star, or the North Star.  There is no star almost exactly at the South Celestial Pole, but if there were such a star, then we would name it the South Star.  The North Star always has an azimuth of 0° since it is directly north, and the South Star (if there were one) always has an azimuth of 180° since it is directly south.  The altitude of the North Star is always equal to our latitude on Earth, while the altitude of the South Star (if there were one) is always equal to the opposite of our latitude on Earth.  For example, if we lived at 40°N latitude, then the Horizon Coordinates of the North Star would be +40° altitude and 0° azimuth, while the Horizon Coordinates of the South Star (if there were one) would be −40° altitude and 180° azimuth.  As another example, if we lived at 60°S latitude, then the Horizon Coordinates of the North Star would be −60° altitude and 0° azimuth, while the Horizon Coordinates of the South Star (if there were one) would be +60° altitude and 180° azimuth.  If we happen to live in the northern hemisphere, all astronomical objects (such as stars and galaxies) close enough to the North Star at the North Celestial Pole would never set; they would just appear to circle around the North Celestial Pole as the Earth rotates.  Also from the northern hemisphere, all astronomical objects (such as stars and galaxies) close enough to the South Star at the South Celestial Pole would never rise; they would just circle around the South Celestial Pole as the Earth rotates.  All of these astronomical objects are called circumpolar, since they appear to circle around the celestial poles.  The Latin root circum- means around in words such as circumference, circumscribe, or circumvent for example; hence, the word circumpolar literally means around the pole.  If we happen to live in the southern hemisphere, all astronomical objects (such as stars and galaxies) close enough to the South Star at the South Celestial Pole would never set; they would just appear to circle around the South Celestial Pole as the Earth rotates.  Also from the southern hemisphere, all astronomical objects (such as stars and galaxies) close enough to the North Star at the North Celestial Pole would never rise; they would just circle around the North Celestial Pole as the Earth rotates.  If we happen to live at the North Terrestrial Pole, our latitude would be 90°N, but this means that the North Celestial Pole is at +90° altitude which means it is at the zenith.  Also from the North Terrestrial Pole, the South Celestial Pole is at −90° altitude which means it is at the nadir.  As the Earth rotates, the entire sky would appear to be circumpolar, with half of the sky never setting and the other half of the sky never rising.  This stands to reason.  At the North Terrestrial Pole, there is no east for anything to rise from, nor is there west for anything to set to.  At the North Terrestrial Pole, all directions are south!  If we happen to live at the South Terrestrial Pole, our latitude would be 90°S, but this means that the North Celestial Pole is at −90° altitude which means it is at the nadir.  Also from the South Terrestrial Pole, the South Celestial Pole is at +90° altitude which means it is at the zenith.  As the Earth rotates, the entire sky would appear to be circumpolar, with half of the sky never setting and the other half of the sky never rising.  This stands to reason.  At the South Terrestrial Pole, there is no east for anything to rise from, nor is there west for anything to set to.  At the South Terrestrial Pole, all directions are north!  Also, note that the half of the sky that would never set at the North Terrestrial Pole would be the half of the sky that would never rise at the South Terrestrial Pole.  Conversely, the half of the sky that would never rise at the North Terrestrial Pole would be the half of the sky that would never set at the South Terrestrial Pole.  If we happen to live at the Terrestrial Equator, our latitude would be 0°, but this means that the North Celestial Pole is at 0° altitude, which means it is at the North Point on the horizon.  Also from the Terrestrial Equator, the South Celestial Pole is at 0° altitude, which means it is at the South Point on the horizon.  As the Earth rotates, nothing in the entire sky would appear to be circumpolar; all astronomical objects appear to rise and set.  Also, the Terrestrial Equator is the only location on the Earth where the three giant circles on the Celestial Sphere we have discussed (the horizon, the Celestial Meridian, and the Celestial Equator) are all perpendicular to one another.

 

There is a common misconception that the North Star is the brightest star in the sky.  This is false.  In fact, the North Star is among the dimmest stars in the sky; the North Star is so dim that it is barely visible to our naked eye.  The importance of the North Star is its location in the sky, not its brightness.  As we discussed, the North Star is almost exactly at the North Celestial Pole.  Therefore, the North Star appears to remain fixed as everything else in the sky appears to circle around it.  Since the North Star remains fixed, the cardinal directions (north, south, east, and west) are determined by first finding the North Star.  If we stand facing the direction of the North Star, then we are facing north.  To our left would be west, to our right would be east, and south would be behind us.  This is the only way to determine the cardinal directions.  It is a common misconception that we use a magnetic compass to determine the cardinal directions.  This is false.  As we will discuss later in the course, a magnetic compass points toward magnetic north, which is different from cardinal north (true north, toward the North Terrestrial Pole).  Admittedly, the direction of magnetic north is somewhat close to the direction of cardinal north (true north, toward the North Terrestrial Pole).  Nevertheless, we strictly emphasize the difference between magnetic north and cardinal north in this course.  Again, we do not use a magnetic compass to determine the cardinal directions (north, south, east, and west).  The only way to determine the cardinal directions (north, south, east, and west) is by finding the North Star, which is difficult since it is one of the dimmest stars in the sky, barely visible with the naked eye.

 

For thousands of years, humans have looked up into the sky and observed that the stars appear to remain fixed relative to one another while continuously rising and setting.  Many cultures formed pictures from groups of stars in the sky and named them constellations.  However, the modern definition of a constellation is a region of the Celestial Sphere defined by a boundary.  This modern definition of a constellation is analogous to the definition of a country, which is a region of a continent on Earth defined by a boundary (the country’s borders).  Just as the Earth is divided into roughly two hundred countries each defined by its own boundary (its own borders), the entire Celestial Sphere is divided into eighty-eight modern constellations, each defined by its own boundary.  Any astronomical object (whether it is a star, a galaxy, a planet, or even the Sun or the Moon) is considered to be within a certain constellation if it is within the boundary that defines that constellation.  An asterism is a group of stars in the sky that is not one of the eighty-eight strictly defined modern constellations.

 

There are several prominent circumpolar constellations.  The constellation Ursa Major (the big bear) includes seven bright stars that form the Big Dipper asterism.  Two of the stars in the Big Dipper asterism can be used to find the North Star, also known as α (alpha) Ursae Minoris or Polaris or the Pole Star.  The North Star is within the constellation Ursa Minor (the little bear), which includes the Little Dipper asterism.  Three other stars in the Big Dipper asterism can be used to find Arcturus, the brightest star in the constellation Boötes (the shepherd).  The constellation Cassiopeia (the queen of Aethiopia) has five bright stars shaped like the letter W.  There are several prominent summer constellations.  The constellation Cygnus (the swan) includes the Northern Cross asterism.  The brightest star in the constellation Cygnus (the swan) is Deneb.  The brightest star in the constellation Lyra (the harp) is Vega.  The brightest star in the constellation Aquila (the eagle) is Altair.  The Summer Triangle asterism is formed by connecting Vega, Deneb, and Altair.  There are several prominent winter constellations.  The constellation Orion (the hunter) includes seven bright stars, with Betelgeuse and Rigel among them.  The sword of Orion is actually the Orion Nebula, which we will discuss later in the course.  The brightest star in the constellation Canis Major (the big dog) is Sirius, the dog star.  The brightest star in the constellation Canis Minor (the little dog) is Procyon.  The Winter Triangle asterism is formed by connecting Betelgeuse, Sirius, and Procyon.  The brightest star in the constellation Auriga (the charioteer) is Capella.  The two brightest stars in the constellation Perseus (the hero) are Mirfak and Algol.

 

For thousands of years, humans have looked up into the sky and observed that the Sun appears to wander around the Celestial Sphere.  The giant circle that the Sun appears to wander around is called the ecliptic, and it takes the Sun one year to complete one journey around the ecliptic.  The constellations along the ecliptic are called the zodiac constellations.  If we arbitrarily decide to begin with the constellation Aquarius (the water bearer), next comes the constellation Pisces (the fish) followed by the constellation Aries (the ram).  Next comes the constellation Taurus (the bull).  The brightest star in the constellation Taurus (the bull) is Aldebaran, and the Pleiades is a star cluster within the constellation Taurus (the bull).  Next comes the constellation Gemini (the twins) with the two bright stars Pollux and Castor.  Next comes the constellation Cancer (the crab) followed by the constellation Leo (the lion).  The brightest star in the constellation Leo (the lion) is Regulus.  Next comes the constellation Virgo (the virgin).  The brightest star in the constellation Virgo (the virgin) is Spica.  Next comes the constellation Libra (the scales) followed by the constellation Scorpius (the scorpion).  The brightest star in the constellation Scorpius (the scorpion) is Antares.  Next comes the constellation Ophiuchus (the serpent bearer) followed by the constellation Sagittarius (the centaur archer) followed by the constellation Capricornus (the sea goat) followed by the constellation Aquarius (the water bearer), which is where we arbitrarily decided to begin our journey around the ecliptic.  Again, it takes the Sun one year to complete one journey around the ecliptic.  As the Sun journeys around the ecliptic, it appears to wander from within one zodiac constellation to another, spending approximately one month within each of these zodiac constellations.

 

The ecliptic intersects the Celestial Equator at two points called the equinoxes: the vernal equinox (or the spring equinox) is where the Sun appears to be on the ecliptic on roughly March 21st every year, and the autumn equinox is where the Sun appears to be on the ecliptic on roughly September 21st every year.  Both equinoxes are 0° declination, since they are on the Celestial Equator.  Astronomers have agreed to define 00h right ascension as the line of right ascension that intersects the vernal equinox (spring equinox).  The autumn equinox is therefore 12h right ascension since it is on the opposite side of the Celestial Sphere from the vernal equinox (spring equinox).  The furthest angle from the Celestial Equator the Sun ever wanders along the ecliptic is roughly 23˝°.  This occurs at two points halfway between the equinoxes called the solstices.  The summer solstice is where the Sun appears to be on the ecliptic on roughly June 21st every year, and the winter solstice is where the Sun appears to be on the ecliptic on roughly December 21st every year.  The summer solstice is 06h right ascension since it is halfway from the vernal equinox (spring equinox) at 00h to the autumn equinox at 12h.  The winter solstice is 18h right ascension since it is halfway from the autumn equinox at 12h to the vernal equinox at 24h (which is the same as 00h since 24h is all the way around the Celestial Sphere back to 00h).  The summer solstice is where the Sun has a positive maximum declination of +23˝°, and the winter solstice is where the Sun has a negative maximum declination of −23˝°.  In summary, the Equatorial Coordinates of the vernal equinox (spring equinox) where the Sun appears on the ecliptic on roughly March 21st every year is 0° declination and 00h right ascension, the Equatorial Coordinates of the summer solstice where the Sun appears on the ecliptic on roughly June 21st every year is +23˝° declination and 06h right ascension, the Equatorial Coordinates of the autumn equinox where the Sun appears on the ecliptic on roughly September 21st every year is 0° declination and 12h right ascension, and the Equatorial Coordinates of the winter solstice where the Sun appears on the ecliptic on roughly December 21st every year is −23˝° declination and 18h right ascension.  When the Sun is at the vernal equinox (spring equinox) with 0° declination, it appears to have an increasing declination as it journeys from the winter solstice in the negative-declination hemisphere of the Celestial Sphere toward the summer solstice in the positive-declination hemisphere of the Celestial Sphere.  When the Sun is at the autumn equinox with 0° declination, it appears to have a decreasing declination as it journeys from the summer solstice in the positive-declination hemisphere of the Celestial Sphere toward the winter solstice in the negative-declination hemisphere of the Celestial Sphere.

 

 

The Wave Theory of Light

 

A wave is a propagating (traveling) disturbance.  Therefore, a wave requires a medium through which to propagate, since we cannot have a propagating disturbance if there is nothing to disturb!  A transverse wave is a wave where the direction of the disturbance is perpendicular to the direction of propagation, while a longitudinal wave is a wave where the direction of the disturbance is parallel and antiparallel to the direction of propagation.  Light is a real-life example of a transverse wave, while sound is a real-life example of a longitudinal wave.  A wave can have a component of its disturbance perpendicular to the direction of propagation and another component parallel and antiparallel to the direction of propagation.  In other words, a wave can be both transverse and longitudinal.  Water waves that we see in the ocean are a real-life example of a wave that is both transverse and longitudinal.  The maximum magnitude of a wave’s disturbance is called the wave amplitude, and these amplitudes occur at what are called crests (maximum positive disturbance) and troughs (maximum negative disturbance).  The wave height is the amount of disturbance between a crest and a trough.  In other words, the wave height is double the wave amplitude (or the wave amplitude is half of the wave height).  The distance from one crest to the next crest (which is also the distance from one trough to the next trough) is called the wavelength of the wave and is always given the symbol λ (the lowercase Greek letter lambda).  Caution: the word wavelength is misleading, since it may lead us to conclude that the wavelength is the length of the entire wave, which it is not.  The wavelength of a wave is the length of only one cycle of the wave.  The frequency of a wave is the number of crests passing a point every second as well as the number of troughs passing a point every second.  We may also interpret the frequency of a wave as how many cycles or oscillations or vibrations the wave executes every second.  In other words, the frequency of a wave is how frequently the wave is vibrating or oscillating, which is why it is called the frequency!  A high-frequency wave is oscillating (vibrating) many cycles every second, while a low-frequency wave is oscillating (vibrating) a small number of cycles every second.  We will use the symbol f for frequency, and its units are cycles per second or vibrations per second or oscillations per second.  This unit is called a hertz with the abbreviation Hz, named for the German physicist Heinrich Hertz.  Again, one hertz (Hz) is one cycle per second or one vibration per second or one oscillation per second.  A kilohertz is one thousand hertz or one thousand cycles per second, since the metric prefix kilo- always means thousand.  For example, one kilometer is one thousand meters, and one kilogram is one thousand grams.  A megahertz is one million hertz or one million cycles per second, since the metric prefix mega- always means million.  On the amplitude-modulation radio band (AM radio), the radio-station numbers are kilohertz, while on the frequency-modulation radio band (FM radio), the radio-station numbers are megahertz.

 

The speed of a wave is a function of the properties of the medium through which the wave propagates.  For example, the speed of sound is some speed through gases such as air, a faster speed through liquids, and an even faster speed through solids.  The speed of sound through air is not even fixed; the speed of sound through air actually changes as the temperature of the air changes.  As another example, the speed of light is some speed through gases such as air, a slower speed through liquids such as water, and an even slower speed through solids such as glass.  The speed of any wave with wavelength λ and frequency f is determined by the equation v = f λ, where v is the speed (the velocity) of propagation of the wave.  If we solve this equation for the frequency, we deduce that f = v / λ.  Therefore, frequency and wavelength are inversely proportional to each other.  Waves with higher frequencies have shorter wavelengths, while waves with lower frequencies have longer wavelengths.  In addition to the properties of the medium of propagation, the speed of a wave is almost always also a function of the wave’s own wavelength or frequency thus resulting in dispersion, as we will discuss shortly.

 

The amplitude of any wave determines its energy.  In particular, the energy of a wave is directly proportional to the square of its amplitude.  Therefore, a wave with a larger amplitude has more energy, while a wave with a smaller amplitude has less energy.  For example, the amplitude of a sound wave determines its loudness.  A sound wave with a larger amplitude is more loud, while a sound wave with a smaller amplitude is less loud (more quiet).  As another example, the amplitude of a light wave determines its brightness.  A light wave with a larger amplitude is more bright, while a light wave with a smaller amplitude is less bright (more dim).  The frequency of a wave is difficult to interpret physically, and so we will interpret the frequency of a wave on a case-by-case basis.  For example, the frequency of a sound wave is its pitch, meaning that a sound wave with a higher frequency has a higher pitch while a sound wave with a lower frequency has a lower pitch.  As another example, the frequency of a visible light wave is its color.  In particular, a visible light wave with a high frequency is blue or violet, a visible light wave with a low frequency is red or orange, and a visible light wave with an intermediate frequency is yellow or green.  In order starting from the lowest frequency (which is also the longest wavelength), the colors of visible light are red, orange, yellow, green, blue, indigo, and violet at the highest frequency (which is also the shortest wavelength).  This is why the colors of the rainbow are in this order; a rainbow reveals the correct sequence of colors as determined by either the frequency or the wavelength.  We can memorize this sequence of colors with a mnemonic.  Using the letters r for red, o for orange, y for yellow, g for green, b for blue, i for indigo, and v for violet, we construct a fanciful name of an imaginary person: Roy G. Biv.

 

Whereas the frequency and the wavelength of a wave are constrained to one another through the equation v = f λ, no universal equation constrains the amplitude with the frequency.  Therefore, a wave can have a large amplitude and a high frequency, a wave can have a large amplitude and a low frequency, a wave can have a small amplitude and a high frequency, and a wave can have a small amplitude and a low frequency.  In other words, all of these combinations are physically possible.  For example, a sound wave with a large amplitude and a high frequency is a loud high-pitch sound, a sound wave with a large amplitude and a low frequency is a loud low-pitch sound, a sound wave with a small amplitude and a high frequency is a quiet high-pitch sound, and a sound wave with a small amplitude and a low frequency is a quiet low-pitch sound.  As another example, a visible light wave with a large amplitude and a high frequency is bright blue, a visible light wave with a large amplitude and a low frequency is bright red, a visible light wave with a small amplitude and a high frequency is dim blue, and a visible light wave with a small amplitude and a low frequency is dim red.

 

A wave that is a propagating (traveling) disturbance through a material medium (a medium composed of atoms) is called a mechanical wave.  Sound waves and water waves are real-life examples of mechanical waves.  We will call a wave that is a propagating disturbance through an abstract field medium a field wave.  Light waves and gravitational waves are real-life examples of field waves.  Light waves are propagating disturbances through the electromagnetic field created by charges, and gravitational waves are propagating disturbances through the gravitational field created by masses.  Since light waves are propagating disturbances through the electromagnetic field, then light is actually an electromagnetic wave.  The Electromagnetic Spectrum is a list of all the different types of electromagnetic waves in order as determined by either the frequency or the wavelength.  Starting with the lowest frequencies (which are also the longest wavelengths), we have radio waves, microwaves, infrared, visible light (the only type of electromagnetic wave our eyes can see), ultraviolet, X-rays, and gamma rays at the highest frequencies (which are also the shortest wavelengths).  All of these are electromagnetic waves.  Therefore, all of them may be regarded as different forms of light.  They all propagate at the same speed of light through the vacuum of outer space for example.  We now realize that whenever we use the word light in colloquial English, we probably mean to use the term visible light, since this is the only type of light that our eyes can actually see.  The visible light band of the Electromagnetic Spectrum is actually quite narrow.  Nevertheless, the visible light band of the Electromagnetic Spectrum can be subdivided.  In order, the subcategories of the visible light band of the Electromagnetic Spectrum starting at the lowest frequency (which is also the longest wavelength) are red, orange, yellow, green, blue, indigo, and violet at the highest frequency (which is also the shortest wavelength).  We now realize why electromagnetic waves with slightly lower frequencies (or with slightly longer wavelengths) than visible light are called infrared, since their frequencies (or wavelengths) are just beyond red visible light.  In other words, infrared light is more red than red!  We also realize why electromagnetic waves with slightly higher frequencies (or with slightly shorter wavelengths) than visible light are called ultraviolet, since their frequencies (or wavelengths) are just beyond violet visible light.  In other words, ultraviolet light is more purple than purple!

 

A wave detector detects a frequency shift if the source of the wave is moving or if the detector of the wave is moving or if both are moving.  This is called the Doppler-Fizeau effect or the Doppler-Fizeau shift, named for the Austrian physicist and mathematician Christian Doppler and the French physicist Hippolyte Fizeau, both of whom mathematically derived the existence of this effect.  According to the Doppler-Fizeau effect, a wave detector detects a higher-frequency shift if there is advancing motion (the source moves toward the detector, the detector moves toward the source, or both).  Conversely, a wave detector detects a lower-frequency shift if there is receding motion (the source moves away from the detector, the detector moves away from the source, or both).  For example, the human ear is a detector of sound waves.  Since the frequency of a sound wave is its pitch, our ears hear higher pitches as a police/ambulance/firetruck siren for example moves towards us, and our ears hear lower pitches as a police/ambulance/firetruck siren for example moves away from us.  Although the human eye is a detector of light waves, the Doppler-Fizeau effect for light is too small to be noticed by the human eye.  This is because the magnitude of the Doppler-Fizeau shift for any wave is a function of the speeds of the source and the detector as compared with the propagation speed of the wave itself.  The vacuum speed of light is nearly one million times faster than the speed of sound through air.  Since the vacuum speed of light is so extraordinarily fast, most objects in the universe move slowly as compared with this incredibly fast speed, resulting in tiny Doppler-Fizeau shifts for light.  The speed of sound through air is nearly one million times slower than the vacuum speed of light; thus, police/ambulance/firetruck sirens for example can move at appreciable fractions of the speed of sound, resulting in Doppler-Fizeau shifts for sound that the human ear actually notices.  Although the Doppler-Fizeau effect for light is too small for the human eye to notice, astronomers can experimentally measure tiny Doppler-Fizeau shifts of the light from astronomical objects such as stars and galaxies.  Our instruments detect higher-frequency shifts (which are also shorter-wavelength shifts) of light from stars and galaxies that move towards us.  Astronomers use the word blueshift for higher-frequency shifts (or shorter-wavelength shifts) of any type of electromagnetic wave.  Conversely, our instruments detect lower-frequency shifts (which are also longer-wavelength shifts) of light from stars and galaxies that move away from us.  Astronomers use the word redshift for lower-frequency shifts (or longer wavelength shifts) of any type of electromagnetic wave.  By measuring these blueshifts and redshifts, astronomers can determine not only the direction of motion of stars and galaxies (whether toward us or away from us) but in addition their speed of motion (whether fast or slow).  We can even determine the rotational speed of a star using the Doppler-Fizeau effect, as we will discuss shortly.

 

Consider any wave propagating in a certain medium that encounters a second medium.  This wave is called the incident wave.  At the boundary between the two media, a part of the wave will bounce back into the first medium while the rest of the wave will be transmitted into the second medium.  The wave that bounces back into the first medium is called the reflected wave, while the wave that is transmitted into the second medium is called the refracted wave.  We will make clear the meanings of the words reflection and refraction shortly.  Any line perpendicular to the boundary between the two media is called the normal to the boundary, since the word normal in physics and engineering means perpendicular.  The angle between the incident wave and the normal is called the angle of incidence with the symbol θ1.  The angle between the reflected wave and the normal is called the angle of reflection with the symbol θ3.  The angle between the refracted wave and the normal is called the angle of refraction with the symbol θ2.  The Law of Reflection states that θ1 = θ3 in all cases.  In other words, the angle of incidence is equal to the angle of reflection in all cases for all waves.  Reflection is the bouncing of a part of a wave off of another medium with no change in angle with respect to the normal.  The Law of Refraction states sin(θ1)/v1 = sin(θ2)/v2, where v1 is the speed of the wave in the first medium and v2 is the speed of the refracted wave in the second medium.  Refraction is the bending of a wave due to a change in speed of the wave.  According to the Law of Refraction, the angle of refraction θ2 is smaller than the angle of incidence θ1 if the transmitted speed v2 is slower than the incident speed v1, while the angle of refraction θ2 is larger than the angle of incidence θ1 if the transmitted speed v2 is faster than the incident speed v1.  More plainly, a wave is refracted (bent) toward the normal if the transmitted wave propagates slower than the incident wave, while a wave is refracted (bent) away from the normal if the transmitted wave propagates faster than the incident wave.

 

When two or more waves of the same nature (the same type) occupy the same space at the same time, they add together to become a combined wave.  This is called interference.  Caution: sometimes addition is indeed an addition, but sometimes addition is actually a subtraction.  For example, positive five added with positive three yields positive eight, and negative five added with negative three yields negative eight.  However, positive five added with negative three yields positive two.  Notice that two is actually the difference between five and three.  Again, whereas sometimes addition is indeed an addition, sometimes addition is actually a subtraction.  When two waves of the same nature (the same type) occupy the same space at the same time, crest and crest may meet, trough and trough may meet, or crest and trough may meet.  The crests of a wave are locations of maximum positive disturbance, and the troughs of a wave are locations of maximum negative disturbance.  When crest and crest meet, the waves interfere with one another to become a combined wave with an even greater positive disturbance, since adding positive numbers together yields even greater positive numbers.  When trough and trough meet, the waves interfere with one another to become a combined wave with an even greater negative disturbance, since adding negative numbers together yields even greater negative numbers (in magnitude).  Either of these scenarios is called constructive interference, since the combined wave has a larger amplitude than the individual waves that interfered to form the combined wave.  When crest and trough meet, the waves interfere with one another to become a combined wave with a smaller disturbance, since adding a positive number with a negative number actually results in subtraction, such as adding positive five with negative three to yield positive two.  This scenario is called destructive interference, since the combined wave has a smaller amplitude than some of the individual waves that interfered to form the combined wave.  Any type of interference between these two extremes is called mixed interference.

 

Consider many waves with many different wavelengths (or frequencies) that interfere with one another to form a combined wave.  The speed of the individual waves is called the phase speed, while the speed of the combined wave is called the group speed.  Suppose all the individual waves move at the same speed.  So, these individual waves will all move together.  The resulting combined wave will then move at the same speed as the individual waves.  In other words, the group speed and the phase are equal to each other.  Now suppose instead that the individual waves all move at different speeds.  The combined wave will then spread as faster-moving waves pull out ahead of slower-moving waves that lag behind.  This is called dispersion.  The verb to disperse means to spread out.  The combined wave spreads because the individual waves all move at different speeds.  The combined wave will also move at a different speed.  In other words, dispersion is the spreading of a wave because the phase speed and the group speed are different from each other.  When the individual waves all move at the same speed, they will all move together, resulting in the combined wave also moving at that same speed.  The combined wave does not spread, since all the individual waves move together.  This is called no dispersion.  A rainbow results from the dispersion of light within water or glass.  White light is a combination of individual light waves with different wavelengths or frequencies (different colors).  When white light propagates within water or glass, the different colors all propagate at different speeds.  This also causes these colors to refract (to bend) at different angles which separates the individual colors from each other, thus forming a rainbow.

 

Diffraction is the bending of a wave, but this is a different bending than the refraction of a wave.  Refraction is the bending of a wave due to a change in speed of the wave, but diffraction is the bending of a wave without involving a change in speed of the wave.  Diffraction is the bending of a wave around obstacles.  However, if the wavelength of a wave is small compared with the size of the obstacles, this diffractive bending will be negligible, and the wave will seem to propagate in straight lines.  This is why light from a flashlight or even more so from a laser pointer appears to travel a straight path.  The wavelength of visible light waves is so small compared with the sizes of everyday obstacles around us that the diffraction of visible light is not noticeable.  However, the wavelength of sound waves is not small compared with the sizes of everyday obstacles around us, such as hallways and doorways.  Thus, the diffraction of sound is quite severe.  This is why we can hear someone speaking who is nevertheless standing around a corner.  The wavelength of sound is not small compared to the size of the hallway; therefore, the sound diffracts around the corner instead of traveling a simple straight path.

 

As we discussed, a transverse wave is a wave where the direction of the disturbance is perpendicular to the direction of propagation.  The polarization of a transverse wave is the orientation and the motion of its perpendicular disturbance.  If the disturbance is a vertical vibration, a horizontal vibration, or a diagonal vibration, we say that the wave is linearly polarized.  If the disturbance traces out the shape of an ellipse, we say that the wave is elliptically polarized.  A special case of an elliptically polarized wave is a circularly polarized wave, since a circular shape is a special case of an elliptical shape.  We may interpret an elliptically polarized wave as the interference (the combination) of two appropriately linearly polarized waves.  Conversely, we may interpret a linearly polarized wave as the interference (the combination) of two appropriately elliptically polarized waves.  Since light is a transverse wave, light waves can be linearly polarized or elliptically polarized.  A random combination of a large number of light waves each with its own particular polarization results in unpolarized light.  In particular, light from the Sun (sunlight) is unpolarized light.

 

 

Quantum Physics: Photons, Atoms, and Spectra

 

According to the wave theory of light, radio, microwaves, infrared, visible light, ultraviolet, X-rays, and gamma rays are electromagnetic waves.  However, according to the quantum theory of light, radio, microwaves, infrared, visible light, ultraviolet, X-rays, and gamma rays are composed of particles called photons.  In other words, a photon is a particle of the electromagnetic field.  The energy of a photon is determined by the equation Ephoton = h f, where h is the Planck constant, one of the fundamental physical constants of the universe.  The Planck constant is named for the German physicist Max Planck, one of the grandfathers of Quantum Mechanics, the correct theory of molecules, atoms, and subatomic particles.  According to this equation Ephoton = h f, the energy of a photon is directly proportional to the frequency.  Therefore, higher frequencies of light are composed of higher-energy photons, and lower frequencies of light are composed of lower-energy photons.  Therefore, the Electromagnetic Spectrum starting with the lowest photon energy is radio, microwaves, infrared, visible light, ultraviolet, X-rays, and gamma rays at the highest photon energy.  Notice that ultraviolet photons have greater energy than visible light photons; this is why ultraviolet causes suntans and sunburns.  Also notice that X-ray photons have even greater energy, so much so that they penetrate most substances.  This is why X-rays are used to take X-rays!  The subcategories of the visible light band of the Electromagnetic Spectrum starting at the lowest photon energy are red, orange, yellow, green, blue, indigo, and violet at the highest photon energy.

 

Since every blue photon in the universe has more energy than every red photon in the universe, if we had blue light composed of more photons and red light composed of fewer photons, the blue light would certainly be brighter than the red light.  If we had red light and blue light each composed of the same number of photons, the blue light would still be brighter than the red light, since each blue photon composing the blue light has more energy than each red photon composing the red light.  How could red light ever be equally as bright as blue light?  In this case, the red light must be composed of significantly more photons than the blue light to ensure that the total energy of the red light were equal to the total energy of the blue light, even though each red photon actually has less energy than each blue photon.  Furthermore, if every blue photon in the universe has more energy than every red photon in the universe, could red light ever be even brighter than blue light?  In this case, the bright red light must be composed of many many more photons than the dim blue light to ensure that the total energy of the red light is greater than the total energy of the blue light even though each red photon actually has less energy than each blue photon.  In summary, the total energy of light is equal to the number of photons multiplied by the energy of each photon.  This reveals a connection between the wave theory of light and the quantum theory of light.  According to the wave theory of light, the total energy of light is directly proportional to the square of its amplitude.  According to the quantum theory of light, the total energy of light is directly proportional to the product of the number of photons and the frequency (since the energy of each photon is directly proportional to the frequency through the equation Ephoton = h f).  We conclude the following connection between the wave theory of light and the quantum theory of light: the square of the amplitude of a light wave must be directly proportional to the product of the number of photons that composes the light and the frequency of the light.

 

All materials in the universe (such as solids, liquids, and gases) are composed of atoms.  Atoms are composed of even smaller particles.  The center of the atom is called the nucleus, since the center of anything is often called its nucleus.  For example, the center of a biological cell is called the cellular nucleus, and the center of an entire galaxy is called the galactic nucleus.  The center of an atom is more properly called the atomic nucleus, but we will often simply refer to it as the nucleus.  Surrounding the atomic nucleus are electrons.  The atomic nucleus is positively charged, and electrons are negatively charged.  Like charges repel, and unlike charges attract.  In other words, positive and positive repel, negative and negative repel, and positive and negative attract.  It is the attraction between the positive nucleus and the surrounding negative electrons that holds the atom together.  The atomic nucleus is composed of even smaller particles: protons and neutrons.  Protons are positively charged.  In fact, it is because of the protons that the entire atomic nucleus has a positive charge.  Neutrons have zero electrical charge.  In other words, neutrons are neutral.  This is why they are called neutrons!

 

The number of protons in the nucleus is the single most important number of the atom.  The number of protons in the nucleus is so important that it is called the atomic number.  The atomic number, which is always the number of protons in the nucleus, is so important that an atom is named solely based on its atomic number.  For example, every atom in the universe with twelve protons in its nucleus is considered to be a magnesium atom.  As another example, every atom in the universe with seven protons in its nucleus is considered to be a nitrogen atom.  We are not saying that the number of neutrons is irrelevant, nor are we saying that the number of electrons is irrelevant.  The neutrons and the electrons are quite important.  We are saying that the atomic number is always the number of protons, and the name of an atom is based only upon its atomic number, the number of protons.

 

Consider an atom where the number of electrons balances the number of protons.  Since protons are positive and electrons are negative, the atom is neutral overall.  Now suppose we add extra electrons to the atom.  Since electrons are negative, the atom will no longer be neutral overall; the atom will now be negative overall.  Suppose instead that we removed electrons from the atom in the first place.  Now the atom will be positive overall.  A charged atom is called an ion.  Therefore, changing the number of electrons results in ions.  For example, consider the sodium atom with the symbol Na.  The atomic number of sodium is eleven, meaning that every sodium atom in the universe has eleven protons.  We will make this clear with a subscript before the atom’s symbol as follows: 11Na.  If the sodium atom were neutral, it would have eleven electrons as well, but suppose we add three more electrons.  Since electrons are negative, we now have an ion with a charge of negative three.  We write the charge as a superscript after the symbol of the atom as follows: 11Na3.  Even though the charge is read negative three, the superscript is written in the strange way 3–.  As another example, consider the aluminum atom with the symbol Al.  The atomic number of aluminum is thirteen, meaning that every aluminum atom in the universe has thirteen protons.  We make this clear with a subscript before the atom’s symbol as follows: 13Al.  If the aluminum atom were neutral, it would have thirteen electrons as well, but suppose we remove two of its electrons.  We now have an ion with a charge of positive two.  We write the charge as a superscript after the symbol of the atom as follows: 13Al2+.  Even though the charge is read positive two, the superscript is written in the strange way 2+.  A positive ion is called a cation, and a negative ion is called an anion.  In other words, an anion (a negative ion) has extra electrons, while a cation (a positive ion) is deficient (has lost) electrons.

 

If we change the number of neutrons, we do not get ions, since neutrons are neutral.  So, adding or removing neutrons does not change the charge at all.  If we change the number of neutrons, what we are changing is the mass of the atom.  The atomic mass of an atom is the number of protons plus the number of neutrons.  We do not include the electrons when calculating the mass of the atom because an electron is almost two thousand times less massive than a proton or a neutron.  Thus, electrons contribute a minuscule amount to the mass of an atom.  A proton and a neutron have roughly equal amounts of mass, which is why we count them equally.  When we change the number of neutrons, we are changing the atomic mass of the atom.  Two atoms with the same atomic number but different atomic mass are called isotopes.  Therefore, changing the number of neutrons results in isotopes.  For example, consider the carbon atom with the symbol C.  The atomic number of carbon is six, meaning that every carbon atom in the universe has six protons.  We make this clear with a subscript before the symbol of the atom as follows: 6C.  However, carbon has three isotopes: carbon-twelve, carbon-thirteen, and carbon-fourteen.  An isotope is named based on its atomic mass.  Thus, the numbers twelve, thirteen, and fourteen are the atomic masses of these isotopes of carbon.  We make this clear with a superscript before the symbol of the atom as follows:  for carbon-twelve,  for carbon-thirteen, and  for carbon-fourteen.  Notice that carbon always has six protons, but the carbon-fourteen isotope has eight neutrons, since six plus eight equals fourteen.  The carbon-thirteen isotope has seven neutrons, since six plus seven equals thirteen.  The carbon-twelve isotope has six neutrons, since six plus six equals twelve.  As another example, consider the oxygen atom with the symbol O.  The atomic number of oxygen is eight, meaning that every oxygen atom in the universe has eight protons.  We make this clear with a subscript before the symbol of the atom as follows: 8O.  However, oxygen has three isotopes: oxygen-sixteen, oxygen-seventeen, and oxygen-eighteen.  An isotope is named based on its atomic mass.  Thus, the numbers sixteen, seventeen, and eighteen are the atomic masses of these isotopes of oxygen.  We make this clear with a superscript before the symbol of the atom as follows:  for oxygen-sixteen,  for oxygen-seventeen, and  for oxygen-eighteen.  Notice that oxygen always has eight protons, but the oxygen-eighteen isotope has ten neutrons, since eight plus ten equals eighteen.  The oxygen-seventeen isotope has nine neutrons, since eight plus nine equals seventeen.  The oxygen-sixteen isotope has eight neutrons, since eight plus eight equals sixteen.

 

Let us apply everything we have discussed about atoms to the following examples.  Consider the neon atom with the symbol Ne.  Now suppose we write 2–.  This neon atom has ten protons, eleven neutrons, twelve electrons, an atomic number of ten, an atomic mass of twenty-one, and a charge of negative two.  As another example, consider the boron atom with the symbol B.  (There are borons in this class!)  Now suppose we write 3+.  This boron atom has five protons, four neutrons, two electrons, an atomic number of five, an atomic mass of nine, and a charge of positive three.

 

The two most important atoms in this course are hydrogen and helium, since most of the atoms in the universe are hydrogen atoms, and helium atoms are the second most abundant atom in the universe.  The symbol for the hydrogen atom is H.  The atomic number of hydrogen is one, meaning that every hydrogen atom in the universe has one proton in its nucleus.  We make this clear with a subscript before the symbol of the atom as follows: 1H.  However, hydrogen has three isotopes: hydrogen-one which is written , hydrogen-two which is written , and hydrogen-three which is written .  Hydrogen is so important that these three isotopes have additional names besides hydrogen-one, hydrogen-two, and hydrogen-three.  Hydrogen-one is also called protium.  It is also called ordinary hydrogen, since most of the hydrogen atoms in the universe are this isotope.  Hydrogen-two is also called deuterium.  It is also called heavy hydrogen, since it is twice as massive as ordinary hydrogen.  (When an oxygen atom chemically bonds to two ordinary hydrogen atoms, the result is a molecule of ordinary water.  When an oxygen atom chemically bonds to two heavy hydrogen atoms, the result is a molecule of heavy water.)  Hydrogen-three is also called tritium.  The names protium, deuterium, and tritium are derived from the names of the nuclei of these atoms.  The atomic number of hydrogen is one, meaning that every hydrogen atom in the universe has one proton in its nucleus.  This means that the hydrogen-one isotope (or protium or ordinary hydrogen) has no neutrons in its nucleus, since one plus zero equals one.  In other words, its nucleus is a single proton all by itself.  This is the simplest nucleus in the universe.  Since the nucleus is a proton, when we put an electron around it to build the entire atom, we name the entire atom protium, since its nucleus is a proton.  The hydrogen-two isotope (or deuterium or heavy hydrogen) has one neutron in its nucleus, since one plus one equals two.  In other words, its nucleus is a proton and a neutron stuck to each other.  A proton and a neutron stuck to each other is called a deuteron.  Since the nucleus is a deuteron, when we put an electron around it to build the entire atom, we name the entire atom deuterium, since its nucleus is a deuteron.  The hydrogen-three isotope (or tritium) has two neutrons in its nucleus, since one plus two equals three.  In other words, its nucleus is a proton and two neutrons all stuck to one another.  A proton and two neutrons all stuck to one another is called a triton.  Since the nucleus is a triton, when we put an electron around it to build the entire atom, we name the entire atom tritium, since its nucleus is a triton.  The helium atom with the symbol He has an atomic number of two, meaning that every helium atom in the universe has two protons in its nucleus.  We make this clear with a subscript before the symbol of the atom as follows: 2He.  Most of the helium atoms in the universe are the helium-four isotope which is written .  Helium-four is also called ordinary helium, since most of the helium atoms in the universe are this isotope.  The nucleus of helium-four is composed of two protons and two neutrons, since two plus two equals four.  In other words, the nucleus of helium-four is two protons and two neutrons all stuck to one another.  Two protons and two neutrons all stuck to one another is called an alpha particle.  To summarize, the nucleus of the protium atom is a proton, the nucleus of the deuterium atom is a deuteron, the nucleus of the tritium atom is a triton, and the nucleus of the ordinary helium atom is an alpha particle.

 

Electrons do not orbit an atomic nucleus like planets orbit the Sun.  In fact, the electrons do not orbit at all; they exist in abstract quantum-mechanical states that we will not discuss deeply in this course.  We simply state that there are definite energy levels within an atom.  Some levels are at lower energies, and other levels are at higher energies.  If an electron wishes to change its energy from a lower level to a higher level, it must absorb a photon, a particle of light.  However, not any photon will accomplish this transition.  The energy of the photon absorbed must be exactly equal to the difference in energy between the two levels.  If an electron wishes to change its energy from a higher level to a lower level, it must emit (spit out) a photon, but not any photon will accomplish this transition.  The energy of the photon emitted must be exactly equal to the difference in energy between the two levels.  Therefore, an atom can only absorb or emit photons of very specific energies (or very specific frequencies or very specific wavelengths).  The list of all the allowed photon energies (or frequencies or wavelengths) an atom is permitted to absorb is called the absorption spectrum of the atom, and the list of all the allowed photon energies (or frequencies or wavelengths) an atom is permitted to emit is called the emission spectrum of the atom.  The study and measurement of spectra is called spectroscopy.  Since different atoms have different energy levels, every atom has its own unique spectrum, different from the spectra of all other atoms.  Therefore, the spectrum of an atom is rather like its fingerprint, enabling us to uniquely identify an atom.  A spectacular application of spectroscopy was the discovery of the Sun’s composition, the discovery of which atoms compose the Sun.  In the early nineteenth century (early 1800s), the Bavarian physicist Joseph von Fraunhofer discovered missing wavelengths in the Sun’s light.  These absorption lines within sunlight were later called Fraunhofer lines in his honor.  By measuring the wavelengths of these absorption lines and consulting a table of absorption spectra, we can determine which atoms absorbed these missing wavelengths and thus determine the composition of the Sun.  We discover that the Sun is composed of all the atoms across the entire the Periodic Table of Elements, but not in equal amounts.  Only two atoms account for close to one hundred percent of the Sun’s mass; all the other atoms on the Periodic Table of Elements account for only a tiny fraction (tiny percentage) of the Sun’s mass.  What are these two elements that account for close to one hundred percent of the Sun’s mass?  We discover from the Fraunhofer lines in sunlight that hydrogen atoms account for roughly three-quarters (roughly seventy-five percent) of the Sun’s mass.  What about the remaining roughly one-quarter (roughly twenty-five percent) of the Sun’s mass?  During the nineteenth century (the 1800s), the wavelengths of the remaining absorption lines were not found in any atom’s tabulated absorption spectrum!  Evidently, roughly one-quarter (roughly twenty-five percent) of the Sun’s mass is composed of a new atom never before discovered!  This newly-discovered atom was called helium, named for Helios the personification of the Sun in ancient Greek mythology.  In the early twentieth century (early 1900s), helium was discovered on Earth as the product of certain nuclear reactions, and today helium is in common use, in blimps and in party balloons for example.  Nevertheless, helium was first discovered from its absorption lines in the Sun’s light!

 

What is temperature?  What do we mean when we say something is hot?  What do we mean when we say something is cold?  The temperature of an object is a measure of the average energy of the atoms that compose that object.  In this course, we may assume that the average energy of atoms corresponds to their average speed.  In other words, the atoms of a hotter object are moving relatively faster, whereas the atoms of a cooler object are moving relatively slower.  There are two scales of temperature in common use: degrees fahrenheit and degrees celsius.  However, neither degrees fahrenheit nor degrees celsius are acceptable units of temperature, since zero is in the wrong place in both of these scales.  What do we mean by saying zero is in the wrong place?  If the temperature of an object is a measure of the average speed of its atoms, then the coldest possible temperature of our universe is the temperature at which all the atoms of an object completely stop moving.  After all, there is no slower speed than not moving at all!  The temperature at which all atoms completely stop moving is commonly called the absolute zero of temperature.  However, this absolute zero of temperature is not zero degrees fahrenheit nor is it zero degrees celsius.  Atoms are still moving at zero degrees fahrenheit, and atoms are still moving at zero degrees celsius.  There are negative temperatures on both of these scales (commonly called temperatures below zero) where the atoms move slower still.  The absolute zero of temperature when all atoms completely stop moving is exactly negative 273.15 degrees celsius or exactly negative 459.67 degrees fahrenheit.  A correct unit of temperature must assign the number zero to the absolute zero of temperature.  The simplest way to correct degrees celsius is to add 273.15 to all degrees celsius.  Since absolute zero is negative 273.15 degrees celsius, then adding 273.15 would yield zero, and all other temperatures would become positive.  The simplest way to correct degrees fahrenheit is to add 459.67 to all degrees fahrenheit.  Since absolute zero is negative 459.67 degrees fahrenheit, then adding 459.67 would yield zero, and all other temperatures would become positive.  When we correct the celsius scale by adding 273.15 to all degrees celsius, the resulting correct units of temperature are called kelvins.  When we correct the fahrenheit scale by adding 459.67 to all degrees fahrenheit, the resulting correct units of temperature are called rankines.  To summarize, absolute zero temperature is negative 273.15 degrees celsius or negative 459.67 degrees fahrenheit using incorrect temperature scales, but absolute zero temperature is zero kelvins or zero rankines using correct units of temperature.  We will use kelvins throughout this course.  Although kelvins are a correct unit of temperature, it is somewhat difficult growing accustomed to kelvins.  For example, most humans consider 280 kelvins to be uncomfortably cold, most humans consider 300 kelvins to be a comfortable room temperature, and most humans consider 320 kelvins to be uncomfortably hot.

 

The Third Law of Thermodynamics states that it is impossible to cool an object to absolute zero temperature in a finite number of processes.  It follows that every object in the universe has a temperature that is warmer than absolute zero.  Therefore, every object in the universe has its atoms moving at some average speed.  Since atoms are composed of protons, neutrons, and electrons and since protons and electrons are charged, every object in the universe radiates electromagnetic waves from its moving atoms.  The neutrons also contribute to these electromagnetic waves.  Although neutrons are neutral, they still have some electromagnetic properties.  The amount of energy radiated from a hot, dense object often follows the blackbody spectrum, which is a continuous spectrum with its primary radiation within a band of the Electromagnetic Spectrum determined by the temperature of the object.  In particular, hotter temperatures correspond to higher photon energies (which are also at higher frequencies and shorter wavelengths), while cooler temperatures correspond to lower photon energies (which are also at lower frequencies and longer wavelengths).  In other words, a hot, dense object’s primary radiation is displaced as its temperature changes.  This is called the Wien displacement law, named for the German physicist Wilhelm Wien who discovered this relationship between a hot, dense object’s primary radiation and the object’s temperature.  More precisely, the Wien displacement law states that the wavelength of a hot, dense object’s primary radiation is inversely proportional to its temperature, assuming we measure temperature with correct units such as kelvins or rankines.  At extremely cold temperatures (close to absolute zero), objects radiate primarily in the microwave band of the Electromagnetic Spectrum.  At a few hundred kelvins (such as room temperatures), objects radiate primarily in the infrared band of the Electromagnetic Spectrum.  At roughly one or two thousand kelvins, objects radiate primarily red visible light.  At roughly three or four thousand kelvins, objects radiate primarily orange visible light.  At roughly five or six thousand kelvins, objects radiate primarily yellow visible light.  At roughly ten thousand kelvins, objects radiate primarily blue visible light.  At hundreds of thousands of kelvins, objects radiate primarily in the ultraviolet band of the Electromagnetic Spectrum.  At a couple million kelvins, objects radiate primarily in the X-ray band of the Electromagnetic Spectrum.  At tens of millions of kelvins, objects radiate primarily in the gamma-ray band of the Electromagnetic Spectrum.  Notice how hotter temperatures displace the primary radiation to higher and higher photon energies (which are also higher and higher frequencies and shorter and shorter wavelengths), while cooler temperatures displace the primary radiation to lower and lower photon energies (which are also lower and lower frequencies and longer and longer wavelengths).  We can demonstrate this Wien displacement law by heating metal.  A metal that is sufficiently hot radiates red visible light.  As the metal is made even hotter, it radiates orange visible light.  If the metal is made hotter still, it radiates yellow visible light.  We can also demonstrate this Wien displacement law with a flame on a stovetop.  At the lowest setting, the flame is red.  At a slightly higher setting, the flame is orange.  At an even higher setting, the flame is yellow, and the hottest part of the flame is blue.  The Sun is a yellow star, and from that yellow color we can correctly estimate that the surface temperature of the Sun is roughly six thousand kelvins.  Stars throughout the universe that are red in color are cooler than our Sun, stars that are blue in color are hotter than our Sun, and stars that are yellow in color are roughly the same temperature as our Sun.  We must emphasize that we are referring to the color that an object radiates because it is hot enough to be radiating that color.  Many objects have various different colors even though they are all at room temperature, such as red roses, yellow paint, green grass, and blue jeans.  These objects are not radiating these colors; these objects are reflecting these colors while absorbing all other colors.  We must be careful to make a distinction between the color of an object simply because it is reflecting that color versus the color of an object because it is actually hot enough to be radiating that color.  A red pen is at room temperature, while a piece of charcoal glowing red is at roughly one or two thousand kelvins of temperature!

 

We may summarize our discussion of spectroscopy with the three Kirchhoff laws of spectroscopy, named for the German physicist Gustav Kirchhoff.  The first Kirchhoff law of spectroscopy states that a dense object (such as a solid, a liquid, or even a sufficiently dense gas) will radiate (or emit) light at all wavelengths (or frequencies or photon energies).  This is a continuous spectrum.  The second Kirchhoff law of spectroscopy states that a tenuous (low density) gas that is sufficiently hot will radiate (or emit) light at particular wavelengths (or frequencies or photon energies) that are unique to the particular gas.  This is an emission spectrum.  The third Kirchhoff law of spectroscopy states that a tenuous (low density) gas that is sufficiently cool that is subjected to a continuous spectrum will remove light at particular wavelengths (or frequencies or photon energies) that are unique to the particular gas.  This is an absorption spectrum.  Generally, continuous spectra enable us to determine the temperature of an object, whereas emission spectra and absorption spectra enable us to determine the composition of an object.

 

 

Observational Astronomy

 

Since light is a wave, light must obey the Law of Reflection θ1 = θ3 and the Law of Refraction sin(θ1)/v1 = sin(θ2)/v2.  A device that reflects light is called a mirror.  A device that refracts light is called a lens.  Since most metals reflect light very well, we can construct a mirror by coating a piece of glass with a metal (often aluminum) and polishing the metal.  Any piece of glass may be regarded as a lens, since light will refract (bend) as it is transmitted from the air into the glass and will refract (bend) again as it is transmitted from within the glass back into the air.  In the following discussion of mirrors and lenses, we will assume that we are in the paraxial approximation.  In this approximation, all light rays incident upon a mirror or lens must be near the symmetry axis of the mirror or lens.  We can guarantee that we are in the paraxial approximation by constructing the mirror or lens with a large radius of curvature.

 

A curved mirror with its center of curvature and its focal point facing toward the light incident upon it is called a concave mirror.  In this case, we say that the focal point is in front of the mirror.  A curved mirror with its center of curvature and its focal point facing away from the light incident upon it is called a convex mirror.  In this case, we say that the focal point is behind the mirror.  In the paraxial approximation, light incident upon a concave mirror will reflect and converge at the focal point that is in front of the mirror.  For this reason, a concave mirror is also called a converging mirror.  Also in the paraxial approximation, light incident upon a convex mirror will reflect and diverge away from the focal point that is behind the mirror.  For this reason, a convex mirror is also called a diverging mirror.  A curved lens that is thicker in its middle than it is at its edge is called a convex lens.  A curved lens that is thinner in its middle than it is at its edge is called a concave lens.  In the paraxial approximation, light incident upon a convex lens will refract and converge at the focal point that is on the opposite side of the lens as the incident rays.  For this reason, a convex lens is also called a converging lens.  Also in the paraxial approximation, light incident upon a concave lens will refract and diverge away from the focal point that is on the same side of the lens as the incident rays.  For this reason, a concave lens is also called a diverging lens.  Notice that mirrors and lenses are completely opposite in character.  A concave mirror is converging, but a concave lens is diverging.  A convex mirror is diverging, but a convex lens is converging.  Even the geometry of convergence or divergence (as the case may be) is opposite in character.  In particular, light rays converge to a focus on the same side as the incident rays for a converging mirror, but light rays converge to a focus on the opposite side as the incident rays for a converging lens.  Also, light rays diverge away from a focus on the opposite side as the incident rays for a diverging mirror, but light rays diverge away from a focus on the same side as the incident rays for a diverging lens.

 

A telescope is a device that collects light from a large distant object.  We must never confuse a telescope with a microscope, which is a device that collects light from a small nearby object.  When we define a telescope as a device that collects light from a large distant object, we mean any type of light whatsoever.  In other words, a telescope is a device that collects from a large distant object electromagnetic waves (composed of photons) from any band whatsoever across the Electromagnetic Spectrum.  A telescope that collects radio waves is called a radio telescope.  A telescope that collects infrared is called an infrared telescope.  A telescope that collects ultraviolet is called an ultraviolet telescope.  A telescope that collects X-rays is called an X-ray telescope.  A telescope that collects gamma rays is called a gamma-ray telescope.  Caution: a telescope that collects visible light is called an optical telescope.  We now realize that whenever we use the word telescope in colloquial English, we probably mean to use the term optical telescope, since there are other types of telescopes that collect other forms of light that our eyes cannot see.  Whereas the optical telescope was invented more than four hundred years ago, it has only been in recent decades that other types of telescopes have been built thus giving astronomers a more complete understanding of the universe by collecting light from across the entire Electromagnetic Spectrum from astronomical objects such as stars and galaxies.

 

Optical telescopes are often divided into two categories: refracting telescopes (or just refractors for short) and reflecting telescopes (or just reflectors for short).  Refracting telescopes use lenses as their primary optical instruments, while reflecting telescopes use mirrors as their primary optical instruments.  Refracting telescopes and reflecting telescopes each have their own particular advantages and disadvantages, but astronomers have concluded in recent decades that the advantages of reflectors far outweigh their disadvantages and that the disadvantages of refractors far outweigh their advantages.  For example, dispersion causes different colors of light to refract through a lens by different angles, causing the final image to appear blurred with color.  This is called chromatic aberration.  Refracting telescopes suffer from chromatic aberration, since they use lenses as their primary optical instruments.  However, reflecting telescopes do not suffer from chromatic aberration, since they use mirrors as their primary optical instruments.  Mirrors reflect light, and the Law of Reflection states that the angle of incidence is equal to the angle of reflection in all cases regardless of color.  As another example of an advantage of reflectors and disadvantage of refractors, any small imperfection within the glass composing a lens may render the entire refracting telescope useless, since light must refract through the glass that composes the lens.  However, any imperfection within the glass supporting a mirror is irrelevant to the operation of a reflecting telescope, since light will not refract through the glass at all but will instead reflect off of the metal (often aluminum) that is coated on the surface of the glass.  Certainly, any imperfections within the metal coating will affect the operation of the reflecting telescope, but such imperfections are much easier to correct than imperfections within the glass of a lens.  Since astronomers have concluded in recent decades that reflectors are superior to refractors, all of the large optical telescopes built in recent decades have been and continue to be reflecting telescopes.  Nevertheless, the first optical telescopes ever built were small refractors.  To build a primitive refracting telescope, all that is required is two lenses with different focal lengths.  The lens with the smaller focal length is placed closer to the eye; this is called the ocular lens, commonly known as the eyepiece.  The lens with the larger focal length is placed further from the eye; this is called the objective lens, since the light from the object we are observing will first pass through that lens before passing through the ocular lens (the eyepiece) and then finally into our eye.  The two lenses (objective and ocular) must be aligned with each other so that they share the same symmetry axis.  The distance between the two lenses must be the sum of the two focal lengths of the lenses, and the magnification of the resulting image when looking through the telescope is equal to the ratio of the two focal lengths of the lenses.  For example, suppose we wish to build a small refracting telescope from two lenses, one with a three-inch focal length and the other with a twelve-inch focal length.  The focal length of the ocular lens is three inches, and the focal length of the objective lens is twelve inches.  The distance between these two lenses must be fifteen inches, since the sum of three and twelve is fifteen.  (The word sum means addition.  In this example, twelve plus three equals fifteen.)  Finally, everything observed through this telescope will be magnified four times, appearing to be four times larger or four times closer, since the ratio of twelve to three is four.  (The word ratio means division.  In this example, twelve divided by three equals four.)  If we looked through this telescope the wrong way around, everything would appear to be four times smaller or four times further, since the magnification would now be one-fourth instead of four.

 

It is commonly known that the image of an astronomical object is magnified by a telescope.  More precisely, the resolving power of a telescope is the maximum extent to which the image formed by the telescope can separate (or resolve) two distant objects.  For any telescope, diffraction (the bending of a wave around obstacles) limits the resolving power of the telescope.  Hence, the resolving power of a telescope is directly related to the size of the telescope’s primary optical instrument, whether mirror or lens as the case may be.  A larger telescope (larger mirror or lens) will result in greater resolving power (commonly known as higher magnification), while a smaller telescope (smaller mirror or lens) will result in poorer resolving power (commonly known as lower magnification).  A telescope that achieves its maximum resolving power as constrained by diffraction is said to be at its own diffraction limit.

 

Any telescope on planet Earth is called a ground-based telescope, while any telescope in outer space (usually orbiting the Earth) is called a space telescope.  The Earth’s atmosphere is the single most important factor that causes ground-based optical telescopes to operate far below their maximum resolving power, far below their diffraction limit.  This is for several reasons.  Firstly, light pollution is light from human activities (such as city lights and highway lights) that adds brightness to the night sky thus preventing astronomers from observing dim stars and galaxies.  Secondly, even in the absence of light pollution, the Earth’s atmosphere continuously refracts incoming light from outer space.  This is why stars appear to twinkle; the atmosphere’s continuous refraction of light is so severe that even our naked eyes observe stars appearing to twinkle as a result!  Note however that the planets in our Solar System do not appear to twinkle in the sky.  The planets in our Solar System are hundreds of thousands of times closer to us than the nearest stars besides our Sun, causing these planets to appear to have a perceptible (noticeable) size in the sky.  Although this perceptible (noticeable) size is still small and although the Earth’s atmosphere still continuously refracts the incoming light from these planets, the blurring is nevertheless smaller than their perceptible (noticeable) size in the sky, thus eliminating the twinkling effect, causing these planets to appear steady (non-twinkling) in the sky.  An extreme example of this lack of twinkling is our Moon.  The Moon is by far the closest astronomical object in the entire universe.  The Moon is so close to us that its size in the sky is unmistakable.  Although the Earth’s atmosphere still continuously refracts the incoming light from our Moon, the blurring is much smaller than its unmistakable size in the sky.  Thus, the image of the Moon in the sky appears to remain steady.  Astronomers use the word seeing to describe whether atmospheric conditions happen to be favorable or unfavorable for astronomical observing.  If atmospheric conditions happen to be particularly steady thus minimizing blurring, astronomers say that the seeing is good or excellent.  If atmospheric conditions happen to cause severe blurring, astronomers say that the seeing is poor or bad.  Furthermore, the Earth’s atmosphere is opaque to certain wavelengths of light.  In particular, the Earth’s atmosphere is opaque to X-rays, as we will discuss later in the course.  As a result, a ground-based X-ray telescope would not collect any X-rays from stars or galaxies at all!  All X-ray telescopes must therefore be space telescopes.  Similar difficulties arise with infrared telescopes and ultraviolet telescopes, since certain gases within the Earth’s atmosphere absorb far ultraviolet, while other gases within the Earth’s atmosphere absorb infrared, as we will discuss later in the course.  For all of these reasons, the National Aeronautics and Space Administration (NASA) has launched several space telescopes, each one covering a different band of the Electromagnetic Spectrum.  These telescopes are called the NASA Great Space Observatories, since astronomers have gained a more complete understanding of the universe using these space telescopes.  The Hubble telescope is the great optical space telescope (placed in Earth orbit in 1990 and is still in operation).  The Compton observatory was the great gamma-ray space telescope (in operation from 1991 to 2000).  The Compton observatory was replaced by the Swift observatory (placed in Earth orbit in 2004 and is still in operation) and the Fermi telescope (placed in Earth orbit in 2008 and is still in operation).  The Chandra observatory is the great X-ray space telescope (placed in Earth orbit in 1999 as is still in operation).  The Spitzer telescope was the great infrared space telescope (in operation from 2003 to 2020).  The Spitzer telescope was replaced by the Webb telescope (placed in solar orbit in 2022 and is still in operation).  The Cosmic Background Explorer was the great microwave space telescope (in operation from 1989 to 1993).  The Cosmic Background Explorer was replaced by the Wilkinson telescope (in operation from 2001 to 2010).  The Galaxy Evolution Explorer was the great ultraviolet space telescope (in operation from 2003 to 2013).  Although these space telescopes have given astronomers a more complete understanding of the universe, it is nevertheless expensive and dangerous to launch and to service space telescopes.  Therefore, astronomers continue to build and use ground-based telescopes.  There are many ground-based optical telescopes much larger than the Hubble telescope for example.  In order to minimize the negative effects of the Earth’s atmosphere, ground-based astronomical observatories should always be built at astronomically optimal locations.  Firstly, ground-based astronomical observatories should be built far from cities to minimize light pollution and thus maximize the darkness of the night sky.  Secondly, ground-based astronomical observatories should be built at high elevations, such as on mountaintops.  By operating significantly above sea level, light from outer space will propagate through less of the Earth’s atmosphere, thus reducing its negative effects.  Thirdly, ground-based astronomical observatories should be built in arid (dry) locations, such as in deserts, to further reduce the negative effects of the Earth’s atmosphere.  In summary, ground-based astronomical observatories should be built at locations that are dark (far from cities) and high (on mountaintops) and dry (in deserts).

 

In recent decades, extraordinary technologies have been developed enabling ground-based telescopes to match the quality of observations of space telescopes.  For example, instead of using photographic film, photoelectric detectors together with computers permit astronomers to digitally subtract the negative effects of the Earth’s atmosphere and even to digitally correct for any imperfections in the optical devices composing the telescope.  In particular, the charged-coupled device (CCD) is a small photoelectric detector that can detect light (photons) with extraordinary efficiency and sensitivity.  The manufacture of these charged-coupled devices (CCDs) eventually became so inexpensive that they are now integrated into every mobile telephone as a digital camera, although these digital cameras have nowhere nearly the same sensitivity as the charged-coupled devices (CCDs) used with telescopes.  Adaptive optics is another extraordinary technology that corrects for the continuous refraction of light caused by the Earth’s atmosphere.  This technology uses deformable mirrors that continuously adjust their shapes based upon how light has been blurred by the Earth’s atmosphere.  Astronomers use laser beams to continuously probe the motion of the air to determine how the shape of the deformable mirror is to be adjusted, thus forming images that have eliminated the blurring of the atmosphere.

 

Astronomers use telescopes to make a wide variety of measurements to study the universe.  Astrometry is the precise measurement of the position and the motion of astronomical objects as they appear on the Celestial Sphere (as they appear in the sky).  As we discussed, this reveals not only the physical motion of stars and galaxies through the universe but also the slow precession and nutation of the Earth’s own axis of rotation.  Photometry is the measurement of the apparent brightness of astronomical objects (the brightness as they appear in the sky).  For thousands of years before the invention of the telescope, photometric measurements were made with the human eye, which can modestly discriminate between somewhat brighter stars and somewhat dimmer stars.  Even after the telescope was invented, the human eye together with telescopes continued to make photometric measurements for a few centuries.  Photographic film together with telescopes began to be used for photometric measurements roughly one century ago, and in recent decades charged-coupled devices (CCDs) together with telescopes have made very precise photometric measurements of extraordinarily dim stars and galaxies.  Spectroscopy is the measurement of spectra, the measurement of wavelengths (or frequencies or photon energies) within the light from astronomical objects.  As we discussed, the measurement of either emission spectra or absorption spectra can reveal the composition of an astronomical object, and the measurement of the primary wavelength of a continuous spectrum can reveal the temperature of an astronomical object.  Spectroscopy also reveals the motion of an astronomical object.  In particular, shifts in the wavelengths (or the frequencies or the photon energies) of spectral lines (whether absorption or emission) within the light of an astronomical object reveals not only the direction of motion of the astronomical object (whether toward us or away from us) but in addition its speed of motion (whether fast or slow), as determined by the Doppler-Fizeau effect.  Spectroscopy can even reveal the rotational speed of a star.  As a star rotates, one side of it moves toward us, while the other side of the star moves away from us.  Hence, the light from one side of the star will be blueshifted, while the light from the other side of the star will be redshifted, again as determined by the Doppler-Fizeau effect.  As a result, any particular spectral line (whether absorption or emission) will actually be broadened over a range of wavelengths (or frequencies or photon energies).  If the star is rotating faster, the blueshift and the redshift from either side of the star will be more severe, resulting in more widely broadened spectral lines.  If the star is rotating slower, the blueshift and the redshift from either side of the star will be less severe, resulting in more narrowly broadened spectral lines.  Therefore, the width of broadened spectral lines (either absorption or emission) reveals the rotational speed of the star (whether fast or slow).  Polarimetry is the measurement of the polarization of the light from astronomical objects.  Interferometry is the use of more than one telescope to measure the interference (constructive, destructive, or mixed) among the light collected by the telescopes from the same astronomical object.  Interferometry increases the resolving power of each telescope tremendously.  In principle, interferometry permits two telescopes to together have the same resolving power as a single telescope with a size equal to the distance between the two individual telescopes.  For example, two telescopes on opposite sides of planet Earth used together as a single interferometer would in principle have the same resolving power as a single telescope the size of planet Earth!  In recent decades, many astronomical observatories use not one telescope but a pair of telescopes working together as a single interferometer.  However, the total amount of light collected by the two telescopes would still be determined by the size of the two individual telescopes combined, not the effective size of the entire interferometer.  For this reason, ideally an interferometer should be not just two telescopes but many telescopes working together to form a single interferometer to increase the light gathering power of the entire interferometer.  A radio interferometer for example often consists of dozens of radio telescopes all working together as a single interferometer.

 

 

History of Astronomy from Ancient Astronomy to the Beginnings of Modern Astronomy

 

For thousands of years before the invention of the telescope, humans looked up into the sky and tracked the motion of the Sun, the motion of the stars, and the motion of a band of milk wrapped around the entire sky they called the milky way.  For thousands of years, humans watched the Sun (during the daytime) rise in the eastern horizon and set in the western horizon, and they watched the stars and the milky way (during the nighttime) also rise in the eastern horizon and set in the western horizon.  Therefore, humans concluded that the Earth is at the center of the universe, and everything else in the universe moves around the Earth.  The first person to question whether or not this is actually the case was Aristarchus of Samos, an ancient Greek astronomer who lived twenty-three centuries ago.  Using geometry, he attempted to calculate the sizes of the Sun and the Moon relative to the size of the Earth.  He calculated that the Moon is smaller than the Earth, and he calculated that the Sun is larger than the Earth.  Today, we know that Aristarchus’s numerical results were significantly incorrect, since he made some false assumptions in his calculations.  Nevertheless, we know today that the Moon is indeed smaller than the Earth, and we know today that the Sun is indeed larger than the Earth.  In other words, Aristarchus’s results may not have been quantitatively correct, but they were at least qualitatively correct.  Aristarchus then reasoned that it made no sense for the Sun to move around the Earth if it was larger than the Earth; he argued that it made more sense for the Earth to move around the Sun.  Aristarchus’s Greek contemporaries persuaded him that the Earth could not be moving, since we would then see stars appear to suffer parallax if the Earth were moving.  Parallax is the apparent motion of an object when in actuality the observer is moving.  Since the ancient Greeks did not see stars suffer parallax, they continued to believe that the Earth is not moving and that everything else in the universe, including the Sun, moves around the Earth.  Today, we of course know that Aristarchus was correct; the Earth does move around the Sun, but his Greek contemporaries were also correct: stars must appear to suffer parallax due to the Earth’s motion around the Sun.  No one at the time realized how incredibly distant stars truly are.  Stars are so distant that our naked eyes cannot observe the tiny parallax they appear to suffer.  (The further distant a star, the smaller the parallax.)  These tiny parallax angles were finally measured during the Modern Ages thanks to the invention of the telescope, but ancient humans did not have the ability to measure these tiny angles.  Since the naked eye cannot see the parallax of stars, ancient humans continued to believe that the Earth is at the center of the universe, and that everything else in the universe moves around the Earth.

 

For thousands of years, humans looked up into the sky and observed that stars appeared to remain fixed relative to one another while continuously rising and setting.  However, ancient humans also noticed seven objects that do not remain fixed relative to the stars or even to each another.  These seven objects appeared to wander around the sky.  The Greek word for wanderer transliterates to the English word planet.  The seven planets (wanderers) of ancient astronomy are the Sun, the Moon, Mercury, Venus, Mars, Jupiter, and Saturn.  Today, we know that the Sun and the Moon are not truly planets, but the meaning of the word planet in ancient astronomy was wanderer, and the Sun and the Moon do indeed appear to wander around the sky.  Also note that ancient humans did not understand that the Earth is itself a planet.  The reason for this is obvious: we look up to see the planets, but we look down to see the Earth!  We will use the term ancient planets for these seven objects so that we will not confuse them with the modern and correct meaning of the word planet.  For thousands of years, humans observed only these seven ancient planets: the Sun, the Moon, Mercury, Venus, Mars, Jupiter, and Saturn.  For this reason, the number seven has attained tremendous importance in many different cultures to the present day.  For example, seven is considered to be a lucky number, even to the present day.  In addition, we continue to use to the present day a calendar with seven days in a week, and each of these days is named for one of the ancient planets.  This is most obvious for Sunday which means day of the Sun, Monday which means day of the Moon, and Saturday which means day of Saturn.  Thursday means day of Thor, who was the northern European mythological analogue to Jupiter in ancient Roman mythology.  Friday means day of Frea, who was the northern European mythological analogue to Venus in ancient Roman mythology.  Tuesday means day of Týr, who was the northern European mythological analogue to Mars in ancient Roman mythology.  Wednesday means day of Odin, who was the northern European mythological analogue to Mercury in ancient Roman mythology.  Today we know that other objects appear to wander around the sky, such as Uranus, Neptune, Pluto, Eris, Ceres, and Vesta for example, but these objects are too dim to be seen with the naked eye; they were not discovered until the Modern Ages after the telescope was invented.  Actually, it is possible to see Uranus with the naked eye under ideal conditions.  If ancient humans had discovered Uranus, then there would have been eight ancient planets instead of seven.  As a result, eight would have been considered a lucky number instead of seven.  Furthermore, we would be using a calendar with eight days in a week instead of seven, and the eighth day of the week would most certainly be called Uranusday!

 

For thousands of years, humans noticed that Mercury, Venus, Mars, Jupiter, and Saturn at times appear to slow down and stop and then turn around and move retrograde (backwards from their usual motion) until slowing down and stopping again before continuing with their original motion.  Ancient humans also noticed that the Sun and the Moon never move retrograde.  The Greco-Roman astronomer Claudius Ptolemy who lived nineteen centuries ago formulated a model of the universe to explain these motions.  Ptolemy’s model of the universe was a geocentric model, meaning that it placed the Earth at the center of the universe, as all humans believed at the time.  Anyone who believes that the Earth is the center of the universe is called a geocentrist.  The words geocentric and geocentrist are derived from the Greek root geo- for Gaia, the personification of the Earth in ancient Greek mythology.  Many other words are derived from the same Greek root, such as geology, geologist, and geography for example.  According to Ptolemy’s geocentric model, the Earth is at the center of the universe, and the Moon moves along a simple circle around the Earth, since the Moon never appears to move retrograde.  Next comes Mercury and Venus in this model.  In order to explain their occasional retrograde motions, Ptolemy claimed that they must be moving around small circles (called epicycles) while at the same time moving around large circles (called deferents) around the Earth.  Next comes the Sun, which according to Ptolemy moves along a simple circle around the Earth, since the Sun never appears to move retrograde.  Next comes Mars, Jupiter, and Saturn, which Ptolemy claimed must be moving around small epicycles while at the same time moving around large deferents around the Earth to explain their occasional retrograde motions.  Finally, Ptolemy claimed that the stars and the milky way were very far from the Earth and fixed relative to one another.  Although we know today that Ptolemy’s geocentric model of the universe is not correct, this model nevertheless predicted the motions of the ancient planets with fair reliability.  Therefore, humans believed quite strongly in Ptolemy’s geocentric model of the universe.  For the rest of the history of the Western Roman Empire, humans accepted Ptolemy’s geocentric model of the universe.  Even after the Western Roman Empire crumbled, Europeans during the Middle Ages continued to believe in Ptolemy’s geocentric model of the universe.  Truth be told, Middle-Age Europeans believed in Ptolemy’s geocentric model not only because of its fair reliability but also fearing punishment from the Catholic Church which had adopted this model as part of its doctrine.

 

Many historians agree that the Modern Ages of human history begins roughly five centuries ago due to the dramatic political, economic, social, artistic, religious, and scientific changes that occurred.  Artistically, the geniuses of the Age of Renaissance began to create magnificent paintings, sculptures, architecture, music, and literature.  Politically, several long-term wars finally concluded, including the Hundred Year War, the Wars of the Roses, the Byzantine-Ottoman Wars, and the Granada War.  The adventurers of the Age of Exploration discovered and explored the American continents, leading directly to major European powers expanding their empires into the American continents.  The global trade that emerged led directly to the establishment of modern banks.  Societally, the invention of the printing press resulted in the proliferation of books and hence dramatically increased literacy.  Religiously, the leaders of the Protestant Reformation questioned the doctrines and the authority of the Catholic Church.  Mathematics and the sciences, astronomy in particular, played an important role in these revolutionary changes in human history.

 

Modern astronomy begins with the work of the Polish astronomer Nicolaus Copernicus who lived five centuries ago.  Copernicus formulated a simpler model of the universe than Ptolemy’s geocentric model.  Copernicus’s model was a heliocentric model, meaning that it placed the Sun at the center of the universe.  Anyone who believes that the Sun is at the center of the universe is called a heliocentrist.  The words heliocentric and heliocentrist are derived from the Greek root helio- for Helios, the personification of the Sun in ancient Greek mythology.  Many other words are derived from the same Greek root, such as helium, helioseismology, helioseismologist, and helium for example.  According to Copernicus, Mercury and Venus move around the Sun along simple circles.  Next comes the Earth, which Copernicus claimed was a planet that also moves around the Sun along a simple circle.  Next comes Mars, Jupiter, and Saturn, which Copernicus claimed also move around the Sun along simple circles.  If Mercury, Venus, Earth, Mars, Jupiter, and Saturn all move around the Sun along simple circles, then how did the heliocentric Copernicus model explain occasional retrograde motions?  Copernicus claimed that whenever the Earth moved passed another planet, it would appear as if the planet moved backwards relative to the Earth when in fact the planet’s motion did not actually change.  Copernicus’s heliocentric model of the universe is certainly simpler than Ptolemy’s geocentric model of the universe.  Nevertheless, Copernicus’s heliocentric model did not predict the motions of the planets in the sky any more reliably than Ptolemy’s geocentric model.  Therefore, Europeans continued to believe that the Earth is at the center of the universe.  Again truth be told, Europeans continued to believe in Ptolemy’s geocentric model not only because of its fair reliability but also fearing punishment from the Catholic Church.  Copernicus himself waited until he was dying from natural causes before publishing his heliocentric model.

 

For thousands of years before the invention of the telescope, humans built ancient observatories that used large objects to point into the sky to track the motions of stars and ancient planets.  We will use the term ancient observatory so as not to cause confusion with modern observatories, which use telescopes.  Examples of ancient observatories include Stonehenge in England and the pyramids in Egypt and Mexico.  The Danish astronomer Tycho Brahe who lived more than four centuries ago built such an observatory and spent decades of his life tracking the motions of the ancient planets.  He collected so much data that he hired the German mathematician Johannes Kepler to analyze the data.  Tycho Brahe died shortly thereafter, and Kepler then proceeded to use Brahe’s measurements to attempt to prove with certainty that Copernicus was correct, that the Earth together with the other planets do indeed move around the Sun.  Kepler did not succeed until he abandoned the assumption that the planets move along simple circles.  After abandoning this assumption, Kepler used Brahe’s measurements to show with superb accuracy that the planets do indeed move around the Sun.  Moreover, he formulated what we today call Kepler’s three laws of planetary motion: the Law of Ellipses, the Law of Equal Areas, and the Law of Periods.

 

According to Kepler’s first law, the Law of Ellipses, the planets (including the Earth) move around the Sun along orbits that are ellipses.  An ellipse is an elongated circle with a major axis that is longer than and perpendicular to its minor axis.  Every ellipse has two focal points (or foci) on its major axis; each focus is the same distance from the center of the ellipse.  Half of the entire major axis of any ellipse is called its semi-major axis, always denoted a; half of the entire minor axis of any ellipse is called its semi-minor axis, always denoted b.  The Latin root semi- means half.  For example, a semicircle is half of a full circle, and a semiformal event is halfway between a formal event and an informal event.  Not only is the orbit of a planet around the Sun an ellipse, but the Sun is not at the center of the ellipse; the Sun is at one of the foci of the ellipse.  (There is nothing at the other focus.)  Since the Sun is at one of the foci of the elliptical orbit, there is only one point on the elliptical orbit where the planet is closest to the Sun, called the perihelion of the planet’s orbit.  Also, there is only one point on the elliptical orbit where the planet is furthest from the Sun, called the aphelion of the planet’s orbit.  The distance from the Sun to a planet’s perihelion is called the perihelion distance and is denoted rperihelion; the distance from the Sun to a planet’s aphelion is called the aphelion distance and is denoted raphelion.  Notice that the sum of the perihelion distance and the aphelion distance is equal to the entire major axis of the orbit, which is twice the semi-major axis of the orbit.  In other words, rperihelion + raphelion = 2a.  The time it takes a planet to move one complete orbit around the Sun is called the orbital period of the planet, denoted P.  Notice that the orbital period of any planet is the time it takes that planet to move from its perihelion all the way around its orbit, returning to its perihelion.  The orbital period is also the time it takes the planet to move from its aphelion all the way around its orbit, returning to its aphelion.  According to Kepler’s second law, the Law of Equal Areas, planets sweep out equal areas in equal times.  It follows from this that a planet moves faster while closer to the Sun (and fastest in fact at its perihelion) and moves slower while further from the Sun (and slowest in fact at its aphelion).  According to Kepler’s third law, the Law of Periods, the square of the orbital periods of all the planets (including the Earth) around the Sun are all directly proportional to the cube of the semi-major axes of the orbits of all the planets (including the Earth) around the Sun.  The orbital period of the Earth around the Sun is one Earth-year (1 yr).  The semi-major axis of the Earth’s orbit around the Sun is called one astronomical unit (1 au), which we know today is roughly equal to 150 million kilometers.  Assuming we agree to measure the orbital parameters of all planets around the Sun in terms of the Earth’s orbital parameters, we may write Kepler’s third law as P2 = a3, where P must be measured in Earth-years and a must be measured in astronomical units.  We may apply Kepler’s laws to perform simple orbit calculations.  For example, consider a hypothetical planet orbiting the Sun that is six astronomical units from the Sun at its perihelion and twelve astronomical units from the Sun at its aphelion.  In other words, we are given that rperihelion = 6 au and raphelion = 12 au.  Thus, the entire major axis of the orbit is eighteen astronomical units, since six plus twelve equals eighteen.  Therefore, the semi-major axis of the orbit is nine astronomical units, since half of eighteen equals nine.  In other words, a = 9 au.  Since Kepler’s third law states P2 = a3, we have now discovered that P2 = 729, since the cube of nine equals seven hundred and twenty-nine.  Finally, we conclude that the orbital period of this hypothetical planet around the Sun is twenty-seven Earth-years, since the square-root of seven hundred and twenty-nine equals twenty-seven.  In other words, P = 27 yr for this hypothetical planet.  Although Kepler deduced these three laws from Brahe’s measurements, he could not explain why any of these laws are true.  Although Kepler was a brilliant mathematician, he was nevertheless an empiricist, not a physicist.  An empiricist makes observations and deduces mathematical relationships that describe those observations but has no answers for deeper questions about those mathematical relationships.  In particular, an empiricist cannot explain why those mathematical relationships are true.  A physicist on the other hand seeks and discovers answers to these deeper questions.  In particular, a physicist seeks and discovers reasons why certain mathematical relationships that describe the universe are true.  Since Kepler was an empiricist but not a physicist, he could not explain why his own laws of planetary motion are true.  It would be Isaac Newton who would offer explanations for not just these three laws but for far more than Kepler or anyone else could ever dream any single person could ever accomplish.  All these thousands of years of astronomical history are leading up to the person who would become the first true physicist, Isaac Newton.

 

The Italian astronomer Galileo Galilei read about a new invention: the (optical) telescope.  He built his own telescope after reading about this new invention, and in the year 1609 he became the first person to ever make telescopic observations of the sky.  His discoveries were breathtaking.  Galileo Galilei discovered mountains and craters on the Moon.  Galileo Galilei discovered sunspots on the Sun.  (Never ever observe the Sun through a telescope.  Never ever observe the Sun through binoculars.  Never ever observe the Sun even with the naked eye.  Solar observations done incorrectly causes permanent blindness.)  Galileo Galilei discovered four moons orbiting around Jupiter that were later named the Galilean moons in his honor.  (Today we know that Jupiter has roughly one hundred moons.  Only four of them are large enough to be visible through a primitive telescope.)  Galileo Galilei discovered rings around Saturn.  (His telescope was too primitive to see that they are actually rings.  He speculated that they were moons around Saturn.)  Galileo Galilei discovered phases of Venus analogous to the phases of the Moon, including full, half, crescent (less than half), and gibbous (more than half).  Galileo Galilei discovered that the milky way is not in fact milk; the milky way is innumerable stars sufficiently crowded together in the sky that with the naked eye all of their light blends together so as to appear to be milk.  Only a telescope can produce enough magnification to reveal all of these magnificent discoveries.  The phases of Venus can only be correctly explained if Venus moves around the Sun, not the Earth.  Moreover, the discovery of four moons orbiting around Jupiter revealed that Jupiter is the center of its own mini-universe, further proving that the Earth is not the center of everything.  For all of these discoveries and for his formulation of the scientific method, we will regard Galileo Galilei as one of the grandfathers of logical/mathematical/scientific reasoning.  Other grandfathers of logical/mathematical/scientific reasoning include the British philosopher Francis Bacon, the French mathematician René Descartes, and the British philosopher John Locke.

 

 

The Newtonian Model of the Universe

 

The British mathematician and physicist Isaac Newton was among the most brilliant persons who have ever lived.  Isaac Newton discovered calculus (advanced mathematics) and became the first physicist when he discovered three universal laws of motion and the law of universal gravitation.  Physics is the search for, the study of, and the application of the mathematical equations that describe the universe; these mathematical equations are called the laws of physics.  Isaac Newton accomplished all of these achievements between the 1670s and the 1680s and published them in his textbook Philosophić Naturalis Principia Mathematica (Mathematical Principles of Natural Philosophy) or the Principia for short.  In this textbook, Isaac Newton presented what is today called the Newtonian model of the universe.  For all of these reasons, we will regard Isaac Newton as the true father of logical/mathematical/scientific reasoning.  We begin our discussion of the Newtonian model of the universe with Newton’s three universal laws of motion: the Law of Inertia, the Law of Acceleration, and the Law of Action-Reaction.

 

A force is a push or a pull.  For most of human history, humans believed that force causes motion.  Sadly, most humans even today still believe that force causes motion, but this misunderstanding is forgivable.  Often in our daily experience, objects seem to move only while we push or pull them, and objects seem to stop moving when we stop pushing or pulling them.  We are tempted to conclude that force does indeed cause motion; nevertheless, this conclusion is false.  Consider the following experiment.  If we give an object a brief push or pull and observe carefully, we notice that the object continues to move even after our brief push or pull has ended; the object continues moving even when we are no longer pushing or pulling it!  Of course, objects often slow down and eventually stop moving when we have stopped pushing or pulling them, but if we observe carefully we discover that it is other forces that cause the object to slow down and eventually stop moving.  Friction is often the force that is responsible for slowing down and eventually stopping objects from moving.  If we give a brief push or pull on an object that happens to be on a smooth surface, we notice that the object continues to move for a longer distance and a longer time after our brief push or pull has ended, since there is less friction from the smooth surface to slow down the object.  Bowling lanes are so smooth that a bowling ball travels the entire length of the bowling lane while slowing down only a small amount; in fact, the bowling ball only stops moving because it strikes the pins or the back wall at the very end of the bowling lane.  Notice that the bowling ball continues moving along the entire bowling lane even though nothing is pushing or pulling it in the direction that it moves.  We are compelled to conclude that if there were zero forces (no pushing or pulling), an object would continue to move without slowing down at all, and certainly without speeding up and certainly without changing the direction that it moves.  We conclude that force does not cause motion, since we have carefully observed that objects actually continue to move in the absence of force.  This is Newton’s first law of motion, the Law of Inertia.  Force does not cause motion, since an object moves in a straight line at a constant speed when there is zero force pushing or pulling the object.  Certainly, an object at rest (not moving) will remain at rest without forces pushing or pulling the object, but an object in motion will remain in motion along a straight line at a constant speed without forces pushing or pulling the object.

 

If force does not cause motion, this begs the following question: what does force do?  What does pushing or pulling an object accomplish?  This question is answered by Newton’s second law of motion, the Law of Acceleration.  According to this law, force (pushing or pulling) causes changes in the motion of an object.  This change may be speeding up the object, slowing down the object, changing the direction the object moves, or combinations of these changes.  Physicists use the word acceleration for the rate at which an object’s motion changes, but we must always remember that this could be any change in motion whatsoever.  An object that is speeding up is said to be accelerating, but an object that is slowing down is also said to be accelerating.  (In colloquial English, we would use the word decelerating instead.)  Moreover, an object that is neither speeding up nor slowing down but only changing the direction that it moves is also said to be accelerating.  Newton’s second law of motion is written mathematically as , where  is the net force (the sum of all the forces) acting on the object,  is the acceleration (the rate of any change in motion whatsoever) of the object, and m is the mass of the object.  Notice that the net force is directly proportional to the acceleration, meaning that stronger forces cause greater accelerations and weaker forces cause smaller accelerations.  If we solve this equation for the acceleration, we conclude , and we see that the acceleration is inversely proportional to the mass of the object.  In other words, larger-mass objects suffer smaller accelerations from a force (a push or a pull), while smaller-mass objects suffer larger accelerations from a force (a push or a pull).  This stands to reason.  For example, a baseball struck with a baseball bat will suffer a large acceleration since the baseball has a small mass, but a car struck with the same baseball bat will suffer a small acceleration since the car has a large mass.

 

According to Newton’s third law of motion, the Law of Action-Reaction, if one object exerts a force on another object, then the second object must also exert a force on the first object that is equal in magnitude but opposite in direction.  In some circumstances, this law is intuitive.  For example, if we lean against a wall, we are obviously exerting a force (pushing) the wall.  Therefore, the wall must also exert a force on us, and this is the force that prevents us from falling over as we lean against the wall.  However, there are other circumstances when Newton’s third law of motion is counterintuitive.  For example, if a truck strikes a pedestrian, the pedestrian will be killed with body parts everywhere while the truck barely suffers a scratch.  Did the truck exert a greater force on the pedestrian or did the pedestrian exert a greater force on the truck?  According to Newton’s third law of motion, the Law of Action-Reaction, the answer is neither.  The force that the truck exerted on the pedestrian was equal (in magnitude) to the force that the pedestrian exerted on the truck, but how can this be the case if the pedestrian was killed with body parts everywhere while the truck barely suffered a scratch?  The resolution of this paradox is as follows.  The pedestrian has a very small mass (as compared with the truck, which has a much larger mass) which caused the pedestrian to suffer a very large acceleration (as compared with the truck, which suffers a very small acceleration).  When a pedestrian is killed by a moving car or truck, they were killed not only by the force from the car or truck; the pedestrian was also killed by their own small mass, resulting in a large acceleration.

 

The Newtonian model of the universe is Newton’s three universal laws of motion together with Newton’s law of universal gravitation.  According to Newton’s law of universal gravitation, everything in the universe attracts everything else in the universe.  This is difficult to believe.  We do not observe tables and chairs and humans and mobile telephones attracting each other; we only observe planets, moons, and stars causing attractions.  (Actually, we do observe humans and mobile telephones attracting each other quite strongly, but this strong attraction is not gravitational!)  Nevertheless, Newton’s law of universal gravitation is correct: everything in the universe attracts everything else in the universe.  We do not notice tables and chairs and humans and mobile telephones attracting each another gravitationally because gravity is by far by far by far by far the weakest force in the entire universe.  Gravity is so weak that we never notice gravitational attractions among tables and chairs and humans and mobile telephones.  We only notice gravitational attractions from gargantuan objects, such as planets, moons, and stars.  Even in this case, gravity is noticeably weak.  Every time we walk up a staircase, we are effortlessly defying planet Earth’s gravitational attraction!  Newton’s law of universal gravitation can be written in mathematical form.  Consider any two objects whatsoever, one with mass m1 and the other with mass m2, and suppose the distance between these two objects is r.  The gravitational force (attraction) between these two objects is directly proportional to the product of their masses and is inversely proportional to the square of the distance between them.  Mathematically, Newton’s law of universal gravitation is written , where Fgravitational is the gravitational force (attraction) between the two objects.  Also, the symbol G is called Newton’s gravitational constant of the universe, which is an example of a fundamental God-given constant of the universe.  Within the mathematical equations that describe the universe (the laws of physics), there are certain fixed constants (fixed numbers).  These fixed constants (fixed numbers) are called the fundamental God-given constants of the universe.  Most physicists agree that the three most fundamental God-given constants of the universe are the vacuum speed of light (always written with the symbol c), the Planck constant (always written with the symbol h), and Newton’s gravitational constant of the universe (always written with the symbol G).  Each and every one of the fundamental God-given constants of the universe has an absolutely fixed value, having that same value everywhere in the universe and everywhen in the universe.  (The word everywhen means at all times in the past, present, and future.)  Newton’s gravitational constant of the universe is roughly equal to 6.67×1011 assuming that we agree to measure all masses in kilograms, all distances in meters, and all times in seconds.  This number 6.67×1011 is incredibly small: 103 is one thousandth, 106 is one millionth, 109 is one billionth, and 1011 is even smaller than one billionth.  We now realize why gravity is by far by far by far by far the weakest force in the entire universe.  Whenever we calculate a gravitational force (attraction), we must multiply by Newton’s gravitational constant of the universe G, which is so incredibly small that the resulting gravitational attraction is incredibly weak.  This is why we never notice gravitational attractions among tables and chairs and humans and mobile telephones.  The only hope we have of ever feeling gravitational attractions is from gargantuan objects, such as planets, moons, and stars, and even in this case gravity is noticeably weak.

 

According to Newton’s law of universal gravitation, the gravitational force (attraction) between any two objects in the universe is directly proportional to the product of their masses.  For example, if we double both masses, the gravitational force (attraction) strengthens by a factor of four, since the product of two and two is four.  (The word product means multiplication.)  As another example, if we triple both masses, the gravitational force (attraction) strengthens by a factor of nine, since the product of three and three is nine.  As yet another example, if we double one mass and triple the other mass, the gravitational force (attraction) strengthens by a factor of six, since the product of two and three is six.  If we double only one of the masses, the gravitational force (attraction) strengthens by a factor of two, since the product of two and one is two.  If we triple only one of the masses, the gravitational force (attraction) strengthens by a factor of three, since the product of three and one is three.  Also according to Newton’s law of universal gravitation, the gravitational force (attraction) between any two objects in the universe is inversely proportional to the square of the distance between them.  This means that increasing the distance between two objects weakens the gravitational force (attraction) between them, while decreasing the distance between two objects strengthens the gravitational force (attraction) between them.  This stands to reason; we expect the attraction between objects to be stronger when they are closer together, and we expect the attraction between objects to be weaker when they are further apart.  For example, if we triple the distance between two objects, the gravitational force (attraction) weakens by a factor of nine, since three squared equals nine.  As another example, if we quadruple the distance between two objects, the gravitational force (attraction) weakens by a factor of sixteen, since four squared equals sixteen.  As yet another example, if we double the distance between two objects, the gravitational force (attraction) weakens by a factor of four, since two squared equals four.  If we cut the distance between two objects down to one-third, the gravitational force (attraction) strengthens by a factor of nine.  If we cut the distance between two objects down to one-fourth, the gravitational force (attraction) strengthens by a factor of sixteen.  If we cut the distance between two objects down to one-half, the gravitational force (attraction) strengthens by a factor of four.  We can put all of this together with the following amusing example: if we double one mass, octuple the other mass, and quadruple the distance, the gravitational force (attraction) does not strengthen or weaken; it remains the same strength!

 

Isaac Newton combined his three universal laws of motion with his law of universal gravitation, and using calculus (which he also discovered) he proceeded to mathematically explain (what was believed at the time to be) everything that had ever been observed in the universe.  This model of the universe is called the Newtonian model of the universe, and physicists regard it as the first mathematically correct description of the universe.  We now discuss some of the greatest achievements of the Newtonian model of the universe.  Firstly, Newton explained why Kepler’s three laws of planetary motion are true, but he went even beyond this.  Newton generalized Kepler’s three laws of planetary motion, showing mathematically that Kepler’s own formulation of his own planetary laws was not precisely correct.

 

According to Kepler’s first law as Kepler formulated it, the orbits of the planets around the Sun are ellipses.  According to Newton, it is not just planets orbiting the Sun that should have elliptical orbits.  If gravitation is indeed universal, then the orbit of anything around anything else (such as a moon orbiting a planet) should also be an ellipse, but Newton went even beyond this.  He proved mathematically that the orbit could be a circle, an ellipse, a parabola, or a hyperbola.  The total energy of the two attracting objects determines the shape of the orbit.  If the total energy of the two attracting objects is sufficiently large, then the two objects will not remain bound to each other; they will escape from each other’s gravitational attraction.  The orbit in this case will be a parabola or a hyperbola.  Hence, parabolic orbits and hyperbolic orbits are called unbound orbits.  However, if the total energy of the two attracting objects is not this large, then the two objects will remain bound to each other; they cannot escape from each other’s gravitational attraction.  The orbit in this case will be a circle or an ellipse.  Hence, circular orbits and elliptical orbits are called bound orbits.  Circles, ellipses, parabolae, and hyperbolae are all conic sections, the intersection of a cone and a plane.  Conic sections are classified using a variable called the eccentricity.  A circle is a conic section with an eccentricity equal to zero, an ellipse is a conic section with an eccentricity anywhere between zero and one, a parabola is a conic section with an eccentricity equal to one, and a hyperbola is a conic section with an eccentricity anywhere greater than one.  In summary, whereas Kepler’s formulation of his own first law states that the orbit of a planet around the Sun is an ellipse, Newton’s formulation of Kepler’s first law states that the orbit of anything around anything else is a conic section.

 

According to Kepler’s second law as Kepler formulated it, planets sweep out equal areas in equal times, since a planet moves faster while closer to the Sun (fastest at its perihelion) and moves slower while further from the Sun (slowest at its aphelion).  According to Newton, it is not just planets orbiting the Sun that should sweep out equal areas in equal times.  If gravitation is indeed universal, then anything orbiting anything else (such as a moon orbiting a planet) should also sweep out equal areas in equal times, but Newton went even beyond this.  He proved mathematically that equal areas are swept in equal times because of the Conservation of Angular Momentum, another law of physics that he discovered.  Ice skaters spin faster when they pull their arms in, and ice skaters spin slower when they pull their arms out.  This ensures that the angular momentum of the ice skater remains conserved (remains constant).  Rather like ice skaters who spin faster when they pull their arms in, planets orbiting a star (or moons orbiting a planet) speed up as they move closer to their attractor, and rather like ice skaters who spin slower when they pull their arms out, planets orbiting a star (or moons orbiting a planet) slow down as they move further from their attractor.  This continuously changing speed keeps the angular momentum conserved, and Newton proved mathematically that this causes equal areas to be swept in equal times.  In summary, whereas Kepler’s formulation of his own second law states that planets sweep out equal areas in equal times, Newton’s formulation of Kepler’s second law states that the angular momentum of anything orbiting anything else must remain conserved.

 

According to Kepler’s third law as Kepler formulated it, the square of the orbital periods of all the planets around the Sun are directly proportional to the cube of the semi-major axes of the orbits of all the planets around the Sun.  According to Newton, it is not just planets orbiting the Sun where this direct proportionality should be true.  If gravitation is indeed universal, then the square of the orbital periods should always be directly proportional to the cube of the semi-major axes of the orbits.  This direct proportionality should also be true for moons orbiting a planet for example, but Newton went even beyond this.  He proved mathematically that this direct proportionality actually states , where P is the orbital period, π is roughly equal to 3.141 592 653 589 793 238 462 643 383 279 502 884 197 169 399 375 105 820 974 944 592, G is Newton’s gravitational constant of the universe, M is the total mass, and a is the semi-major axis of the orbit.  With Newton’s formulation of Kepler’s third law, the units of a need not be astronomical units, and the units of P need not be Earth-years.  All that is required is for the units of P, a, M, and G to all be consistent with one another.  For example, we may use G = 6.67×1011 if we agree to measure P, a, and M in seconds, meters, and kilograms, respectively.  Using this equation, astronomers have calculated the mass of the Sun from the orbital parameters of anything orbiting the Sun, such as planets, asteroids, and comets.  Using this equation, astronomers have calculated the mass of the Earth from the orbital parameters of anything orbiting the Earth, such as the Moon and artificial satellites.  Using this equation, astronomers have calculated the mass of Jupiter from the orbital parameters of the moons orbiting Jupiter.  In fact, it is not an exaggeration to say that the only way astronomers can accurately calculate the mass of any object in the universe (such as a star, planet, or moon) is to use this equation.  In summary, whereas Kepler’s formulation of his own third law states that P2 = a3 for the planets around the Sun where P must be measured in Earth-years and a must be measured in astronomical units, Newton’s formulation of Kepler’s third law states that  for anything in orbit around anything else, where P, a, M, and G may be measured in any units that are consistent with one another.

 

For most of human history, humans believed that heavier objects fall faster than lighter objects.  Sadly, most humans even today still believe that heavier objects fall faster than lighter objects.  In actuality, everything, no matter how heavy or how light, falls toward the Earth with the same acceleration near the surface of the Earth: 9.8 meters per second per second downward.  Caution: this is only the case when all non-gravitational forces such as air resistance can be ignored as compared with the gravitational force.  We can demonstrate this by dropping a heavy object and a light object at the same time from the same height, such as a textbook and a pencil.  Both will hit the ground at the same time even though the book is hundreds of times heavier than the pencil!  Although Galileo Galilei first demonstrated that this is true, it was Isaac Newton who explained mathematically why this is true.  Actually, Newton went even beyond this; he proved mathematically that everything, no matter how light or how heavy, falls toward any planet, moon, or star with the same acceleration, ignoring all non-gravitational forces such as air resistance as usual.  Isaac Newton discovered the following equation for this acceleration due to gravity for any planet, moon, or star in the universe: g = GM / R2, where g is the acceleration due to gravity near the surface of the planet, moon, or star, G is Newton’s gravitational constant of the universe, M is the mass of the planet, moon, or star, and R is the radius of the planet, moon, or star.  In other words, the acceleration due to gravity near the surface of any planet, moon, or star in the universe is G multiplied by the mass of the planet, moon, or star and divided by the square of the radius of the planet, moon, or star.  For example, if we multiply G by the mass of the Earth and divide by the square of the radius of the Earth, the answer is 9.8 meters per second per second!  As another example, if we multiply G by the mass of the Earth’s Moon and divide by the square of the radius of the Earth’s Moon, the answer is 1.6 meters per second per second, which is only one-sixth of the acceleration due to gravity near the surface of the Earth.  The Earth’s Moon has virtually no atmosphere and therefore virtually no air resistance.  Consequently, Newton’s equation was tested on the Earth’s Moon in a dramatic way.  Fifty years ago, one of the astronauts on the Moon dropped a hammer and a feather, and both fell with the same acceleration; both hit the Moon’s ground at the same time!  As yet another example, if we multiply G by the mass of Mars and divide by the square of the radius of Mars, the answer is 3.7 meters per second per second, which is only one-third of the acceleration due to gravity near the surface of the Earth.

 

Perhaps the most brilliant of Newton’s achievements was the explanation of the tides.  Sometimes the ocean is at flood tide (high tide); sometimes the ocean is at ebb tide (low tide).  Why do the tides occur?  Newton proved mathematically that any object exerts different gravitational forces (attractions) across any other object due to the varying distances between different parts of the two objects from each other.  Parts of the two objects that are closer to each other feel stronger attractions, while parts of the two objects that are further from each other feel weaker attractions.  The differences in the gravitational forces (attractions) across an object are called tidal forces, because they cause the tides in the ocean.  The Moon and the Sun each exert roughly equal tidal forces on the Earth’s oceans causing them to bulge, resulting in flood tides and ebb tides every day.  When the Earth, the Moon, and the Sun happen to form a nearly straight line (this occurs during New Moon or Full Moon), the lunar tidal force and the solar tidal force reinforce each other.  If we interpret the tides as waves, then during this straight-line configuration, the lunar tidal crests and the solar tidal crests meet each other, while the lunar tidal troughs and the solar tidal troughs also meet each other.  This is constructive interference, resulting in severely high flood tides and severely low ebb tides.  These are called the spring tides.  When the Earth, the Moon, and the Sun happen to form a nearly right angle (this occurs during First Quarter Moon or Third Quarter Moon), the lunar tidal force and the solar tidal force counteract each other.  If we interpret the tides as waves, then during this right-angle configuration, the lunar tidal crests and the solar tidal troughs meet each other, while the lunar tidal troughs and the solar tidal crests meet each other.  This is destructive interference, resulting in modest flood tides and modest ebb tides.  These are called the neap tides.  The tidal range is the wave height of the tides, the difference between the flood tide and the ebb tide.  The spring tides have the largest tidal range, since the flood tide is very high and the ebb tide is very low, resulting in a large difference between them.  The neap tides have the smallest tidal range, since the flood tide is modestly high and the ebb tide is modestly low, resulting in a small difference between them.  The Moon’s orbital period around the Earth is roughly one month.  In fact, the word month is derived from the word moon.  If we take the word month, remove the suffix -th, and insert an extra letter o, we obtain the word moon!  One month is roughly four weeks.  If today is New Moon, we will have spring tides with the largest tidal range (severely high flood tides and severely low ebb tides).  Roughly one week later will be First Quarter Moon, and we will have neap tides with the smallest tidal range (modest flood tides and modest ebb tides).  Roughly one week later will be Full Moon, and we will have spring tides again with the largest tidal range (severely high flood tides and severely low ebb tides).  Roughly one week later will be Third Quarter Moon, and we will have neap tides again with the smallest tidal range (modest flood tides and modest ebb tides).  Roughly one week later, we have returned to New Moon, roughly four weeks since the previous New Moon.  For thousands of years, humans already noticed that there is a correlation between the changing appearance of the Moon in the sky and the changing tidal range in the ocean, but it was Isaac Newton who explained mathematically why this is the case.  The lunar tidal force and the solar tidal force not only cause the Earth’s oceans to bulge, but they also cause the shape of the solid Earth itself to bulge.  The shape of the solid Earth itself suffers flood tides and ebb tides every day.  When the solid Earth itself suffers a flood tide, we are slightly further from the center of the Earth.  Later when the solid Earth itself suffers an ebb tide, we are slightly closer to the center of the Earth.  Each and every day of our lives, we move up and down roughly one meter, even while we believe ourselves to be remaining still!

 

Although many Europeans were convinced that the Newtonian model of the universe is correct, many other Europeans were not convinced.  This was partly because Isaac Newton had made many enemies during his lifetime by his own personal arrogance.  As a result, the enemies of Newton tried desperately to disprove the Newtonian model of the universe.  When the Halley comet passed near the Earth during Newton’s lifetime, its orbital parameters were measured, and Newton’s equations were used to calculate that it had an orbital period around the Sun of roughly seventy-four years.  After seventy-four years, the Halley comet did not return as scheduled, and the enemies of Newton rejoiced since they believed that this disproved the Newtonian model of the universe.  However, the original calculation only included the gravitational attraction of the Sun.  What about the gravitational attractions of the planets?  Physicists and mathematicians recalculated the orbit of the Halley comet including the gravitational attractions of the planets in addition to the gravitational attraction of the Sun.  This more correct calculation revealed that the orbital period of the Halley comet was not seventy-four years; it was in fact seventy-six years.  In other words, the original calculation was two years in error.  Europeans waited two more years, and the Halley comet returned!  Isaac Newton, who had died a few decades earlier, was already considered a genius by many Europeans, but the return of the Halley comet as correctly predicted by Newton’s equations convinced not only his admirers but his enemies as well that he may have been the most brilliant person who ever lived.  The British poet Alexander Pope wrote the following poem in honor of Isaac Newton.  “Nature and Nature’s laws lay hid in night.  God said, ‘Let Newton be!’ and all was light.”  Isaac Newton’s achievements were the culmination of the Age of Reason of seventeenth-century Europe (the 1600s), which led directly to the Age of Enlightenment of eighteenth-century Europe (the 1700s).  During the Age of Enlightenment, scholars in many different disciplines began to approach their subjects with mathematical logic and scientific reasoning.  For example, mathematicians began to insist on rigorous proofs before any mathematical statement would be regarded as a true theorem.  As another example, political philosophers debated different systems of government in a reasoned and logical fashion.  As yet another example, religious scholars used logic and reason when studying the Bible.  Isaac Newton is not only one of the greatest figures of scientific and mathematical history for his model of the universe, but he is also one of the greatest figures of world history for his exceptional role in the Age of Reason and for inspiring the Age of Enlightenment.  The most authoritative biography of Isaac Newton is Never at Rest: A Biography of Isaac Newton, written by Richard S. Westfall.  No one would dare question the Newtonian model of the universe until two hundred years after Isaac Newton died.  In the early twentieth century (early 1900s), Albert Einstein dared to question the Newtonian model of the universe.  The British poet J. C. Squire wrote the following poem in honor of Albert Einstein as a sequel to Alexander Pope’s poem.  “It did not last: the devil howling ‘Ho! Let Einstein be,’ restored the status quo.”  We will study the Einsteinian model of the universe later in this course.

 

 

 

Links

 

Libarid A. Maljian homepage at the Department of Physics at CSLA at NJIT

Libarid A. Maljian profile at the Department of Physics at CSLA at NJIT

Department of Physics at CSLA at NJIT

College of Science and Liberal Arts at NJIT

New Jersey Institute of Technology

 

 

 

This webpage was most recently modified on Wednesday, the twenty-fifth day of September, anno Domini MMXXIV, at 03:45 ante meridiem EDT.