Doper Physicists: Explain Einstein to me

First of all: I am no dummy. For my ME degree I had 10 or 12 hours of physics and got 2 A’s and a B. (Fortunately Einstein is not required in mechanical engineering. As near as I can tell, my grasp of physics stalled at 1895 or so.) What I have seen of Einstein’s work involved mathematical functions that I have never heard of.

Did Einstein start somewhere (Maxwell’s equations?) and go from there? Did he invent stuff from whole cloth? I know he never really accepted quantum theory, so how does that “illuminate” (?) his work? It seems to me that “E=mcc” is probably a special case of some more general equation: True?

Any help is appreciated. My learning about this won’t change the world, just my world.

Albert Einstein’s Theory of Relativity In Words of Four Letters or Less

As briefly as possible, Einstein took two premises:[ul][li]The speed of light in vacuo, c, is constant for all observers. (If I run at 20 mph and throw a ball at a further 20 mph, the ball travels at 40 mph if I throw it forwards, but drops behind me at 0 mph if I throw it backwards. But if instead of throwing a ball I throw light by shining a torch in front of or behind me, the speed of light doesn’t change, irrespective of my speed or the direction I shine the torch)Nobody is absolutely standing still. (The Earth spins and orbits the Sun, which itself orbits the centre of mass of the Solar System, which itself orbits the centre of mass of the galaxy, which is getting further from almost every other galaxy since space itself is expanding.)[/ul][/li]…and really really thought about the consequences, even if they seemed counterintuitive. One such consequence was the equivalence between mass and energy. Another was that clocks moving relative to Earth, or in stronger gravity, would run slow. Another was that gravity would bend distant starlight by literally warping spacetime.

All of these counterintuitive consequences were subsequently verified experimentally to astonishing accuracy.

Well my question is related…
My dad has tasked me with distilling an executive brief of relativity (just SR for now), and wanted to know what led Einstein to the 2 postulates.

The No_Absolute_Rest postulate is clearly motivated by the results of the MM experiments.

But what about the Constancy_of_Speed_of_Light postulate? What reasoning motivated this postulate?

And that story about him imagining how it would be to ride beside a beam of light at the same speed.
Wonder if someone can articulate the thought process… Perhaps it might be “What’s wrong with newtonian physics with no absolute rest?”

TIA

The Michaelson-Morley Experiment, the most successful failed experiement in physics.

Briefly, the split a light beam and directed it with mirrors so that half went in the direction the Earth was moving in space, and the other half went in the opposite direction. The motion of the Earth should have affected the light beam, which would have shown up as an interference pattern. But there was no interference pattern.

It was a well-known mystery of physics at the time Einstein was formulating relativity.

OTOH, Einstein is quoted as saying that the experiment only confirmed the line of reasoning he was following at the time.

Yeah, prior to Michelson-Morley it was believed that there was an ether through which light travelled, like how sound waves travel through air. Michelson-Morley showed that there was no ether, or that it cannot be detected. Several propopsals were put forward to explain the undetectability of the ether, before Einstein came along and said what he said :slight_smile:

Einstein himself did a great job of explaining Einstein. He wrote this book for the non-specialist audience, and it does a pretty good job of explaining relativity. I’ve heard complaints that scientists themselves don’t often make their work accessible to the non-expert, but there are a surprising number of such works out there. People just aren’t familiar with them.

http://www.amazon.com/gp/product/0517884410/103-9306395-7540623?v=glance&n=283155&s=books&v=glance

I think you’re overstating this a bit, especially at the end. None of your examples are consistent with special relativity, and Einstein himself didn’t believe the last one for a long time.

It’s more accurate to state that if I see you moving at a constant velocity, you see me moving at a constant velocity, and neither of us are “more right” than the other any more than we would be if we looked at the world with our heads tilted at different angles.

It seems clearly motivated by the MM experiments, but I’ve read that Einstein was unaware of MM at the time he came up with his ideas.

The no absolute reference frame idea comes from the idea that the speed of light is constant for all observers. If it’s constant for all observers, then there is no preferred reference frame.

The idea that c is constant for all observers was motivated by Maxwell’s Equations. They predict that waves propagate at c, but don’t have any mention of which observer this is relative to. This stumped physicists in the second half of the 19th century. The equations clearly imply that c is constant for all observers, but most physicists didn’t have the mathematical horsepower (or trusted their common sense too much) to follow that idea through and figure out how physics would work in a world like that.

Here’s a really condensed explanation:

Maxwell combined four equations for electromagnetism, corrected one by adding a missing term, and showed that they give rise to a wave equation for electromagnetic waves propagating at 300,000 km/s. Which just so happens to be the speed of light. Hence, he concluded that light is a wave, and its speed is what’s predicted by Maxwell’s equations.

But that raises the question: in what frame of reference do Maxwell’s equations apply? See, it was well understood at the time that speeds look different to different observers. If I throw a baseball at 60 miles per hour, it’s moving at 60 mph relative to me, but if someone drives by me in a car going 60 mph (in the same direction as the baseball), the ball will appear to be standing still. And who’s to say my frame of reference is right and theirs is wrong? After all, I’m standing on the Earth, which is rotating and orbiting the sun, so really I’m no more “standing still” than they are. (This last sentence is just a way of convincing students that there isn’t one “truly motionless” frame – as noted by other posters, this didn’t play a role in convincing Einstein of the principle of relativity.)

At any rate, this raises questions because Maxwell’s equations gave one specific number for the speed of light (the aforementioned 300,000 km/s), which was calculated from other empirically determined fundamental constants. But if speeds are different in different reference frames, one has to ask: in which reference frame do Maxwell’s equations apply? People originially thought it was the reference frame of the aether, some hypothetical invisible substance that provides the medium through which light traveled. But if that’s the case, we should be able to measure different values for the speed of light depending on how we’re moving relative to the aether. All the experimental attempts to detect this effect failed – i.e., they always measured the same speed for the speed of light.

Einstein’s answer to this came in 1905, when he proposed the postulate that the laws of physics, including Maxwell’s equations are the same in all inertial reference frames. (An inertial reference frame is one that is not accelerated – I’ll mention non-inertial frames in a bit.) Einstein’s second postulate follows from the first. If Maxwell’s equations are applicable in all inertial frames, then the speed of light must also be the same in all inertial frames, even though those frames of reference are moving relative to each other.

This simple idea had far reaching consequences. Before, we had a picture of the world in which observers in relative motion agreed on the lengths of objects and the lengths of time between events, but disagreed on the speeds of objects. Now, we have a picture where observers moving relative to each other agree on certain speeds (i.e., the speeds of electromagnetic waves), and as a consequence, they have to disagree about the lengths of objects and the amount of time that passes between two events. (There are various clever thought experiments that demonstrate this, but like I said, I’m trying to keep this short.) And because quantities like length and duration now depend on one’s frame of reference, the same turns out to be true for other related quantities like energy and momentum. However, it should be noted that when the relative differences in speed between the observers are at ordinary every-day values (i.e., much, much less than the speed of light), these differences are so tiny that we never notice them, and classical pre-Einsteinian physics is still a sufficiently good approximation for dealing with those cases.

Einstein followed up this paper with another one which examined what happens when a particle emits light. Specifically, he dealt with the case where a particle emits two identical light waves in opposite directions. In this case, the particle’s speed won’t change as a result of emitting the light (just as if I simultaneously fired two perfectly aligned identical guns in opposite directions – the recoil forces exactly cancel out). So, since the particle was initially at rest in its own reference frame, it stays at rest. Einstein compared this to another reference frame, one which was moving relative to the particle. In that frame, the particle moved with a constant velocity, since again the emission of equal quantities of light in opposite directions doesn’t change the speed of the particle.

The interesting physics showed up because the energy of the light being emitted was different for the two reference frames, as a consequence of Einstein’s theory of relativity. Einstein looked at the energy difference between the two frames, which represented the kinetic energy of the particle. (In the rest frame of the particle, it had internal energy only. In the moving frame, it had the same internal energy, plus some kinetic energy. So the difference was kinetic energy alone.) But because the light carried away different amounts of energy in the two frames, the energy difference between frames – the kinetic energy – changed. Since kinetic energy depends only on mass and speed, and the speed of the particle hadn’t changed, Einstein conluded that its mass must have changed, and in fact he found this change was equal to the energy of the emitted light times the square of the speed at which the light traveled. Thus, Einstein concluded that mass was proportional to energy content, and furthermore that mass could be converted into electromagnetic energy. The entire internal (non-kinetic) energy content of a particle of mass m was thus given by E = mc[sup]2[/sup], where c is the speed of light.

Although this equation deals with the rest-energy of a particle, there is a version that gives its total energy, namely E[sup]2[/sup] = p[sup]2[/sup]c[sup]2[/sup] + m[sup]2[/sup]c[sup]4[/sup], where p is the momentum of the particle. For particles with zero mass (like photons, the particles of light) this simplifies to E = pc, and for particles with mass m and zero momentum, this simplifies to E = mc[sup]2[/sup].

Now, at this point Einstein’s theory of relativity dealt only with inertial (i.e., unaccelerated) frames of reference. Because it is specialized to these cases, this has become known as the Special Theory of Relativity. In subsequent years, Einstein developed the General Theory of Relativity, which deals with accelerated frames. It’s based on the basic idea that accelerations are indistinguishable from gravitational forces. I.e., if you’re in an elevator and it suddenly starts accelerating upward, there’s no experiment you can do to determine whether the elevator is accelerating upward or a gravitational field is being applied to produce a downward force. This theory had various consequences, such as predicting that spacetime curves around massive bodies and that light rays can be bent by gravitational forces.

So what does this all have to do with quantum mechanics? Not too much. But in the same year that Einstein wrote his paper on Special Relativity and the subsequent E=mc[sup]2[/sup] paper (which didn’t actually contain the famous equation, since Einstein’s notation differed from that used today), Einstein also wrote two other papers of distinct significance. It is because of this that 1905 has come to be known as Einstein’s Miracle Year, and its hundreth anniversary is being celebrated throughout the world.

One of Einstein’s 1905 papers explained a phenomenon known as Brownian motion. The explanation was based on the existence of atoms, and as a result Brownian motion came to be seen as evidence that atoms really exist. But the other paper Einstein wrote that year dealt with a phenomenon called the photoelectric effect (essentially, light stiking metals and causing them to emit sparks). Einstein explained the photoelectric effect using the principle that light could only be absorbed in certain discrete amounts. This prediction of particle-like behavior for light eventually lead to the wave-particle duality and other quantum mechanical ideas. A similar idea had been used by Max Planck a few years earlier to derive his black-body radiation law, but there is some reason to think that Planck saw this as merely a mathematical trick, whereas Einstein saw it as a feature of physical reality. For this reason, some feel that Einstein, and not Planck, should be regarded as the true father of quantum mechanics. In spite of this, Einstein never fully accepted the probabilistic, non-deterministic nature of quantum mechanics, believing instead that the theory was an incomplete description of reality.

Incidentally, Einstein’s Nobel Prize was awarded mainly for his paper on the photoelectric effect, not for special relativity. Apparently, this was because the Nobel committee didn’t consider the theory of relativity to be fully confirmed at the time. But it’s generally accepted that in 1905 alone Einstein produced three Nobel-worthy papers (photoelectric effect, special relativity, and brownian motion).

Excellent condensation, tim. I wonder if there’s any way that can be made a sticky. “Read this before asking yet another introductory question about Special Relativity!”

Thanks for the responses. But one thing I want to know is: Surely his work wasn’t all thought-experiments. Didn’t it involve, and result in, equations that people still explore? (Or maybe I am thinking of Unified stuff…)

Where do you think equations come from? These days one doesn’t write equations to fit data, but rather thinks of a mathematical model (including equations) and then checks to see if it fits the data. Yes, he proposed experiments to test the theory, but the theory was created (as are most modern theoretical physics theories) from thought-experiments.


[line of people waiting to kill Shalmanese]

Hey!

no death threats in GQ! :stuck_out_tongue:

True. It’s a special case of:


           *m[sub]0[/sub]*
*m'* = ---------------
     SQRT (1 - *v[sup]2[/sup]*/*c[sup]2[/sup]*)

… where m[sub]0[/sub] is the rest mass of the object, v is the velocity of the object relative to the observer, and m’ is the mass of the object as measured by the observer.

The faster something goes, the more massive it becomes. See? So, as an object gains kinetic energy, it also gains mass. If you work out the numbers, using the full relativistic equation for calculating kinetic energy (which is a converging infinite series), you’ll discover that an object with a kinetic energy of E gains additional mass m over-and-above its rest mass equal to E/c[sup]2[/sup]. Multiply both sides by c[sup]2[/sup], and you get E=mc[sup]2[/sup].

Heh. I dropped into the thread to post the same link and found you had long beaten me to it.

Fortunately I dodged the death threats that way… :slight_smile:

Thanks for that impressively helpful post. Now, regarding these ‘clever thought experiments’… I’ll bite. Examples?

Well, one common example to demonstrate time dilation (time running slow for a system moving relative to the observer) is the idea of a light clock.

Basically, suppose you have two parallel mirrors – one on the ceiling and one on the floor – and a beam of light is reflecting back and forth between the two. You can use this system as a clock, with the time it takes to complete one cycle (light going from floor to ceiling and back again) being one tick of the clock. That is, you can determine the duration of an event by seeing how many ticks of your light clock go by over the course of that event.

But now, suppose we introduce a second observer (the first being the one at rest with respect to the light clock.) This observer runs along the floor (perpendicular to the direction that the beam of light is moving. But in his frame of reference, he’s standing still, and the light clock is moving backwards (i.e., moving in the opposite direction from the way he’s facing.) So, if he watches the clock, he sees the beam of light travel along a diagonal (up from the floor and backwards), then hit the ceiling mirror and travel along another diagonal (down from the floor and backwards.) (Here a diagram is helpful, I’ll link to one at the end. Also note that he wouldn’t really see the beam of light moving unless it hits his eyes – it’s more accurate to say he can measure its motion using some apparatus.)

The point is, in the running observer’s reference frame, the light is traveling along a longer, diagonal path, whereas in the standing observer’s reference frame it’s traveling along a shorter straight-up-and-down path. So in the running observer’s frame, it’s traveling a longer distance.

In pre-relativistic physics, this doesn’t give rise to a difference in time scales – we’d say the light travels a longer distance during one tick in the runner’s frame because it’s moving faster. Specifically, it would be moving at a speed of (v[sup]2[/sup] + c[sup]2[/sup])[sup]1/2[/sup] in the runner’s frame, where v is the runners speed in the standing observer’s frame, and c is the speed of light in the standing observer’s frame.

Not so with Special Relativity. According to Einstein’s theory, the light travels the same speed, c, in the runner’s frame as it does in the standing observer’s frame. But we also know it covers a greater distance during one tick of the clock in the runner’s frame. How can this be? The answer is that the tick of the light clock lasts longer in the runner’s frame then he would expect – he measures the ticking of the light clock to be slowed due to its motion relative to him. He can even build an identical light clock that he carries with him, in his own frame of reference, and he’ll see it ticking faster than the standing observer’s clock.

And it’s not just light clocks that are slowed down – the runner measures everything in the standing observer’s frame to be happening in slow motion, including the motion of the standing observer himself (his heartbeat, etc.) This must be the case, because the standing observer is in synch with his clock – in his frame of reference it is ticking at its usual rate.

This is all a lot clearer with diagrams. See, for instance, this pdf file. It also goes into some of the other standard thought experiments, such as to demonstrate length contraction and the relativity of simultaneity.

These other guys said it right. I would like to add a couple of things:
((I)) Albert Einstein and Max Planck knew each other fairly well. Einstein made plenty of contributions to nuclear as well as being a huge contributor to photon theory. Einstein agreed with him that there were “quanta” of Energy = photons [http://www.cox-internet.com/hermital/book/holopara1-4.htm].

It was the Copenhagen Philosophical interpretation of Quantum Mechanics that Einstein disagreed with, Einstein is quoted as having said: “God does not play dice with the Universe”. Nearly everyone agreed that the empirical, statistical observations toward quantum mechanics were correct/necessary, but the main-stream believed that there were NO fundamental physics concepts involved deeper than such statistics [AKA = “no hidden variables”]. Still today, the main stream of physics holds on to the view that Quantum Statistics are the pure forms, with no reason to the rhyme - using “Heisenberg’s Observational Uncertainty Principle” as a reason for their firm assertion.

((II)) Here’s a back_of_the_envelope, ballpark explanation of the Uncertainty Principle:

(1) Planck: Actively scan with photon: E = hf
(2) Einstein: E=p
c
so, E_injected = p_uncertainty_added*c
(3) DeBroglie: p= h/Lambda
(4) Heisenberg: Minimum Dist_observable > Lambda (Experimental Obs)
(5) Heisenberg: so, (Min_Dist) > h/(p_uncertainty_added)

(6) Heisenberg: so, (Min_Dist_obs)(p_uncert) > h [Heis Uncert Princ]

The real explanation is a bit more in-depth, but for this forum, I hope this will do.
If you want more, go to http://www.spaceandmotion.com/quantum-theory-werner-heisenberg-quotes.htm]

((III)) Here’s my damage - a Quick deriv of E=m’cc:

(1) Planck: E=hf (for any EM photon)
(2) DeBroglie: Lambda = h/p (for matter-wave theory)
(3) Hertz: Lambda
f = c (for any EM photon)
(4) Einstein: So, E=pc (for any EM photon)

 Destroy an amount of matter and shoot the energy out at the speed of light, and you sub the ad-hoc idea:  m' -> p'/c

 so: E=(m')*c*c, where m' increases with v.

In the end: E=sqrt((m_0cc)^2 + (p’*c)^2)

Hope it’s not too much brain damage.