Wolverhampton's Physics Blog

Warning: include_once(/home1/camilope/public_html/wlmblog/wp-content/plugins/wp-super-cache/wp-cache-phase1.php): Failed to open stream: No such file or directory in /home3/camilope/public_html/wlmblog/wp-content/advanced-cache.php on line 22

Warning: include_once(): Failed opening '/home1/camilope/public_html/wlmblog/wp-content/plugins/wp-super-cache/wp-cache-phase1.php' for inclusion (include_path='.:/opt/cpanel/ea-php83/root/usr/share/pear') in /home3/camilope/public_html/wlmblog/wp-content/advanced-cache.php on line 22

Warning: include(/home1/camilope/public_html/wlmblog/wp-content/plugins/wp-super-cache/wp-cache-base.php): Failed to open stream: No such file or directory in /home3/camilope/public_html/wlmblog/wp-content/plugins/wp-super-cache/wp-cache.php on line 137

Warning: include(/home1/camilope/public_html/wlmblog/wp-content/plugins/wp-super-cache/wp-cache-base.php): Failed to open stream: No such file or directory in /home3/camilope/public_html/wlmblog/wp-content/plugins/wp-super-cache/wp-cache.php on line 137

Warning: include(): Failed opening '/home1/camilope/public_html/wlmblog/wp-content/plugins/wp-super-cache/wp-cache-base.php' for inclusion (include_path='.:/opt/cpanel/ea-php83/root/usr/share/pear') in /home3/camilope/public_html/wlmblog/wp-content/plugins/wp-super-cache/wp-cache.php on line 137

Warning: include_once(/home1/camilope/public_html/wlmblog/wp-content/plugins/wp-super-cache/ossdl-cdn.php): Failed to open stream: No such file or directory in /home3/camilope/public_html/wlmblog/wp-content/plugins/wp-super-cache/wp-cache.php on line 174

Warning: include_once(): Failed opening '/home1/camilope/public_html/wlmblog/wp-content/plugins/wp-super-cache/ossdl-cdn.php' for inclusion (include_path='.:/opt/cpanel/ea-php83/root/usr/share/pear') in /home3/camilope/public_html/wlmblog/wp-content/plugins/wp-super-cache/wp-cache.php on line 174

Warning: Cannot modify header information - headers already sent by (output started at /home3/camilope/public_html/wlmblog/wp-content/advanced-cache.php:22) in /home3/camilope/public_html/wlmblog/wp-includes/feed-rss2.php on line 8
Wolverhampton's Physics Blog https://camilopez.org/wlmblog What I cannot create, I cannot understand Sat, 06 Jul 2019 09:01:23 +0000 en-US hourly 1 https://wordpress.org/?v=7.0.2 https://camilopez.org/wlmblog/wp-content/uploads/2018/09/cropped-01-Logo-2-32x32.png Wolverhampton's Physics Blog https://camilopez.org/wlmblog 32 32 Our first quantitative look at the magnetic breaking experiment https://camilopez.org/wlmblog/2019/07/05/our-first-quantitative-look-at-the-magnetic-breaking-experiment/ Fri, 05 Jul 2019 09:35:07 +0000 http://camilopez.org/wlmblog/?p=223 One of our favourite outreach setups is the magnetic pipe experiment: two pellets fall at the same rate in vacuum, but pass them through a pipe and one of the two takes ages to get through. If you peep in, you’ll see it being mysteriously slowed down without touching anything. What is going on?

One is a magnet. Although the pipe itself is not magnetic (the effect would otherwise be quite easy to understand), there is some sort of magnetic interaction. This is a manifestation of magnetic braking, itself a direct consequence of Maxwell’s equations as they turn into Faraday’s and Lenz’s laws. Faraday’s law tells you that a changing magnetic field creates an electric field. It is really the Maxwell-Faraday equation that tells you that:

$${\displaystyle \nabla \times \mathbf {E} =-{\frac {\partial \mathbf {B} }{\partial t}}}$$

where ${\frac {\partial \mathbf {B} }{\partial t}}$ means that the magnetic field $\mathbf{B}$ is changing and this causes a curl (or rotation) of the electric field $\mathbf{E}$.

If there is a conductor nearby, the electric field will set charges in motions, creating a current, and that is really where Faraday’s law gets on stage: it tells you the force of the so-called electromotive force that pushes the electrons in the conductor as a result of the changing $\mathbf{B}$. Now another of the Maxwell’s equations, that named after Ampère, says that an electric current $\mathbf{F}$ creates a magnetic field (and also that a changing electric field creates a magnetic field, because everything’s changing I give you the full equation: ${\nabla \times \mathbf {B} =\mu _{0}\left(\mathbf {J} +\varepsilon _{0}{\frac {\partial \mathbf {E} }{\partial t}}\right)}$). Lenz’s law tells us, basically, that the magnetic field created by the current itself created by the changing magnetic field opposes the change, that is, if the external magnetic field grows, the generated current will create an opposite magnetic field to keep it small, if the external magnetic field shrinks, the generated current will create an opposite magnetic field to keep it big. That comes as a minus sign in the equation that relates the electromotive force $\mathcal {E}$ and the change of the change (${d\over dt}$) of the magnetic flux $\int_\mathrm{Area}\mathbf {B} \cdot d\mathbf{S}$ (which is how the change of the magnetic field is to be tracked):

$$\displaystyle {\mathcal {E}}=-{d\over dt}\int_\mathrm{Area}\mathbf {B} \cdot d\mathbf{S}$$

It’s a lot of stuff for something so simple as a magnet falling through a pipe. There is something generated dynamically, which causes a back-reaction, leading to some equilibrium. Beautiful Physics. That deserves to be studied in details. So far, we only use it as a striking (and mysterious, if you have not heard of it before) demo with only the gist of the explanation. At some point, we will put a couple of students to study this in details as part of their Electromagnetism project. But our first quantitative study started on the occasion of a visit from our neighbours from the Oldbury wells school, who spent a morning with us along with their Physics teacher Mal (whom we meet at our IOP meetings) and with whom we discussed various aspects of free fall last Thursday. Regarding this particular experiment, we asked them to work out the velocity of the magnetic pellet inside the pipe. In however way they see fit.

What they did is best recorded through the recollection of one of the students, Annabel. She writes:

The aim of the experiment was to find the average velocity of the magnet over various distances in the tube. The idea was that if we were to take the average velocity for the magnet to fall the whole length of the tube, as the magnet would start at a higher velocity and decelerate before reaching terminal velocity, this reading would be higher than terminal velocity. However, if we measured its average velocity from say, half way down the tube to the bottom, this should give a lower value, and eventually, at some point along the tube and below, the measurements of average velocity will all be equal, and so this implies the magnet has reached terminal velocity.

To collect data for this, after measuring the total length of the tube, we made markings on the tube at various points and measured and recorded the distance between each marking and the bottom of the tube. Then we used the magnetic field detecter to sense where the magnet was in the tube. For each marking, we held the magnetic field detector level with the marking, as when this moved, it suggested the magnet was passing the mark inside the tube, this was when we started the timer. The timer was stopped when the magnet emerged at the end of the tube. Therefore the time recorded was the time taken for the magnet to travel the measured distance from the mark to the end of the tube. We did this 3 times to get an average time measurement for each distance. I have attached a picture of the results recorded.

This is the picture of the results recorded:

Neat and tidy. Even collecting several points for each measurement (we didn’t ask for that). But in this form, the results turn out to be very difficult to interpretate. Annabel and friends carried on to obtain the distance the magnet fell (from the mark to the end of the tube) and the average time taken for the magnet to fall this distance, from which they got average velocities for each distance. These are the results:

That gives an average speed of 0.288 m/s. Even working out the correct theory, we found it difficult to draw any conclusions on the terminal velocity from the experiment made in this way.

Therefore, we gave it another shot, this time in a more natural configuration of measuring from the top of the tube, rather than from the bottom. This was carefully done by our skilled assistant, Kasia, who got the following results for the same “observable” as above:

This is in agreement with the other measurement made in fairly different configuration. The average velocity in this case is 0.297 m/s. Now it escapes my memory if Kasia (and Gregor, she came with her own assistant on this occasion) also took error bars, that we ignored anyway, but it seems the two approaches are compatible. This approach, however, allows to measure the instantaneous speed, and gives clear and strong results on the terminal velocity:

It is around 0.29m/s indeed, but here, you also see that it is a terminal velocity (a plateau) and you can see how quickly it is reached. Beautiful. In this graph it’d be worth putting error bars, but this starts to look like a B.Sc semester project rather than a short session to get a grip of free-falling objects.

If you look at the literature, there are many interesting attempts at describing the terminal velocity of a falling magnet. Maybe the best description one can find is that of Levin et al. in the American Journal of Physics 74, 815 (2006) (also available freely on the arXiv). This link points us to a formula derived in Zangwill’s Modern Electrodynamics. In this other work, people built a device with two Hall-effect switches to directly measure the terminal velocity (as a function of the thickness of their pipe, following a more detailed theoretical model of Donoso et al. in Eur. J. Phys. 30 855). Someone else provides numerical simulations of the fields. This recent work has a great bibliography. Etc., etc, there is a lot of material available. This blog post is only our first attempt at contributing something into this direction. We will provide a closer look at both the theoretical and experimental aspects of this spectacular effect. Maybe even the young people of Oldburywells will be the one telling us the rest of the story would they come back and give this problem more thoughts and re-iterated efforst (as part of a University-student career, maybe).

]]> Two thin lenses https://camilopez.org/wlmblog/2018/11/07/two-thin-lenses/ Wed, 07 Nov 2018 05:37:16 +0000 http://camilopez.org/wlmblog/?p=205 Our Matrix Quiz for the Ray Matrices module had this entry:

Multiply two matrices for thin lenses, with focal lengths and . What do you find? What does this tell you about this compound lens?

Canvas actually allows you to typeset your answer but we cover $\LaTeX$ in 2nd semester, so we have to deal with ASCII tricks, but that doesn’t matter. What does is the answer, which is here correctly given by Dilem:

I have little to add. This is another basically correct entry, with a more loose articulation of the conclusion (“focal length equal to the sum of each lens“):

These are less successful or less inspired entries:

It’s dependent on the focal length

As the focal length is here in the singular, maybe it is meant that the compound object depends on its focal length. It sounds like a tautology, or a trivial statement. It looks like the student just tried a desperate reply that sounds like the sort of answer that could grab a few points (it did get 0.25 as it’s not clearly incorrect).

They magnify

Who is “They”? The compound lens could, or not, magnify. I think this has missed the point (which was to recognize that the compound object behaves as a lens, in the first place).

(1 , -1/f2) (-1/f1, -1/f1 x -1/f2 +1) I have no idea what this tells me

Here there seems to be some error in the computation, doubled with some confusing notation. No wonder this is totally unclear what this could tell. At least the student recognizes it’s unclear rather than making a mystifying statement that would convey the impression they have a clever idea.

The matrice for a compound lense is only reliant on -1/f

Another unclear—trying-to-guess-what-answer-could-work—statement. It could be interpreted as correct, with the implication that the compound object is a lens fully determined by its focal length, which also looks a tautology, also, $f$ is not clearly defined. So unless you know the answer, I believe it’s difficult to make much sense out of this input. Quarter of a point.

And the most popular reply:

We indeed got 2 students who left this entry blank (out of those who took the Quiz in time).

This is the reasoning (adapted from our Canvas module). The matrix of a compound lens formed by putting two thin lenses back to back is given by the product of their ray matrices:

$$M_{f_1}M_{f_2}=\begin{pmatrix}1&0\\\displaystyle-\frac1f_1&1\end{pmatrix}
\begin{pmatrix}1&0\\\displaystyle-\frac1f_2&1\end{pmatrix}=\begin{pmatrix}1&0\\\displaystyle-\frac1f_1-\frac1f_2&1\end{pmatrix}$$

We can write this in a more economical but also more insightful form:

$$M_{f_1}M_{f_2}=M_{f_1f_2/(f_1+f_2)}$$

What the notation highlights is precisely the fact that the product matrix has the same form as each indivual matrix, therefore, the compound object behaves collectively as any of its single part. So two thin-lenses put back to back make another thin lens! The only obvious difference is that we now have a more involved expression for the $C$ element of the $ABCD$ matrix, but this we can always write as $-1/f_\mathrm{compound}$, and this gives us, furthemore, an expression for the compound focal length:

$$f_\mathrm{compound}=\displaystyle\frac{f_1f_2}{f_1+f_2}$$

What would have been great is an analogy to similar results familiar to the students from other fields, something like:

Focal lengths of thin lenses add like resistances put in parallel in an electric circuit.

And this observation would lead to further questions: is there a way to assemble lenses so that their focal lengths add like “in series” ($f_1+f_2$)? The fact lenses behave in this way is why we introduce the notion of vergence of optical systems. That’s maybe close to what Dylan had in mind, but he didn’t pursue it any further, I believe because of the imprecision of his formulation. A proper statement often opens natural new questions of its own, and when such a process is started, one is not merely answering a quiz on Canvas, but is doing research already…

]]> Our radio wave setups https://camilopez.org/wlmblog/2018/10/26/our-radio-wave-setups/ Fri, 26 Oct 2018 06:53:53 +0000 http://camilopez.org/wlmblog/?p=200 Our radiowave setups have arrived and students have pioneered our first “radiowave optics”. This is our demonstration of the “filtering more yields more signal” experiment:

Interestingly, this is characterized as microwave optics by the provider, although these are short radiowaves. Indeed some nomenclature defines microwaves up to 10cm. Possibly because these are not useful for radio-applications, although I still find it strange to label micro something that is larger than your hand.

This is a first characterization of our setup, testing the inverse square law:

We like to plot these as $1/R^2$ to see if this follows the inverse square law, but in this case the natural variables are also interesting because, it doesn’t… there are oscillations, particularly notable at small distances (also at 70cm, it’s probably not an outlier). Here oscillations are caused by resonances and standing waves. The wavelength, I think I picked up from yesterday discussions in the lab, is 2.8cm, and more resolution is definitely called for to see the type of Physics that takes over geometry in this case, as well as possibly at larger distances where the law should be excellently satisfied (and there’s still plenty of signal at about 1m and the fall-off we have observed so far is sub-quadratic). That would require for students to tweak a bit the setup. So this is a good starting point. Maybe a nice student-research project, possibly combined with inverse square of gamma rays and light. There are great experiments we’re looking forward to do, including the double slit (of course) but also Bragg diffraction. So we’ll see much more of these things.

]]> Bending light https://camilopez.org/wlmblog/2018/10/24/bending-light/ Wed, 24 Oct 2018 18:50:43 +0000 http://camilopez.org/wlmblog/?p=150

Let us consider the following scenario: on a sunny day, some people decide to go to the beach and have some fun with the waves. A coastal guard is keeping an eye on the swimmers, and she notices that one of them appears to be drowning. To help the person in distress, she has to decide which path will take her to the swimmer in the least amount of time: she could try to get to there in a straight line (path A), but in this case she would have to swim most of the way. Alternatively, she could run a lot on the sand and swim the least amount of distance (path B). However, even though humans run faster than they can swim, the most efficient way to get to the surfer is making a compromise the amount of running on the sand and swimming on the water (path C).

Light behaves in this way too. When it has to go from a point $A$ to another point $B$, it will take the path that guarantees the least amount of time, and such a path, as in the case of the coast guard, may not be a straight line. Because of this, when we put a straw in a glass of water, it looks like the straw is bending. Well, it is actually the light that is bending!

While the concept of light bending was known by the ancient Greeks, it was not until the enlightenment that the description was put in mathematical terms. Willebrord Snellius showed that the relation between the angles with which light approaches the interface in one medium and leaves the interface in the other medium are related as $$\frac{\sin \theta_1}{\sin \theta_2} = \frac{v_1}{v_2}\,,$$ where $v_1$ and $v_2$ are the velocity at which light propagates on each of the media. As nothing can travel faster than the speed of light in vacuum, it is convenient to define the velocity of propagation of light in a given medium as $v=c/n$, where $n$ is the so-called index of refraction of the medium. Thus, the relation above becomes $$\frac{\sin \theta_1}{\sin \theta_2} = \frac{n_2}{n_1}\,,$$ which is the well-known Snell’s law.

Armed with this equation, we may tackle more involved problems, as the refraction of light when the interface between the two media is spherical, as shown in the figure below.

From the sketch becomes reasonable that an object placed in the medium with refraction index $n_1$ will generate an image (is it real or virtual?) on the medium with refraction index $n_2\,.$ The problem is as fundamental as it gets: we need to find the relation between the position of the object $o$, the position of the image $i$, the radius of the spherical surface $R$ and the refraction of the two media $n_1$ and $n_2$.

Let us consider a ray of light that goes from the position of the object $o$, to an arbitrary point $P$ on the spherical surface. The light impinges onto the surface at an angle $\theta_i$, as measured from the line perpendicular to the surface (shown in purple on the sketch). When passing to the medium with refraction index $n_2$, the light refracts and leaves the surface at an angle $\theta_r$ (from the sketch, does $n_2<n_1$ or $n_1<n_2$?), and continues its propagation until it reaches the optical axis.

Considering that the sum of the angles inside of a triangle is equal to $\pi$, we know that the angles in the sketch are related as follows:

$\delta + \gamma+\theta_r=\pi$.
$\beta+\gamma=\pi$.
$\alpha+\beta+(\pi-\theta_i)=\pi$.

Using 1. and 2. get can get rid of the angle $\gamma$ to find $\theta_r=\beta-\delta$, and simplifying 3. we have $\theta_i = \alpha+\beta$.

Conversely, we know that
$$\tan \alpha = \frac{PQ}{oQ}\,, \quad \quad \tan \delta = \frac{PQ}{Qi} \quad \quad \mathrm{and} \quad \quad \tan \beta = \frac{PQ}{QR}\,,$$
where, for instance, $PQ$ is the distance between the points $P$ and $Q$. Furthermore, in the paraxial approximation, in which we assume that the rays are barely separated from the optical axis, we have very smalls angles an therefore: $\tan \alpha \approx \alpha$, $\tan \delta \approx \delta$, $\tan \beta \approx \beta$, $\sin \theta_i \approx \theta_i$, $\sin \theta_r \approx \theta_r$ and notably $oQ \approx s$, $Qi \approx s’$ and $QR \approx R$ . Using these approximations we have the following equations:

From Snell’s law we have: $n_1 \theta_i = n_2 \theta_r$.
The relation $\theta_r=\beta-\delta$ becomes $\theta_r = \frac{PQ}{R} – \frac{PQ}{s’}$.
The relation $\theta_i = \alpha+\beta$ becomes $\theta_i = \frac{PQ}{s}+\frac{PQ}{R}$.

Joining these three equations we obtain
$$n_1 \left( \frac{PQ}{s}+\frac{PQ}{R} \right) =n_2 \left(\frac{PQ}{R} – \frac{PQ}{s’} \right)\,.$$
Dividing the two sides by the distance $PQ$ and rearranging the terms, we get
$$ \frac{n_1}{s} + \frac{n_2}{s’} = \frac{n_2-n_1}{R}\,,$$
which is the final form of our relation.

]]> A wrong magnification? https://camilopez.org/wlmblog/2018/10/10/a-wrong-magnification/ https://camilopez.org/wlmblog/2018/10/10/a-wrong-magnification/#comments Wed, 10 Oct 2018 16:00:23 +0000 http://camilopez.org/wlmblog/?p=122 A hypothetical student decided to investigate on his own (as one should) the magnification of a . He came up with this design

and reasoning:

The angles $\theta$ and $\theta’$ are equal as vertically opposite angles:

\begin{equation}
\label{eq:Wed10Oct162927BST2018}
\theta=\theta’
\end{equation}

and they can be both related to the important quantities of the problem as follows:

\begin{align}
\tan\theta=\frac{h}{s-R}\\
\tan\theta’=\frac{h’}{s’-R}\label{eq:Wed10Oct162841BST2018}
\end{align}

On the second line, I note that $h’$ is negative, which is okay as it is an algebraic quantity. Anyway, from the first equation, I can deduce that the magnification $m\equiv\displaystyle\frac{h’}{h}$ is given by

\begin{equation}
m=\frac{s’-R}{s-R}\,.
\end{equation}

However the formula which has been given in class for the magnification is:

\begin{equation}
\label{eq:Wed10Oct163138BST2018}
m=-\frac{s’}{s}\,.
\end{equation}

Can you explain to this student what is going on? (before next tutorial and/or next blog post, where an explanation will be proposed; you are welcome to share your suggestions in the comments below. If you figure it out, it should be clear what is going on).

]]> https://camilopez.org/wlmblog/2018/10/10/a-wrong-magnification/feed/ 1 The art of approximation https://camilopez.org/wlmblog/2018/10/07/the-art-of-approximation/ Sun, 07 Oct 2018 17:09:36 +0000 http://camilopez.org/wlmblog/?p=50 One of the most important skills we learn as physicists is how to approximate. The best physicists like or could turn this into a real art. Fermi for instance would tackle a question such as:

How many piano tuners are there in Chicago?

What’s your guess? Such problems became to be known as Fermi problems. By multiplying reasonable estimates (number of people in Chicago, number of people per house, percentage of households having a piano, how many times a year a piano needs tuning, how many pianos can a piano-tuner tune on a working day, etc.), one arrives at an estimate which, often, is surprisingly accurate. There is some statistics involved in the stability of such a procedure, that we will not go into today. Suffice to say that one of the most important and intriguing open questions ever posed to mankind relies on such an estimation (where is everybody? see the Drake equation).

Another useful approximation which every physicist uses is the linear approximation. This replaces an exact but awkward expression into an approximate but useful result. For instance, we have seen in the lecture on the double-slit experiment how the path difference reads straight from Pythagoras:

$$\begin{align}\Delta l&=l_2-l_1\\
&=\sqrt{(x+d/2)^2+L^2}-\sqrt{(x-d/2)^2+L^2}\tag{1}\label{eq:1}\\
&={L}\left(\sqrt{1+\left[\frac{x+d/2}{L}\right]^2}-\sqrt{1+\left[\frac{x-d/2}{L}\right]^2}\right)\\
&=\frac{xd}{L}\tag{2}\label{eq:2}\end{align}$$

and how this brings us to a useful formula—which we can keep in our collection of important results to remember—between the position of the $n$th bright fringe, at $x$, when light with wavelength $\lambda$ is projected on a screen a distance $L$ apart through a double slit of width $d$:

$$x=n\frac{\lambda L}{d}$$

The 2nd line, eq. ( $1$ ), is the exact result. The last line, eq. ( $2$ ), is the approximation when $L\gg x+d$. This is achieved by using so-called Taylor expansion, or Taylor series. Namely, we have used here the fact that:

$$\sqrt{1+\epsilon}\approx 1+\epsilon/2\tag{3}\label{eq:3}$$

which is a good approximation when $\epsilon\approx 0$. This is shown graphically below.

The blue line is ${\sqrt{1+\epsilon}}$ and the orange one is $1+\epsilon/2$. We have replaced a curvy shape by a line!

You can see how we approximate a complicated nonlinear function (here a square root) by a linear function. For instance, it is easy to compute that $\sqrt{1.012}$ is close to 1.006 (the real result is 1.00598, an error of less than 0.002%). It works fairly well even when $\epsilon$ is not that small. This is the percentage of relative error made:

and for a qualitative understanding, 10% is acceptable. So you can happily estimate $\sqrt{1.75}$ as 1.375 (by mental calculation, we’d go first with 1.35 (1+0.7/2) and the time to say that, you could add the 0.05/2=0.025 but as this is an overestimation, as shown on the first graph, it’d be wise to stop there anyway. The real result is 1.32288, an error of about 2%).

An engineer, or a computer, would want to keep the exact result. A physicist would typically look for as much simplifications as possible (but not more). Actually, so great is the temptation that we even simplify very simple things that would not seem to require it, such as $(1+x)^2\approx 1+2x$. Here it’s from the binomial theorem. But for the general case, where does this come from anyway? At the first order, simply from the derivative, which is defined as:

$$f'(x)=\lim_{\epsilon\rightarrow0}\frac{f(x+\epsilon)-f(x)}{\epsilon}$$

That’s the definition of a Mathematician. For us, it becomes:

$$f'(x)\approx\frac{f(x+\epsilon)-f(x)}{\epsilon}$$

The limit means that it becomes exact when $\epsilon$ is vanishing. For any finite value, this is only an approximation (a better one the smaller the $\epsilon$, but an approximation nonetheless). For us who, like in the army, are still happy with a 10% loss, the limit can just be overlooked. Rearranging, we get:
$$f(x+\epsilon)\approx f(x)+\epsilon f'(x)$$
If we take $f=\sqrt{}$, $x=1$ and $\epsilon$ a small number, remembering that $\sqrt{x}’=1/(2\sqrt{x})$, we arrive at the result above, eq. ( $3$ ). Two things are now in order:

We need to know this trick for all the recurring nonlinear functions we will meet (and we’ll meet a lot of them!)
Sometimes the 1st order is not enough, so it’d be good to have approximations beyond this.

This can be done thanks to the complete Taylor formula, which you will demonstrate in your Math course:

$$f(x+\epsilon)=\sum_{n=0}^\infty\frac{f^{(n)}(x)}{n!}\epsilon$$

This is an exact (Mathematical) result… although there’s no limit! it’s because we’re summing up to infinity. $f^{(n)}$ stands for the $n$th derivative. So this becomes, back to approximating (we don’t really have time for the infinite):

$$f(x+\epsilon)\approx f(x)+f'(x)\epsilon+f^{\prime\prime}(x)\frac{\epsilon^2}{2}+f^{\prime\prime\prime}(x)\frac{\epsilon^3}{6}+f^{\prime\prime\prime\prime}(x)\frac{\epsilon^4}{24}+\cdots$$

(you remember that n! is n-factorial, right?) Here we have a fourth-order expansion (and leaving open for more). The Taylor formula is also usefully (and probably more commonly) encountered in this (equivalent) form:

$$f(x)=\sum_{k=0}^\infty f^{(n)}(a)\frac{(x-a)}{n!}=f(a)+f'(a)(x-a)+f^{\prime\prime}(a)\frac{(x-a)^2}{2}+\cdots$$

Here’s an example with another important (and recurrent) nonlinear function: the inverse. This gives the geometric series $\sum x^k$:

$$\frac{1}{1-x}=1+x+x^2+x^3+\cdots$$

And this is the graphical representation.

Note that it does not apply everywhere. There is a divergence at $x=1$, and convergence is slow here (meaning that even though we put a lot of terms of the Taylor expansion, we don’t approach the correct value quickly). But around $x=0$, this is excellent.

Once we know such formulae, we can obtain many more, by combinations, e.g.:

$$\frac{1}{1+x}=1-x+x^2-x^3+x^4+\cdots$$

(which is equally important). Or:

$$\frac{1}{1+x^2}=1-x^2+x^4-x^6+\cdots$$

We will conclude with this one, because it has a very beautiful result cast in it. This is plotting below, again, the exact function $1/(1+x^2)$ (in thick blue) and various terms of the Taylor expansion for increasing numbers of terms.

There is something shocking here. The Taylor expansion also fits very well around $x=0$, but even though we add as many terms as we want, we can’t get to approximate it beyond $x=1$ (or below $x=-1$). It’s not even that we get a poor approximation, we get a diverging series of positive and negative numbers. This is actually the same as for $1/(1-x)$ where we couldn’t get to approximate the function for $|x|>1$. In the previous case, there was a divergence so we were somehow expecting troubles there. But now, this is strange because the function itself, $1/(1+x^2)$, has no divergence. It is everywhere well defined. And smooth, and so clearly wanting for a nice approximation everywhere. But even if we’re ready to pour in as many terms as we can, even up to infinity, we can’t approximate it at, say, $x=1.5$. Why is that?

We’ll need to learn of the analysis (or calculus) of other numbers to understand this apparent mystery, namely, complex numbers. When we have complex numbers in our pocket, it will become obvious why the Taylor expansion is so constrained, even for such a nicely defined function.

But before we turn to complex analysis, we need to sharpen our Taylor approximations for common real-valued function. Using the laws that you know (or will find a way to find back and remember), please compute for yourself the Taylor series for the following, extremely important functions. And share your computations with us:

$\sin(x)$
$\cos(x)$
$\tan(x)$
$\exp(x)$
$\ln(1+x)$
$(1+x)^\alpha$
$\sqrt{1+x}$ (that’s $(1+x)^{1/2}$)
$1/\sqrt{1+x}$ (that’s $(1+x)^{-1/2}$)

]]> Kepler’s laws https://camilopez.org/wlmblog/2018/09/30/keplers-laws/ Sun, 30 Sep 2018 22:36:39 +0000 http://camilopez.org/wlmblog/?p=33 2018 marks the 400th anniversary of (I’m not sure that many know about this birthday). In , the Dutch physicist reported a great discovery (his third law). It states that:

The square of the orbital period of a planet is proportional to the cube of the semi-major axis of its orbit.

Kepler’s centennial birthday – a beautiful discovery between cubes of distances and squares of times.

This resulted from many observations and a lot of fiddling with numbers (allegedly stolen from Tycho Brahe), until this mysterious relationship was discovered, a feast which impressed Einstein himself. The delight of the discovery was penned by the author of the harmonies of the world in a way unfortunately not available nowadays to scientific publications:

I first believed I was dreaming… But it is absolutely certain and exact that the ratio which exists between the period times of any two planets is precisely the ratio of the 3/2th power of the mean distance.

It was a bit strange and as most people who come with something genuinely new, poor Kepler met with much criticism. His own mentor (some now-forgotten Michael Mästlin) objected to him that “One should only treat astronomical things astronomically and not mix them with earthly physics.” The greatest breakthrough in Physics, Newton’s Universal theory of gravitation, would precisely show that Physics is what mixes earthly things like falling apples with astronomical ones like the falling moon.

We used this example during induction week as an illustration of the difference between an empirical law (Johannes’ observation cooked up into three particular laws) and a theoretical model (Newton’s theory expressed into three general and far-reaching laws). Another example we discussed is Rydberg’s formula and Bohr model of the atom, to explain the spectral lines of hydrogen.

How Newton derived Kepler’s law is a topic in Mechanics. Two of our students already knew the trick, and could reconstruct it… during the lecture! This is the note handed over by Luke after the class:

Well done! And came another, more detailed derivation from Joel, which went along the same line. To have something new to show, we asked Joel to compute the numerical value of Kepler’s constant in SI units (which we also covered) and this is what he found

That is, 7.52 MAU$^3$/day$^2$ (MAU is Mega-Astronomical units), vs 7.51 for Kepler’s average over six planets. These are the units from the Wikipedia (as also used by Kepler). They should really be in Mau$^3$/year$^2$ in which case the number is much more meaningful in itself (and is not exactly 1 for Earth, do you see why? We have a comment section if you want to contribute your thoughts).

Planetary motion will definitely make several comebacks in our Mechanics course, taught by an astro-solar-mathematico-physicist, and so will Rydberg’s formula, although it is not until the end of Year 5 that the connection will be established in this case (and so far we did not receive any back-of-the-envelope derivation for that). It’s also my birthday today, so I’ll leave the connection between empirical observations and theoretical models at that for today.

]]>