5

I don't understand why there is necessarily a diffraction limitation on optical systems. Where does this limitation in focusing light come from?

4 Answers4

6

That's a good question, and one that looks simple but has a complicated answer. Here's my attempt at an answer with no maths - as usual in physics you'll only really understand it by getting stuck into the mathematics.

It's commonly believed that lenses work by bending the light. You see diagrams like:

lens

showing the light ray bending as it passes through the lens. This is one way of looking at it, but a more fundamental explanation is that the lens changes the phase of the plane wave passing through it. Specifically the phase change produced by the lens varies with distance away from the centre line. So on the left side of the lens we have a plane wave of constant phase, while on the right side we have a plane wave with the phase varying with distance. The result is that on the right side we get an interference pattern - we generally call the interference pattern the image, but it is an interference pattern.

Incidentally, this is why a Fresnel lens can focus light even though it's a completely different shape to your usual convex lens. The Fresnel lens produces the same phase changes as a convex lens, so it focuses light in the same way.

But back to your question: the reason that the image isn't perfect is that it's formed by interference of only a finite portion of light wave i.e. the portion passing through the lens. The bigger the lens the more of the light wave forms the interference pattern and the better the image. But to get a perfect image you need all the light, i.e. an infinitely big lens, to contribute to the interference pattern.

Mathematically, the light intensity at the focal plane is the Fourier transform of the incoming light. The integration limits of a Fourier transform are from $-\infty$ to $+\infty$, but a finite size lens restricts the integration limits and changes the intensity in the focal plane away from the perfect case. It actually convolves the light intensity in the focal plane with the Fourier transform of the aperture through which the light passes. For a round lens this means your image is convolved with an Airy disk, and this smears out the image slightly.

John Rennie
  • 367,598
2

Well I suppose by perfectly focus, you mean to a mathematical point. And if we could do that, Heisenberg's principle implies that the momentum uncertainty would be infinite. So by the time we looked where the point was supposed to be, it would have moved to someplace else, in fact it could be anywhere at all, and we would never find it.

The problem is that the wavelength is not zero.

1

I'd like to add to George Smith's answer: "The problem is that the wavelength is not zero" because a full answer to this question begins and ends with that statement, altogether independently of lenses and other hardware.

Let's begin with the monochromatic case and think about how an image, i.e. a pattern of light, forms on the focal, or indeed any plane. Any free running (as opposed to evanescent - more below) solution to Maxwell's equations can be decomposed through Fourier analysis into a superposition of plane waves. All these plane waves have the same wavelength and wavenumber - they're just heading in different directions. The interference between these waves is what forms the image at the focal (or any other) plane. So a wave with wave vector (heading direction) pointing at an angle $\theta$ to the focal plane normal varies like $\exp(i\,k\,\sin\theta\,x)$ (or, if you like, $\cos(k\,\sin\theta\,x - \omega\,t+\delta)$) where increasing $x$ is in the plane containing both the focal plane normal and the wavevector. So a wave propagating orthogonal to the focal plane has its phase fronts parallel to the focal plane: it's disturbance at a given time doesn't vary at all in the focal plane. A wave slightly tilted wave gives a low spatial frequency ($k\,\sin\theta$ radians per unit length) variation in the focal plane, $i.e.$ this is a low frequency Fourier component. The highest spatial frequency Fourier component we can get comes from waves running parallel to focal plane, and their spatial frequency is $k$ radians per unit length.

So, simply put, there are no high spatial frequency components to build an arbitrarily-swiftly-varying-with position disturbance from. But if we wanted to have a perfect spot focus, we would need arbitrarily high spatial frequencies. But we don't have these, so we're just stuck with a spot whose characteristic spatial frequencies are in the range $2\pi/\lambda$. This is exactly how an Airy disk, diffraction limited focus looks.

There is a theoretical exception to the above, and that is the evanescent field. Solutions to Helmholtz's equation (which all the Cartesian components of the electric and magnetic field vectors fulfill) fulfill $k^2 = k_x^2 + k_y^2 + k_z^2$, where $(k_x,k_y,k_z)$ is the wavevector. There are solutions to Maxwell's equations where $k_x > k$ and $k_y > k$, i.e. arbitrarily high spatial frequencies in the focal plane, but such waves have imaginary values of $k_z$. These solutions are altogether valid, but their amplitudes dwindle exponentially with distance $z$ away from their source. Theoretically we have solutions growing exponentially away from the source too, but these are unphysical - they would violate energy conservation because evanescent waves, although they are not propagating in the sense of transporting energy, are still an energy store - they represent reactive power shuttling back and forth between neighbouring regions and are thus akin to capacitive and inductive energy stores in an AC circuit.

Therefore, owing to this exponential dwindling with distance from the source, these evanescent waves cannot propagate far from their sources, and so are practically excluded from any focusing field, whose constituent spatial frequencies are thus limited to a maximum of $k$ radians per unit length.

Selene Routley
  • 90,184
  • 7
  • 198
  • 428
0

To give a further wordy answer and extend the discussion of fourier transform I offer the following.

Start with three planes, the source, the image and an aperture plane where our lens sits. The distribution of intensity in the aperture plane will, in the far field limit, be the fourier transform of the intensity in the source plane. Note the 'in the far field' implies a planar wave in the aperture plane.

Taking a lens of finite size we can only intercept a portion of this plane, and therefore the information contained in the portions outside the aperture is lost.

That collected will undergo a reverse fourier transform and an image of the source can be formed. But the information that the image is formed from is limited to that which was collected in the lens. This strongly hints that we cannot 'undo' the loss of information, or image degradation, introduced by the lens. We can make our lens arbitarily large but until it encompasses the entire infinite plane we can't get a perfect image.

Nic
  • 3,907