Jump back to chapter selection.
Table of Contents
5.1 Plane Waves and the Helmholtz Equation
5.2 Paraxial Approximation
5.3 Fresnel Approximation
5.4 The Fraunhofer Limit: Far Field
5.5 Diffraction Patterns - Amplitude Modulation
5.6 Fourier Optics with a Lens
5.7 Holography
5.8 Paraxial Ray Optics
5 Fourier Optics
In this chapter, we will use a plane wave expansion of a monochromatic field to study light propagation through an optical system. The simplest of these systems is free space. It will soon become clear why this chapter is specifically titled 'Fourier' Optics.
5.1 Plane Waves and the Helmholtz Equation
As we have seen previously, an arbitrary function may often be constructed from a sum or integral of harmonic functions (plane waves) of different frequencies and complex amplitudes. This principle extends to multiple dimensions: an arbitrary spatial function
In the figures above, we describe the optical wave with a scalar function
As described in more detail here, a linear system is characterised by its impulse response or, equivalently, by its response to a harmonic function (its transfer function).
We begin the discussion of Fourier optics by recalling the wave equation for the electric field in a source-free, homogeneous, linear, isotropic, and non-dispersive medium (as derived in Chapter 1):
For a monochromatic wave, we may write the electric field as
with
For such waves, we define the notion of a wavefront: a wavefront is a surface of constant phase. That is, if
We can see that the wavefronts change curvature upon propagation in a Gaussian beam. For a plane wave, the wavefronts are planes. As we may expect, these wavefronts bend when passing through optical components, such as lenses:
5.2 Paraxial Approximation
In the paraxial approximation, we assume that the light rays (normals to the wavefronts) form only small angles with the principal axis of propagation (conventionally the
For a field
The inverse transformation is:
Each component
The paraxial approximation can be formally stated as
Applying the 2D Fourier transform (with respect to
So, we obtain an ordinary differential equation for
Recognising
The solution for forward propagation in
We define the transfer function of free space propagation over distance
Thus,
Knowing
The procedure is:
- Calculate the 2D spatial Fourier transform
of the input field . - Multiply
by the transfer function to obtain . - Apply the inverse 2D spatial Fourier transform to
to obtain .
It is important to remember that we are working with linear systems: harmonic components (spatial frequencies
5.3 Fresnel Approximation
In the expression for the transfer function
Using the Taylor expansion
The Fresnel approximation (or paraxial wave equation approximation) consists of keeping terms only up to the first order in
This approximates the spherical wavefront segment of each plane wave component with a parabolic one. The simplified transfer function becomes:
We can write
The Fresnel approximation implies that we are observing the field at a distance
Considering the source of the wave to be spherical, the Fresnel approximation approximates these spherical wavefronts with parabolas:
The validity of the Fresnel approximation requires that the next term in the Taylor expansion of the phase,
Next, we find the impulse response
- The 2D Fourier transform of
is . - Therefore,
. - We take the inverse Fourier transform:
We find that the impulse response function
This impulse response is useful because for a linear system, the output
Substituting
This is the Fresnel diffraction integral.
The general steps to find the electric field
- Find the 2D spatial Fourier transform
of . - Multiply by the appropriate transfer function
to obtain . - Apply the inverse 2D spatial Fourier transform to
to obtain .
5.4 The Fraunhofer Limit: Far Field
The Fraunhofer approximation, or far-field diffraction, is a limit of the Fresnel approximation valid at sufficiently large distances
The condition for Fraunhofer diffraction is often expressed using the Fresnel number
If
The crucial simplification in the Fraunhofer limit arises from approximating the quadratic phase term in the Fresnel integral
We expand
The field
Recognising
The Fraunhofer approximation essentially states that in the far field, the observed complex amplitude is proportional to the Fourier transform of the aperture distribution, multiplied by a phase factor. All plane wave components effectively interfere constructively only along specific directions corresponding to their
5.5 Diffraction Patterns - Amplitude Modulation
When an optical wave passes through an aperture or is otherwise spatially modulated in amplitude and/or phase, and then propagates some distance in free space, the resulting intensity distribution is called a diffraction pattern. From the discussion above, it should be clear that simply expecting the intensity pattern to be a geometric shadow of the aperture is an oversimplification, valid only in the limit of ray optics where the wave nature of light is ignored.
Consider an aperture described by an aperture function
If an incident wave
5.5.1 Rectangular Aperture
Consider a rectangular aperture of width
The 2D Fourier transform
where
Here
This result is expected, as the Fourier transform of a rectangular function (top-hat) is a sinc function. The intensity pattern is shown next:
5.5.2 Circular Aperture
Consider a circular aperture of diameter
where
This is the characteristic Airy disk pattern shown in the next figure.
5.6 Fourier Optics with a Lens
A thin lens introduces a quadratic phase transformation to an incident wavefront. For a lens with focal length
If an object
More generally, if an object
The integral is
A common configuration is a 2f system, where an object is placed at distance
In the Fourier plane (at
Consider the image example: A standard circular aperture in the Fourier plane acts as a low-pass filter, blurring the image by removing high spatial frequencies (sharp details). An opaque stop in the centre acts as a high-pass filter, enhancing edges and removing large-scale variations, making the man's skin appear dark while highlighting hair.
A Fresnel zone plate is another optical element that can focus light, but it operates based on diffraction rather than refraction.
Its transmission function
for a binary zone plate designed for focal length
The spacing of these Fresnel zones is such that light diffracted from the transparent zones interferes constructively at the desired focal point. Zone plates are inherently chromatic, focusing different wavelengths to different focal points.
5.7 Holography
Holograms are recordings that encode the full optical wave from an object, including both its amplitude and phase information. In principle, if we could create a transparency
If these two waves overlap at
Assuming the film's amplitude transmittance
where
When this hologram is illuminated by the original reference wave
If the reference wave
: A scaled version of the reference wave (DC term). : The reference wave modulated by the object intensity (distorted image). : A wave proportional to the original object wave . This term reconstructs the virtual image of the object. : A wave related to the conjugate of the object wave. This term reconstructs the real image (often called the twin image or conjugate image).
To separate these four waves spatially, a common technique is off-axis holography, where the reference wave
This angular separation ensures that upon reconstruction, the four terms propagate in different directions, allowing the desired
Holography generally requires light sources with high temporal and spatial coherence (such as lasers) for both recording and reconstruction. Variations like volume holography (where interference fringes are recorded throughout the depth of a thick medium) and rainbow holography (which allows viewing in white light) address some of these limitations.
Generally, we differentiate two types of hologram based on recording geometry:
- Reflection hologram: The reference and object beams approach the recording material from opposite sides. Interference fringes are typically formed in planes nearly parallel to the surface. Reconstruction is usually done by illuminating the hologram from the same side as the original reference beam was incident.
- Transmission hologram: The reference and object beams approach the film from the same side. Interference fringes are generally formed nearly perpendicular to the surface. Reconstruction is done by illuminating with the reference beam, and the reconstructed object wave is viewed by looking through the hologram.
An example of a transmission hologram setup is shown in the next figure (source).
In ordinary photography, only the intensity distribution
5.8 Paraxial Ray Optics
Often, for analysing simple optical systems, a full wave-optical (Fourier) treatment is not necessary, and the simpler ray approximation (geometric optics) suffices. This is especially true when effects of diffraction can be neglected, for instance, when all apertures and beam sizes are much larger than the wavelength of light. We define a ray as the local normal to a wavefront. We will work within the paraxial approximation, meaning all rays form small angles with respect to the optical axis (conventionally the
A single ray at a transverse plane can be described by a 2D vector, commonly its radial distance
where we have used the small angle approximation
This transformation can be expressed using a matrix, called the ABCD matrix or ray-transfer matrix, which relates the output ray vector to the input ray vector:
For the thin lens described above, the ray-transfer matrix is:
Similarly, for propagation through a homogeneous medium of length
For a planar interface between homogeneous media with refractive indices
The effect of a spherical mirror with concave radius of curvature
The advantage of this matrix formalism is that the overall ray-transfer matrix for a cascade of optical elements is found by multiplying the individual matrices in the correct order (last element encountered by the ray first in the matrix product if input is on the right, or first element first if input is on the left, depending on how the output vector is written). For example, consider propagation through a medium of length
The matrix