Jump back to chapter selection.

Table of Contents

1.1 Microscopic Form of Maxwell's Equations in Vacuum
1.2 Maxwell's Equations in a Medium
1.3 The Material Equations
1.4 Macroscopic Approximation
1.5 Wave Equation
1.6 Solutions to the wave equation
1.7 Polarisation
1.8 Poynting Vector and Poynting's Theorem
1.9 Timescales
1.10 Momentum of Light

1 Electromagnetic Theory of Light

Light is an electromagnetic wave governed by the same theoretical principles that describe all forms of electromagnetic radiation. It consists of coupled oscillating electric and magnetic fields.

1.1 Microscopic Form of Maxwell's Equations in Vacuum

We begin with the simplest case by considering the electric and magnetic fields in free space, meaning there are no charges or currents present. The governing equations are Maxwell's equations:

\begin{aligned} \nabla \cdot E & = 0, \\ \nabla \cdot B & = 0, \\ \nabla \times E & = - \frac{\partial B}{\partial t}, \\ \nabla \times B & = ε_{0} μ_{0} \frac{\partial E}{\partial t} . \end{aligned}

Here, $ε_{0}$ is the permittivity of free space, and $μ_{0}$ is the permeability of free space. These equations describe how the electric field $E$ and the magnetic field $B$ evolve in time and space, with both fields being functions of position and time, so that $E (r, t)$ and $B (r, t)$ .

A key property of Maxwell's equations is their linearity: any linear combination of solutions remains a valid solution. This has important implications, as we will see later. These equations have been experimentally confirmed for over a century and are fundamental to classical electrodynamics.

1.2 Maxwell's Equations in a Medium

To describe electromagnetic waves in a medium, we need a framework that accounts for the charge densities and currents at the atomic scale. The microscopic form of Maxwell's equations in a medium is given by:

\begin{aligned} 1) & \nabla \cdot E & = \frac{ρ}{ε_{0}}, \\ 2) & \nabla \cdot B & = 0, \\ 3) & \nabla \times E & = - \frac{\partial B}{\partial t}, \\ 4) & \nabla \times B & = μ_{0} (j + ε_{0} \frac{\partial E}{\partial t}), \end{aligned}

where $j$ is the total microscopic current density and $ρ$ is the total microscopic charge density. It is worth giving a meaning to each equation:

Gauss' Law: The electric field originates from charges. Positive charges act as sources, and negative charges act as sinks. The flux of $E$ through a closed surface is proportional to the enclosed charge.
Gauss' Law for Magnetism: There are no magnetic monopoles; magnetic field lines always form closed loops. This distinguishes magnetic fields from electric fields, which can have isolated point sources (charges).
Faraday's Law of Induction: A time-dependent magnetic field creates a circulating electric field. This principle underlies electromagnetic induction, which is the basis of electrical generators, transformers, and inductors.
Ampère-Maxwell Law: Magnetic fields are produced both by electric currents and by changing electric fields. The latter term, $ε_{0} \frac{\partial E}{\partial t}$ , is known as the displacement current density and allows electromagnetic waves to propagate even in the absence of actual charge flow.

While these equations describe the fundamental behaviour of electric and magnetic fields, solving them exactly in a material by tracking every individual charge is impractical. Instead, we often work with macroscopic versions of Maxwell’s equations. To achieve this, we introduce two auxiliary fields: the electric displacement field $D$ and the magnetic field $H$ (sometimes called magnetic field intensity). These quantities result from an effective spatial averaging of the microscopic fields over volumes that are large compared to atomic dimensions but small compared to the wavelength of the electromagnetic fields. This averaging is justified since atomic-scale structures are typically on the order of Angstroms or nanometres, whereas relevant optical wavelengths are often hundreds of nanometres or larger.

The macroscopic fields are defined as:

\begin{aligned} D & = ε_{0} E + P, \\ H & = \frac{1}{μ_{0}} B - M, \end{aligned}

where $P$ is the electric polarisation density (electric dipole moment per unit volume), and $M$ is the magnetisation density (magnetic dipole moment per unit volume). These definitions allow us to describe the response of the medium without explicitly tracking all individual microscopic charges and currents. In a dielectric medium, the polarisation $P$ is the macroscopic sum of the electric dipole moments induced by the electric field. The magnetisation $M$ is defined analogously for magnetic materials. Both the displacement field $D$ and the magnetic field $H$ are often referred to as auxiliary fields. The polarisation and magnetisation are related to the electric field $E$ and magnetic induction $B$ through material-dependent relations called constitutive relations. In free space, both polarisation $P$ and magnetisation $M$ are zero, so $D = ε_{0} E$ and $H = B / μ_{0}$ .

In this course, we will be mainly concerned with isotropic media, meaning that the material response is independent of direction. This implies that the dielectric function $ε_{r} (r, t)$ (relative permittivity) and the relative permeability $μ_{r} (r, t)$ are scalars (or tensors that reduce to scalars). Furthermore, in many practical optical problems, it is sufficient to solve for the electric field alone. This is because in the non-relativistic regime, the force exerted by the magnetic component of light on charges is often much weaker than that of the electric component for many interactions. This assumption is further justified by the fact that most materials relevant to optics are non-magnetic at optical frequencies. However, one must always keep in mind the presence and role of the magnetic field.

1.3 The Material Equations

Solving Maxwell's equations in a medium requires explicit relationships, known as material or constitutive equations, which describe how the medium responds to the fields. As mentioned earlier, these relationships depend on the material properties. To establish the macroscopic Maxwell's equations, we begin by separating both the total charge density $ρ$ and the total current density $j$ into free and bound contributions:

\begin{aligned} ρ (r, t) & = ρ_{f} (r, t) + ρ_{b} (r, t), \\ j (r, t) & = j_{f} (r, t) + j_{b} (r, t) . \end{aligned}

Free charges and currents are typically those that can move over macroscopic distances (like conduction electrons in a metal), while bound charges and currents are associated with localised atomic or molecular dipoles.

Our goal is to reformulate Maxwell's equations so that only free charges and currents appear explicitly as sources.
Starting from Gauss' Law in a medium, $\nabla \cdot E = (ρ_{f} + ρ_{b}) / ε_{0}$ , we can write $ε_{0} \nabla \cdot E - ρ_{b} = ρ_{f}$ .
By defining the polarisation density $P$ such that the bound charge density is given by

ρ_{b} = - \nabla \cdot P,

we substitute this into Gauss' Law: $ε_{0} \nabla \cdot E + \nabla \cdot P = ρ_{f}$ . This can be rewritten using the electric displacement $D = ε_{0} E + P$ as

\nabla \cdot D = ρ_{f} .

At this point, we have successfully removed explicit dependence on the bound charges.
A similar approach applies to the Ampère-Maxwell Law. The total current $j$ includes $j_{f}$ and $j_{b}$ . The bound current density $j_{b}$ can be expressed in terms of polarisation $P$ and magnetisation $M$ as

j_{b} = \frac{\partial P}{\partial t} + \nabla \times M .

Substituting this into the microscopic Ampère-Maxwell Law $\nabla \times B = μ_{0} (j_{f} + j_{b} + ε_{0} \frac{\partial E}{\partial t})$ :

\nabla \times B = μ_{0} (j_{f} + \frac{\partial P}{\partial t} + \nabla \times M + ε_{0} \frac{\partial E}{\partial t}) .

Rearranging gives $\nabla \times (\frac{B}{μ_{0}} - M) = j_{f} + \frac{\partial (ε_{0} E + P)}{\partial t}$ .
Using the definitions of $H = \frac{1}{μ_{0}} B - M$ and $D = ε_{0} E + P$ , this becomes

\nabla \times H = j_{f} + \frac{\partial D}{\partial t} .

We can summarise the microscopic Maxwell's equations (which are universally valid) and the macroscopic Maxwell's equations (useful for describing fields in media):

Name	Microscopic Maxwell's equations (in medium)	Macroscopic Maxwell's equations
Gauss' Law	$\nabla \cdot E = \frac{ρ_{total}}{ε_{0}}$	$\nabla \cdot D = ρ_{f}$
Gauss' Law for Magnetism	$\nabla \cdot B = 0$	$\nabla \cdot B = 0$
Faraday's Law of Induction	$\nabla \times E = - \frac{\partial B}{\partial t}$	$\nabla \times E = - \frac{\partial B}{\partial t}$
Ampère-Maxwell Law	$\nabla \times B = μ_{0} (j_{total} + ε_{0} \frac{\partial E}{\partial t})$	$\nabla \times H = j_{f} + \frac{\partial D}{\partial t}$

Additionally, the auxiliary relations defining $D$ and $H$ are:

\begin{aligned} D = ε_{0} E + P & and & H & = \frac{1}{μ_{0}} B - M . \end{aligned}

And the definitions relating bound sources to $P$ and $M$ are:

\begin{aligned} ρ_{b} = - \nabla \cdot P & and & j_{b} & = \frac{\partial P}{\partial t} + \nabla \times M . \end{aligned}

1.4 Macroscopic Approximation

The macroscopic quantities $P$ and $M$ (and thus $D$ and $H$ ) are obtained by averaging microscopic properties over physically infinitesimal volumes that are nevertheless large enough to contain many atoms or molecules. The total charge in a macroscopic volume $V$ at position $R$ is

q_{R} = \int_{V} ρ (r^{'}) d^{3} r^{'},

while the total current through a surface element associated with this volume is related to

i_{R} = \int_{V} j (r^{'}) d^{3} r^{'} .

The electric dipole moment of the volume $V$ is defined as

p_{total, R} = \int_{V} (r^{'} - R) ρ (r^{'}) d^{3} r^{'},

while the magnetic dipole moment is

m_{total, R} = \frac{1}{2} \int_{V} (r^{'} - R) \times j (r^{'}) d^{3} r^{'} .

The free charge and free current densities are then given by averages:

ρ_{f} (R, t) = \frac{⟨ q_{f, R} ⟩}{V} and j_{f} (R, t) = \frac{⟨ i_{f, R} ⟩}{V},

while the macroscopic polarisation and magnetisation are dipole moments per unit volume:

P (R, t) = \frac{p_{total bound, R}}{V} and M (R, t) = \frac{m_{total bound, R}}{V} .

The macroscopic Maxwell equations effectively describe the fields averaged over these volumes. This approximation is valid as long as the fields do not vary significantly over the scale of the averaging volume.

To proceed further with solving problems, we require constitutive relations that link $P$ to $E$ and $M$ to $H$ (or $B$ ). These relations depend on the material's properties and are often established through a set of approximations:

Electric and magnetic field dependence: It is often assumed that $P$ depends primarily on $E$ and not on $B$ , while $M$ depends primarily on $B$ (or $H$ ) and not on $E$ . This is a good approximation for many materials at optical frequencies, although magneto-optic effects do exist where fields cross-couple.
Locality: $P (r)$ and $M (r)$ are assumed to depend only on the fields $E (r)$ and $B (r)$ at the same position $r$ . This implies that the response is local and bound charges/currents do not move significantly relative to the scale over which the fields change. This is part of the long-wavelength approximation (wavelength much larger than atomic scales). Spatial dispersion occurs when this is not true.
Homogeneity: The functional dependence of $P$ and $M$ on $E$ and $B$ respectively, does not vary with position $r$ in the medium, implying the medium is optically homogeneous.
Instantaneous Response (No Temporal Dispersion): $P$ and $M$ at time $t$ are assumed to depend only on the values of $E$ and $B$ at the same time $t$ , eliminating time integrals (convolutions) in the time domain. This assumption is only valid for optically transparent materials far from any absorption resonances. In reality, this is often a poor approximation for many materials over a broad range of frequencies, and temporal dispersion (frequency dependence of material parameters like $ε_{r} (ω)$ ) is crucial. This will be refined later.
Linearity: $P$ and $M$ are assumed to be linear functions of $E$ and $B$ (or $H$ ), respectively. This is the domain of linear optics. Non-linear optics deals with higher-order dependencies.
Isotropy: The response of the medium is assumed to be independent of the direction of the applied fields. For isotropic media, $P$ is parallel to $E$ , and $M$ is parallel to $H$ (or $B$ ). This means susceptibilities and permittivities can be treated as scalars. This assumption is violated in anisotropic materials like many crystals.

Let us examine the effect of these assumptions. With assumption 1, we may generally write the response as a functional of the field history. For instance:

\begin{aligned} P (r, t) & = \int_{- \infty}^{t} f_{P} (r, E (r, t^{'}), t - t^{'}) d t^{'} and \\ M (r, t) & = \int_{- \infty}^{t} f_{M} (r, H (r, t^{'}), t - t^{'}) d t^{'} . \end{aligned}

With assumptions 2, 3, and 4 (locality, homogeneity, and instantaneous response), these simplify to:

\begin{aligned} P (r, t) & = f_{P} (E (r, t)) \\ M (r, t) & = f_{M} (H (r, t)) . \end{aligned}

With assumption 5 (linearity), we can expand $P$ and $M$ in a Taylor series and keep only the linear terms (assuming no permanent dipole moments $P^{(0)}, M^{(0)}$ , or that they are zero for symmetry reasons):

\begin{aligned} P_{i} (r, t) & \approx ε_{0} \sum_{j} χ_{i j}^{(1)} E_{j} (r, t) \\ M_{i} (r, t) & \approx \sum_{j} χ_{i j}^{(m 1)} H_{j} (r, t) . \end{aligned}

Here, $χ_{i j}^{(1)}$ is the linear electric susceptibility tensor and $χ_{i j}^{(m 1)}$ is the linear magnetic susceptibility tensor. Higher-order terms (like $χ^{(2)}, χ^{(3)}$ ) are the focus of non-linear optics, but for this course, low-intensity light and linear responses are generally assumed unless stated otherwise.
Finally, with assumption 6 (isotropy), the susceptibility tensors reduce to scalars multiplied by the identity tensor, so $χ_{i j}^{(1)} = χ δ_{i j}$ and $χ_{i j}^{(m 1)} = χ_{m} δ_{i j}$ . This leads to:

\begin{aligned} P (r, t) & \approx ε_{0} χ E (r, t) and \\ M (r, t) & \approx χ_{m} H (r, t) . \end{aligned}

The relative permittivity (dielectric constant) $ε_{r}$ and relative permeability $μ_{r}$ are then defined as:

ε_{r} = 1 + χ and μ_{r} = 1 + χ_{m} .

Therefore, we can write the macroscopic constitutive relations as:

\begin{array}{r} D = ε_{0} ε_{r} E and B = μ_{0} μ_{r} H . \end{array}

In optics, we generally deal with non-magnetic media, so $M \approx 0$ , which implies $χ_{m} \approx 0$ and thus $μ_{r} \approx 1$ . In such cases, $B \approx μ_{0} H$ .

1.5 Wave Equation

To describe the propagation of light, we seek an equation that relates the temporal evolution of the fields to their spatial variation. We derive this for the case of a homogeneous, isotropic, linear, and non-magnetic ( $μ_{r} = 1$ ) medium, with no free charges ( $ρ_{f} = 0$ ) or free currents ( $j_{f} = 0$ ), and initially assuming no dispersion (so $ε_{r}$ is constant).
Consider the macroscopic curl equation (Faraday's Law):

\nabla \times E = - \frac{\partial B}{\partial t} .

Apply the curl operator to both sides:

\nabla \times (\nabla \times E) = - \frac{\partial (\nabla \times B)}{\partial t} .

Using the vector identity $\nabla \times (\nabla \times A) = \nabla (\nabla \cdot A) - \nabla^{2} A$ , the left side becomes $\nabla (\nabla \cdot E) - \nabla^{2} E$ .
Since $ρ_{f} = 0$ and the medium is homogeneous, $\nabla \cdot D = ε_{0} ε_{r} \nabla \cdot E = 0$ , which implies $\nabla \cdot E = 0$ .
Thus, $\nabla \times (\nabla \times E) = - \nabla^{2} E$ .
For the right side, we use the Ampère-Maxwell Law (macroscopic, no free currents): $\nabla \times H = \frac{\partial D}{\partial t}$ .
Since $B = μ_{0} H$ (for $μ_{r} = 1$ ) and $D = ε_{0} ε_{r} E$ , we have $\nabla \times B = μ_{0} \frac{\partial (ε_{0} ε_{r} E)}{\partial t} = μ_{0} ε_{0} ε_{r} \frac{\partial E}{\partial t}$ .
Substituting these into the curled Faraday's Law:

- \nabla^{2} E = - \frac{\partial}{\partial t} (μ_{0} ε_{0} ε_{r} \frac{\partial E}{\partial t}) = - μ_{0} ε_{0} ε_{r} \frac{\partial^{2} E}{\partial t^{2}} .

This yields the wave equation for $E$ :

\nabla^{2} E = μ_{0} ε_{0} ε_{r} \frac{\partial^{2} E}{\partial t^{2}} .

A similar derivation yields the wave equation for $H$ (or $B$ ).
Both equations have the form of a generic linear wave equation. The wave propagation speed, or phase velocity $v_{p}$ , in the medium is given by $v_{p}^{2} = 1 / (μ_{0} ε_{0} ε_{r})$ .
The speed of light $c$ in vacuum is given by $c^{2} = 1 / (ε_{0} μ_{0})$ , so $c = 1 / \sqrt{ε_{0} μ_{0}}$ .
The refractive index $n$ of the medium is defined as the ratio of the speed of light in vacuum to the phase velocity in the medium:

n = \frac{c}{v_{p}} = \frac{\sqrt{ε_{0} μ_{0} ε_{r}}}{\sqrt{ε_{0} μ_{0}}} = \sqrt{ε_{r}} .

This uses the assumption of a non-magnetic medium ( $μ_{r} = 1$ ). Therefore, the wave equation can be written as:

\nabla^{2} E = \frac{n^{2}}{c^{2}} \frac{\partial^{2} E}{\partial t^{2}} = \frac{1}{v_{p}^{2}} \frac{\partial^{2} E}{\partial t^{2}} .

1.6 Solutions to the wave equation

One fundamental solution of the wave equation is the monochromatic plane wave:

E (r, t) = E_{0} \cos (ω t - k \cdot r + ϕ),

where $E_{0}$ is the constant amplitude vector, $ω$ is the angular frequency, $k$ is the wavevector, and $ϕ$ is a phase constant. The angular frequency and the magnitude of the wavevector, $k = | k |$ , are related by the dispersion relation:

ω = v_{p} k = \frac{c}{n} k .

The relation between the wavenumber $k$ and the wavelength in the medium $λ_{n}$ is $k = 2 π / λ_{n}$ . The wavelength in vacuum is $λ_{0} = n λ_{n}$ .

The wave equation is linear (since derivatives are linear operators), and thus any superposition of solutions is also a solution. While plane waves are simple solutions, they form a complete basis, meaning any solution to the wave equation can be expressed as a linear combination (or integral) of plane waves (Fourier decomposition).
From $\nabla \cdot E = 0$ (for a uniform medium with no free charges), it follows that plane waves are transverse:

\begin{aligned} \nabla \cdot E & = \nabla \cdot (E_{0} \cos (ω t - k \cdot r + ϕ)) \\ = E_{0} \cdot (- k) (- \sin (ω t - k \cdot r + ϕ)) \\ = (E_{0} \cdot k) \sin (ω t - k \cdot r + ϕ) . \end{aligned}

For this to be zero at all times and positions, we require $E_{0} \cdot k = 0$ , meaning the electric field vector is perpendicular to the direction of propagation.
Similarly for the magnetic field, a solution is:

B (r, t) = B_{0} \cos (ω t - k \cdot r + ϕ_{m}) .

The electric and magnetic fields of a plane wave are not independent. From Maxwell's equation $\nabla \times E = - \frac{\partial B}{\partial t}$ , by substituting the plane wave solutions, we find the relation:

k \times E_{0} = ω B_{0} .

This implies that $B_{0}$ is perpendicular to both $k$ and $E_{0}$ . Therefore, for a plane wave in an isotropic medium, $E$ , $B$ , and $k$ form a mutually orthogonal triad. This is illustrated in the next figure:

Attachments/Notes/2025 Quantum Electronics/1 Electromagnetic Theory of Light/01_Introduction_and_theoretical_foundation.webp|700

The relationship between the amplitudes can also be expressed using the wave impedance of the medium, $Z = \sqrt{μ / ε} = \sqrt{μ_{0} μ_{r} / ε_{0} ε_{r}}$ . For non-magnetic media ( $μ_{r} = 1$ ), $Z = \sqrt{μ_{0} / (ε_{0} ε_{r})} = Z_{0} / n$ , where $Z_{0} = \sqrt{μ_{0} / ε_{0}} \approx 377 Ω$ is the impedance of free space. Then $| E_{0} | = Z | H_{0} |$ , and $H_{0} = \frac{1}{Z} (\hat{k} \times E_{0})$ , where $\hat{k} = k / k$ .
Because the electric and magnetic fields are orthogonal to the direction of propagation, these waves are also called transverse electro-magnetic (TEM) waves.
It is often more convenient to use complex notation:

E (r, t) = Re [{\tilde{E}}_{0} e^{i (k \cdot r - ω t)}] or simply E (r, t) = \tilde{E} (r) e^{- i ω t},

where $\tilde{E} (r) = {\tilde{E}}_{0} e^{i k \cdot r}$ is the complex amplitude (phasor), and ${\tilde{E}}_{0}$ may itself be complex to include the phase constant $ϕ$ . The physical field is obtained by taking the real part. Often, the $Re [\cdot]$ is dropped for brevity in intermediate calculations, but it must be reinstated when calculating real physical quantities, especially those that depend nonlinearly on the fields, such as intensity or the Poynting vector.

1.7 Polarisation

The polarisation of light describes the orientation of the electric field vector oscillation. For a plane wave propagating in the $z$ -direction ( $k = k \hat{z}$ ), the electric field vector $E_{0}$ lies in the $x - y$ plane.
Linear polarisation means that the electric field vector oscillates along a fixed straight line in the $x - y$ plane:

E (z, t) = E_{0} (\hat{x} \cos α_{p} + \hat{y} \sin α_{p}) \cos (ω t - k z + ϕ),

where $α_{p}$ is the angle of the polarisation direction with respect to the $x$ -axis.

The underlying physics of how materials respond to polarised light relates to how their constituent charges (and thus dipole moments) interact with the electric field. A microscopic electric dipole moment is $p = q L$ , where $q$ is charge and $L$ is the vector separating charges. The macroscopic polarisation $P$ is the vector sum of these microscopic dipole moments per unit volume: if there are $N$ dipoles per unit volume, and $⟨ p ⟩$ is the average dipole moment, then $P = N ⟨ p ⟩$ .

Polarisation is important in the interaction of light with matter: the amount of light reflected from or transmitted through a surface depends on it (Fresnel equations), as does the amount of light absorbed in many materials. This is even more general - light scattering is often polarisation dependent. The refractive index itself can be polarisation dependent in anisotropic materials.
Light does not have to be linearly polarised. The general case is elliptical polarisation, where the tip of the electric field vector traces an ellipse in the $x - y$ plane over one optical cycle.

This can be best shown graphically. In the following figures (source), a wave oscillates (red) into the $z$ -direction. The projections onto the $x$ - and $y$ -axis are in green and blue, respectively.

Linear polarisation - the total electric field vector oscillates along a straight line in the $x - y$ plane:

Attachments/Notes/2025 Quantum Electronics/1 Electromagnetic Theory of Light/linear.gif|700

Elliptical polarisation - the total electric field vector traces an ellipse in the $x - y$ plane:
Attachments/Notes/2025 Quantum Electronics/1 Electromagnetic Theory of Light/elliptical.gif|700

Circular polarisation - the total electric field vector traces a circle in the $x - y$ plane:
Attachments/Notes/2025 Quantum Electronics/1 Electromagnetic Theory of Light/circular.gif|700

It becomes clear that circular polarisation is a special case of elliptical polarisation, where the $x$ - and $y$ -amplitudes of the electric field components are equal ( $| E_{0 x} | = | E_{0 y} |$ ), and their phase difference is $\pm π / 2$ . The case of a $+ π / 2$ phase difference (e.g., $E_{y}$ leads $E_{x}$ ) can define right-hand circular polarisation (RHCP) by one convention, while a $- π / 2$ phase difference defines left-hand circular polarisation (LHCP) (conventions vary, often depending on whether viewed from source or receiver).
A linear polariser is an optical element that transmits light of a specific polarisation while blocking light of the orthogonal polarisation. If $\hat{e}$ is the unit vector along the transmission axis of the polariser, then for a given input electric field $E_{in}$ , the output field is its projection onto this axis:

E_{out} = (\hat{e} \cdot E_{in}) \hat{e} .

If $\hat{e} = \cos θ_{p} \cdot \hat{x} + \sin θ_{p} \cdot \hat{y}$ , where $θ_{p}$ is the angle of the polariser's axis with respect to the $x$ -axis.

1.8 Poynting Vector and Poynting's Theorem

Light carries energy. The quantity quantifying the rate and direction of electromagnetic energy flow per unit area is the Poynting vector $S$ , defined for instantaneous real fields as:

S = E \times H .

Its units are Watts per square metre (W/m $^{2}$ ). Note that for calculating instantaneous power flow, real physical fields $E (r, t)$ and $H (r, t)$ must be used, not their complex representations directly, as energy and power are real, non-linear quantities in terms of fields. The Poynting vector indicates that the energy flow is orthogonal to both the electric and magnetic fields.

The Poynting theorem expresses energy conservation for electromagnetic fields. It states that the rate of decrease of electromagnetic energy stored within a volume, plus the rate of energy flowing out through the surface of that volume, equals the rate of work done by the fields on the free charges within the volume:

- \frac{\partial u}{\partial t} = \nabla \cdot S + j_{f} \cdot E, or \nabla \cdot S + \frac{\partial u}{\partial t} = - j_{f} \cdot E .

The electromagnetic energy density $u$ in a linear, isotropic medium is given by:

u = \frac{1}{2} (E \cdot D + B \cdot H) .

The Poynting theorem represents the conservation or balance of energy: the power flow out of a volume plus the rate of increase of stored energy within that volume equals the negative of the power delivered to free charges (Ohmic losses if $j_{f} = σ E$ ). This form is valid for media where $u$ is well-defined as above (e.g., non-dispersive or carefully treated dispersive cases). The term $- j_{f} \cdot E$ represents the rate of energy conversion per unit volume from electromagnetic to other forms (like heat).

To gain some intuition, the sign of the divergence of the Poynting vector indicates the local change in energy density due to flow:

If $\nabla \cdot S > 0$ at a point, it means that energy is flowing away from that point (it acts as a source of energy flow if $\partial u / \partial t$ and $j_{f} \cdot E$ are zero).
If $\nabla \cdot S < 0$ at a point, it means that energy is flowing into that point (it acts as a sink of energy flow if other terms are zero).

Proof of Poynting's Theorem:
We use real, instantaneous fields and currents. Start with Maxwell's curl equations (macroscopic form):

$\nabla \times E = - \frac{\partial B}{\partial t}$
$\nabla \times H = j_{f} + \frac{\partial D}{\partial t}$

Take the dot product of (1) with $H$ : $H \cdot (\nabla \times E) = - H \cdot \frac{\partial B}{\partial t}$ .
Take the dot product of (2) with $E$ : $E \cdot (\nabla \times H) = E \cdot j_{f} + E \cdot \frac{\partial D}{\partial t}$ .
Subtract the second result from the first:

H \cdot (\nabla \times E) - E \cdot (\nabla \times H) = - H \cdot \frac{\partial B}{\partial t} - E \cdot \frac{\partial D}{\partial t} - E \cdot j_{f} .

Using the vector identity $\nabla \cdot (E \times H) = H \cdot (\nabla \times E) - E \cdot (\nabla \times H)$ , the left side is $\nabla \cdot (E \times H)$ .
So, $\nabla \cdot (E \times H) = - (H \cdot \frac{\partial B}{\partial t} + E \cdot \frac{\partial D}{\partial t}) - E \cdot j_{f}$ .
For linear, non-dispersive media, $D = ε_{0} ε_{r} E$ and $B = μ_{0} μ_{r} H$ , so $E \cdot \frac{\partial D}{\partial t} = E \cdot (ε_{0} ε_{r} \frac{\partial E}{\partial t}) = \frac{1}{2} \frac{\partial}{\partial t} (ε_{0} ε_{r} E \cdot E) = \frac{\partial}{\partial t} (\frac{1}{2} E \cdot D)$ .
Similarly, $H \cdot \frac{\partial B}{\partial t} = \frac{\partial}{\partial t} (\frac{1}{2} H \cdot B)$ .
Thus, the term in parenthesis is $\frac{\partial u}{\partial t}$ , where $u = \frac{1}{2} (E \cdot D + H \cdot B)$ .
Substituting $S = E \times H$ , we get:

\nabla \cdot S = - \frac{\partial u}{\partial t} - E \cdot j_{f},

which rearranges to the Poynting theorem: $\nabla \cdot S + \frac{\partial u}{\partial t} = - j_{f} \cdot E$ .

Lastly, an animation to illustrate the electromagnetic wave and its Poynting vector in vacuum:

Attachments/Notes/2025 Quantum Electronics/1 Electromagnetic Theory of Light/vacuum.gif|700

1.9 Timescales

If one optical cycle lasts roughly $T_{0} = 2 π / ω_{0}$ , for example $10 fs$ for visible light, and typical laser pulse durations are $T_{pulse}$ and measurement durations are $T_{m}$ , then generally we have:

T_{pulse} ≫ T_{0} and T_{m} ≫ T_{0} .

The duration of one pulse is typically many optical cycles, while the duration of measurement can range from nanoseconds to milliseconds or longer. We can often separate the electric field into a slowly-varying envelope $A (r, t)$ , and a fast oscillation at the carrier frequency $ω_{0}$ :

E (r, t) = Re [A (r, t) e^{i ω_{0} t}] .

This is depicted in the next figure. The overall pulse shape (left box-like behaviour in the example) is captured by the slowly-varying envelope $A (r, t),$ while the fast (second, right) oscillation is captured by the $e^{i ω_{0} t}$ term:

Attachments/Notes/2025 Quantum Electronics/1 Electromagnetic Theory of Light/01_Introduction_and_theoretical_foundation 8.webp|700

Let us next explicitly calculate the instantaneous Poynting vector for such fields. If we define the complex envelopes $\tilde{E} (r, t)$ and $\tilde{H} (r, t)$ such that the physical fields are $E_{phys} = Re [\tilde{E} (r, t) e^{i ω_{0} t}]$ and $H_{phys} = Re [\tilde{H} (r, t) e^{i ω_{0} t}]$ (where $\tilde{E}$ and $\tilde{H}$ are the slowly varying complex amplitudes):

\begin{aligned} S (r, t) & = E_{phys} (r, t) \times H_{phys} (r, t) \\ = \frac{1}{2} Re [\tilde{E} (r, t) \times {\tilde{H}}^{*} (r, t)] \\ + \frac{1}{2} Re [\tilde{E} (r, t) \times \tilde{H} (r, t) e^{i 2 ω_{0} t}] . \end{aligned}

The first term is slowly varying, while the second term oscillates rapidly at $2 ω_{0}$ . Therefore, the Poynting vector has slowly varying contributions which change over timescales of the pulse envelope, and fast contributions changing over timescales of the optical cycle (through $2 ω_{0}$ ).

To obtain a measure for the average energy flux over a measurement time interval $T_{m} ≫ T_{0}$ , we calculate

⟨ S (r, t) ⟩_{T_{m}} = \frac{1}{T_{m}} \int_{t - T_{m} / 2}^{t + T_{m} / 2} S (r, t^{'}) d t^{'} .

If $T_{m}$ is much longer than the optical period $T_{0} = 2 π / ω_{0}$ , the fast oscillating terms (at $2 ω_{0}$ ) average to zero over $T_{m}$ . Thus, if the envelopes $\tilde{E}$ and $\tilde{H}$ are approximately constant over $T_{m}$ (or $T_{m}$ is chosen as an integer multiple of optical periods and envelopes vary slowly within $T_{m}$ ), only the slowly-varying term contributes significantly to the average:

⟨ S (r, t) ⟩_{T_{m}} \approx \frac{1}{2} Re [\tilde{E} (r, t) \times {\tilde{H}}^{*} (r, t)] .

For stationary, monochromatic fields, the complex amplitudes $\tilde{E} (r)$ and $\tilde{H} (r)$ are independent of time (the envelope is constant). The time-averaged Poynting vector is then

⟨ S (r) ⟩ = \frac{1}{2} Re [\tilde{E} (r) \times {\tilde{H}}^{*} (r)] .

This time-averaged Poynting vector is often used to define the optical intensity $I$ :

I (r) \equiv | ⟨ S (r) ⟩ | .

The intensity is therefore the magnitude of the time-averaged Poynting vector, and its units are typically Watts per square metre (W/m $^{2}$ ) or Watts per square centimetre (W/cm $^{2}$ ). By measuring intensity, we average over the fast varying optical cycle, and thus typically discard direct phase information of the optical field.

1.10 Momentum of Light

Light carries not only energy but also momentum. The momentum density of an electromagnetic field in a medium with refractive index $n$ is $g = S / v_{p}^{2} = n^{2} S / c^{2}$ . The total momentum $p_{field}$ contained in a volume is $\int_{V} g d V$ .
When light is absorbed or reflected by an object, it exerts a force (radiation pressure) due to the transfer of momentum.

The momentum $Δ p$ transferred to an object that completely absorbs an incident light pulse of energy $U_{p u l s e}$ propagating in a medium of refractive index $n$ is $Δ p = (U_{p u l s e} / c) n \hat{k}$ , where $\hat{k}$ is the direction of light propagation.
For a light beam with time-averaged Poynting vector $⟨ S ⟩$ (intensity $I = | ⟨ S ⟩ |$ ) normally incident on a perfectly absorbing surface of area $A$ for a duration $Δ t$ , the total energy incident is $U = I A Δ t$ . The momentum transferred is:

Δ p = \frac{n}{c} U = \frac{n}{c} I A Δ t .

If the surface is inclined such that its normal makes an angle $θ$ with the incident Poynting vector $⟨ S ⟩$ , the projected area normal to the beam is $A_{⊥} = A \cos θ$ . The energy incident on area $A$ is $I A \cos θ Δ t$ . Thus, the magnitude of momentum transferred to a perfectly absorbing disk is:

⟨ Δ p ⟩ = \frac{n}{c} I_{0} Δ t A \cos θ,

where $I_{0}$ is the incident intensity.

The linear momentum of light finds applications such as in laser cooling and trapping of atoms, optical tweezers, and proposals for light sails for spacecraft. The force exerted is the radiation pressure. For example, for a high-power ultrafast laser with pulse duration $10 fs$ , $10 PW$ peak power, and an intensity of $10^{23} {W/cm}^{2}$ , the radiation pressure $P_{rad} = I / c$ (for normal incidence, perfect absorption in vacuum) can be immense, $P_{rad} \approx (10^{23} \times 10^{4} {W/m}^{2}) / (3 \times 10^{8} m/s) \approx 3.3 \times 10^{18} Pa$ , which is about $3 \times 10^{13}$ bar.