Jump back to chapter selection.


Table of Contents

4.1 Describing the Light Field
4.2 Multipole Expansion of the Light Field
4.3 Microscopic Approach to Magneto-Optics


4 Light Matter Interaction

To investigate light-matter interaction rigorously, a quantum mechanical treatment is necessary. We describe an undisturbed physical system (such as an atom or a unit cell in a crystal) by a Hilbert space spanned by a set of mutually orthogonal eigenstates |n of the unperturbed Hamiltonian H^0. The energy eigenvalues are given by:

H^0|n=ϵn|n.

The eigenstates form an orthonormal basis:

Ψn(r)Ψn(r)d3rn|n=δn,n.

The matrix elements of the unperturbed Hamiltonian in this basis are diagonal:

n|H^0|n=ϵnδn,n.

This orthogonality implies that transitions between different eigenstates, for instance from a ground state |g to a final state |f (where fg), are not possible in an unperturbed, time-independent system due to energy conservation and the absence of a coupling mechanism. However, if the system experiences an external perturbation, such as an incident light field, this perturbation is added to the Hamiltonian as an interaction term H^int:

H^=H^0+H^int.

For an electron in an atom, H^0=p22m+V(r). The interaction term H^int mixes the eigenstates of H^0, creating new perturbed eigenstates of H^. This mixing, which can also be viewed as inducing transitions between the states |n, is described by the off-diagonal matrix elements of H^int:

n|H^int|n(generally non-diagonal for nn).

Now, a transition |g|f becomes possible, mediated by H^int.

4.1 Describing the Light Field

So far, much of the physics was treated classically, where perturbations lead to forces, described by equations of motion. We will continue the description in terms of generalised position ri and momentum pi coordinates, using Hamiltonian mechanics. This approach maintains the formal structure of classical mechanics while providing a pathway to quantum mechanics. Hamilton's equations of motion are:

r˙i=H^piandp˙i=H^ri,

for i=x,y,z. The potential term V(r) in H^=H^kin+V^(r) is defined up to an arbitrary constant; a common gauge choice is limrV(r)=0.

Let us consider charged particles moving in an electromagnetic field described by the scalar potential Φ(r,t) and the vector potential A(r,t). The Hamiltonian for a particle of charge q (for an electron, q=e) and mass m is given by:

H^=(pqA)22m+qΦ.

The potentials Φ and A are related to the electric field E and magnetic induction B by:

B=×AandE=ΦAt.

These definitions automatically satisfy two of Maxwell's equations: B=0 and ×E=Bt.
The potentials are not unique; they can be changed by a gauge transformation:

AA=A+χandΦΦ=Φχt,

where χ(r,t) is an arbitrary scalar function. This transformation does not change the physical fields E and B. The equations of motion derived from this Hamiltonian are:

r˙i=H^pi=1m(piqAi),p˙i=H^ri=qmj(pjqAj)AjriqΦri=qjr˙jAjriqΦri.

Taking the time derivative of mr˙i=piqAi:

mr¨i=p˙iqdAidt=p˙iq(Ait+jr˙jAirj).

Substituting p˙i:

mr¨i=qjr˙jAjriqΦriqAitqjr˙jAirj=q(ΦriAit)+qjr˙j(AjriAirj).

Recognising Ei=ΦriAit and

(r˙×(×A))i=jr˙j(iAjjAi)=jr˙j(Aj/riAi/rj)

using Levi-Civita notation for cross product, we arrive at the Lorentz force law:

mr¨=qE+q(r˙×B).

Thus, the Hamiltonian form correctly describes the motion and implies the "minimal substitution" principle for introducing electromagnetic interactions: replace the canonical momentum p with the kinetic momentum pqA (or p+eA for electrons, q=e) in the kinetic energy term of the field-free Hamiltonian.
Summing over all crystal electrons and adding the energy of the electromagnetic field itself yields the complete Hamiltonian function:

H=electrons[(pkqkA(rk,t))22mk+qkΦ(rk,t)]+12(ε0E2+1μ0B2)d3r=electrons[(pk22mk+qkΦ)Matter (H^0)+(qkmkApk+qk22mkA2)Light-Matter Interaction (H^int)]+12(ε0E2+1μ0B2)d3rEM Field Energy.

(Assuming Coulomb gauge A=0, so Ap=pA).

4.2 Multipole Expansion of the Light Field

The complete analysis of this Hamiltonian is quite tedious. In many optical phenomena, a less complex approach based on a multipole expansion of the interaction term yields meaningful results. We are primarily interested in the light-matter interaction terms. For typical light intensities encountered in linear optics and many nonlinear optics scenarios (but not strong-field physics), the term quadratic in A (A2) is often much smaller than the term linear in A (Ap) and can be neglected. Therefore, the dominant interaction term is often taken as:

H^intkqkmkA(rk,t)pk.

Consider a single electron (q=e,m=me). We describe the vector potential of the light field as a monochromatic plane wave with angular frequency ωL, wavevector kL, and polarisation vector e^:

A(r,t)=A0e^cos(kLrωLt)=A0e^2(ei(kLrωLt)+ei(kLrωLt)).

We omit the explicit time dependence when calculating matrix elements for transitions induced by absorption or emission of a photon ωL. The interaction Hamiltonian (considering one Fourier component) is effectively H^intpe^eikLr.
The transition matrix element between an initial state |g and a final state |f is:

f|H^int |gf|e^peikLr|g.

For many light-matter interactions at optical or lower frequencies, the wavelength of light λL=2π/|kL| is much larger than the spatial extent a0 of the relevant electronic wavefunctions (such as atomic or molecular dimensions). This means kLra0/λL1 over the region where ΨfΨg is significant. This allows for a Taylor expansion of the exponential term, known as the multipole expansion:

eikLr=1+ikLr12(kLr)2+.

As a starting point, we consider the zeroth-order term (electric dipole approximation): eikLr1. The matrix element becomes:

f|H^int (0)|gf|e^p|g.

Using the commutation relation [H^0,r]=ip/me, we have p=ime[H^0,r].
Thus, f|p|g=ime(EfEg)f|r|g=imeωfgf|r|g.
The matrix element becomes proportional to e^f|(er)|gmeωfgee^dfg, where dfg=f|(er)|g is the electric dipole transition moment. The interaction Hamiltonian in this approximation is effectively H^ED=dEopt, where Eopt=A/t=iωLA for the relevant Fourier component.

Now let us consider the first-order term in the expansion, ikLr. The interaction matrix element is:

f|H^int (1)|gf|(e^p)(ikLr)|g.

The operator (e^p)(kLr) can be decomposed into terms corresponding to magnetic dipole (MD) and electric quadrupole (EQ) interactions:

(e^p)(kLr)=12((e^p)(kLr)(kLp)(e^r))+12((e^p)(kLr)+(kLp)(e^r)).

The first part (antisymmetric) is related to the magnetic dipole interaction (kL×e^)(r×p)=(kL×e^)L. Since BoptkL×EoptkL×e^, this term gives rise to H^MD=μBopt.
The second part (symmetric) is related to the electric quadrupole interaction H^EQijQijiEj.
With correct factors and including electron spin S, the interaction Hamiltonians are approximately:

The effective "light fields" in this multipole interaction picture are Eopt and Bopt (and field gradients for EQ), while the "matter moments" are the electric dipole P (from d), magnetisation M (from μ), and electric quadrupole density Q (from Qij). Both magnetic dipole and electric quadrupole interactions arise from the first-order term in the eikLr expansion and are typically much weaker than electric dipole interactions at optical frequencies unless ED transitions are forbidden by selection rules.


4.3 Microscopic Approach to Magneto-Optics

In the earlier chapter, we described magneto-optic effects using a classical harmonic oscillator model, which led to a dielectric tensor of the form:

εr(ω)=(ε1ig0igε1000ε2).

However, this macroscopic approach does not explain the origin or magnitude of the tensor components, particularly the gyrotropic factor g. This requires a microscopic, quantum mechanical approach. The harmonic oscillator model provides an intuition that resonances are dominant factors. The microscopic details are usually derived using quantum mechanical perturbation theory.

We distinguish three regimes of light intensity for light-matter interaction:

  1. Small light intensities (Linear Optics): First-order perturbation theory is sufficient. Only single light-matter interaction processes (absorption/emission of single photons) are significant. This gives rise to linear optical phenomena, described by linear susceptibilities. The interaction matrix element is f|H^int|g.
  2. Intense light fields (Nonlinear Optics): Higher-order perturbation theory is required. Multiple light-matter interactions become important, leading to nonlinear effects such as frequency conversion (second harmonic generation, sum/difference frequency generation). Mathematically, such interactions involve sequences of matrix elements, for instance, n2n1|H^int|n2n2|H^int|n3.
  3. Very intense light fields (Strong-Field Physics): The perturbative expansion breaks down. This regime gives rise to phenomena like high-order harmonic generation and above-threshold ionisation.

4.3.1 Microscopy for Linear Optics

In linear optics, the induced polarisation P(ω) is linearly related to the electric field E(ω) via the electric susceptibility tensor χe(ω):

Pi(ω)=ε0j(χe)ij(ω)Ej(ω)with(εr)ij(ω)=δij+(χe)ij(ω).

(Nonlinear optics considers terms in E2,E3, etc.).
A quantum mechanical expression for the linear susceptibility (derived from first-order time-dependent perturbation theory, considering electric dipole transitions) is:

(χe)ij(ω)=Nε0ng(ρg(0)ρn(0))[(di)gn(dj)ngωngωiΓng+(dj)gn(di)ngωng+ω+iΓng].

Here:

A real excitation involves transfer of population to state |n, requiring energy conservation ωωng. A virtual excitation does not change the population of |n; the system briefly enters state |n and returns to |g, allowed by the energy-time uncertainty principle ΔEΔt.

4.3.2 Local Field Corrections

In condensed matter, the electric field experienced by a single atom/molecule (the local field Eloc) is not generally the same as the macroscopic average electric field E (or an externally applied field Eext). This difference arises from the polarisation of the surrounding medium, which creates depolarisation fields.

Eloc(ω)Emacro(ω).

The macroscopic polarisation P is related to Emacro via the macroscopic susceptibility χe: P=ε0χeEmacro.
Microscopically, P is related to the average induced dipole moment per unit volume, which depends on the atomic/molecular polarisability αmol and the local field: P=NαmolEloc.
For a dense medium, particularly for a spherical or ellipsoidal sample in a uniform external field, or atoms in a cubic lattice site, the local field can be related to the macroscopic field by the Lorentz local field correction (for isotropic media or cubic symmetry):

Attachments/Notes/Crystal Optics in Intense Light Fields/4 Light-Matter Interaction/Script (Unofficial) 4.webp|700

Eloc=Emacro+P3ε0.

Using P=ε0(εr1)Emacro:

Eloc=Emacro+εr13Emacro=εr+23Emacro.

Combining P=NαmolEloc and P=ε0(εr1)Emacro, we get:

ε0(εr1)Emacro=Nαmolεr+23Emacro.

This leads to the Clausius-Mossotti relation (or Lorentz-Lorenz equation if using refractive index n2=εr):

Nαmol3ε0=εr1εr+2orNαmol3ε0=n21n2+2.

The macroscopic susceptibility is then related to the molecular polarisability by:

χe=εr1=Nαmol/ε01Nαmol/(3ε0).

Or, the polarisation expressed using Emacro:

P=ε0NαmolEloc=ε0Nαmolεr+23Emacro,

which allows finding the macroscopic susceptibility χe=Nαmolεr+23.

4.3.3 Microscopy of the Faraday Effect

We can now connect the macroscopic description of the Faraday effect (characterised by the gyrotropic factor g in the dielectric tensor) to the microscopic polarisability.
Recall Di=ε0j(εr)ijEj. The off-diagonal elements of εr induced by the magnetic field, ±ig, are responsible for Faraday rotation. This g is related to a change in macroscopic susceptibility δχe due to the magnetic field, modified by local field factors.
The complex Faraday rotation per unit length Φ~F (containing rotation ΦF and ellipticity/dichroism ΦF) is:

Φ~F=ΦF+iΦF=ωg2cn0,

where n0 is the refractive index in absence of Bext, and g is the off-diagonal element of εr.
The parameter g reflects the difference in the material's response to LCP (σ) and RCP (σ+) light, induced by the magnetic field. This difference arises from Zeeman splitting of energy levels. Microscopically, g is related to the difference in atomic/molecular polarisability for LCP and RCP light, δα=α(σ+)α(σ).

gN(δα)LLF,

where LLF is a Lorentz local field factor, such as (n02+23)2 if δα relates to microscopic fields.
The difference δα can be calculated from the quantum mechanical expression for χij(ω) by considering transitions allowed by σ+ and σ light (which have different selection rules for ΔmJ) between Zeeman-split levels. The relevant part of the transition matrix elements will be |(d+)ng|2|(d)ng|2, where d±=dx±idy are dipole operators for circular polarisations.
The complex Faraday rotation Φ~F can then be written as a sum over transitions:

Φ~F=NωLLF4ε0cn0ng(ρg(0)ρn(0))(|(d+)gn|2|(d)gn|2ωngωiΓng+non-resonant).

This expression can be separated into real and imaginary parts. The real part (ΦF), describing rotation, is often expressed using oscillator strengths f± for σ± transitions and a dispersive line shape function φD(ω,ωng):

ΦFng(f+,ngf,ng)φD(ω,ωng).

The imaginary part (ΦF), describing magnetic circular dichroism (MCD), is related to an absorptive line shape function.
Two limiting cases are often discussed:

Diamagnetic type: The ground state |g is non-degenerate (e.g., singlet), and the magnetic field splits the excited states |n. The splitting ΔEn=±μBgnBz is often symmetric (ωng±=ωn0±Δn). If the oscillator strengths f+,ngf,ngfn0 are nearly equal, the Faraday rotation arises from the difference of two oppositely shifted dispersive profiles. Near resonance ωωn0, and for small splitting ΔnΓn0:

ΦF(dia)ωΔnfn0(ωn0ω)2+Γn02.

This lineshape is symmetric (dispersive-looking) around ωn0.

Attachments/Script (Unofficial) 2.webp|700

Paramagnetic type: The ground state |g is degenerate and split by the magnetic field. The populations of these split ground sub-levels, ρg±(0), become unequal due to thermal Boltzmann distribution, proportional to magnetisation M(T). Transitions from these sub-levels to an excited state |n (which may also be split) have different strengths for σ+ and σ light, or the same oscillator strength but different populations. If the excited state splitting is negligible compared to linewidth:

ΦF(para)(ρg+(0)ρg(0))fn0φA(ω,ωn0)M(T)φA(ω,ωn0),

where φA(ω,ωn0) is an absorptive-looking (antisymmetric if χ is taken) lineshape function peaked near ωn0.

Attachments/Script (Unofficial) 3.webp|700

Typically, paramagnetic rotation is larger than diamagnetic rotation, especially at low temperatures where M(T) is large. The ratio scales as ΦF,max(dia)/ΦF,max(para)Δn/Γn0(1/m(T)), which is often less than 1.

4.3.4 Important Interactions

The energy levels |n and transition frequencies ωng are determined by a hierarchy of interactions in an atom/ion within a crystal:

Attachments/Script (Unofficial) 4.webp|700