Up to now, we have mostly assumed that the interaction between electrons leads to secondary effects. This was, essentially, the message of the Fermi liquid theory, the standard model of condensed matter physics. There, the interactions of course renormalise the properties of a metal, but their description is still possible by using a language of nearly independent fermionic quasiparticles with a few modifications. Even in connection with the magnetism of itinerant electrons, where interactions proved to be crucial, the description was in terms of extended Bloch states. Many properties were determined by the band structure of the electrons in the lattice; the electrons were preferably described in -space.
However, in this chapter, we will consider situations where it is less clear whether we should describe the electrons in momentum or in real space. The problem becomes obvious with the following Gedanken experiment: We look at a regular lattice of H-atoms. The lattice constant should be large enough such that the atoms can be considered to be independent for now. In the ground state, each H-atom contains exactly one electron in the -state, which is the only atomic orbital we consider at the moment. The transfer of one electron to another atom would cost the relatively high energy of , since it corresponds to an ionisation. Therefore, the electrons remain localised on the individual H-atoms and the description of the electron states is obviously best done in real space. The reduction of the lattice constant will gradually increase the overlap of the electron wave functions of neighbouring atoms. In analogy to the molecule, the electrons can now extend over neighbouring atoms, but the cost in energy remains that of an "ionisation". Thus, transfer processes are only possible virtually; there are not yet itinerant electrons in the sense of a metal.
On the other hand, we know the example of the alkali metals, which release their outermost ns-electron into an extended Bloch state and build a metallic (half-filled) band. This would actually work well for H-atoms for a sufficiently small lattice constant too. In nature, this can only be induced by enormous pressures – metallic hydrogen probably exists in the centres of the large gas planets Jupiter and Saturn due to the gravitational pressure. Obviously, a transition between the two limiting behaviours should exist. This metal-insulator transition occurs if the gain in kinetic energy surpasses the energy cost for charge transfer. The insulating side is known as a Mott insulator.
While the obviously metallic state is reliably described by the band picture and can be sufficiently well approximated by the previously discussed methods, this viewpoint becomes obsolete when approaching the metal-insulator transition. According to band theory, a half-filled band must produce a metal, which is definitely incorrect when entering the insulating side of the transition. Unfortunately, no well-controlled approximation for the description of this metal-insulator transition exists, since there are no small parameters for a perturbation theory.
Another important aspect is the fact that, in a standard Mott insulator, each atom features an electron in the outermost occupied orbital and, hence, a degree of freedom in the form of a localised spin , in the simplest case. While charge degrees of freedom (motion of electrons) are frozen at low temperatures, the same does not apply to these spin degrees of freedom. Many interesting magnetic phenomena are produced by the coupling of these spins. Other, more general forms of Mott insulators exist as well, which include more complex forms of localised degrees of freedom, for instance, partially occupied degenerate orbital states.
8.1 Mott Transition
First, we investigate the metal-insulator transition. Its description is difficult, since it does not constitute a transition between an ordered and a disordered state in the usual sense. We will, however, use some simple considerations which will allow us to gain some insight into the behaviour of such systems.
8.1.1 Hubbard Model
We introduce a model, which is based on the tight-binding approximation we have introduced in chapter 1. It is inevitable to go back to a description based on a lattice and give up a continuum description. The model describes the motion of electrons, if their wave functions on neighbouring lattice sites only weakly overlap. Furthermore, the Coulomb repulsion, leading to an increase in energy if a site is doubly occupied, is taken into account. We include this with the lattice analogue of the contact interaction. The model, called the Hubbard model, has the form
where we consider hopping between nearest neighbours only, via the matrix element . Note that are real-space field operators on the lattice (site index ) and is the density operator. We focus on half filling, , one electron per site on average. There are two obvious limiting cases:
Insulating atomic limit: We put . The ground state has exactly one electron on each lattice site. This state is, however, highly degenerate. In fact, the degeneracy is (number of sites ), since each electron has spin , thus,
where the spin configuration can be chosen arbitrarily. We will deal with the lifting of this degeneracy later. The first excited states feature one lattice site without an electron and one doubly occupied site. This state has energy and its degeneracy is even higher, namely . Even higher excited states correspond to more empty and doubly occupied sites. The system is an insulator and the density of states is shown here:
Metallic band limit: We set . The electrons are independent and move freely via hopping processes. The band energy is found through a Fourier transform of the Hamiltonian. With
we can rewrite
where
and the sum runs over all vectors connecting nearest neighbours. Obviously, this system is metallic, with a unique ground state
Note that at half filling, whereas the bandwidth .
8.1.2 Insulating State
We consider the two lowest energy sectors for the case . The ground state sector has already been defined: one electron sits on each lattice site. The lowest excited states create the sector with one empty and one doubly occupied site (as depicted in the previous figure showing the atomic limit). With the finite hopping matrix element, the empty (holon) and the doubly occupied (doublon) site become "mobile". A
fraction of the degeneracy () is hereby lifted and the energy obtains a momentum dependence,
Even though ignoring the spin configurations here is a daring approximation, we obtain a qualitatively good picture of the situation. Note that the motion of an empty site (holon) or doubly occupied site (doublon) is not independent of the spin configuration which is altered by moving these objects. As a consequence, the holon/doublon motion is not entirely free leading to a reduction of the bandwidth. Therefore the bandwidth seen in the next figure is smaller than , in general. One notices that, with increasing , the two energy sectors approach each other, until they finally overlap. In the left panel the holon-doublon excitation spectrum is depicted by two bands, the lower and upper Hubbard bands, where the holon is a hole in the lower and the doublon a particle in the upper Hubbard band:
The excitation gap is the gap between the two bands and we may interpret this system as an insulator, called a Mott insulator. (Note, however, that this band structure depends strongly on correlation effects, for instance, spin correlation, and is not rigid like the band structure of a semiconductor.) The band overlap (closing of the gap) indicates a transition, after which a perturbative treatment is definitely inapplicable. This is, in fact, the metal-insulator transition.
8.1.3 The Metallic State
On the metallic side, the initial state is better defined since the ground state is a filled Fermi sea without degeneracy. The treatment of the Coulomb repulsion turns out to become difficult, once we approach the Mott transition, where the electrons suffer a strong impediment in their mobility. In this region, there is no straightforward way of a perturbative treatment. The so-called Gutzwiller approximation, however, provides qualitative and very instructive insight into the properties of the strongly correlated electrons.
For this approximation we introduce the following important densities:
: electron density (average number of electrons per site, for half-filling)
: density of the singly occupied lattice sites with spin
: density of the singly occupied lattice sites with spin
: density of the doubly occupied sites
: density of the empty sites
It is easily seen that and , as long as no uniform magnetisation is present. Note that determines the energy contribution of the interaction term to , which we regard as the index of fixed interaction energy sectors. Furthermore, for half-filling (),
holds. The viewpoint of the Gutzwiller approximation is based on the renormalisation of the probability of the hopping process due to the correlation of the electrons, beyond the restrictions imposed by the Pauli principle on non-interacting electrons. With this, the importance of the spatial configuration of the electrons is enhanced. In the Gutzwiller approximation, the latter is taken into account statistically by simple probabilities for the occupation of lattice sites.
We fix the density of the doubly occupied sites and investigate the hopping processes which keep constant. First, we consider an electron hopping from a singly occupied to an empty site . Hopping probability depends on the availability of the initial configuration. We compare the probability to find this initial state for the correlated and the uncorrelated () case and write
The factor will eventually appear as the renormalisation of the hopping probability and, thus, leads to an effective kinetic energy of the system due to correlations. We determine both sides statistically. In the correlated case (fixed , so and ), the joint probability for to be singly occupied and to be empty is obviously
In the uncorrelated case (where is not fixed, and average occupancy per spin state is ), we have
The case for follows accordingly (). In order to collect the total result for hopping processes which keep constant, we have to do the same calculation for the hopping process , which leads to the same result. Processes of the type change and are ignored for calculating . With this, we obtain in all cases the same renormalisation factor for the kinetic energy,
meaning, . We consider the correlations by treating the electrons as independent but with a renormalised matrix element . The energy in the sector becomes
For fixed and , we can minimise this with respect to (note that this is not a variational calculation in a strict sense, the resulting energy is not an upper bound to the ground state energy), and find
with the critical value
For , double occupancy and, thus, hopping is completely suppressed; electrons become localised. This observation by Brinkman and Rice provides a qualitative description of the metal-insulator transition to a Mott insulator, but takes into account only local correlations, while correlations between different lattice sites are not considered. Moreover, correlations between the spin degrees of freedom are entirely neglected. The charge excitations contain contributions between different energy scales: (1) a metallic part, described via the renormalised effective Hamiltonian
and (2) a part with higher energy, corresponding to charge excitations on the energy scale , meaning to excitations raising the number of doubly occupied sites by one (or more).
We can estimate the contribution to the metallic conduction. Since in the tight-binding description the current operator contains the hopping matrix element and is thus subject to the same renormalisation as the kinetic energy, we obtain
where we have used the relation for a perfect conductor (no residual resistivity in a perfect lattice), as discussed in Chapter 6. There is a high-energy part which we do not specify here. The plasma frequency is renormalised, , such that the -sum rule, also from Chapter 6, yields
For , the coherent metallic part becomes weaker and weaker,
According to the -sum rule, the lost weight must gradually be transferred to the "high-energy" contribution.
8.1.4 Fermi Liquid Properties of the Metallic State
The just discussed approximation allows us to discuss a few Fermi liquid properties of the metallic state close to the metal-insulator transition in a simplified way. Let us investigate the momentum distribution. According to the above definition,
where the sum runs over all in the Fermi sea (FS). One can show within the above approximation that the distribution is constant within () and outside () the Fermi surface for finite , such that, for in the first Brillouin zone,
and
Taking into account particle-hole symmetry, which means
we are able to determine and ,
With this, the jump in the distribution at the Fermi energy is equal to , which corresponds to the quasiparticle weight :
Quasiparticle weight as the jump in the momentum distribution.
For it vanishes; the quasiparticles cease to exist for . Without going into the details of the calculation, we provide a few Fermi liquid parameters. It is easy to see that the effective mass ratio is
and thus
where is related to the bare electron mass (for instance, for a simple lattice) and the density of states . Furthermore,
It follows that the compressibility vanishes for as expected, since it becomes more and more difficult to compress the electrons or to add more electrons, respectively. The insulator is, of course, incompressible. The spin susceptibility diverges because of the diverging density of states . This indicates that local spins form, which exist as completely independent degrees of freedom at . Only the antiferromagnetic correlation between the spins would lead to a renormalisation, which turns finite. This correlation is, as mentioned above, neglected in the Gutzwiller approximation. The effective mass diverges and shows that the quasiparticles are more and more localised close to the transition, since the occupation of a lattice site is getting more rigidly fixed to . This can be observed within the Gutzwiller approximation in the form of local fluctuations of the particle number. For this, we introduce the density matrix of the electron states on an arbitrary lattice site,
from which we deduce the variance of the occupation number,
The deviation from single occupation vanishes with , meaning with the approach of the metal-insulator transition. Note that the fluctuation-dissipation theorem connects to the compressibility. As a last remark, it turns out that the Gutzwiller approximation is well suited to describe the strongly correlated Fermi liquid .
8.2 The Mott Insulator as a Quantum Spin System
One of the most important characteristics of the Mott insulator is the presence of spin degrees of freedom after the freezing of the charge. This is one of the most profound features distinguishing a Mott insulator from a band insulator. In our simple discussion, we have seen that the atomic limit of the Mott insulator provides us with a highly degenerate ground state, where a spin- degree of freedom is present on each lattice site. We lift this degeneracy by taking into account the kinetic energy term (for ). In this way new physics appears on a low-energy scale, which can be described by an effective spin Hamiltonian. Prominent examples for such spin systems are transition-metal oxides like the cuprates or vanadates , .
8.2.1 The Effective Hamiltonian
In order to employ our perturbative considerations, it is sufficient to observe the spins of two neighbouring lattice sites and to consider perturbation theory for discrete degenerate states. Here, this is preferably done in real space. There are 4 degenerate configurations, . The application of yields
where, in the last two cases, the resulting states (such as , denoting site 1 doubly occupied and site 2 empty) have an energy higher by and lie outside the ground state sector. Thus, it becomes clear that we have to proceed to second order perturbation, where the states of higher energy will appear only virtually:
We obtain the matrix elements for the effective Hamiltonian in the degenerate subspace
where represents an intermediate state like or , such that the energy denominator is . We end up with (for distinct initial and final states within the subspace)
Note that the signs originate from the anti-commutation properties of the Fermion operators during the specific evaluation of matrix elements of . In the subspace , the Hamiltonian matrix is . The eigenstates of the respective secular equations are,
Since the states and have energy (no hopping possible in second order that returns to these states within the degenerate manifold without changing spins), the sector with total spin (triplet: if we associate the state with the singlet and the other combinations form the triplet. This requires care: the states have energy . The symmetric combination corresponds to (ground state, spin singlet ). The antisymmetric combination (part of spin triplet ) has energy .
An effective Hamiltonian with the same energy spectrum for the spin configurations can be written with the help of the spin operators and on the two lattice sites (where are spin angular momentum operators with eigenvalues for , and )
Using the common form where would give eigenvalues for triplet and for singlet.
The provided form is .
For (triplet), . .
For (singlet), . . This matches the derived energies.
This mechanism of spin-spin coupling is called superexchange, a mechanism introduced by P.W. Anderson. Since this relation is valid between all neighbouring lattice sites, we can write the total Hamiltonian as
This model, reduced to spins only, is called the Heisenberg model. The Hamiltonian is invariant under a global spin rotation, generated by , where .
Thus, the total spin is a good quantum number, as we have seen in the two-spin case. The coupling constant (as defined: ) is positive and favors an antiparallel alignment of neighboring spins (since is then minimized). The ground state is therefore not ferromagnetic.
8.2.2 Mean Field Approximation of the Antiferromagnet
There are a few exact results for the Heisenberg model, but not even the ground state energy can be calculated exactly (except in the case of the one-dimensional spin chain which can be solved by means of a Bethe Ansatz). The difficulty lies predominantly in the treatment of quantum fluctuations; the zero-point motion of coupled spins. It is easiest seen already with two spins, where the ground state is a singlet and maximally entangled. The ground state of all antiferromagnetic systems is a spin singlet (). In the so-called thermodynamic limit () there is long-range antiferromagnetic order in the ground state for dimensions . Contrary, the fully polarised ferromagnetic state (ground state for a model with ) is known exactly, and as a state with maximal spin quantum number it features no quantum fluctuations.
In order to describe the antiferromagnetic state, we apply the mean field approximation again. We can characterise the equilibrium state of the classical Heisenberg model (spins as simple vectors without quantum properties) by splitting the lattice into two sublattices and , where each -site has only -sites as neighbors, and vice-versa. Lattices which allow for such a separation are called bipartite. There are lattices where this is not possible, for instance, triangular or face-centred cubic lattices. There, frustration phenomena appear, a further complication of antiferromagnetically coupled systems. On the - (-) sublattice, the spins point up (down). This is unique up to a global spin rotation. Note that this spin configuration doubles the unit cell.
We introduce the respective mean field,
and the mean-field Hamiltonian on a site interacting with its neighbours is approximated by replacing terms like with .
This leads to the mean field Hamiltonian
with the coordination number , the number of nearest neighbors ( in a simple cubic lattice). Note the constant term has been adjusted for standard mean-field derivation. It is simple to calculate the partition sum of this Hamiltonian,
The free energy per spin is consequently given by
At fixed temperature, we minimise the free energy with respect to to determine the thermal equilibrium state, meaning set and find
Actually, a staggered magnetic field would be another equilibrium variable (next to the temperature). We set it to zero. This is the self-consistency equation of the mean field theory. It provides a critical temperature (Néel temperature), below which the mean moment is finite. For approaches 0 continuously. Thus, can be found from a linearised self-consistency equation,
and thus (using )
This means that scales with the coupling constant and with . The larger and the more neighbours are present, the more stable is the ordered state. At infinite , the mean field approximation becomes exact. For close to , we can expand the free energy in ,
This is a Landau theory for a phase transition of second order, where a symmetry is spontaneously broken. The breaking of the symmetry (from the high-temperature phase with high symmetry to the low-temperature phase with low symmetry) is described by the order parameter . The minimisation of with respect to yields
8.3 Collective Modes - Spin Wave Excitations
Besides its favourable properties, the mean field approximation also has a number of insufficiencies. Quantum and some part of thermal fluctuations are neglected, and the insight into the low-energy excitations remains vague. As a matter of fact, as in the case of the ferromagnet, collective excitations exist here. In order to investigate these, we write the Heisenberg model in its spin components,
In the ordered state, the moments shall be aligned along the -axis.
To observe the dynamics of a flipped spin, we apply the operator on the ground state , and determine the spectrum by solving the resulting eigenvalue equation
with the ground state energy . Using the spin-commutation relations (for angular momentum operators including )
then yields the equation (after linearisation, replacing )
where and are the mean-field values on sites and . We decouple this complicated problem by replacing the operators by their mean fields. Therefore, we have to distinguish between and sublattices (), such that we end up with two equations,
Choosing
we introduce the operators
Inserting and into the equations of motion and using the definitions for and (as Fourier sums of operators) leads to:
where and represent the sums of spin operators for sublattices A and B respectively, and for a simple cubic lattice. For a non-trivial solution , the determinant must vanish:
This eigenvalue equation is easily solved leading to the description of spin waves in the antiferromagnet. The energy spectrum is given by
Note that only the positive energies make physical sense. It is interesting to investigate the limit of small , where for some constant .
Then .
This means that, in contrast to the ferromagnet, the spin waves of the antiferromagnet have a linear low-energy spectrum (as shown in the figure below). The same applies here if we expand the spectrum around (folding of the Brillouin zone due to the doubling of the unit cell). After a suitable normalisation, the operators and (or their Bogoliubov-transformed combinations) are of bosonic nature. This is an approximation where spin operators are treated as bosons, often via Holstein-Primakoff transformation, especially for small deviations from the ordered state. For instance, (constant for fixed ).
The zero-point fluctuations of these bosons yield quantum fluctuations, which reduce the moment from its mean field value. In a one-dimensional spin chain these fluctuations are strong enough to suppress antiferromagnetic order even for the ground state. The fact that the spectrum starting at zero energy has to do with the infinite degeneracy of the ground state (continuous symmetry of spin rotation). This property is known under the name Goldstone theorem, which states that each ordered state resulting from the spontaneous breaking of a continuous symmetry has collective excitations with arbitrarily small (positive) energies. The linear spectrum is normal for collective excitations of this kind; the quadratic spectrum of the ferromagnet is also a Goldstone mode but with different power due to conserved total magnetisation.
These spin excitations show the difference between a band and a Mott insulator very clearly. While in the band insulator both charge and spin excitations have an energy gap and are inert, the Mott insulator has only gapped charge excitations. However, the spin degrees of freedom form a low-energy sector that can even form gapless excitations, as shown above.