# Gauge theory (mathematics) Information

https://en.wikipedia.org/wiki/Gauge_theory_(mathematics)

In mathematics, and especially differential geometry and mathematical physics, gauge theory is the general study of connections on vector bundles, principal bundles, and fibre bundles. Gauge theory in mathematics should not be confused with the closely related concept of a gauge theory in physics, which is a field theory which admits gauge symmetry. In mathematics theory means a mathematical theory, encapsulating the general study of a collection of concepts or phenomena, whereas in the physical sense a gauge theory is a mathematical model of some natural phenomenon.

Gauge theory in mathematics is typically concerned with the study of gauge-theoretic equations. These are differential equations involving connections on vector bundles or principal bundles, or involving sections of vector bundles, and so there are strong links between gauge theory and geometric analysis. These equations are often physically meaningful, corresponding to important concepts in quantum field theory or string theory, but also have important mathematical significance. For example, the Yang–Mills equations are a system of partial differential equations for a connection on a principal bundle, and in physics solutions to these equations correspond to vacuum solutions to the equations of motion for a classical field theory, particles known as instantons.

Gauge theory has found uses in constructing new invariants of smooth manifolds, the construction of exotic geometric structures such as hyperkähler manifolds, as well as giving alternative descriptions of important structures in algebraic geometry such as moduli spaces of vector bundles and coherent sheaves.

## History

The dx1⊗σ3 coefficient of a BPST instanton on the (x1,x2)-slice of R4 where σ3 is the third Pauli matrix (top left). The dx2⊗σ3 coefficient (top right). These coefficients determine the restriction of the BPST instanton A with g=2,ρ=1,z=0 to this slice. The corresponding field strength centered around z=0 (bottom left). A visual representation of the field strength of a BPST instanton with center z on the compactification S4 of R4 (bottom right). The BPST instanton is a classical instanton solution to the Yang–Mills equations on R4.

Gauge theory has its origins as far back as the formulation of Maxwell's equations describing classical electromagnetism, which may be phrased as a gauge theory with structure group the circle group. Work of Paul Dirac on magnetic monopoles and relativistic quantum mechanics encouraged the idea that bundles and connections were the correct way of phrasing many problems in quantum mechanics. Gauge theory in mathematical physics arose as a significant field of study with the seminal work of Robert Mills and Chen-Ning Yang on so-called Yang–Mills gauge theory, which is now the fundamental model that underpins the standard model of particle physics. [1]

The mathematical investigation of gauge theory has its origins in the work of Michael Atiyah, Isadore Singer, and Nigel Hitchin on the self-duality equations on a Riemannian manifold in four dimensions. [2] [3] In this work the moduli space of self-dual connections (instantons) on Euclidean space was studied, and shown to be of dimension ${\displaystyle 8k-3}$ where ${\displaystyle k}$ is a positive integer parameter. This linked up with the discovery by physicists of BPST instantons, vacuum solutions to the Yang–Mills equations in four dimensions with ${\displaystyle k=1}$. Such instantons are defined by a choice of 5 parameters, the center ${\displaystyle z\in \mathbb {R} ^{4}}$ and scale ${\displaystyle \rho \in \mathbb {R} _{>0}}$, corresponding to the ${\displaystyle 8-3=5}$-dimensional moduli space. A BPST instanton is depicted to the right.

Around the same time Atiyah and Richard Ward discovered links between solutions to the self-duality equations and algebraic bundles over the complex projective space ${\displaystyle \mathbb {CP} ^{3}}$. [4] Another significant early discovery was the development of the ADHM construction by Atiyah, Vladimir Drinfeld, Hitchin, and Yuri Manin. [5] This construction allowed for the solution to the anti-self-duality equations on Euclidean space ${\displaystyle \mathbb {R} ^{4}}$ from purely linear algebraic data.

Significant breakthroughs encouraging the development of mathematical gauge theory occurred in the early 1980s. At this time the important work of Atiyah and Raoul Bott about the Yang–Mills equations over Riemann surfaces showed that gauge theoretic problems could give rise to interesting geometric structures, spurring the development of infinite-dimensional moment maps, equivariant Morse theory, and relations between gauge theory and algebraic geometry. [6] Important analytical tools in geometric analysis were developed at this time by Karen Uhlenbeck, who studied the analytical properties of connections and curvature proving important compactness results. [7] The most significant advancements in the field occurred due to the work of Simon Donaldson and Edward Witten.

Donaldson used a combination of algebraic geometry and geometric analysis techniques to construct new invariants of four manifolds, now known as Donaldson invariants. [8] [9] With these invariants, novel results such as the existence of topological manifolds admitting no smooth structures, or the existence of many distinct smooth structures on the Euclidean space ${\displaystyle \mathbb {R} ^{4}}$ could be proved. For this work Donaldson was awarded the Fields Medal in 1986.

Witten similarly observed the power of gauge theory to describe topological invariants, by relating quantities arising from Chern–Simons theory in three dimensions to the Jones polynomial, an invariant of knots. [10] This work and the discovery of Donaldson invariants, as well as novel work of Andreas Floer on Floer homology, inspired the study of topological quantum field theory.

After the discovery of the power of gauge theory to define invariants of manifolds, the field of mathematical gauge theory expanded in popularity. Further invariants were discovered, such as Seiberg–Witten invariants and Vafa–Witten invariants. [11] [12] Strong links to algebraic geometry were realised by the work of Donaldson, Uhlenbeck, and Shing-Tung Yau on the Kobayashi–Hitchin correspondence relating Yang–Mills connections to stable vector bundles. [13] [14] Work of Nigel Hitchin and Carlos Simpson on Higgs bundles demonstrated that moduli spaces arising out of gauge theory could have exotic geometric structures such as that of hyperkähler manifolds, as well as links to integrable systems through the Hitchin system. [15] [16] Links to string theory and mirror symmetry were realised, where gauge theory is essential to phrasing the homological mirror symmetry conjecture and the AdS/CFT correspondence.

## Fundamental objects of interest

The fundamental objects of interest in gauge theory are connections on vector bundles and principal bundles. In this section we briefly recall these constructions, and refer to the main articles on them for details. The structures described here are standard within the differential geometry literature, and an introduction to the topic from a gauge-theoretic perspective can be found in the book of Donaldson and Peter Kronheimer. [17]

### Principal bundles

Non-trivial Z/2Z principal bundle over the circle. There is no obvious way to identify which point corresponds to +1 or -1 in each fibre. This bundle is non-trivial as there is no globally defined section of the projection π.
The frame bundle ${\displaystyle {\mathcal {F}}(E)}$ of the Möbius strip ${\displaystyle E}$ is a non-trivial principal ${\displaystyle \mathbb {Z} /2\mathbb {Z} }$-bundle over the circle.

The central objects of study in gauge theory are principal bundles and vector bundles. The choice of which to study is essentially arbitrary, as one may pass between them, but principal bundles are the natural objects from the physical perspective to describe gauge fields, and mathematically they more elegantly encode the corresponding theory of connections and curvature for vector bundles associated to them.

A principal bundle with structure group ${\displaystyle G}$, or a principal ${\displaystyle G}$-bundle, consists of a quintuple ${\displaystyle (P,X,\pi ,G,\rho )}$ where ${\displaystyle \pi :P\to X}$ is a smooth fibre bundle with fibre space isomorphic to a Lie group ${\displaystyle G}$, and ${\displaystyle \rho }$ represents a free and transitive right group action of ${\displaystyle G}$ on ${\displaystyle P}$ which preserves the fibres, in the sense that for all ${\displaystyle p\in P}$, ${\displaystyle \pi (pg)=\pi (p)}$ for all ${\displaystyle g\in G}$. Here ${\displaystyle P}$ is the total space, and ${\displaystyle X}$ the base space. Using the right group action for each ${\displaystyle x\in X}$ and any choice of ${\displaystyle p\in P_{x}}$, the map ${\displaystyle g\mapsto pg}$ defines a diffeomorphism ${\displaystyle P_{x}\cong G}$ between the fibre over ${\displaystyle x}$ and the Lie group ${\displaystyle G}$ as smooth manifolds. Note however there is no natural way of equipping the fibres of ${\displaystyle P}$ with the structure of Lie groups, as there is no natural choice of element ${\displaystyle p\in P_{x}}$ for every ${\displaystyle x\in X}$.

The simplest examples of principal bundles are given when ${\displaystyle G=\operatorname {U} (1)}$ is the circle group. In this case the principal bundle has dimension ${\displaystyle \dim P=n+1}$ where ${\displaystyle \dim X=n}$. Another natural example occurs when ${\displaystyle P={\mathcal {F}}(TX)}$ is the frame bundle of the tangent bundle of the manifold ${\displaystyle X}$, or more generally the frame bundle of a vector bundle over ${\displaystyle X}$. In this case the fibre of ${\displaystyle P}$ is given by the general linear group ${\displaystyle \operatorname {GL} (n,\mathbb {R} )}$.

Since a principal bundle is a fibre bundle, it locally has the structure of a product. That is, there exists an open covering ${\displaystyle \{U_{\alpha }\}}$ of ${\displaystyle X}$ and diffeomorphisms ${\displaystyle \varphi _{\alpha }:P_{U_{\alpha }}\to U_{\alpha }\times G}$ commuting with the projections ${\displaystyle \pi }$ and ${\displaystyle \operatorname {pr} _{1}}$, such that the transition functions ${\displaystyle g_{\alpha \beta }:U_{\alpha }\cap U_{\beta }\to G}$ defined by ${\displaystyle \varphi _{\alpha }\circ \varphi _{\beta }^{-1}(x,g)=(x,g_{\alpha \beta }(x)g)}$ satisfy the cocycle condition

${\displaystyle g_{\alpha \beta }(x)g_{\beta \gamma }(x)=g_{\alpha \gamma }(x)}$

on any triple overlap ${\displaystyle U_{\alpha }\cap U_{\beta }\cap U_{\gamma }}$. In order to define a principal bundle it is enough to specify such a choice of transition functions, The bundle is then defined by gluing trivial bundles ${\displaystyle U_{\alpha }\times G}$ along the intersections ${\displaystyle U_{\alpha }\cap U_{\beta }}$ using the transition functions. The cocycle condition ensures precisely that this defines an equivalence relation on the disjoint union ${\displaystyle \bigsqcup _{\alpha }U_{\alpha }\times G}$ and therefore that the quotient space ${\displaystyle P=\bigsqcup _{\alpha }U_{\alpha }\times G/{\sim }}$ is well-defined. This is known as the fibre bundle construction theorem and the same process works for any fibre bundle described by transition functions, not just principal bundles or vector bundles.

Notice that a choice of local section ${\displaystyle s_{\alpha }:U_{\alpha }\to P_{U_{\alpha }}}$ satisfying ${\displaystyle \pi \circ s_{\alpha }=\operatorname {Id} }$ is an equivalent method of specifying a local trivialisation map. Namely, one can define ${\displaystyle \varphi _{\alpha }(p)=(\pi (p),{\tilde {s}}_{\alpha }(p))}$ where ${\displaystyle {\tilde {s}}_{\alpha }(p)\in G}$ is the unique group element such that ${\displaystyle p{\tilde {s}}_{\alpha }(p)^{-1}=s_{\alpha }(\pi (p))}$.

### Vector bundles

A vector bundle ${\displaystyle E}$ over a base ${\displaystyle M}$ with a section ${\displaystyle s}$.

A vector bundle is a triple ${\displaystyle (E,X,\pi )}$ where ${\displaystyle \pi :E\to X}$ is a fibre bundle with fibre given by a vector space ${\displaystyle \mathbb {K} ^{r}}$ where ${\displaystyle \mathbb {K} =\mathbb {R} ,\mathbb {C} }$ is a field. The number ${\displaystyle r}$ is the rank of the vector bundle. Again one has a local description of a vector bundle in terms of a trivialising open cover. If ${\displaystyle \{U_{\alpha }\}}$ is such a cover, then under the isomorphism

${\displaystyle \varphi _{\alpha }:E_{U_{\alpha }}\to U_{\alpha }\times \mathbb {K} ^{r}}$

one obtains ${\displaystyle r}$ distinguished local sections of ${\displaystyle E}$ corresponding to the ${\displaystyle r}$ coordinate basis vectors ${\displaystyle e_{1},\dots ,e_{r}}$ of ${\displaystyle \mathbb {K} ^{r}}$, denoted ${\displaystyle {\boldsymbol {e}}_{1},\dots ,{\boldsymbol {e}}_{r}}$. These are defined by the equation

${\displaystyle \varphi _{\alpha }({\boldsymbol {e}}_{i}(x))=(x,e_{i}).}$

To specify a trivialisation it is therefore equivalent to give a collection of ${\displaystyle r}$ local sections which are everywhere linearly independent, and use this expression to define the corresponding isomorphism. Such a collection of local sections is called a frame.

Similarly to principal bundles, one obtains transition functions ${\displaystyle g_{\alpha \beta }:U_{\alpha }\cap U_{\beta }\to \operatorname {GL} (r,\mathbb {K} )}$ for a vector bundle, defined by

${\displaystyle \varphi _{\alpha }\circ \varphi _{\beta }^{-1}(x,v)=(x,g_{\alpha \beta }(x)v).}$

If one takes these transition functions and uses them to construct the local trivialisation for a principal bundle with fibre equal to the structure group ${\displaystyle \operatorname {GL} (r,\mathbb {K} )}$, one obtains exactly the frame bundle of ${\displaystyle E}$, a principal ${\displaystyle \operatorname {GL} (r,\mathbb {K} )}$-bundle.

### Associated bundles

Given a principal ${\displaystyle G}$-bundle ${\displaystyle P}$ and a representation ${\displaystyle \rho }$ of ${\displaystyle G}$ on a vector space ${\displaystyle V}$, one can construct an associated vector bundle ${\displaystyle E=P\times _{\rho }V}$ with fibre the vector space ${\displaystyle V}$. To define this vector bundle, one considers the right action on the product ${\displaystyle P\times V}$ defined by ${\displaystyle (p,v)g=(pg,\rho (g^{-1})v)}$ and defines ${\displaystyle P\times _{\rho }V=(P\times V)/G}$ as the quotient space with respect to this action.

In terms of transition functions the associated bundle can be understood more simply. If the principal bundle ${\displaystyle P}$ has transition functions ${\displaystyle g_{\alpha \beta }}$ with respect to a local trivialisation ${\displaystyle \{U_{\alpha }\}}$, then one constructs the associated vector bundle using the transition functions ${\displaystyle \rho \circ g_{\alpha \beta }:U_{\alpha }\cap U_{\beta }\to \operatorname {GL} (V)}$.

The associated bundle construction can be performed for any fibre space ${\displaystyle F}$, not just a vector space, provided ${\displaystyle \rho :G\to \operatorname {Aut} (F)}$ is a group homomorphism. One key example is the capital A adjoint bundle ${\displaystyle \operatorname {Ad} (P)}$ with fibre ${\displaystyle G}$, constructed using the group homomorphism ${\displaystyle \rho :G\to \operatorname {Aut} (G)}$ defined by conjugation ${\displaystyle g\mapsto (h\mapsto ghg^{-1})}$. Note that despite having fibre ${\displaystyle G}$, the Adjoint bundle is neither a principal bundle, or isomorphic as a fibre bundle to ${\displaystyle P}$ itself. For example, if ${\displaystyle G}$ is Abelian, then the conjugation action is trivial and ${\displaystyle \operatorname {Ad} (P)}$ will be the trivial ${\displaystyle G}$-fibre bundle over ${\displaystyle X}$ regardless of whether or not ${\displaystyle P}$ is trivial as a fibre bundle. Another key example is the lowercase a adjoint bundle ${\displaystyle \operatorname {ad} (P)}$ constructed using the adjoint representation ${\displaystyle \rho :G\to \operatorname {Aut} ({\mathfrak {g}})}$ where ${\displaystyle {\mathfrak {g}}}$ is the Lie algebra of ${\displaystyle G}$.

### Gauge transformations

A gauge transformation of a vector bundle or principal bundle is an automorphism of this object. For a principal bundle, a gauge transformation consists of a diffeomorphism ${\displaystyle \varphi :P\to P}$ commuting with the projection operator ${\displaystyle \pi }$ and the right action ${\displaystyle \rho }$. For a vector bundle a gauge transformation is similarly defined by a diffeomorphism ${\displaystyle \varphi :E\to E}$ commuting with the projection operator ${\displaystyle \pi }$ which is a linear isomorphism of vector spaces on each fibre.

The gauge transformations (of ${\displaystyle P}$ or ${\displaystyle E}$) form a group under composition, called the gauge group, typically denoted ${\displaystyle {\mathcal {G}}}$. This group can be characterised as the space of global sections ${\displaystyle {\mathcal {G}}=\Gamma (\operatorname {Ad} (P))}$ of the adjoint bundle, or ${\displaystyle {\mathcal {G}}=\Gamma (\operatorname {Ad} ({\mathcal {F}}(E)))}$ in the case of a vector bundle, where ${\displaystyle {\mathcal {F}}(E)}$ denotes the frame bundle.

One can also define a local gauge transformation as a local bundle isomorphism over a trivialising open subset ${\displaystyle U_{\alpha }}$. This can be uniquely specified as a map ${\displaystyle g_{\alpha }:U_{\alpha }\to G}$ (taking ${\displaystyle G=\operatorname {GL} (r,\mathbb {K} )}$ in the case of vector bundles), where the induced bundle isomorphism is defined by

${\displaystyle \varphi _{\alpha }(p)=pg_{\alpha }(\pi (p))}$

and similarly for vector bundles.

Notice that given two local trivialisations of a principal bundle over the same open subset ${\displaystyle U_{\alpha }}$, the transition function is precisely a local gauge transformation ${\displaystyle g_{\alpha \alpha }:U_{\alpha }\to G}$. That is, local gauge transformations are changes of local trivialisation for principal bundles or vector bundles.

### Connections on principal bundles

A principal bundle connection is required to be compatible with the right group action of ${\displaystyle G}$ on ${\displaystyle P}$. This can be visualized as the right multiplication ${\displaystyle R_{g}}$ taking the horizontal subspaces into each other. This equivariance of the horizontal subspaces ${\displaystyle H\subset TP}$ interpreted in terms of the connection form ${\displaystyle \omega }$ leads to its characteristic equivariance properties.
A principal bundle connection form ${\displaystyle \omega }$ may be thought of as a projection operator on the tangent bundle ${\displaystyle TP}$ of the principal bundle ${\displaystyle P}$. The kernel of the connection form is given by the horizontal subspaces for the associated Ehresmann connection.

A connection on a principal bundle is a method of connecting nearby fibres so as to capture the notion of a section ${\displaystyle s:X\to P}$ being constant or horizontal. Since the fibres of an abstract principal bundle are not naturally identified with each other, or indeed with the fibre space ${\displaystyle G}$ itself, there is no canonical way of specifying which sections are constant. A choice of local trivialisation leads to one possible choice, where if ${\displaystyle P}$ is trivial over a set ${\displaystyle U_{\alpha }}$, then a local section could be said to be horizontal if it is constant with respect to this trivialisation, in the sense that ${\displaystyle \varphi _{\alpha }(s(x))=(x,g)}$ for all ${\displaystyle x\in U_{\alpha }}$ and one ${\displaystyle g\in G}$. In particular a trivial principal bundle ${\displaystyle P=X\times G}$ comes equipped with a trivial connection.

In general a connection is given by a choice of horizontal subspaces ${\displaystyle H_{p}\subset T_{p}P}$ of the tangent spaces at every point ${\displaystyle p\in P}$, such that at every point one has ${\displaystyle T_{p}P=H_{p}\oplus V_{p}}$ where ${\displaystyle V}$ is the vertical bundle defined by ${\displaystyle V=\ker d\pi }$. These horizontal subspaces must be compatible with the principal bundle structure by requiring that the horizontal distribution ${\displaystyle H}$ is invariant under the right group action: ${\displaystyle H_{pg}=d(R_{g})(H_{p})}$ where ${\displaystyle R_{g}:P\to P}$ denotes right multiplication by ${\displaystyle g}$. A section ${\displaystyle s}$ is said to be horizontal if ${\displaystyle T_{p}s\subset H_{p}}$ where ${\displaystyle s}$ is identified with its image inside ${\displaystyle P}$, which is a submanifold of ${\displaystyle P}$ with tangent bundle ${\displaystyle Ts}$. Given a vector field ${\displaystyle v\in \Gamma (TX)}$, there is a unique horizontal lift ${\displaystyle v^{\#}\in \Gamma (H)}$. The curvature of the connection ${\displaystyle H}$ is given by the two-form with values in the adjoint bundle ${\displaystyle F\in \Omega ^{2}(X,\operatorname {ad} (P))}$ defined by

${\displaystyle F(v_{1},v_{2})=[v_{1}^{\#},v_{2}^{\#}]-[v_{1},v_{2}]^{\#}}$

where ${\displaystyle [\cdot ,\cdot ]}$ is the Lie bracket of vector fields. Since the vertical bundle consists of the tangent spaces to the fibres of ${\displaystyle P}$ and these fibres are isomorphic to the Lie group ${\displaystyle G}$ whose tangent bundle is canonically identified with ${\displaystyle TG=G\times {\mathfrak {g}}}$, there is a unique Lie algebra-valued two-form ${\displaystyle F\in \Omega ^{2}(P,{\mathfrak {g}})}$ corresponding to the curvature. From the perspective of the Frobenius integrability theorem, the curvature measures precisely the extent to which the horizontal distribution fails to be integrable, and therefore the extent to which ${\displaystyle M}$ fails to embed inside ${\displaystyle P}$ as a horizontal submanifold locally.

The choice of horizontal subspaces may be equivalently expressed by a projection operator ${\displaystyle \nu :TP\to V}$ which is equivariant in the correct sense, called the connection one-form. For a horizontal distribution ${\displaystyle H}$, this is defined by ${\displaystyle \nu _{H}(h+v)=v}$ where ${\displaystyle h+v}$ denotes the decomposition of a tangent vector with respect to the direct sum decomposition ${\displaystyle TP=H\oplus V}$. Due to the equivariance, this projection one-form may be taken to be Lie algebra-valued, giving some ${\displaystyle \nu \in \Omega ^{1}(P,{\mathfrak {g}})}$.

A local trivialisation for ${\displaystyle P}$ is equivalently given by a local section ${\displaystyle s_{\alpha }:U_{\alpha }\to P_{U_{\alpha }}}$ and the connection one-form and curvature can be pulled back along this smooth map. This gives the local connection one-form ${\displaystyle A_{\alpha }=s_{\alpha }^{*}\nu \in \Omega ^{1}(U_{\alpha },\operatorname {ad} (P))}$ which takes values in the adjoint bundle of ${\displaystyle P}$. Cartan's structure equation says that the curvature may be expressed in terms of the local one-form ${\displaystyle A_{\alpha }}$ by the expression

${\displaystyle F=dA_{\alpha }+{\frac {1}{2}}[A_{\alpha },A_{\alpha }]}$

where we use the Lie bracket on the Lie algebra bundle ${\displaystyle \operatorname {ad} (P)}$ which is identified with ${\displaystyle U_{\alpha }\times {\mathfrak {g}}}$ on the local trivialisation ${\displaystyle U_{\alpha }}$.

Under a local gauge transformation ${\displaystyle g:U_{\alpha }\to G}$ so that ${\displaystyle {\tilde {A}}_{\alpha }=(g\circ s)^{*}\nu }$, the local connection one-form transforms by the expression

${\displaystyle {\tilde {A}}_{\alpha }=\operatorname {ad} (g)\circ A_{\alpha }+(g^{-1})^{*}\theta }$

where ${\displaystyle \theta }$ denotes the Maurer–Cartan form of the Lie group ${\displaystyle G}$. In the case where ${\displaystyle G}$ is a matrix Lie group, one has the simpler expression ${\displaystyle {\tilde {A}}_{\alpha }=gA_{\alpha }g^{-1}-(dg)g^{-1}.}$

### Connections on vector bundles

The covariant derivative of a connection on a vector bundle may be recovered from its parallel transport. The values ${\displaystyle s(\gamma (t))}$ of a section ${\displaystyle s\in \Gamma (E)}$ are parallel transported along the path ${\displaystyle \gamma }$ back to ${\displaystyle \gamma (0)=x}$, and then the covariant derivative is taken in the fixed vector space, the fibre ${\displaystyle E_{x}}$ over ${\displaystyle x}$.

A connection on a vector bundle may be specified similarly to the case for principal bundles above, known as an Ehresmann connection. However vector bundle connections admit a more powerful description in terms of a differential operator. A connection on a vector bundle is a choice of ${\displaystyle \mathbb {K} }$-linear differential operator

${\displaystyle \nabla :\Gamma (E)\to \Gamma (T^{*}X\otimes E)=\Omega ^{1}(E)}$

such that

${\displaystyle \nabla (fs)=df\otimes s+f\nabla s}$

for all ${\displaystyle f\in C^{\infty }(X)}$ and sections ${\displaystyle s\in \Gamma (E)}$. The covariant derivative of a section ${\displaystyle s}$ in the direction of a vector field ${\displaystyle v}$ is defined by

${\displaystyle \nabla _{v}(s)=\nabla s(v)}$

where on the right we use the natural pairing between ${\displaystyle \Omega ^{1}(X)}$ and ${\displaystyle TX}$. This is a new section of the vector bundle ${\displaystyle E}$, thought of as the derivative of ${\displaystyle s}$ in the direction of ${\displaystyle v}$. The operator ${\displaystyle \nabla _{v}}$ is the covariant derivative operator in the direction of ${\displaystyle v}$. The curvature of ${\displaystyle \nabla }$ is given by the operator ${\displaystyle F_{\nabla }\in \Omega ^{2}(\operatorname {End} (E))}$ with values in the endomorphism bundle, defined by

${\displaystyle F_{\nabla }(v_{1},v_{2})=\nabla _{v_{1}}\nabla _{v_{2}}-\nabla _{v_{2}}\nabla _{v_{1}}-\nabla _{[v_{1},v_{2}]}.}$

In a local trivialisation the exterior derivative ${\displaystyle d}$ acts as a trivial connection (corresponding in the principal bundle picture to the trivial connection discussed above). Namely for a local frame ${\displaystyle {\boldsymbol {e}}_{1},\dots ,{\boldsymbol {e}}_{r}}$ one defines

${\displaystyle d(s^{i}{\boldsymbol {e}}_{i})=ds^{i}\otimes {\boldsymbol {e}}_{i}}$

where here we have used Einstein notation for a local section ${\displaystyle s=s^{i}{\boldsymbol {e}}_{i}}$.

Any two connections ${\displaystyle \nabla _{1},\nabla _{2}}$ differ by an ${\displaystyle \operatorname {End} (E)}$-valued one-form ${\displaystyle A}$. To see this, observe that the difference of two connections is ${\displaystyle C^{\infty }(X)}$-linear:

${\displaystyle (\nabla _{1}-\nabla _{2})(fs)=f(\nabla _{1}-\nabla _{2})(s).}$

In particular since every vector bundle admits a connection (using partitions of unity and the local trivial connections), the set of connections on a vector bundle has the structure of an infinite-dimensional affine space modelled on the vector space ${\displaystyle \Omega ^{1}(\operatorname {End} (E))}$. This space is commonly denoted ${\displaystyle {\mathcal {A}}}$.

Applying this observation locally, every connection over a trivialising subset ${\displaystyle U_{\alpha }}$ differs from the trivial connection ${\displaystyle d}$ by some local connection one-form ${\displaystyle A_{\alpha }\in \Omega ^{1}(U_{\alpha },\operatorname {End} (E))}$, with the property that ${\displaystyle \nabla =d+A_{\alpha }}$ on ${\displaystyle U_{\alpha }}$. In terms of this local connection form, the curvature may be written as

${\displaystyle F_{A}=dA_{\alpha }+A_{\alpha }\wedge A_{\alpha }}$

where the wedge product occurs on the one-form component, and one composes endomorphisms on the endomorphism component. To link back to the theory of principal bundles, notice that ${\displaystyle A\wedge A={\frac {1}{2}}[A,A]}$ where on the right we now perform wedge of one-forms and commutator of endomorphisms.

Under a gauge transformation ${\displaystyle u}$ of the vector bundle ${\displaystyle E}$, a connection ${\displaystyle \nabla }$ transforms into a connection ${\displaystyle u\cdot \nabla }$ by the conjugation ${\displaystyle (u\cdot \nabla )_{v}(s)=u(\nabla _{v}(u^{-1}(s))}$. The difference ${\displaystyle u\cdot \nabla -\nabla =-(\nabla u)u^{-1}}$ where here ${\displaystyle \nabla }$ is acting on the endomorphisms of ${\displaystyle E}$. Under a local gauge transformation ${\displaystyle g}$ one obtains the same expression

${\displaystyle {\tilde {A}}_{\alpha }=gA_{\alpha }g^{-1}-(dg)g^{-1}}$

as in the case of principal bundles.

### Induced connections

A connection on a principal bundle induces connections on associated vector bundles. One way to see this is in terms of the local connection forms described above. Namely, if a principal bundle connection ${\displaystyle H}$ has local connection forms ${\displaystyle A_{\alpha }\in \Omega ^{1}(U_{\alpha },\operatorname {ad} (P))}$, and ${\displaystyle \rho :G\to \operatorname {Aut} (V)}$ is a representation of ${\displaystyle G}$ defining an associated vector bundle ${\displaystyle E=P\times _{\rho }V}$, then the induced local connection one-forms are defined by

${\displaystyle \rho _{*}A_{\alpha }\in \Omega ^{1}(U_{\alpha },\operatorname {End} (E)).}$

Here ${\displaystyle \rho _{*}}$ is the induced Lie algebra homomorphism from ${\displaystyle {\mathfrak {g}}\to \operatorname {End} (V)}$, and we use the fact that this map induces a homomorphism of vector bundles ${\displaystyle \operatorname {ad} (P)\to \operatorname {End} (E)}$.

The induced curvature can be simply defined by

${\displaystyle \rho _{*}F_{A}\in \Omega ^{2}(U_{\alpha },\operatorname {End} (E)).}$

Here one sees how the local expressions for curvature are related for principal bundles and vector bundles, as the Lie bracket on the Lie algebra ${\displaystyle {\mathfrak {g}}}$ is sent to the commutator of endomorphisms of ${\displaystyle \operatorname {End} (V)}$ under the Lie algebra homomorphism ${\displaystyle \rho _{*}}$.

### Space of connections

The central object of study in mathematical gauge theory is the space of connections on a vector bundle or principal bundle. This is an infinite-dimensional affine space ${\displaystyle {\mathcal {A}}}$ modelled on the vector space ${\displaystyle \Omega ^{1}(X,\operatorname {ad} (P))}$ (or ${\displaystyle \Omega ^{1}(X,\operatorname {End} (E))}$ in the case of vector bundles). Two connections ${\displaystyle A,A'\in {\mathcal {A}}}$ are said to be gauge equivalent if there exists a gauge transformation ${\displaystyle u}$ such that ${\displaystyle A'=u\cdot A}$. Gauge theory is concerned with gauge equivalence classes of connections. In some sense gauge theory is therefore concerned with the properties of the quotient space ${\displaystyle {\mathcal {A}}/{\mathcal {G}}}$, which is in general neither a Hausdorff space or a smooth manifold.

Many interesting properties of the base manifold ${\displaystyle X}$ can be encoded in the geometry and topology of moduli spaces of connections on principal bundles and vector bundles over ${\displaystyle X}$. Invariants of ${\displaystyle X}$, such as Donaldson invariants or Seiberg–Witten invariants can be obtained by computing numeral quantities derived from moduli spaces of connections over ${\displaystyle X}$. The most famous application of this idea is Donaldson's theorem, which uses the moduli space of Yang–Mills connections on a principal ${\displaystyle \operatorname {SU} (2)}$-bundle over a simply connected four-manifold ${\displaystyle X}$ to study its intersection form. For this work Donaldson was awarded a Fields Medal.

## Notational conventions

There are various notational conventions used for connections on vector bundles and principal bundles which will be summarised here.

• The letter ${\displaystyle A}$ is the most common symbol used to represent a connection on a vector bundle or principal bundle. It comes from the fact that if one chooses a fixed connection ${\displaystyle \nabla _{0}\in {\mathcal {A}}}$ of all connections, then any other connection may be written ${\displaystyle \nabla =\nabla _{0}+A}$ for some unique one-form ${\displaystyle A\in \Omega ^{1}(X,\operatorname {ad} (P))}$. It also comes from the use of ${\displaystyle A_{\alpha }}$ to denote the local form of the connection on a vector bundle, which subsequently comes from the electromagnetic potential ${\displaystyle A}$ in physics. Sometimes the symbol ${\displaystyle \omega }$ is also used to refer to the connection form, usually on a principal bundle, and usually in this case ${\displaystyle \omega }$ refers to the global connection one-form ${\displaystyle \omega \in \Omega ^{1}(P,{\mathfrak {g}})}$ on the total space of the principal bundle, rather than the corresponding local connections forms. This convention is usually avoided in the mathematical literature as it often clashes with the use of ${\displaystyle \omega }$ for a Kähler form when the underlying manifold ${\displaystyle X}$ is a Kähler manifold.
• The symbol ${\displaystyle \nabla }$ is most commonly used to represent a connection on a vector bundle as a differential operator, and in that sense is used interchangeably with the letter ${\displaystyle A}$. It is also used to refer to the covariant derivative operators ${\displaystyle \nabla _{X}}$. Alternative notation for the connection operator and covariant derivative operators is ${\displaystyle \nabla _{A}}$ to emphasize the dependence on the choice of ${\displaystyle A\in {\mathcal {A}}}$, or ${\displaystyle D_{A}}$ or ${\displaystyle d_{A}}$.
• The operator ${\displaystyle d_{A}}$ most commonly refers to the exterior covariant derivative of a connection ${\displaystyle A}$ (and so is sometimes written ${\displaystyle d_{\nabla }}$ for a connection ${\displaystyle \nabla }$). Since the exterior covariant derivative in degree 0 is the same as the regular covariant derivative, the connection or covariant derivative itself is often denoted ${\displaystyle d_{A}}$ instead of ${\displaystyle \nabla }$.
• The symbol ${\displaystyle F_{A}}$ or ${\displaystyle F_{\nabla }}$ is most commonly used to refer to the curvature of a connection. When the connection is referred to by ${\displaystyle \omega }$, the curvature is referred to by ${\displaystyle \Omega }$ rather than ${\displaystyle F_{\omega }}$. Other conventions involve ${\displaystyle R}$ or ${\displaystyle R_{A}}$ or ${\displaystyle R_{\nabla }}$, by analogy with the Riemannian curvature tensor in Riemannian geometry which is denoted by ${\displaystyle R}$.
• The letter ${\displaystyle H}$ is often used to denote a principal bundle connection or Ehresmann connection when emphasis is to be placed on the horizontal distribution ${\displaystyle H\subset TP}$. In this case the vertical projection operator corresponding to ${\displaystyle H}$ (the connection one-form on ${\displaystyle P}$) is usually denoted ${\displaystyle \omega }$, or ${\displaystyle v}$, or ${\displaystyle \nu }$. Using this convention the curvature is sometimes denoted ${\displaystyle F_{H}}$ to emphasize the dependence, and ${\displaystyle F_{H}}$ may refer to either the curvature operator on the total space ${\displaystyle F_{H}\in \Omega ^{2}(P,{\mathfrak {g}})}$, or the curvature on the base ${\displaystyle F_{H}\in \Omega ^{2}(X,\operatorname {ad} (P))}$.
• The Lie algebra adjoint bundle is usually denoted ${\displaystyle \operatorname {ad} (P)}$, and the Lie group adjoint bundle by ${\displaystyle \operatorname {Ad} (P)}$. This disagrees with the convention in the theory of Lie groups, where ${\displaystyle \operatorname {Ad} }$ refers to the representation of ${\displaystyle G}$ on ${\displaystyle {\mathfrak {g}}}$, and ${\displaystyle \operatorname {ad} }$ refers to the Lie algebra representation of ${\displaystyle {\mathfrak {g}}}$ on itself by the Lie bracket. In the Lie group theory the conjugation action (which defines the bundle ${\displaystyle \operatorname {Ad} (P)}$) is often denoted by ${\displaystyle \Psi _{g}}$.

### Dictionary of mathematical and physical terminology

The mathematical and physical fields of gauge theory involve the study of the same objects, but use different terminology to describe them. Below is a summary of how these terms relate to each other.

Comparison of concepts in mathematical and physical gauge theory
Mathematics Physics
Principal bundle Instanton sector or charge sector
Structure group Gauge group or local gauge group
Gauge group Group of global gauge transformations or global gauge group
Gauge transformation Gauge transformation or gauge symmetry
Change of local trivialisation Local gauge transformation
Local trivialisation Gauge
Choice of local trivialisation Fixing a gauge
Functional defined on the space of connections Lagrangian of gauge theory
Object does not change under the effects of a gauge transformation Gauge invariance
Gauge transformations that are covariantly constant with respect to the connection Global gauge symmetry
Gauge transformations which are not covariantly constant with respect to the connection Local gauge symmetry
Connection Gauge field or gauge potential
Curvature Gauge field strength or field strength
Induced connection/covariant derivative on associated bundle Minimal coupling
Section of associated vector bundle Matter field
Term in Lagrangian functional involving multiple different quantities

(e.g. the covariant derivative applied to a section of an associated bundle, or a multiplication of two terms)

Interaction
Section of real or complex (usually trivial) line bundle (Real or complex) Scalar field

As a demonstration of this dictionary, consider an interacting term of an electron-position particle field and the electromagnetic field in the Lagrangian of quantum electrodynamics: [18]

${\displaystyle {\mathcal {L}}={\bar {\psi }}(i\gamma ^{\mu }D_{\mu }-m)\psi -{\frac {1}{4}}F_{\mu \nu }F^{\mu \nu },}$

Mathematically this might be rewritten

${\displaystyle {\mathcal {L}}=\langle \psi ,({D\!\!\!\!/}_{A}-m)\psi \rangle _{L^{2}}+\|F_{A}\|_{L^{2}}^{2}}$

where ${\displaystyle A}$ is a connection on a principal ${\displaystyle \operatorname {U} (1)}$ bundle ${\displaystyle P}$, ${\displaystyle \psi }$ is a section of an associated spinor bundle and ${\displaystyle {D\!\!\!\!/}_{A}}$ is the induced Dirac operator of the induced covariant derivative ${\displaystyle \nabla _{A}}$ on this associated bundle. The first term is an interacting term in the Lagrangian between the spinor field (the field representing the electron-positron) and the gauge field (representing the electromagnetic field). The second term is the regular Yang–Mills functional which describes the basic non-interacting properties of the electromagnetic field (the connection ${\displaystyle A}$). The term of the form ${\displaystyle \nabla _{A}\psi }$ is an example of what in physics is called minimal coupling, that is, the simplest possible interaction between a matter field ${\displaystyle \psi }$ and a gauge field ${\displaystyle A}$.

## Yang–Mills theory

The predominant theory that occurs in mathematical gauge theory is Yang–Mills theory. This theory involves the study of connections which are critical points of the Yang–Mills functional defined by

${\displaystyle \operatorname {YM} (A)=\int _{X}\|F_{A}\|^{2}\,d\mathrm {vol} _{g}}$

where ${\displaystyle (X,g)}$ is an oriented Riemannian manifold with ${\displaystyle d\mathrm {vol} _{g}}$ the Riemannian volume form and ${\displaystyle \|\cdot \|^{2}}$ an ${\displaystyle L^{2}}$-norm on the adjoint bundle ${\displaystyle \operatorname {ad} (P)}$. This functional is the square of the ${\displaystyle L^{2}}$-norm of the curvature of the connection ${\displaystyle A}$, so connections which are critical points of this function are those with curvature as small as possible (or higher local minima of ${\displaystyle \operatorname {YM} }$).

These critical points are characterised as solutions of the associated Euler–Lagrange equations, the Yang–Mills equations

${\displaystyle d_{A}\star F_{A}=0}$

where ${\displaystyle d_{A}}$ is the induced exterior covariant derivative of ${\displaystyle \nabla _{A}}$ on ${\displaystyle \operatorname {ad} (P)}$ and ${\displaystyle \star }$ is the Hodge star operator. Such solutions are called Yang–Mills connections and are of significant geometric interest.

The Bianchi identity asserts that for any connection, ${\displaystyle d_{A}F_{A}=0}$. By analogy for differential forms a harmonic form ${\displaystyle \omega }$ is characterised by the condition

${\displaystyle d\star \omega =d\omega =0.}$

If one defined a harmonic connection by the condition that

${\displaystyle d_{A}\star F_{A}=d_{A}F_{A}=0}$

the then study of Yang–Mills connections is similar in nature to that of harmonic forms. Hodge theory provides a unique harmonic representative of every de Rham cohomology class ${\displaystyle [\omega ]}$. Replacing a cohomology class by a gauge orbit ${\displaystyle \{u\cdot A\mid u\in {\mathcal {G}}\}}$, the study of Yang–Mills connections can be seen as trying to find unique representatives for each orbit in the quotient space ${\displaystyle {\mathcal {A}}/{\mathcal {G}}}$ of connections modulo gauge transformations.

### Self-duality and anti-self-duality equations

In dimension four the Hodge star operator sends two-forms to two-forms, ${\displaystyle \star :\Omega ^{2}(X)\to \Omega ^{2}(X)}$, and squares to the identity operator, ${\displaystyle \star ^{2}=\operatorname {Id} }$. Thus the Hodge star operating on two-forms has eigenvalues ${\displaystyle \pm 1}$, and the two-forms on an oriented Riemannian four-manifold split as a direct sum

${\displaystyle \Omega ^{2}(X)=\Omega _{+}(X)\oplus \Omega _{-}(X)}$

into the self-dual and anti-self-dual two-forms, given by the ${\displaystyle +1}$ and ${\displaystyle -1}$ eigenspaces of the Hodge star operator respectively. That is, ${\displaystyle \alpha \in \Omega ^{2}(X)}$ is self-dual if ${\displaystyle \star \alpha =\alpha }$, and anti-self dual if ${\displaystyle \star \alpha =-\alpha }$, and every differential two-form admits a splitting ${\displaystyle \alpha =\alpha _{+}+\alpha _{-}}$ into self-dual and anti-self-dual parts.

If the curvature of a connection ${\displaystyle A}$ on a principal bundle over a four-manifold is self-dual or anti-self-dual then by the Bianchi identity ${\displaystyle d_{A}\star F_{A}=\pm d_{A}F_{A}=0}$, so the connection is automatically a Yang–Mills equations. The equation

${\displaystyle \star F_{A}=\pm F_{A}}$

is a first order partial differential equation for the connection ${\displaystyle A}$, and therefore is simpler to study than the full second order Yang–Mills equation. The equation ${\displaystyle \star F_{A}=F_{A}}$ is called the self-duality equation, and the equation ${\displaystyle \star F_{A}=-F_{A}}$ is called the anti-self-duality equation, and solutions to these equations are self-dual connections or anti-self-dual connections respectively.

### Dimensional reduction

One way to derive new and interesting gauge-theoretic equations is to apply the process of dimensional reduction to the Yang–Mills equations. This process involves taking the Yang–Mills equations over a manifold ${\displaystyle X}$ (usually taken to be the Euclidean space ${\displaystyle X=\mathbb {R} ^{4}}$), and imposing that the solutions of the equations be invariant under a group of translational or other symmetries. Through this process the Yang–Mills equations lead to the Bogomolny equations describing monopoles on ${\displaystyle \mathbb {R} ^{3}}$, Hitchin's equations describing Higgs bundles on Riemann surfaces, and the Nahm equations on real intervals, by imposing symmetry under translations in one, two, and three directions respectively.

## Gauge theory in one and two dimensions

Here the Yang–Mills equations when the base manifold ${\displaystyle X}$ is of low dimension is discussed. In this setting the equations simplify dramatically due to the fact that in dimension one there are no two-forms, and in dimension two the Hodge star operator on two-forms acts as ${\displaystyle \star :\Omega ^{2}(X)\to C^{\infty }(X)}$.

### Yang–Mills theory

One may study the Yang–Mills equations directly on a manifold of dimension two. The theory of Yang–Mills equations when the base manifold is a compact Riemann surface was carried about by Michael Atiyah and Raoul Bott. [6] In this case the moduli space of Yang–Mills connections over a complex vector bundle ${\displaystyle E}$ admits various rich interpretations, and the theory serves as the simplest case to understand the equations in higher dimensions. The Yang–Mills equations in this case become

${\displaystyle \star F_{A}=\lambda (E)\operatorname {Id} _{E}}$

for some topological constant ${\displaystyle \lambda (E)\in \mathbb {C} }$ depending on ${\displaystyle E}$. Such connections are called projectively flat, and in the case where the vector bundle is topologically trivial (so ${\displaystyle \lambda (E)=0}$) they are precisely the flat connections.

When the rank and degree of the vector bundle are coprime, the moduli space ${\displaystyle {\mathcal {M}}}$ of Yang–Mills connections is smooth and has a natural structure of a symplectic manifold. Atiyah and Bott observed that since the Yang–Mills connections are projectively flat, their holonomy gives projective unitary representations of the fundamental group of the surface, so that this space has an equivalent description as a moduli space of projective unitary representations of the fundamental group of the Riemann surface, a character variety. The theorem of Narasimhan and Seshadri gives an alternative description of this space of representations as the moduli space of stable holomorphic vector bundles which are smoothly isomorphic to the ${\displaystyle E}$. [19] Through this isomorphism the moduli space of Yang–Mills connections gains a complex structure, which interacts with the symplectic structure of Atiyah and Bott to make it a compact Kähler manifold.

Simon Donaldson gave an alternative proof of the theorem of Narasimhan and Seshadri that directly passed from Yang–Mills connections to stable holomorphic structures. [20] Atiyah and Bott used this rephrasing of the problem to illuminate the intimate relationship between the extremal Yang–Mills connections and the stability of the vector bundles, as an infinite-dimensional moment map for the action of the gauge group ${\displaystyle {\mathcal {G}}}$, given by the curvature map ${\displaystyle A\mapsto F_{A}}$ itself. This observation phrases the Narasimhan–Seshadri theorem as a kind of infinite-dimensional version of the Kempf–Ness theorem from geometric invariant theory, relating critical points of the norm squared of the moment map (in this case Yang–Mills connections) to stable points on the corresponding algebraic quotient (in this case stable holomorphic vector bundles). This idea has been subsequently very influential in gauge theory and complex geometry since its introduction.

### Nahm equations

The Nahm equations, introduced by Werner Nahm, are obtained as the dimensional reduction of the anti-self-duality in four dimensions to one dimension, by imposing translational invariance in three directions. [21] Concretely, one requires that the connection form ${\displaystyle A=A_{0}\,dx^{0}+A_{1}\,dx^{1}+A_{2}\,dx^{2}+A_{3}\,dx^{3}}$ does not depend on the coordinates ${\displaystyle x^{1},x^{2},x^{3}}$. In this setting the Nahm equations between a system of equations on an interval ${\displaystyle I\subset \mathbb {R} }$ for four matrices ${\displaystyle T_{0},T_{1},T_{2},T_{3}\in C^{\infty }(I,{\mathfrak {g}})}$ satisfying the triple of equations

${\displaystyle {\begin{cases}{\frac {dT_{1}}{dt}}+[T_{0},T_{1}]+[T_{2},T_{3}]=0\\{\frac {dT_{2}}{dt}}+[T_{0},T_{2}]+[T_{3},T_{1}]=0\\{\frac {dT_{3}}{dt}}+[T_{0},T_{3}]+[T_{1},T_{2}]=0.\end{cases}}}$

It was shown by Nahm that the solutions to these equations (which can be obtained fairly easily as they are a system of ordinary differential equations) can be used to construct solutions to the Bogomolny equations, which describe monopoles on ${\displaystyle \mathbb {R} ^{3}}$. Nigel Hitchin showed that solutions to the Bogomolny equations could be used to construct solutions to the Nahm equations, showing solutions to the two problems were equivalent. [22] Donaldson further showed that solutions to the Nahm equations are equivalent to rational maps of degree ${\displaystyle k}$ from the complex projective line ${\displaystyle \mathbb {CP} ^{1}}$ to itself, where ${\displaystyle k}$ is the charge of the corresponding magnetic monopole. [23]

The moduli space of solutions to the Nahm equations has the structure of a hyperkähler manifold.

### Hitchin's equations and Higgs bundles

Hitchin's equations, introduced by Nigel Hitchin, are obtained as the dimensional reduction of the self-duality equations in four dimensions to two dimensions by imposing translation invariance in two directions. [24] In this setting the two extra connection form components ${\displaystyle A_{3}\,dx^{3}+A_{4}\,dx^{4}}$ can be combined into a single complex-valued endomorphism ${\displaystyle \Phi =A_{3}+iA_{4}}$, and when phrased in this way the equations become conformally invariant and therefore are natural to study on a compact Riemann surface rather than ${\displaystyle \mathbb {R} ^{2}}$. Hitchin's equations state that for a pair ${\displaystyle (A,\Phi )}$ on a complex vector bundle ${\displaystyle E\to \Sigma }$ where ${\displaystyle \Phi \in \Omega ^{1,0}(\Sigma ,\operatorname {End} (E))}$, that

${\displaystyle {\begin{cases}F_{A}+[\Phi ,\Phi ^{*}]=0\\{\bar {\partial }}_{A}\Phi =0\end{cases}}}$

where ${\displaystyle {\bar {\partial }}_{A}}$ is the ${\displaystyle (0,1)}$-component of ${\displaystyle d_{A}}$. Solutions of Hitchin's equations are called Hitchin pairs.

Whereas solutions to the Yang–Mills equations on a compact Riemann surface correspond to projective unitary representations of the surface group, Hitchin showed that solutions to Hitchin's equations correspond to projective complex representations of the surface group. The moduli space of Hitchin pairs naturally has (when the rank and degree of the bundle are coprime) the structure of a Kähler manifold. Through an analogue of Atiyah and Bott's observation about the Yang–Mills equations, Hitchin showed that Hitchin pairs correspond to so-called stable Higgs bundles, where a Higgs bundle is a pair ${\displaystyle (E,\Phi )}$ where ${\displaystyle E\to \Sigma }$ is a holomorphic vector bundle and ${\displaystyle \Phi :E\to E\otimes K}$ is a holomorphic endomorphism of ${\displaystyle E}$ with values in the canonical bundle of the Riemann surface ${\displaystyle \Sigma }$. This is shown through an infinite-dimensional moment map construction, and this moduli space of Higgs bundles also has a complex structure, which is different to that coming from the Hitchin pairs, leading to two complex structures on the moduli space ${\displaystyle {\mathcal {M}}}$ of Higgs bundles. These combine to give a third making this moduli space a hyperkähler manifold.

Hitchin's work was subsequently vastly generalised by Carlos Simpson, and the correspondence between solutions to Hitchin's equations and Higgs bundles over an arbitrary Kähler manifold is known as the nonabelian Hodge theorem. [25] [26] [27] [28] [29]

## Gauge theory in three dimensions

### Monopoles

The dimensional reduction of the Yang–Mills equations to three dimensions by imposing translational invariance in one direction gives rise to the Bogomolny equations for a pair ${\displaystyle (A,\Phi )}$ where ${\displaystyle \Phi :\mathbb {R} ^{3}\to {\mathfrak {g}}}$ is a family of matrices. [30] The equations are

${\displaystyle F_{A}=\star d_{A}\Phi .}$

When the principal bundle ${\displaystyle P\to \mathbb {R} ^{3}}$ has structure group ${\displaystyle \operatorname {U} (1)}$ the circle group, solutions to the Bogomolny equations model the Dirac monopole describing a magnetic monopole in classical electromagnetism. The work of Nahm and Hitchin shows that when the structure group is the special unitary group ${\displaystyle \operatorname {SU} (2)}$ solutions to the monopole equations correspond to solutions to the Nahm equations, and by work of Donaldson these further correspond to rational maps from ${\displaystyle \mathbb {CP} ^{1}}$ to itself of degree ${\displaystyle k}$ where ${\displaystyle k}$ is the charge of the monopole. This charge is defined as the limit

${\displaystyle \lim _{R\to \infty }\int _{S_{R}}(\Phi ,F_{A})=4\pi k}$

of the integral of the pairing ${\displaystyle (\Phi ,F_{A})\in \Omega ^{2}(\mathbb {R} ^{3})}$ over spheres ${\displaystyle S_{R}}$ in ${\displaystyle \mathbb {R} ^{3}}$ of increasing radius ${\displaystyle R}$.

### Chern–Simons theory

Chern–Simons theory in 3 dimensions is a topological quantum field theory with an action functional proportional to the integral of the Chern–Simons form, a three-form defined by

${\displaystyle \operatorname {Tr} (F_{A}\wedge A-{\frac {1}{3}}A\wedge A\wedge A).}$

Classical solutions to the Euler–Lagrange equations of the Chern–Simons functional on a closed 3-manifold ${\displaystyle X}$ correspond to flat connections on the principal ${\displaystyle G}$-bundle ${\displaystyle P\to X}$. However, when ${\displaystyle X}$ has a boundary the situation becomes more complicated. Chern–Simons theory was used by Edward Witten to express the Jones polynomial, a knot invariant, in terms of the vacuum expectation value of a Wilson loop in ${\displaystyle \operatorname {SU} (2)}$ Chern–Simons theory on the three-sphere ${\displaystyle S^{3}}$. [10] This was a stark demonstration of the power of gauge theoretic problems to provide new insight in topology, and was one of the first instances of a topological quantum field theory.

In the quantization of the classical Chern–Simons theory, one studies the induced flat or projectively flat connections on the principal bundle restricted to surfaces ${\displaystyle \Sigma \subset X}$ inside the 3-manifold. The classical state spaces corresponding to each surface are precisely the moduli spaces of Yang–Mills equations studied by Atiyah and Bott. [6] The geometric quantization of these spaces was achieved by Nigel Hitchin and Axelrod–Della Pietra–Witten independently, and in the case where the structure group is complex, the configuration space is the moduli space of Higgs bundles and its quantization was achieved by Witten. [31] [32] [33]

### Floer homology

Andreas Floer introduced a type of homology on a 3-manifolds defined in analogy with Morse homology in finite dimensions. [34] In this homology theory, the Morse function is the Chern–Simons functional on the space of connections on an ${\displaystyle \operatorname {SU} (2)}$ principal bundle over the 3-manifold ${\displaystyle X}$. The critical points are the flat connections, and the flow lines are defined to be the Yang–Mills instantons on ${\displaystyle M\times I}$ that restrict to the critical flat connections on the two boundary components. This leads to instanton Floer homology. The Atiyah–Floer conjecture asserts that instanton Floer homology agrees with the Lagrangian intersection Floer homology of the moduli space of flat connections on the surface ${\displaystyle \Sigma \subset X}$ defining a Heegaard splitting of ${\displaystyle X}$, which is symplectic due to the observations of Atiyah and Bott.

In analogy with instanton Floer homology one may define Seiberg–Witten Floer homology where instantons are replaced with solutions of the Seiberg–Witten equations. By work of Clifford Taubes this is known to be isomorphic to embedded contact homology and subsequently Heegaard Floer homology.

## Gauge theory in four dimensions

Gauge theory has been most intensively studied in four dimensions. Here the mathematical study of gauge theory overlaps significantly with its physical origins, as the standard model of particle physics can be thought of as a quantum field theory on a four-dimensional spacetime. The study of gauge theory problems in four dimensions naturally leads to the study of topological quantum field theory. Such theories are physical gauge theories that are insensitive to changes in the Riemannian metric of the underlying four-manifold, and therefore can be used to define topological (or smooth structure) invariants of the manifold.

### Anti-self-duality equations

In four dimensions the Yang–Mills equations admit a simplification to the first order anti-self-duality equations ${\displaystyle \star F_{A}=-F_{A}}$ for a connection ${\displaystyle A}$ on a principal bundle ${\displaystyle P\to X}$ over an oriented Riemannian four-manifold ${\displaystyle X}$. [17] These solutions to the Yang–Mills equations represent the absolute minima of the Yang–Mills functional, and the higher critical points correspond to the solutions ${\displaystyle d_{A}\star F_{A}=0}$ that do not arise from anti-self-dual connections. The moduli space of solutions to the anti-self-duality equations, ${\displaystyle {\mathcal {M}}_{P}}$, allows one to derive useful invariants about the underlying four-manifold.

Cobordism given by moduli space of anti-self-dual connections in Donaldson's theorem

This theory is most effective in the case where ${\displaystyle X}$ is simply connected. For example, in this case Donaldson's theorem asserts that if the four-manifold has negative-definite intersection form (4-manifold), and if the principal bundle has structure group the special unitary group ${\displaystyle \operatorname {SU} (2)}$ and second Chern class ${\displaystyle c_{2}(P)=1}$, then the moduli space ${\displaystyle {\mathcal {M}}_{P}}$ is five-dimensional and gives a cobordism between ${\displaystyle X}$ itself and a disjoint union of ${\displaystyle b_{2}(X)}$ copies of ${\displaystyle \mathbb {CP} ^{2}}$ with its orientation reversed. This implies that the intersection form of such a four-manifold is diagonalisable. There are examples of simply connected topological four-manifolds with non-diagonalisable intersection form, such as the E8 manifold, so Donaldson's theorem implies the existence of topological four-manifolds with no smooth structure. This is in stark contrast with two or three dimensions, in which topological structures and smooth structures are equivalent: any topological manifold of dimension less than or equal to 3 has a unique smooth structure on it.

Similar techniques were used by Clifford Taubes and Donaldson to show that Euclidean space ${\displaystyle \mathbb {R} ^{4}}$ admits uncountably infinitely many distinct smooth structures. This is in stark contrast to any dimension other than four, where Euclidean space has a unique smooth structure.

An extension of these ideas leads to Donaldson theory, which constructs further invariants of smooth four-manifolds out of the moduli spaces of connections over them. These invariants are obtained by evaluating cohomology classes on the moduli space against a fundamental class, which exists due to analytical work showing the orientability and compactness of the moduli space by Karen Uhlenbeck, Taubes, and Donaldson.

When the four-manifold is a Kähler manifold or algebraic surface and the principal bundle has vanishing first Chern class, the anti-self-duality equations are equivalent to the Hermitian Yang–Mills equations on the complex manifold ${\displaystyle X}$. The Kobayashi–Hitchin correspondence proven for algebraic surfaces by Donaldson, and in general by Uhlenbeck and Yau, asserts that solutions to the HYM equations correspond to stable holomorphic vector bundles. This work gave an alternate algebraic description of the moduli space and its compactification, because the moduli space of semistable holomorphic vector bundles over a complex manifold is a projective variety, and therefore compact. This indicates one way of compactifying the moduli space of connections is to add in connections corresponding to semi-stable vector bundles, so-called almost Hermitian Yang–Mills connections.

### Seiberg–Witten equations

During their investigation of supersymmetry in four dimensions, Edward Witten and Nathan Seiberg uncovered a system of equations now called the Seiberg–Witten equations, for a connection ${\displaystyle A}$ and spinor field ${\displaystyle \psi }$. [11] In this case the four-manifold must admit a SpinC structure, which defines a principal SpinC bundle ${\displaystyle P}$ with determinant line bundle ${\displaystyle L}$, and an associated spinor bundle ${\displaystyle S^{+}}$. The connection ${\displaystyle A}$ is on ${\displaystyle L}$, and the spinor field ${\displaystyle \psi \in \Gamma (S^{+})}$. The Seiberg–Witten equations are given by

${\displaystyle {\begin{cases}F_{A}^{+}=\psi \otimes \psi ^{*}-{\frac {1}{2}}|\psi |^{2}\\d_{A}\psi =0.\end{cases}}}$

Solutions to the Seiberg–Witten equations are called monopoles. The moduli space of solutions to the Seiberg–Witten equations, ${\displaystyle {\mathcal {M}}_{\sigma }}$ where ${\displaystyle \sigma }$ denotes the choice of Spin structure, is used to derive the Seiberg–Witten invariants. The Seiberg–Witten equations have an advantage over the anti-self-duality equations, in that the equations themselves may be perturbed slightly to give the moduli space of solutions better properties. To do this, an arbitrary self-dual two-form is added on to the first equation. For generic choices of metric ${\displaystyle g}$ on the underlying four-manifold, and choice of perturbing two-form, the moduli space of solutions is a compact smooth manifold. In good circumstances (when the manifold ${\displaystyle X}$ is of simple type), this moduli space is zero-dimensional: a finite collection of points. The Seiberg–Witten invariant in this case is simply the number of points in the moduli space. The Seiberg–Witten invariants can be used to prove many of the same results as Donaldson invariants, but often with easier proofs which apply in more generality.

## Gauge theory in higher dimensions

### Hermitian Yang–Mills equations

A particular class of Yang–Mills connections are possible to study over Kähler manifolds or Hermitian manifolds. The Hermitian Yang–Mills equations generalise the anti-self-duality equations occurring in four-dimensional Yang–Mills theory to holomorphic vector bundles over Hermitian complex manifolds in any dimension. If ${\displaystyle E\to X}$ is a holomorphic vector bundle over a compact Kähler manifold ${\displaystyle (X,\omega )}$, and ${\displaystyle A}$ is a Hermitian connection on ${\displaystyle E}$ with respect to some Hermitian metric ${\displaystyle h}$. The Hermitian Yang–Mills equations are

${\displaystyle {\begin{cases}F_{A}^{0,2}=0\\\Lambda _{\omega }F_{A}=\lambda (E)\operatorname {Id} _{E},\end{cases}}}$

where ${\displaystyle \lambda (E)\in \mathbb {C} }$ is a topological constant depending on ${\displaystyle E}$. These may be viewed either as an equation for the Hermitian connection ${\displaystyle A}$ or for the corresponding Hermitian metric ${\displaystyle h}$ with associated Chern connection ${\displaystyle A}$. In four dimensions the HYM equations are equivalent to the ASD equations. In two dimensions the HYM equations correspond to the Yang–Mills equations considered by Atiyah and Bott. The Kobayashi–Hitchin correspondence asserts that solutions of the HYM equations are in correspondence with polystable holomorphic vector bundles. In the case of compact Riemann surfaces this is the theorem of Narasimhan and Seshadri as proven by Donaldson. For algebraic surfaces it was proven by Donaldson, and in general it was proven by Karen Uhlenbeck and Shing-Tung Yau. [13] [14] This theorem is generalised in the nonabelian Hodge theorem by Simpson, and is in fact a special case of it where the Higgs field of a Higgs bundle ${\displaystyle (E,\Phi )}$ is set to zero. [25]

### Exceptional holonomy instantons

The effectiveness of solutions of the Yang–Mills equations in defining invariants of four-manifolds has led to interest that they may help distinguish between exceptional holonomy manifolds such as G2 manifolds in dimension 7 and Spin(7) manifolds in dimension 8, as well as related structures such as Calabi–Yau 6-manifolds and nearly Kähler manifolds. [35] [36]

### String theory

New gauge-theoretic problems arise out of superstring theory models. In such models the universe is 10 dimensional consisting of four dimensions of regular spacetime and a 6-dimensional Calabi–Yau manifold. In such theories the fields which act on strings live on bundles over these higher dimensional spaces, and one is interested in gauge-theoretic problems relating to them. For example, the limit of the natural field theories in superstring theory as the string radius approaches zero (the so-called large volume limit) on a Calabi–Yau 6-fold is given by Hermitian Yang–Mills equations on this manifold. Moving away from the large volume limit one obtains the deformed Hermitian Yang–Mills equation, which describes the equations of motion for a D-brane in the B-model of superstring theory. Mirror symmetry predicts that solutions to these equations should correspond to special Lagrangian submanifolds of the mirror dual Calabi–Yau. [37]

## References

1. ^ Yang, C.N. and Mills, R.L., 1954. Conservation of isotopic spin and isotopic gauge invariance. Physical review, 96(1), p. 191.
2. ^ Atiyah, M.F., Hitchin, N.J. and Singer, I.M., 1977. Deformations of instantons. Proceedings of the National Academy of Sciences, 74(7), pp. 2662–2663.
3. ^ Atiyah, M.F., Hitchin, N.J. and Singer, I.M., 1978. Self-duality in four-dimensional Riemannian geometry. Proceedings of the Royal Society of London. A. Mathematical and Physical Sciences, 362(1711), pp. 425–461.
4. ^ Atiyah, M.F. and Ward, R.S., 1977. Instantons and algebraic geometry. Communications in Mathematical Physics, 55(2), pp. 117–124.
5. ^ Atiyah, M.F., Hitchin, N.J., Drinfeld, V.G. and Manin, Y.I., 1978. Construction of instantons. Physics Letters A, 65(3), pp. 185–187.
6. ^ a b c Atiyah, M.F. and Bott, R., 1983. The yang-mills equations over riemann surfaces. Philosophical Transactions of the Royal Society of London. Series A, Mathematical and Physical Sciences, 308(1505), pp. 523–615.
7. ^ Uhlenbeck, K.K., 1982. Connections withL p bounds on curvature. Communications in Mathematical Physics, 83(1), pp. 31–42.
8. ^ Donaldson, S.K., 1983. An application of gauge theory to four-dimensional topology. Journal of Differential Geometry, 18(2), pp. 279–315.
9. ^ Donaldson, S.K., 1990. Polynomial invariants for smooth four-manifolds. Topology, 29(3), pp. 257–315.
10. ^ a b Witten, E., 1989. Quantum field theory and the Jones polynomial. Communications in Mathematical Physics, 121(3), pp. 351–399.
11. ^ a b Witten, Edward (1994), "Monopoles and four-manifolds.", Mathematical Research Letters, 1 (6): 769–796, arXiv:hep-th/9411102, Bibcode:1994MRLet...1..769W, doi:10.4310/MRL.1994.v1.n6.a13, MR 1306021, archived from the original on 2013-06-29
12. ^ Vafa, C. and Witten, E., 1994. A strong coupling test of S-duality. arXiv preprint hep-th/9408074.
13. ^ a b Simon K. Donaldson, Anti self-dual Yang-Mills connections over complex algebraic surfaces and stable vector bundles, Proceedings of the London Mathematical Society (3) 50 (1985), 1-26.
14. ^ a b Karen Uhlenbeck and Shing-Tung Yau, On the existence of Hermitian–Yang-Mills connections in stable vector bundles.Frontiers of the mathematical sciences: 1985 (New York, 1985). Communications on Pure and Applied
15. ^ Hitchin, N.J., 1987. The self-duality equations on a Riemann surface. Proceedings of the London Mathematical Society, 3(1), pp. 59–126.
16. ^ Simpson, Carlos T. Higgs bundles and local systems. Publications Mathématiques de l'IHÉS, Volume 75 (1992), pp. 5–95. http://www.numdam.org/item/PMIHES_1992__75__5_0/
17. ^ a b Donaldson, S.K., Donaldson, S.K. and Kronheimer, P.B., 1990. The geometry of four-manifolds. Oxford University Press.
18. ^ Peskin, Michael; Schroeder, Daniel (1995). An introduction to quantum field theory (Reprint ed.). Westview Press. ISBN  978-0201503975.
19. ^ Narasimhan, M.S. and Seshadri, C.S., 1965. Stable and unitary vector bundles on a compact Riemann surface. Annals of Mathematics, pp. 540–567.
20. ^ Donaldson, S.K., 1983. A new proof of a theorem of Narasimhan and Seshadri. Journal of Differential Geometry, 18(2), pp. 269–277.
21. ^ Nahm, W., 1983. All self-dual multimonopoles for arbitrary gauge groups. In Structural elements in particle physics and statistical mechanics (pp. 301–310). Springer, Boston, MA.
22. ^ Hitchin, N.J., 1983. On the construction of monopoles. Communications in Mathematical Physics, 89(2), pp. 145–190.
23. ^ Donaldson, S.K., 1984. Nahm's equations and the classification of monopoles. Communications in mathematical physics, 96(3), pp. 387–408.
24. ^ Hitchin, N.J., 1987. The self‐duality equations on a Riemann surface. Proceedings of the London Mathematical Society, 3(1), pp. 59–126.
25. ^ a b Simpson, C.T., 1988. Constructing variations of Hodge structure using Yang-Mills theory and applications to uniformization. Journal of the American Mathematical Society, 1(4), pp. 867–918.
26. ^ Simpson, C.T., 1992. Higgs bundles and local systems. Publications Mathématiques de l'IHÉS, 75, pp. 5–95.
27. ^ Simpson, C.T., 1994. Moduli of representations of the fundamental group of a smooth projective variety I. Publications Mathématiques de l'IHÉS, 79, pp.47–129.
28. ^ Simpson, C.T. Moduli of representations of the fundamental group of a smooth projective variety. II. Publications Mathématiques de L’Institut des Hautes Scientifiques 80, 5–79 (1994). https://doi.org/10.1007/BF02698895
29. ^ Simpson, C., 1996. The Hodge filtration on nonabelian cohomology. arXiv preprint alg-geom/9604005.
30. ^ Atiyah, Michael; Hitchin, Nigel (1988), The geometry and dynamics of magnetic monopoles, M. B. Porter Lectures, Princeton University Press, ISBN  978-0-691-08480-0, MR 0934202
31. ^ Hitchin, N.J., 1990. Flat connections and geometric quantization. Communications in mathematical physics, 131(2), pp. 347–380.
32. ^ Axelrod, S., Della Pietra, S. and Witten, E., 1991. Geometric quantization of Chern–Simons gauge theory. representations, 34, p. 39.
33. ^ Witten, E., 1991. Quantization of Chern-Simons gauge theory with complex gauge group. Communications in Mathematical Physics, 137(1), pp. 29–66.
34. ^ Floer, A., 1988. An instanton-invariant for 3-manifolds. Communications in mathematical physics, 118(2), pp. 215–240.
35. ^ S. K. Donaldson and R. P. Thomas. Gauge theory in higher dimensions. In The Geometric Universe (Oxford, 1996), pages 31–47. Oxford Univ. Press,Oxford, 1998.
36. ^ Simon Donaldson and Ed Segal. Gauge theory in higher dimensions, II. InSurveys in differential geometry. Volume XVI. Geometry of special holonomyand related topics, volume 16 ofSurv. Differ. Geom., pages 1–41. Int. Press,Somerville, MA, 2011.
37. ^ Leung, N.C., Yau, S.T. and Zaslow, E., 2000. From special lagrangian to hermitian-Yang-Mills via Fourier-Mukai transform. arXiv preprint math/0005118.