One of the pillars of functional analysis is the Hahn-Banach theorem, so it makes sense to dedicate a post to this theorem. On normed spaces, the theorem has a plethora of interesting corollaries, some of which will be stated here. The locally convex spaces are of interest since they are the most rudimentary topological vector spaces on which the the Hahn-Banach theorem can be used to extend continuous linear functionals, and encompasses a sizable chunk of the topological vector spaces one might meet in the wild.
– Locally Convex Topological Vector Spaces –
Locally convex topological vector spaces (LCTVS) can be characterized either as a topological vector space
where the origin has a neighborhood basis consisting of convex open sets
each of which satisfy
\begin{array}{lll}
& \lambda x \in U & \text{for all }\lambda \leq 1 \text{ and all }x\in U & \textbf{(Balanced)}\\
& \bigcup^{\infty}_{t=1}tU = X & & \textbf{(Absorbent)}. \\
\end{array}
(recall that on topological vector spaces, translation is a homeomorphism, by definition, so we only need to specify a local neighborhood basis at one point to determine the topology). Note also that any neighborhood of 0 of a topological vector space must be absorbent, this can be shown since “scaling” is continuous.
Alternatively, one can define it as a topological vector spaces whose topology is “induced” by a family of seminorms, (this will be made precise shortly). Here I will follow the latter route.
Let
be a topological vector space, we say that a map
is a seminorm if for all
and
, it satisfies
\begin{array}{ll}
& \sigma(x) \geq 0 \\
& \sigma(x + y) \leq \sigma(x) + \sigma(y) \\
& \sigma(\lambda x) = |\lambda|\sigma(x)
\end{array}
That is, a seminorm is a norm that might not be “positive definite”, e.i.
.
With
![]()
is a seminorm on ![]()
The topology induced by an arbitrary family of seminorms
is the weak topology induced by the seminorm maps
which is the weakest (or coarsest) topology for which all the seminorms are continuous. Explicitly, it is the topology induced by the neighborhood subbasis elements of the origin given by
![]()
Norms are of course also seminorms, and Banach spaces are LCTVS’s since their topologies are weak topologies induced by a single “seminorm” (an actual norm this time).
Let
be a convex set containing
. The point 0 is said to be an internal point of
if the intersection of every line through 0 with
is a non-trivial interval. Note that this is satisfied if
is an interior point of
, or if
is balanced and absorbent. This is the minimum structure we need to define the following (non-linear) functional, as it ensures the infimum is taken over a non-empty set:
Let
![]()
The Minkowski functionals (sometimes called the Gauge functionals or the support functions for C) bridges the gap between the two definitions of LCTVS’s by assigning to each convex absorbent and balanced neighborhood basis element
of
a functional
, and employing the following fact, which I state here without proof,
If
The following Lemma in addition to its intrinsic value will be of use later on in the proof of the Hahn-Banach theorem.
A linear functional
It is an easy exercise to check that same conclusion holds if
or
are bounded in the above lemma (using the formula
), and the lemma holds for real vector spaces as well.
– A word on metrizability –
Since every metrizable space must be (at the very least) first countable and Hausdorff, Proposition 1 has the consequence that if a LCTVS is metrizable it must be possible to induce it’s topology by a countable family of seminorms that separate points, namely the Minkowski functionals associated with the (countable) neighborhood basis of the origin. The first condition comes from the requirement that the space be first countable (i.e. has a countable neighborhood basis at 0) the second from the fact that it must be Hausdorff. Recall that a family
of seminorms on
is said to separate points if for each distinct
there exists a seminorm
such that
. The important fact worth remembering is that this is actually a sufficient requirement:
A locally convex topological vector space is metrizable if and only if its topology can be induced by a countable family of separating seminorms.
![]()
can be shown to induce the same topology.
So for LCTVS metrizability is equivalent to being Hausdorff and second countable, which is equivalent to the existence of a countable family of separating seminorms which induce the topology. Be warned, some authors define topological vector spaces to be Hausdorff out of the box, so they might not even mention the “separates points” requirement, as that is baked into the definition of topological vector spaces.
Let
![]()
for each compact
in the (countable) collection of compact sets which cover
.
– The Hahn-Banach Theorem –
Now that we got some of the basic theory of LCTVS’s out of the way, let’s proceed to the main topic of this post. Here is the original Hahn-Banach theorem for real vector spaces. A function
on a real vector space
is said to be sublinear if it satisfies
\begin{array}{cl}
&p(u + v) \leq p(u) + p(v), & \text{for all } u, v\in V\\
&p(r v) = rp(v)& \text{for all } r\geq 0.
\end{array}
If
![]()
![]()
for some constant
. Choosing
cleverly one can ensure that
.
Now form the collection of all linear extensions of
dominated by
and order them by “extension”, that is,
if
is an extension of
. The collection is non-empty (by the above) and can be shown to be inductively ordered, hence by Zorn’s lemma it has a maximal element, which must be defined on the whole of
, or else one could extend it further (again by the above argument), which contradicts the maximality assumption.
If the vector space is equipped with a countable (Hamel) basis, it may be tempting to rewrite the above proof as an induction argument on the basis, and hope to get away without relying on any of the choice axiom (in particular the axiom of dependent choices). But unfortunately there is no canonical way to choose the value of
at each step in the iteration, so we would still need the axiom. Be warned that the reliance on the axiom of dependent choices is deemed so innocuous that many authors understate or completely miss it.
The Hahn-Banach theorem was first discovered by Hahn and later independently by Banach. At this stage we have not had any use for the condition that the extension be dominated by a sublinear functional, and it may seem strange that both authors came up with this seemingly arbitrary condition, but there is a method to the madness. First of, note that norms, seminorms and the Minkowski functionals are examples of sublinear functionals, so one can think the sublinear functionals as distilling the essential features of these maps, which ensures that an extension of a continuous linear functional on a normed space is also continuous (see the next corollary). Secondly, Hahn never actually formulated the theorem by means of sublinear functionals at all, but something to the extent of the next corollary. The above theorem is due to Banach. Thirdly, both authors likely knew about a previous result of Helly which showed that continuous linear functionals on normed spaces could be extended to continuous linear functionals on the space obtained by adjoining a single basis vector to the domain. So the theorem should be seen as an attempt to generalize Helly’s result to arbitrary continuous extensions of continuous linear functionals on normed spaces.
The above theorem can be extended to complex vector spaces. First we define a complex sublinear functional on
to be a functional satisfying
![Rendered by QuickLaTeX.com \[ \begin{array}{l} & p(u + v) \leq p(u ) + p(v) & \text{for all $u, v \in V$} \\ & p(\alpha u) = |\alpha| p(u) & \text{for all $\alpha \in \mathbb{C}$ and all $u\in V$} \end{array} \]](https://www.mathblog.realhjelp.no/wp-content/ql-cache/quicklatex.com-1cb6481aa8e8e3376cc2fbce4f3bd819_l3.png)
that is,
is just like a seminorm on
that can attain negative values.
If
![]()
![]()
where
is a real real linear functional on
. We may extend
to a real linear functional
on V such that
(by Theorem 1). The corresponding complex linear functional
does the trick, since for any
, with
![Rendered by QuickLaTeX.com \[\alpha = \begin{cases} \frac{\overline{l(v)}}{|l(v)|} & \text{if } l(v) \neq 0\\ 1 & else \end{cases}\]](https://www.mathblog.realhjelp.no/wp-content/ql-cache/quicklatex.com-f053a8a99b94ae28b137d97ee96c6377_l3.png)
we have that
![]()
The next theorem is sometimes called the analytic version of the Hahn-Banach theorem, or simply just the Hahn-Banach theorem. All vector spaces will henceforth be assumed to be complex.
If
If


(the annihilator of
)
![]()
and check that it is bounded (by 1) then employ Corollary 2.1. The resulting extension does the trick.
If
The first supremum is always attained.
![]()
which concludes the proof.
In the above corollary, if
happens to be a Banach space (that is, it is complete with respect to its norm), the second supremum is attained if and only if the space is reflexive, which just means it is isomorphic to its second dual under the canonical imbedding. This is called James Theorem, and is a deep result of functional analysis which will not be covered here, but it should definitely be mentioned.
– Separation Theorems –
Now that some of the corollaries to the Hahn-Banach theorem for normed spaces have been covered we turn to the case of LCTVS. Here we will need the following analogue to Theorem 1,
Let
![]()
![]()
Using Theorem 1 we may extend
to a (real) linear functional
on
such that
, for all
. Check that this functional has the desired properties.
The preceding “separation” theorem, sometimes called the Hahn-Banach separation theorem, though to the best of my knowledge neither Hahn nor Banach actually stated it, has the following interesting consequence on locally convex spaces
Let
![]()
![]()
By Lemma 2,
is continuous. Now
(since
). This inequality is strict. To see this, recall that earlier we noted that any neighborhood of
of a topological vector space is absorbent, hence there is for any
a scalar
such that
. But then with
such that
, if we pick
such that
, we get
, which implies
and, since
was arbitrary, that
. This contradicts that
was non-trivial so we conclude that
![]()
which concludes the proof.
Let
![]()
![]()
which concludes the proof
If
![]()
![]()
Assume there is a
such that
. Then for any
we have
. But then
for all
, which, since
is finite, implies that
. Hence
. Scaling
by
concludes the proof.
