The theory of hyperfunctions, created by the Japanese school of Mikio Sato, Masaki Kashiwara et al. is one of the many variants of the theory of generalized functions. Unlike the earlier theory of distributions of Schwarz et alt. it is not based on duals of some spaces of smooth functions but rather on boundary values of holomorphic functions. This allows usage of sheaf theory and includes Schwartz distributions as a special case. For example in Schwarz’s theory apart from measures one has functions with discrete support which are linear combinations of finite derivatives of delta functions. Here one can allow things like the exponential of a differential operator applied to a delta function. So in a sense one has distributions of infinite order.
Hyperfunctions are a very useful tool in the study of D-modules, holonomic systems of differential equations, and especially some aspects of symplectic geometry and harmonic analysis that are part of microlocal analysis, especially algebraic microlocalization.
This page is about hyperfunctions of one variable only, for the multiple variable case see hyperfunction of multiple variables?.
We define hyperfunctions and explain some basic properties.
The theory of hyperfunctions of one variable is considerably easier than the one for multiple variables, for this reason we will split the exposition to handle the one dimensional case first, and then generalize to multiple dimensions: hyperfunction of multiple variables?.
The exposition of the one dimensional theory will try to illuminate the following points:
A basic strategy of the exposition will therefore be to stress both similarities and differences of hyperfunctions and distributions (real analytic and smooth setting).
Hyperfunctions require less technical machinery, at least in one dimension, since they can be defined and studied with basic complex analysis only. That will nevertheless make it possible to give a rigorous definitions of otherwise only formal expressions that are often used by engineers and physicists, like $\int_a^b \delta (x) dx$.
Hyperfunctions and their microfunction?s both form flabby sheaves, which is the starting point of a study of singularities of hyperfunctions and the algebraic analysis of systems of differential equations.
Let $I = (a, b) \subseteq \mathbb{R}$ be an open interval, a complex neighbourhood of $I$ is an open set $U \subset \mathbb{C}$ such that $U \bigcap \mathbb{R} = I$. For $V \subset \mathbb{C}$ open let $\mathcal{O}(V)$ be the sheaf of holomorphic functions on $V$.
The set of hyperfunctions on $I$ for a complex neighbourhood $U$ of $I$ is defined to be the quotient
The set $\mathcal{B}(I)$ does not depend on the chosen complex neighbourhood $U$, which is a consequence of Mittag-Leffler's theorem? in the one-dimensional case. The definition easily generalizes to open subsets of $\mathbb{R}$.
Let $U^+ \coloneqq \{u \in U : \operatorname{Im}(u) \gt 0 \}$ and $U^- \coloneqq \{u \in U : \operatorname{Im}(u) \lt 0 \}$, then a nontrivial hyperfunction can be described by two functions $F^+$ holomorphic on $U^+$ and $F^-$ holomorphic on $U^-$ such that there is no function $F$ holomorphic on $U$ that restricts to $F^+$ and $F^-$. In this sense hyperfunctions are “differences” of holomorphic functions on a “boundary” (we did not specify an algebraic object that contains both $F^+$ and $F^-$, so strictly speaking we cannot subtract them).
In the following we will often write $F = (F^+, F^-)$ for a hyperfunction $F$, with $(F^+, F^-)$ being a representative of the equivalence class that is $F$ in $\mathcal{B}$ on some complex neighbourhood. If the algebraic expression of $F^+$ and $F^-$ coincides we can further simplify the notation and simply write $[F]$.
The $\delta$ distribution can be represented by $\delta = [\frac{i}{2 \pi z}]$. We can prove that this equality is simply a restatement of Cauchy’s integral formula after we define the concept of integration for a compactly supported hyperfunction below.
The Heaviside function $H(x)$ has as a representation:
Here we use the main branch of the logarithm that is defined on the complex plane minus the negative real axis, then $log(-z)$ takes the value $log(|z|) - \pi \, i$ on the upper side of the positive real axis and $log(|z|) + \pi \, i$ on the lower side of the positive real axis, while being holomorphic on the negative real axis. This results in
and a singularity in $x=0$.
Let $U \subseteq \mathbb{R}$ be open.
The derivative of a hyperfunction $F = [F^+, F^-]$ is
We immediatly see that the derivative of a hyperfunction is again a hyperfunction and that all hyperfunctions are indefinitly differentiable.
As the derivative of the Heaviside function we obtain:
Note that we can derive this relationship without any resort to dualities of TVS.
(hyperfunctions are a module over analytic functions) Let $\mathcal{A}(U)$ be the algebra of real analytic functions on $U$, then $\mathcal{B}(U)$ is a module over $\mathcal{A}(U)$.
In fact every real analytic function $f$ is naturally an element of $\mathcal{B}(U)$ represented by $[f, 0]$ or $[0, -f]$ on a complex neighbourhood. Given any hyperfunction $F = (F^+, F^-)$ we can define the product by $f F := (f F^+, F^-) = (F^+, -f F^-)$ which can be shown to be independent of the various choices involved (representation of $F$, complex neighbourhood of $U$, domain of the extension of the domain of $f$).
(hyperfunctions form a flabby sheaf) The sets of hyperfunctions of open subsets of $U$ form a flabby sheaf of vector spaces over $U$.
(distributions are a subsheaf) There is a linear injection from the space of distributions $\mathcal{D}'(U)$ to the space of hyperfunctions $\mathcal{B}(U)$. The sheaf of germs of distributions is therefore a subsheaf of the sheaf of germs of hyperfunctions.
There is no product of hyperfunctions that reduces to the ordinary product for real analytic functions and respects the chain rule of differential calculus, just as in the case of distributions. Nevertheless certain hyperfunctions may be multiplied, more on that later.
First let us note that the naive definition of the product of two hyperfunctions $F = [F^+, F^-], G = [G^+, G^-]$ via
is obviously not well defined on equivalence classes. …
In general we cannot speak of the value of a hyperfunction at a certain point, much like we cannot speak of the value of a distribution at a certain point. We say that a hyperfunction $F$ vanishes on an open subset $U'$ of $U$ if it coincides with the zero hyperfunction, which is of course equivalent to stating that there is a representation of $F = (F^+, F^-)$ such that the boundary values of $F^+$ and $F^-$ coincide on $U'$ and therefore define a real analytic function on $U'$.
The support of a hyperfunction $F$ is the complement of the largest open subset of $U$ on which $F$ vanishes in the sense described above.
For any compact $K$ we will denote by $\mathcal{B}_K$ all hyperfunctions whose support is contained in $K$.
Since we defined hyperfunctions for open subsets only, we do not yet know anything about hyperfunctions $\mathcal{B}_K$ with support in a compact subset $K$, but the following theorem tells us that there is no difference:
(characterization of compactly supported hyperfunctions) Let $\mathcal{B}_K$ be all hyperfunctions with support contained in $K \subset \mathbb{R}$ compact and $V$ be any complex neighbourhood of $K$. Then we have the following isomorphism:
Let $a, b \in \mathbb{R}$ with $a \leq b$ and suppose that $F$ is a hyperfunction that is analytical in $a$ and $b$. Then we can choose a complex neighbourhood $V$ of $[a, b]$, paths $\tau^+$ in $V^+$ and $\tau^-$ in $V^-$ from a to b and a representation of $F = (F^+, F^-)$ such that we can define the integral
independently of all the arbitrary choices we made, thanks to the holomorphy of the functions on the right side (and Cauchy’s integral formula for holomorphic functions, of course).
Using the definition of an integral of a hyperfunction above, we can easily prove
for all hyperfunctions $F$ that are analytic in $a$ and $b$. Therefore we just made rigorous sense of the formula
for $a, b \neq 0$ and $H$ the Heaviside function.
Warning: As of this moment this paragraph consists of some handwaving.
A well known structure theorem in the theory of distributions says that every distribution whose support consists of one single point $P$ is in fact a finite linear combination of distributional derivatives of the $\delta$ function at $P$.
As an example of a hyperfunction that is not a distribution we therefore have $F = [\exp{\frac{1}{z}}]$ with $\supp{F} = \{ 0 \}$.
Using the representation of the $\delta$ distribution, we see that
Using this representation, for any real analytic test function $a$ we see that the following identity holds, which motivates the use of the $\delta$ as in distribution theory:
Comparing the Laurent series of $F$ with the representation of $\delta^{(n)}$ we get
with $(-1)! := 1$. Given the structure theorem about distributions cited above, we see that the hyperfunction $[\exp{\frac{1}{z}}]$ cannot be a distribution. There is an analog structure theorem for hyperfunctions, however:
(structure of hyperfunctions supported at a single point) Let $F$ be a hyperfunction supported at the origin, then $F$ has a representation as
where the coefficients satisfy
We include a sketch of the proof of the first statement to illustrate the ease of the use of hyperfunctions.
Let $F$ be a hyperfunction supported at the origin, that means that $F$ is analytic in a complex neighbourhood of the origin (excluding the origin itself). Therefore $F$ has a representation by a Laurent series in a neighbourhood of the origin:
In the representation of $F$ we may remove the holomorphic part of the Laurent series. Now we may insert the representation of the $\delta$ hyperfunctions and get
The last sum is understood to converge in the topology of uniform convergence on compact subsets where the summands are holomorph.
We can introduce a canonical topology on the space $\mathcal{B}_K$ of hyperfunctions with support in a compact $K$ and show that the topological dual is the space $\mathcal{A}_K$ of germs of real analytical functions, that is the inductive limit of the spaces of holomorphic functions on a cofinal set of complex neighbourhoods of $K$, and vice versa.
The algebraic statement is known as Köthe’s (duality) theorem.
This will also hint at the fact that we will not be able to introduce a topology on $\mathcal{B}(\mathbb{R})$ that resembles the situation in the theory of distributions, since its dual space would be something like “the space of compactly supported real analytic functions”.
…
One striking example of the use of hyperfunctions in the theory of differential equations with real analytic coefficients is this result due to Sato:
Let $U \subseteq \mathbb{R}$ be open and $P$ be a linear, finite order differential operator with real analytic coefficients defined on $U$:
with $a_n \neq 0$.
(solvability of differential equations) For every $f \in \mathcal{B}(U)$ there is a solution $u \in \mathcal{B}(U)$ of the equation $P u = f$. Every such solution can be extended to an open set $V \supset U$ iff the coefficients $a_i$ and $f$ can be extended to $V$.
Briefly: $P$ is surjective on hyperfunctions.
Two related results are:
$P$ is a sheaf endomorphismus, both of the sheaf of real analytic functions and of the sheaf of hyperfunctions.
$P$ does not enlarge the support of hyperfunctions.
Much information about a given hyperfunction is encoded in the kind of singularities that it has. A first step into the theory of singularities is the following definition:
The singular support sing supp($F$) of a hyperfunction $F$ is the complement of the largest open set on which $F$ is real analytic.
This definition does not seem particularly useful, since it only pinpoints the singular locus of a hyperfunction, without explaining the differences between various singularities. One question of crucial importance to physics is for example “when can two distributions” be multiplied?
The singular support does not help much: The $\delta$ distribution cannot be squared, while the distribution defined by $[\frac{1}{x + \epsilon i}]$ can. (The latter is a distribution defined as a Cauchy principal value). The singular support of both consists of the origin. But in a certain sense the singularity of the $\delta$ distribution is worse than that of $[\frac{1}{x + \epsilon i}]$. Going a step further requires the notion of wavefront sets in the smooth setting. Note that the definition of a wavefront set needs the concept of a cutoff function, that is a smooth function with compact support, which cannot be used in the real analytic setting.
…
A gentle introduction with examples is the booklet
K. Yosida, Operational calculus. A theory of hyperfunctions, Applied Mathematical Sciences, 55. Springer-Verlag, New York, 1984. x+170 pp.
Mikio Sato, Theory of hyperfunctions I, pdf scan
M. Kashiwara, T. Kawai, T. Kimura, Foundation of algebraic analysis, Princeton Univ. Press (1986) ((translated from the Japanese))
Mitsuo Morimoto: An introduction to Sato’s hyperfunctions. (ZMATH entry)
A. Eida, S. Pilipović, On the microlocal decomposition of same classes of hyperfunctions, Math. Proc. Camb. Phil. Soc. 125, 455-461 (1999)
It is possible to use hyperfunctions as an introduction to generalized functions for physicists and engineers with a minimal background in complex analysis, see