On the Functional Optimality of Neural Networks
M. Unser
Proceedings of the XXII Congresso dell’Unione Matematica Italiana (UMI'23), Pisa, Italian Republic, September 4-9, 2023.
Let L be a linear shift-invariant and isotropic operator that is characterized by its radial Fourier profile L^rad : ℝ → ℝ. We further assume that L^rad is non-vanishing, except for a zero of order (n0 − 1) at the origin. This operator is in one-to-one correspondence with the activation function σL = ℱ−1{1 ∕ L^rad} : ℝ → ℝ where ℱ−1 denotes the inverse Fourier transform. We define the corresponding Radon domain regularization operator LR = KradRL : ℳLR(ℝd) → ℳeven(ℝ ⨉ 𝕊d−1) where R is the Radon transform, Krad is the filtering operator of computed tomography (such that R*KradR = Id), and ℳeven is the space of even hyper-spherical bounded measures (see [1] for the precise definition of these elements).
References
-
M. Unser, "From Kernel Methods to Neural Networks: A Unifying Variational Formulation," arXiv:2206.14625 [cs.LG]
@INPROCEEDINGS(http://bigwww.epfl.ch/publications/unser2302.html,
AUTHOR="Unser, M.",
TITLE="On the Functional Optimality of Neural Networks",
BOOKTITLE="Proceedings of the {XXII} Congresso dell’Unione Matematica
Italiana ({UMI'23})",
YEAR="2023",
editor="",
volume="",
series="",
pages="",
address="Pisa, Italian Republic",
month="September 4-9,",
organization="",
publisher="",
note="")