On the Functional Optimality of Neural Networks
M. Unser
Proceedings of the XXII Congresso dell’Unione Matematica Italiana (UMI'23), Pisa, Italian Republic, September 4-9, 2023.
Let L be a linear shift-invariant and isotropic operator that is characterized by its radial Fourier profile L^rad : ℝ → ℝ. We further assume that L^rad is non-vanishing, except for a zero of order (n0 − 1) at the origin. This operator is in one-to-one correspondence with the activation function σL = ℱ−1{1 ∕ L^rad} : ℝ → ℝ where ℱ−1 denotes the inverse Fourier transform. We define the corresponding Radon domain regularization operator LR = KradRL : ℳLR(ℝd) → ℳeven(ℝ ⨉ 𝕊d−1) where R is the Radon transform, Krad is the filtering operator of computed tomography (such that R*KradR = Id), and ℳeven is the space of even hyper-spherical bounded measures (see [1] for the precise definition of these elements).
References
-
M. Unser, "From Kernel Methods to Neural Networks: A Unifying Variational Formulation," arXiv:2206.14625 [cs.LG]
@INPROCEEDINGS(http://bigwww.epfl.ch/publications/unser2302.html, AUTHOR="Unser, M.", TITLE="On the Functional Optimality of Neural Networks", BOOKTITLE="Proceedings of the {XXII} Congresso dell’Unione Matematica Italiana ({UMI'23})", YEAR="2023", editor="", volume="", series="", pages="", address="Pisa, Italian Republic", month="September 4-9,", organization="", publisher="", note="")