arXiv:2511.20888v1 Announce Type: new 
Abstract: This paper argues that DNNs implement a computational Occam's razor -- finding the `simplest' algorithm that fits the data -- and that this could explain their incredible and wide-ranging success over more traditional statistical methods. We start with the discovery that the set of real-valued function $f$ that can be $\epsilon$-approximated with a binary circuit of size at most $c\epsilon^{-\gamma}$ becomes convex in the `Harder than Monte Carlo' (HTMC) regime, when $\gamma>2$, allowing for the definition of a HTMC norm on functions. In parallel one can define a complexity measure on the parameters of a ResNets (a weighted $\ell_1$ norm of the parameters), which induce a `ResNet norm' on functions. The HTMC and ResNet norms can then be related by an almost matching sandwich bound. Thus minimizing this ResNet norm is equivalent to finding a circuit that fits the data with an almost minimal number of nodes (within a power of 2 of being optimal). ResNets thus appear as an alternative model for computation of real functions, better adapted to the HTMC regime and its convexity.

يناقش مقال حديث كيف يمكن اعتبار الشبكات العصبية العميقة (DNN) كأداة لقص أوقام الحوسبة، حيث تحدد بفعالية أبسط الخوارزميات التي تناسب البيانات. يبرز البحث تقوس الدوال ذات القيم الحقيقية التي يتم تقريبها بواسطة دوائر ثنائية في نظام أكثر صعوبة من مونت كارلو، خاصة عند استخدام ResNets، مما يسمح بتعريف جديد لمقياس التعقيد على معلماتها.

Un artículo reciente discute cómo las redes neuronales profundas (DNN) pueden verse como un rasero de Occam computacional, identificando de manera efectiva los algoritmos más simples que se ajustan a los datos. El estudio destaca la convexidad de las funciones de valor real aproximadas por circuitos binarios en el régimen más difícil que Monte Carlo, especialmente al utilizar ResNets, lo que permite una nueva medida de complejidad sobre sus parámetros.

Un article récent discute de la manière dont les réseaux de neurones profonds (DNN) peuvent être considérés comme un rasoir d'Occam computationnel, identifiant efficacement les algorithmes les plus simples qui s'adaptent aux données. L'étude met en évidence la convexité des fonctions réelles approchées par des circuits binaires dans le régime plus difficile que Monte Carlo, en particulier lors de l'utilisation de ResNets, ce qui permet une nouvelle mesure de complexité sur leurs paramètres.

A recent paper discusses how deep neural networks (DNNs) can be viewed as a computational Occam's razor, effectively identifying the simplest algorithms that fit data. The study highlights the convexity of real-valued functions approximated by binary circuits in the Harder than Monte Carlo regime, particularly when using ResNets, which allows for a new complexity measure on their parameters.

Deep Learning as a Convex Paradigm of Computation: Minimizing Circuit Size with ResNets

Was this article worth reading? Share it

LucidQuery AI

Hypertune

The Visualizer

Dyad

SnapChip

Attentive AI

Ready to build your own newsroom?