SheepNav
新上线今天0 投票

Singular Learning and Occam's Razor in Deep Monomial Networks

arXiv:2606.28464v1 Announce Type: new Abstract: In the optimization of neural networks, gradient dynamics are influenced by critical points that arise from the model's architecture. These critical points occur where the Jacobian of the model's parametrization is rank-deficient, and are the most pronounced singularities studied in Singular Learning Theory. We investigate such points in deep fully-connected networks with monomial activations via tools from polynomial algebra such as Mason's Theore

延伸阅读

  1. The AI jobs debate just got messier
  2. 反事实残差数据增强:为回归任务注入新的生命力
  3. scKDGM:基于KAN的动态图掩码学习框架,革新单细胞RNA-seq聚类
查看原文