Skip to main content
King Abdullah University of Science and Technology
Computer Science
Computer Science
  • Study
    • Prospective Students
    • Current Students
  • Research
    • Research Areas
    • Research Groups
  • People
    • All People
    • Faculty
    • Affiliate Faculty
    • Instructional Faculty
    • Research Scientists
    • Research Staff
    • Postdoctoral Fellows
    • Administrative Staff
    • Alumni
    • Students
  • News
  • Events
  • About
  • CEMSE Division
  • Apply

ReLUs

Dynamics and Convergence of Weight Normalization for Training Neural Networks

Guido Montufar, Assistant Professor, Departments of Mathematics and Statistics, University of California, Los Angeles (UCLA)

Jan 29, 13:00 - 14:30

B1 L3 R3119

machine learning neural network optimization ReLUs

We present a result on the convergence of weight normalized training of artificial neural networks. In the analysis, we consider over-parameterized 2-layer networks with rectified linear units (ReLUs) initialized at random and trained with batch gradient descent and a fixed step size. The proof builds on recent theoretical works that bound the trajectory of parameters from their initialization and monitor the network predictions via the evolution of a ''neural tangent kernel'' (Jacot et al. 2018). We discover that training with weight normalization decomposes such a kernel via the so called ''length-direction decoupling''. This in turn leads to two convergence regimes. From the modified convergence we make a few curious observations including a natural form of ''lazy training'' where the direction of each weight vector remains stationary.

Computer Science (CS)

Footer

  • A-Z Directory
    • All Content
    • Browse Related Sites
  • Site Management
    • Log in

© 2025 King Abdullah University of Science and Technology. All rights reserved. Privacy Notice

Disclaimer: The views and opinions expressed in this page are strictly those of the page author. The contents of this page have not been reviewed or approved by the King Abdullah University of Science and Technology.