Public
- Public
- Network
- Groups
- Popular
- People

Conversation

Notices

Akionux (akionux@status.akionux.net)'s status on Saturday, 21-Dec-2019 14:12:50 JST Akionux

[1912.08526] Analytic expressions for the output evolution of a deep neural network https://arxiv.org/abs/1912.08526
深層ニューラルネットワーク (DNN)をテイラー展開して、解析するという論文。DNNの学習前期には線型モデルと変わらない。後期になると高次項が必要になるが、汎化性能が上がってくる。
In conversation Saturday, 21-Dec-2019 14:12:50 JST from status.akionux.net permalink
Attachments
1. No result found on File_thumbnail lookup.
  
  Analytic expressions for the output evolution of a deep neural network
  
  from arXiv.org
  
  We present a novel methodology based on a Taylor expansion of the network output for obtaining analytical expressions for the expected value of the network weights and output under stochastic training. Using these analytical expressions the effects of the hyperparameters and the noise variance of the optimization algorithm on the performance of the deep neural network are studied. In the early phases of training with a small noise coefficient, the output is equivalent to a linear model. In this case the network can generalize better due to the noise preventing the output from fully converging on the train data, however the noise does not result in any explicit regularization. In the later training stages, when higher order approximations are required, the impact of the noise becomes more significant, i.e. in a model which is non-linear in the weights noise can regularize the output function resulting in better generalization as witnessed by its influence on the weight Hessian, a commonly used metric for generalization capabilities.

Public

Conversation

Notices

Feeds