Matches in SemOpenAlex for { <https://semopenalex.org/work/W4313142249> ?p ?o ?g. }
Showing items 1 to 82 of
82
with 100 items per page.
- W4313142249 endingPage "136" @default.
- W4313142249 startingPage "121" @default.
- W4313142249 abstract "Modern deepKim, Seung Hwan neural networks are equipped with normalization layers such as batch normalization or layer normalization to enhance and stabilize training dynamics. If a network contains such normalization layers, the optimization objective is invariant to the scale of the neural network parameters. The scale-invariance induces the neural network’s output to be only affected by the weights’ direction and not the weights’ scale. We first find a common feature of good hyperparameter combinations on such a scale-invariant network, including learning rate, weight decay, number of data samples, and batch size. Then we observe that hyperparameter setups that lead to good performance show similar degrees of angular update during one epoch. Using a stochastic differential equation, we analyze the angular update and show how each hyperparameter affects it. With this relationship, we can derive a simple hyperparameter tuning method and apply it to the efficient hyperparameter search." @default.
- W4313142249 created "2023-01-06" @default.
- W4313142249 creator A5003346555 @default.
- W4313142249 creator A5033951605 @default.
- W4313142249 creator A5066874418 @default.
- W4313142249 creator A5074809370 @default.
- W4313142249 creator A5083790649 @default.
- W4313142249 creator A5084478127 @default.
- W4313142249 date "2022-01-01" @default.
- W4313142249 modified "2023-10-16" @default.
- W4313142249 title "On the Angular Update and Hyperparameter Tuning of a Scale-Invariant Network" @default.
- W4313142249 cites W2117539524 @default.
- W4313142249 cites W2194775991 @default.
- W4313142249 cites W2549139847 @default.
- W4313142249 cites W2752782242 @default.
- W4313142249 cites W2962971773 @default.
- W4313142249 cites W2963446712 @default.
- W4313142249 cites W2964137095 @default.
- W4313142249 cites W2982083293 @default.
- W4313142249 cites W3104181075 @default.
- W4313142249 cites W4250482878 @default.
- W4313142249 doi "https://doi.org/10.1007/978-3-031-19775-8_8" @default.
- W4313142249 hasPublicationYear "2022" @default.
- W4313142249 type Work @default.
- W4313142249 citedByCount "0" @default.
- W4313142249 crossrefType "book-chapter" @default.
- W4313142249 hasAuthorship W4313142249A5003346555 @default.
- W4313142249 hasAuthorship W4313142249A5033951605 @default.
- W4313142249 hasAuthorship W4313142249A5066874418 @default.
- W4313142249 hasAuthorship W4313142249A5074809370 @default.
- W4313142249 hasAuthorship W4313142249A5083790649 @default.
- W4313142249 hasAuthorship W4313142249A5084478127 @default.
- W4313142249 hasConcept C10485038 @default.
- W4313142249 hasConcept C105795698 @default.
- W4313142249 hasConcept C11413529 @default.
- W4313142249 hasConcept C12267149 @default.
- W4313142249 hasConcept C135593079 @default.
- W4313142249 hasConcept C136886441 @default.
- W4313142249 hasConcept C144024400 @default.
- W4313142249 hasConcept C153180895 @default.
- W4313142249 hasConcept C154945302 @default.
- W4313142249 hasConcept C190470478 @default.
- W4313142249 hasConcept C19165224 @default.
- W4313142249 hasConcept C33923547 @default.
- W4313142249 hasConcept C37914503 @default.
- W4313142249 hasConcept C41008148 @default.
- W4313142249 hasConcept C50644808 @default.
- W4313142249 hasConcept C8642999 @default.
- W4313142249 hasConceptScore W4313142249C10485038 @default.
- W4313142249 hasConceptScore W4313142249C105795698 @default.
- W4313142249 hasConceptScore W4313142249C11413529 @default.
- W4313142249 hasConceptScore W4313142249C12267149 @default.
- W4313142249 hasConceptScore W4313142249C135593079 @default.
- W4313142249 hasConceptScore W4313142249C136886441 @default.
- W4313142249 hasConceptScore W4313142249C144024400 @default.
- W4313142249 hasConceptScore W4313142249C153180895 @default.
- W4313142249 hasConceptScore W4313142249C154945302 @default.
- W4313142249 hasConceptScore W4313142249C190470478 @default.
- W4313142249 hasConceptScore W4313142249C19165224 @default.
- W4313142249 hasConceptScore W4313142249C33923547 @default.
- W4313142249 hasConceptScore W4313142249C37914503 @default.
- W4313142249 hasConceptScore W4313142249C41008148 @default.
- W4313142249 hasConceptScore W4313142249C50644808 @default.
- W4313142249 hasConceptScore W4313142249C8642999 @default.
- W4313142249 hasLocation W43131422491 @default.
- W4313142249 hasOpenAccess W4313142249 @default.
- W4313142249 hasPrimaryLocation W43131422491 @default.
- W4313142249 hasRelatedWork W2016839265 @default.
- W4313142249 hasRelatedWork W2018445155 @default.
- W4313142249 hasRelatedWork W2026355170 @default.
- W4313142249 hasRelatedWork W2533072256 @default.
- W4313142249 hasRelatedWork W2946038180 @default.
- W4313142249 hasRelatedWork W3099966684 @default.
- W4313142249 hasRelatedWork W4280535922 @default.
- W4313142249 hasRelatedWork W4295309597 @default.
- W4313142249 hasRelatedWork W4313142249 @default.
- W4313142249 hasRelatedWork W4323894855 @default.
- W4313142249 isParatext "false" @default.
- W4313142249 isRetracted "false" @default.
- W4313142249 workType "book-chapter" @default.