Matches in SemOpenAlex for { <https://semopenalex.org/work/W3204184292> ?p ?o ?g. }
Showing items 1 to 95 of
95
with 100 items per page.
- W3204184292 abstract "A proper initialization of parameters in a neural network can facilitate its training. The Xavier initialization introduced by Glorot and Bengio which is later generalized to Kaiming initialization by He, Zhang, Ren and Sun are now widely used. However, from experiments we find that networks with heavy weight sharing are difficulty to train even with the Xavier or the Kaiming initialization. We also notice that a certain simple network can be decomposed in two ways, where one is difficult to train while the other is easy to train, when both are properly initialized by the Xavier or the Kaiming initialization. In this paper we will propose a new initialization method which will increase training speed and training stability of neural networks with heavy weight sharing. We will also propose a simple yet efficient method to adjust learning rates layer by layer which is indispensable to our initialization." @default.
- W3204184292 created "2021-10-11" @default.
- W3204184292 creator A5001257681 @default.
- W3204184292 creator A5004126693 @default.
- W3204184292 creator A5021574704 @default.
- W3204184292 creator A5033203506 @default.
- W3204184292 creator A5059443966 @default.
- W3204184292 creator A5082994985 @default.
- W3204184292 date "2021-01-01" @default.
- W3204184292 modified "2023-09-28" @default.
- W3204184292 title "A New Initialization Method for Neural Networks with Weight Sharing" @default.
- W3204184292 cites W1677182931 @default.
- W3204184292 cites W179875071 @default.
- W3204184292 cites W1938976761 @default.
- W3204184292 cites W1995562189 @default.
- W3204184292 cites W2064675550 @default.
- W3204184292 cites W2160815625 @default.
- W3204184292 cites W2194775991 @default.
- W3204184292 cites W2257979135 @default.
- W3204184292 cites W2285660444 @default.
- W3204184292 cites W2545985378 @default.
- W3204184292 cites W2964233199 @default.
- W3204184292 doi "https://doi.org/10.1007/978-981-16-2701-9_9" @default.
- W3204184292 hasPublicationYear "2021" @default.
- W3204184292 type Work @default.
- W3204184292 sameAs 3204184292 @default.
- W3204184292 citedByCount "0" @default.
- W3204184292 crossrefType "book-chapter" @default.
- W3204184292 hasAuthorship W3204184292A5001257681 @default.
- W3204184292 hasAuthorship W3204184292A5004126693 @default.
- W3204184292 hasAuthorship W3204184292A5021574704 @default.
- W3204184292 hasAuthorship W3204184292A5033203506 @default.
- W3204184292 hasAuthorship W3204184292A5059443966 @default.
- W3204184292 hasAuthorship W3204184292A5082994985 @default.
- W3204184292 hasConcept C111472728 @default.
- W3204184292 hasConcept C112972136 @default.
- W3204184292 hasConcept C11413529 @default.
- W3204184292 hasConcept C114466953 @default.
- W3204184292 hasConcept C119857082 @default.
- W3204184292 hasConcept C138885662 @default.
- W3204184292 hasConcept C154945302 @default.
- W3204184292 hasConcept C17744445 @default.
- W3204184292 hasConcept C178790620 @default.
- W3204184292 hasConcept C185592680 @default.
- W3204184292 hasConcept C199360897 @default.
- W3204184292 hasConcept C199539241 @default.
- W3204184292 hasConcept C2779227376 @default.
- W3204184292 hasConcept C2779913896 @default.
- W3204184292 hasConcept C2780586882 @default.
- W3204184292 hasConcept C41008148 @default.
- W3204184292 hasConcept C50644808 @default.
- W3204184292 hasConceptScore W3204184292C111472728 @default.
- W3204184292 hasConceptScore W3204184292C112972136 @default.
- W3204184292 hasConceptScore W3204184292C11413529 @default.
- W3204184292 hasConceptScore W3204184292C114466953 @default.
- W3204184292 hasConceptScore W3204184292C119857082 @default.
- W3204184292 hasConceptScore W3204184292C138885662 @default.
- W3204184292 hasConceptScore W3204184292C154945302 @default.
- W3204184292 hasConceptScore W3204184292C17744445 @default.
- W3204184292 hasConceptScore W3204184292C178790620 @default.
- W3204184292 hasConceptScore W3204184292C185592680 @default.
- W3204184292 hasConceptScore W3204184292C199360897 @default.
- W3204184292 hasConceptScore W3204184292C199539241 @default.
- W3204184292 hasConceptScore W3204184292C2779227376 @default.
- W3204184292 hasConceptScore W3204184292C2779913896 @default.
- W3204184292 hasConceptScore W3204184292C2780586882 @default.
- W3204184292 hasConceptScore W3204184292C41008148 @default.
- W3204184292 hasConceptScore W3204184292C50644808 @default.
- W3204184292 hasLocation W32041842921 @default.
- W3204184292 hasOpenAccess W3204184292 @default.
- W3204184292 hasPrimaryLocation W32041842921 @default.
- W3204184292 hasRelatedWork W1989422077 @default.
- W3204184292 hasRelatedWork W2027682888 @default.
- W3204184292 hasRelatedWork W2116322539 @default.
- W3204184292 hasRelatedWork W2355375122 @default.
- W3204184292 hasRelatedWork W2413005285 @default.
- W3204184292 hasRelatedWork W2472117222 @default.
- W3204184292 hasRelatedWork W2492796309 @default.
- W3204184292 hasRelatedWork W2802636049 @default.
- W3204184292 hasRelatedWork W2812009592 @default.
- W3204184292 hasRelatedWork W2916965824 @default.
- W3204184292 hasRelatedWork W2951388117 @default.
- W3204184292 hasRelatedWork W2996243516 @default.
- W3204184292 hasRelatedWork W3019371514 @default.
- W3204184292 hasRelatedWork W3087429855 @default.
- W3204184292 hasRelatedWork W3130602232 @default.
- W3204184292 hasRelatedWork W3138788873 @default.
- W3204184292 hasRelatedWork W3173369353 @default.
- W3204184292 hasRelatedWork W3174185214 @default.
- W3204184292 hasRelatedWork W3201496309 @default.
- W3204184292 hasRelatedWork W2959072671 @default.
- W3204184292 isParatext "false" @default.
- W3204184292 isRetracted "false" @default.
- W3204184292 magId "3204184292" @default.
- W3204184292 workType "book-chapter" @default.