Matches in SemOpenAlex for { <https://semopenalex.org/work/W3207589413> ?p ?o ?g. }
- W3207589413 abstract "Noise-contrastive estimation (NCE) is a statistically consistent method for learning unnormalized probabilistic models. It has been empirically observed that the choice of the noise distribution is crucial for NCE's performance. However, such observations have never been made formal or quantitative. In fact, it is not even clear whether the difficulties arising from a poorly chosen noise distribution are statistical or algorithmic in nature. In this work, we formally pinpoint reasons for NCE's poor performance when an inappropriate noise distribution is used. Namely, we prove these challenges arise due to an ill-behaved (more precisely, flat) loss landscape. To address this, we introduce a variant of NCE called eNCE which uses an exponential loss and for which normalized gradient descent addresses the landscape issues provably when the target and noise distributions are in a given exponential family." @default.
- W3207589413 created "2021-10-25" @default.
- W3207589413 creator A5007869267 @default.
- W3207589413 creator A5014972240 @default.
- W3207589413 creator A5053209283 @default.
- W3207589413 creator A5075587107 @default.
- W3207589413 date "2021-10-21" @default.
- W3207589413 modified "2023-09-26" @default.
- W3207589413 title "Analyzing and Improving the Optimization Landscape of Noise-Contrastive Estimation" @default.
- W3207589413 cites W1483912251 @default.
- W3207589413 cites W1513873506 @default.
- W3207589413 cites W1768488313 @default.
- W3207589413 cites W2013164703 @default.
- W3207589413 cites W2097732278 @default.
- W3207589413 cites W2120340025 @default.
- W3207589413 cites W2120861206 @default.
- W3207589413 cites W2131939418 @default.
- W3207589413 cites W2138204974 @default.
- W3207589413 cites W2152790380 @default.
- W3207589413 cites W2159611450 @default.
- W3207589413 cites W2439299270 @default.
- W3207589413 cites W2841543429 @default.
- W3207589413 cites W2842511635 @default.
- W3207589413 cites W2897723259 @default.
- W3207589413 cites W2922772346 @default.
- W3207589413 cites W2949979820 @default.
- W3207589413 cites W2951873722 @default.
- W3207589413 cites W2962755094 @default.
- W3207589413 cites W2963762683 @default.
- W3207589413 cites W2963800509 @default.
- W3207589413 cites W2971899460 @default.
- W3207589413 cites W2994434574 @default.
- W3207589413 cites W2995024809 @default.
- W3207589413 cites W2995040292 @default.
- W3207589413 cites W3009318622 @default.
- W3207589413 cites W3035058308 @default.
- W3207589413 cites W3035166812 @default.
- W3207589413 cites W3036122622 @default.
- W3207589413 cites W3038081132 @default.
- W3207589413 cites W3148140980 @default.
- W3207589413 cites W3194546735 @default.
- W3207589413 cites W82247633 @default.
- W3207589413 cites W91088564 @default.
- W3207589413 doi "https://doi.org/10.48550/arxiv.2110.11271" @default.
- W3207589413 hasPublicationYear "2021" @default.
- W3207589413 type Work @default.
- W3207589413 sameAs 3207589413 @default.
- W3207589413 citedByCount "0" @default.
- W3207589413 crossrefType "posted-content" @default.
- W3207589413 hasAuthorship W3207589413A5007869267 @default.
- W3207589413 hasAuthorship W3207589413A5014972240 @default.
- W3207589413 hasAuthorship W3207589413A5053209283 @default.
- W3207589413 hasAuthorship W3207589413A5075587107 @default.
- W3207589413 hasBestOaLocation W32075894131 @default.
- W3207589413 hasConcept C110121322 @default.
- W3207589413 hasConcept C11413529 @default.
- W3207589413 hasConcept C114289077 @default.
- W3207589413 hasConcept C115961682 @default.
- W3207589413 hasConcept C119857082 @default.
- W3207589413 hasConcept C134306372 @default.
- W3207589413 hasConcept C151376022 @default.
- W3207589413 hasConcept C153258448 @default.
- W3207589413 hasConcept C154945302 @default.
- W3207589413 hasConcept C162324750 @default.
- W3207589413 hasConcept C163294075 @default.
- W3207589413 hasConcept C187612029 @default.
- W3207589413 hasConcept C187736073 @default.
- W3207589413 hasConcept C200378446 @default.
- W3207589413 hasConcept C28826006 @default.
- W3207589413 hasConcept C29265498 @default.
- W3207589413 hasConcept C33923547 @default.
- W3207589413 hasConcept C41008148 @default.
- W3207589413 hasConcept C49937458 @default.
- W3207589413 hasConcept C50644808 @default.
- W3207589413 hasConcept C55974624 @default.
- W3207589413 hasConcept C96250715 @default.
- W3207589413 hasConcept C99498987 @default.
- W3207589413 hasConceptScore W3207589413C110121322 @default.
- W3207589413 hasConceptScore W3207589413C11413529 @default.
- W3207589413 hasConceptScore W3207589413C114289077 @default.
- W3207589413 hasConceptScore W3207589413C115961682 @default.
- W3207589413 hasConceptScore W3207589413C119857082 @default.
- W3207589413 hasConceptScore W3207589413C134306372 @default.
- W3207589413 hasConceptScore W3207589413C151376022 @default.
- W3207589413 hasConceptScore W3207589413C153258448 @default.
- W3207589413 hasConceptScore W3207589413C154945302 @default.
- W3207589413 hasConceptScore W3207589413C162324750 @default.
- W3207589413 hasConceptScore W3207589413C163294075 @default.
- W3207589413 hasConceptScore W3207589413C187612029 @default.
- W3207589413 hasConceptScore W3207589413C187736073 @default.
- W3207589413 hasConceptScore W3207589413C200378446 @default.
- W3207589413 hasConceptScore W3207589413C28826006 @default.
- W3207589413 hasConceptScore W3207589413C29265498 @default.
- W3207589413 hasConceptScore W3207589413C33923547 @default.
- W3207589413 hasConceptScore W3207589413C41008148 @default.
- W3207589413 hasConceptScore W3207589413C49937458 @default.
- W3207589413 hasConceptScore W3207589413C50644808 @default.
- W3207589413 hasConceptScore W3207589413C55974624 @default.
- W3207589413 hasConceptScore W3207589413C96250715 @default.
- W3207589413 hasConceptScore W3207589413C99498987 @default.