Matches in SemOpenAlex for { <https://semopenalex.org/work/W2913930213> ?p ?o ?g. }
- W2913930213 abstract "We prove that for an $L$-layer fully-connected linear neural network, if the width of every hidden layer is $tildeOmega (L cdot r cdot d_{mathrm{out}} cdot kappa^3 )$, where $r$ and $kappa$ are the rank and the condition number of the input data, and $d_{mathrm{out}}$ is the output dimension, then gradient descent with Gaussian random initialization converges to a global minimum at a linear rate. The number of iterations to find an $epsilon$-suboptimal solution is $O(kappa log(frac{1}{epsilon}))$. Our polynomial upper bound on the total running time for wide deep linear networks and the $expleft(Omegaleft(Lright)right)$ lower bound for narrow deep linear neural networks [Shamir, 2018] together demonstrate that wide layers are necessary for optimizing deep models." @default.
- W2913930213 created "2019-02-21" @default.
- W2913930213 creator A5033061754 @default.
- W2913930213 creator A5062378128 @default.
- W2913930213 date "2019-01-24" @default.
- W2913930213 modified "2023-09-27" @default.
- W2913930213 title "Width Provably Matters in Optimization for Deep Linear Neural Networks" @default.
- W2913930213 cites W1522301498 @default.
- W2913930213 cites W1533861849 @default.
- W2913930213 cites W2146502635 @default.
- W2913930213 cites W2194775991 @default.
- W2913930213 cites W2399994860 @default.
- W2913930213 cites W2565538933 @default.
- W2913930213 cites W2591714514 @default.
- W2913930213 cites W2593380010 @default.
- W2913930213 cites W2593709294 @default.
- W2913930213 cites W2736030546 @default.
- W2913930213 cites W2746420172 @default.
- W2913930213 cites W2788800397 @default.
- W2913930213 cites W2788997738 @default.
- W2913930213 cites W2806265408 @default.
- W2913930213 cites W2886067286 @default.
- W2913930213 cites W2886685759 @default.
- W2913930213 cites W2891942459 @default.
- W2913930213 cites W2894972989 @default.
- W2913930213 cites W2899748887 @default.
- W2913930213 cites W2900959181 @default.
- W2913930213 cites W2951934643 @default.
- W2913930213 cites W2952817981 @default.
- W2913930213 cites W2962698540 @default.
- W2913930213 cites W2962767131 @default.
- W2913930213 cites W2962930448 @default.
- W2913930213 cites W2963092340 @default.
- W2913930213 cites W2963383839 @default.
- W2913930213 cites W2963417959 @default.
- W2913930213 cites W2963427613 @default.
- W2913930213 cites W2963446085 @default.
- W2913930213 cites W2963504252 @default.
- W2913930213 cites W2963519230 @default.
- W2913930213 cites W2963569411 @default.
- W2913930213 cites W2963651774 @default.
- W2913930213 cites W2963827833 @default.
- W2913930213 cites W2963837241 @default.
- W2913930213 cites W2964072429 @default.
- W2913930213 cites W2964106499 @default.
- W2913930213 cites W2964161337 @default.
- W2913930213 cites W2966228138 @default.
- W2913930213 hasPublicationYear "2019" @default.
- W2913930213 type Work @default.
- W2913930213 sameAs 2913930213 @default.
- W2913930213 citedByCount "25" @default.
- W2913930213 countsByYear W29139302132019 @default.
- W2913930213 countsByYear W29139302132020 @default.
- W2913930213 countsByYear W29139302132021 @default.
- W2913930213 crossrefType "posted-content" @default.
- W2913930213 hasAuthorship W2913930213A5033061754 @default.
- W2913930213 hasAuthorship W2913930213A5062378128 @default.
- W2913930213 hasConcept C114466953 @default.
- W2913930213 hasConcept C114614502 @default.
- W2913930213 hasConcept C118615104 @default.
- W2913930213 hasConcept C121332964 @default.
- W2913930213 hasConcept C134306372 @default.
- W2913930213 hasConcept C153258448 @default.
- W2913930213 hasConcept C154945302 @default.
- W2913930213 hasConcept C163716315 @default.
- W2913930213 hasConcept C164226766 @default.
- W2913930213 hasConcept C199360897 @default.
- W2913930213 hasConcept C2779557605 @default.
- W2913930213 hasConcept C33676613 @default.
- W2913930213 hasConcept C33923547 @default.
- W2913930213 hasConcept C41008148 @default.
- W2913930213 hasConcept C50644808 @default.
- W2913930213 hasConcept C62520636 @default.
- W2913930213 hasConcept C77553402 @default.
- W2913930213 hasConcept C90119067 @default.
- W2913930213 hasConceptScore W2913930213C114466953 @default.
- W2913930213 hasConceptScore W2913930213C114614502 @default.
- W2913930213 hasConceptScore W2913930213C118615104 @default.
- W2913930213 hasConceptScore W2913930213C121332964 @default.
- W2913930213 hasConceptScore W2913930213C134306372 @default.
- W2913930213 hasConceptScore W2913930213C153258448 @default.
- W2913930213 hasConceptScore W2913930213C154945302 @default.
- W2913930213 hasConceptScore W2913930213C163716315 @default.
- W2913930213 hasConceptScore W2913930213C164226766 @default.
- W2913930213 hasConceptScore W2913930213C199360897 @default.
- W2913930213 hasConceptScore W2913930213C2779557605 @default.
- W2913930213 hasConceptScore W2913930213C33676613 @default.
- W2913930213 hasConceptScore W2913930213C33923547 @default.
- W2913930213 hasConceptScore W2913930213C41008148 @default.
- W2913930213 hasConceptScore W2913930213C50644808 @default.
- W2913930213 hasConceptScore W2913930213C62520636 @default.
- W2913930213 hasConceptScore W2913930213C77553402 @default.
- W2913930213 hasConceptScore W2913930213C90119067 @default.
- W2913930213 hasLocation W29139302131 @default.
- W2913930213 hasOpenAccess W2913930213 @default.
- W2913930213 hasPrimaryLocation W29139302131 @default.
- W2913930213 hasRelatedWork W2125930537 @default.
- W2913930213 hasRelatedWork W2194775991 @default.
- W2913930213 hasRelatedWork W2788800397 @default.
- W2913930213 hasRelatedWork W2809090039 @default.