Matches in SemOpenAlex for { <https://semopenalex.org/work/W2247432958> ?p ?o ?g. }
Showing items 1 to 88 of
88
with 100 items per page.
- W2247432958 abstract "This work analyzes centered binary Restricted Boltzmann Machines (RBMs) andbinary Deep Boltzmann Machines (DBMs), where centering is done by subtractingoffset values from visible and hidden variables. We show analytically that (i)centering results in a different but equivalent parameterization for artificialneural networks in general, (ii) the expected performance of centered binaryRBMs/DBMs is invariant under simultaneous flip of data and offsets, for anyoffset value in the range of zero to one, (iii) centering can be reformulatedas a different update rule for normal binary RBMs/DBMs, and (iv) using theenhanced gradient is equivalent to setting the offset values to the averageover model and data mean. Furthermore, numerical simulations suggest that (i)optimal generative performance is achieved by subtracting mean values fromvisible as well as hidden variables, (ii) centered RBMs/DBMs reachsignificantly higher log-likelihood values than normal binary RBMs/DBMs, (iii)centering variants whose offsets depend on the model mean, like the enhancedgradient, suffer from severe divergence problems, (iv) learning is stabilizedif an exponentially moving average over the batch means is used for the offsetvalues instead of the current batch mean, which also prevents the enhancedgradient from diverging, (v) centered RBMs/DBMs reach higher LL values thannormal RBMs/DBMs while having a smaller norm of the weight matrix, (vi)centering leads to an update direction that is closer to the natural gradientand that the natural gradient is extremly efficient for training RBMs, (vii)centering dispense the need for greedy layer-wise pre-training of DBMs, (viii)furthermore we show that pre-training often even worsen the resultsindependently whether centering is used or not, and (ix) centering is alsobeneficial for auto encoders." @default.
- W2247432958 created "2016-06-24" @default.
- W2247432958 creator A5025349969 @default.
- W2247432958 creator A5026151059 @default.
- W2247432958 creator A5039663126 @default.
- W2247432958 date "2013-11-06" @default.
- W2247432958 modified "2023-09-27" @default.
- W2247432958 title "How to Center Binary Deep Boltzmann Machines" @default.
- W2247432958 cites W1562353105 @default.
- W2247432958 cites W1579917626 @default.
- W2247432958 cites W1592845119 @default.
- W2247432958 cites W193851967 @default.
- W2247432958 cites W2018168021 @default.
- W2247432958 cites W2051717488 @default.
- W2247432958 cites W2054912984 @default.
- W2247432958 cites W2064675550 @default.
- W2247432958 cites W2072128103 @default.
- W2247432958 cites W2124098876 @default.
- W2247432958 cites W2125113755 @default.
- W2247432958 cites W2132023809 @default.
- W2247432958 cites W2135181320 @default.
- W2247432958 cites W2183660452 @default.
- W2247432958 cites W2200708944 @default.
- W2247432958 cites W3140968660 @default.
- W2247432958 cites W44815768 @default.
- W2247432958 hasPublicationYear "2013" @default.
- W2247432958 type Work @default.
- W2247432958 sameAs 2247432958 @default.
- W2247432958 citedByCount "2" @default.
- W2247432958 countsByYear W22474329582013 @default.
- W2247432958 countsByYear W22474329582018 @default.
- W2247432958 crossrefType "posted-content" @default.
- W2247432958 hasAuthorship W2247432958A5025349969 @default.
- W2247432958 hasAuthorship W2247432958A5026151059 @default.
- W2247432958 hasAuthorship W2247432958A5039663126 @default.
- W2247432958 hasConcept C108583219 @default.
- W2247432958 hasConcept C121332964 @default.
- W2247432958 hasConcept C154945302 @default.
- W2247432958 hasConcept C175291020 @default.
- W2247432958 hasConcept C192576344 @default.
- W2247432958 hasConcept C199360897 @default.
- W2247432958 hasConcept C33923547 @default.
- W2247432958 hasConcept C35304006 @default.
- W2247432958 hasConcept C41008148 @default.
- W2247432958 hasConcept C48372109 @default.
- W2247432958 hasConcept C77088390 @default.
- W2247432958 hasConcept C94375191 @default.
- W2247432958 hasConcept C97355855 @default.
- W2247432958 hasConceptScore W2247432958C108583219 @default.
- W2247432958 hasConceptScore W2247432958C121332964 @default.
- W2247432958 hasConceptScore W2247432958C154945302 @default.
- W2247432958 hasConceptScore W2247432958C175291020 @default.
- W2247432958 hasConceptScore W2247432958C192576344 @default.
- W2247432958 hasConceptScore W2247432958C199360897 @default.
- W2247432958 hasConceptScore W2247432958C33923547 @default.
- W2247432958 hasConceptScore W2247432958C35304006 @default.
- W2247432958 hasConceptScore W2247432958C41008148 @default.
- W2247432958 hasConceptScore W2247432958C48372109 @default.
- W2247432958 hasConceptScore W2247432958C77088390 @default.
- W2247432958 hasConceptScore W2247432958C94375191 @default.
- W2247432958 hasConceptScore W2247432958C97355855 @default.
- W2247432958 hasLocation W22474329581 @default.
- W2247432958 hasOpenAccess W2247432958 @default.
- W2247432958 hasPrimaryLocation W22474329581 @default.
- W2247432958 hasRelatedWork W1567835346 @default.
- W2247432958 hasRelatedWork W1936866406 @default.
- W2247432958 hasRelatedWork W2123726451 @default.
- W2247432958 hasRelatedWork W2128785859 @default.
- W2247432958 hasRelatedWork W2128882956 @default.
- W2247432958 hasRelatedWork W2169218196 @default.
- W2247432958 hasRelatedWork W2517166081 @default.
- W2247432958 hasRelatedWork W2620508304 @default.
- W2247432958 hasRelatedWork W2797328513 @default.
- W2247432958 hasRelatedWork W2949480272 @default.
- W2247432958 hasRelatedWork W2950346378 @default.
- W2247432958 hasRelatedWork W2954097612 @default.
- W2247432958 hasRelatedWork W3011893970 @default.
- W2247432958 hasRelatedWork W3103902731 @default.
- W2247432958 hasRelatedWork W3137089243 @default.
- W2247432958 hasRelatedWork W3208036525 @default.
- W2247432958 hasRelatedWork W2701971652 @default.
- W2247432958 hasRelatedWork W2965235851 @default.
- W2247432958 hasRelatedWork W2968535302 @default.
- W2247432958 hasRelatedWork W3020099821 @default.
- W2247432958 isParatext "false" @default.
- W2247432958 isRetracted "false" @default.
- W2247432958 magId "2247432958" @default.
- W2247432958 workType "article" @default.