Matches in SemOpenAlex for { <https://semopenalex.org/work/W3199147424> ?p ?o ?g. }
- W3199147424 abstract "Despite the powerful expressivity of neural networks with nonlinear activation functions, the underlying mechanism for deep neural networks still remains unclear. However, it can be proved that ultra-wide neural networks are equivalent to Gaussian processes, thus connecting the analysis on neural networks with Bayesian statistics and kernel methods. Moreover, recent studies on infinitely wide neural networks extend this correspondence to a specific kernel, named Neural Tangent Kernel (NTK), which governs the learning dynamics of related neural networks. Without weights and biases, the NTK recursively encodes the architecture information about the corresponding neural networks, including the activation function at each hidden layer. Inspired by this close relationship of Gaussian processes and neural networks, we propose a heuristic search method for activation functions of sufficiently wide neural networks in the NTK regime. To obtain an elegant and closed-form computation, activation functions are decomposed in the basis of Hermite polynomials, which converts the kernels in Gaussian processes into power series. Experiments show the outperformance of the obtained nonlinearities compared with other common activation functions. This work also reveals the potential utility of NTKs for guidance on neural network structure search in the future." @default.
- W3199147424 created "2021-09-27" @default.
- W3199147424 creator A5002732486 @default.
- W3199147424 creator A5014247469 @default.
- W3199147424 creator A5036765526 @default.
- W3199147424 date "2021-07-18" @default.
- W3199147424 modified "2023-09-24" @default.
- W3199147424 title "Heuristic Search for Activation Functions of Neural Networks Based on Gaussian Processes" @default.
- W3199147424 cites W1567512734 @default.
- W3199147424 cites W1665214252 @default.
- W3199147424 cites W1921523184 @default.
- W3199147424 cites W2112796928 @default.
- W3199147424 cites W2133864802 @default.
- W3199147424 cites W2145339207 @default.
- W3199147424 cites W2160815625 @default.
- W3199147424 cites W2294059674 @default.
- W3199147424 cites W2550980560 @default.
- W3199147424 cites W2618530766 @default.
- W3199147424 cites W2734358244 @default.
- W3199147424 cites W2736616523 @default.
- W3199147424 cites W2750384547 @default.
- W3199147424 cites W2752128027 @default.
- W3199147424 cites W2809090039 @default.
- W3199147424 cites W2902986194 @default.
- W3199147424 cites W2910655610 @default.
- W3199147424 cites W2952720005 @default.
- W3199147424 cites W2962939986 @default.
- W3199147424 cites W2963037989 @default.
- W3199147424 cites W2963285578 @default.
- W3199147424 cites W2963341956 @default.
- W3199147424 cites W2964052793 @default.
- W3199147424 cites W2970217468 @default.
- W3199147424 cites W2970239942 @default.
- W3199147424 cites W2970457724 @default.
- W3199147424 cites W2971043187 @default.
- W3199147424 cites W2994747787 @default.
- W3199147424 cites W3034845783 @default.
- W3199147424 cites W3034979923 @default.
- W3199147424 cites W3080914972 @default.
- W3199147424 cites W3099842580 @default.
- W3199147424 cites W3101069636 @default.
- W3199147424 cites W3103424281 @default.
- W3199147424 cites W3146803896 @default.
- W3199147424 cites W32532385 @default.
- W3199147424 doi "https://doi.org/10.1109/ijcnn52387.2021.9533641" @default.
- W3199147424 hasPublicationYear "2021" @default.
- W3199147424 type Work @default.
- W3199147424 sameAs 3199147424 @default.
- W3199147424 citedByCount "0" @default.
- W3199147424 crossrefType "proceedings-article" @default.
- W3199147424 hasAuthorship W3199147424A5002732486 @default.
- W3199147424 hasAuthorship W3199147424A5014247469 @default.
- W3199147424 hasAuthorship W3199147424A5036765526 @default.
- W3199147424 hasConcept C11413529 @default.
- W3199147424 hasConcept C118615104 @default.
- W3199147424 hasConcept C121332964 @default.
- W3199147424 hasConcept C147168706 @default.
- W3199147424 hasConcept C154945302 @default.
- W3199147424 hasConcept C163716315 @default.
- W3199147424 hasConcept C173801870 @default.
- W3199147424 hasConcept C33923547 @default.
- W3199147424 hasConcept C38365724 @default.
- W3199147424 hasConcept C41008148 @default.
- W3199147424 hasConcept C50644808 @default.
- W3199147424 hasConcept C61326573 @default.
- W3199147424 hasConcept C62520636 @default.
- W3199147424 hasConcept C7218915 @default.
- W3199147424 hasConcept C74193536 @default.
- W3199147424 hasConcept C86582703 @default.
- W3199147424 hasConceptScore W3199147424C11413529 @default.
- W3199147424 hasConceptScore W3199147424C118615104 @default.
- W3199147424 hasConceptScore W3199147424C121332964 @default.
- W3199147424 hasConceptScore W3199147424C147168706 @default.
- W3199147424 hasConceptScore W3199147424C154945302 @default.
- W3199147424 hasConceptScore W3199147424C163716315 @default.
- W3199147424 hasConceptScore W3199147424C173801870 @default.
- W3199147424 hasConceptScore W3199147424C33923547 @default.
- W3199147424 hasConceptScore W3199147424C38365724 @default.
- W3199147424 hasConceptScore W3199147424C41008148 @default.
- W3199147424 hasConceptScore W3199147424C50644808 @default.
- W3199147424 hasConceptScore W3199147424C61326573 @default.
- W3199147424 hasConceptScore W3199147424C62520636 @default.
- W3199147424 hasConceptScore W3199147424C7218915 @default.
- W3199147424 hasConceptScore W3199147424C74193536 @default.
- W3199147424 hasConceptScore W3199147424C86582703 @default.
- W3199147424 hasFunder F4320321001 @default.
- W3199147424 hasLocation W31991474241 @default.
- W3199147424 hasOpenAccess W3199147424 @default.
- W3199147424 hasPrimaryLocation W31991474241 @default.
- W3199147424 hasRelatedWork W11937450 @default.
- W3199147424 hasRelatedWork W12521158 @default.
- W3199147424 hasRelatedWork W3102522 @default.
- W3199147424 hasRelatedWork W4342771 @default.
- W3199147424 hasRelatedWork W5979161 @default.
- W3199147424 hasRelatedWork W6699078 @default.
- W3199147424 hasRelatedWork W6717794 @default.
- W3199147424 hasRelatedWork W8203384 @default.
- W3199147424 hasRelatedWork W9554121 @default.
- W3199147424 hasRelatedWork W14377099 @default.
- W3199147424 isParatext "false" @default.