Matches in SemOpenAlex for { <https://semopenalex.org/work/W3014334865> ?p ?o ?g. }
- W3014334865 abstract "We propose to incorporate neural architecture search (NAS) into general-purpose multi-task learning (GP-MTL). Existing NAS methods typically define different search spaces according to different tasks. In order to adapt to different task combinations (i.e., task sets), we disentangle the GP-MTL networks into single-task backbones (optionally encode the task priors), and a hierarchical and layerwise features sharing/fusing scheme across them. This enables us to design a novel and general task-agnostic search space, which inserts cross-task edges (i.e., feature fusion connections) into fixed single-task network backbones. Moreover, we also propose a novel single-shot gradient-based search algorithm that closes the performance gap between the searched architectures and the final evaluation architecture. This is realized with a minimum entropy regularization on the architecture weights during the search phase, which makes the architecture weights converge to near-discrete values and therefore achieves a single model. As a result, our searched model can be directly used for evaluation without (re-)training from scratch. We perform extensive experiments using different single-task backbones on various task sets, demonstrating the promising performance obtained by exploiting the hierarchical and layerwise features, as well as the desirable generalizability to different i) task sets and ii) single-task backbones. The code of our paper is available at https://github.com/bhpfelix/MTLNAS." @default.
- W3014334865 created "2020-04-10" @default.
- W3014334865 creator A5025733574 @default.
- W3014334865 creator A5040010053 @default.
- W3014334865 creator A5065964089 @default.
- W3014334865 creator A5071037763 @default.
- W3014334865 creator A5074618663 @default.
- W3014334865 creator A5075329194 @default.
- W3014334865 date "2020-03-31" @default.
- W3014334865 modified "2023-09-24" @default.
- W3014334865 title "MTL-NAS: Task-Agnostic Neural Architecture Search towards General-Purpose Multi-Task Learning" @default.
- W3014334865 cites W125693051 @default.
- W3014334865 cites W1522301498 @default.
- W3014334865 cites W1536680647 @default.
- W3014334865 cites W1905829557 @default.
- W3014334865 cites W1955369839 @default.
- W3014334865 cites W2067912884 @default.
- W3014334865 cites W2102605133 @default.
- W3014334865 cites W2113207845 @default.
- W3014334865 cites W2163605009 @default.
- W3014334865 cites W2194775991 @default.
- W3014334865 cites W2290180618 @default.
- W3014334865 cites W2407277018 @default.
- W3014334865 cites W2520951797 @default.
- W3014334865 cites W2548228487 @default.
- W3014334865 cites W2549401308 @default.
- W3014334865 cites W2572745118 @default.
- W3014334865 cites W2613718673 @default.
- W3014334865 cites W2618011341 @default.
- W3014334865 cites W2624871570 @default.
- W3014334865 cites W2765119169 @default.
- W3014334865 cites W2769653148 @default.
- W3014334865 cites W2782417188 @default.
- W3014334865 cites W2785366763 @default.
- W3014334865 cites W2798441115 @default.
- W3014334865 cites W2810075754 @default.
- W3014334865 cites W2885311373 @default.
- W3014334865 cites W2885820039 @default.
- W3014334865 cites W2921495890 @default.
- W3014334865 cites W2925303509 @default.
- W3014334865 cites W2932077855 @default.
- W3014334865 cites W2936599103 @default.
- W3014334865 cites W2945445935 @default.
- W3014334865 cites W2948818729 @default.
- W3014334865 cites W2955051405 @default.
- W3014334865 cites W2955425717 @default.
- W3014334865 cites W2960010704 @default.
- W3014334865 cites W2961666066 @default.
- W3014334865 cites W2962746461 @default.
- W3014334865 cites W2962786991 @default.
- W3014334865 cites W2962835968 @default.
- W3014334865 cites W2962847160 @default.
- W3014334865 cites W2962850006 @default.
- W3014334865 cites W2962864421 @default.
- W3014334865 cites W2962919941 @default.
- W3014334865 cites W2963136578 @default.
- W3014334865 cites W2963216850 @default.
- W3014334865 cites W2963374479 @default.
- W3014334865 cites W2963446712 @default.
- W3014334865 cites W2963498646 @default.
- W3014334865 cites W2963677766 @default.
- W3014334865 cites W2963821229 @default.
- W3014334865 cites W2963877604 @default.
- W3014334865 cites W2963918968 @default.
- W3014334865 cites W2964070329 @default.
- W3014334865 cites W2964081403 @default.
- W3014334865 cites W2964081807 @default.
- W3014334865 cites W2964185501 @default.
- W3014334865 cites W2964217527 @default.
- W3014334865 cites W2964247799 @default.
- W3014334865 cites W2964259004 @default.
- W3014334865 cites W2964444661 @default.
- W3014334865 cites W2965658867 @default.
- W3014334865 cites W2966182616 @default.
- W3014334865 cites W2966540493 @default.
- W3014334865 cites W2967733054 @default.
- W3014334865 cites W2971107871 @default.
- W3014334865 cites W2971302835 @default.
- W3014334865 cites W2980270353 @default.
- W3014334865 cites W2981748264 @default.
- W3014334865 cites W2985228975 @default.
- W3014334865 cites W3034906194 @default.
- W3014334865 cites W582055897 @default.
- W3014334865 doi "https://doi.org/10.48550/arxiv.2003.14058" @default.
- W3014334865 hasPublicationYear "2020" @default.
- W3014334865 type Work @default.
- W3014334865 sameAs 3014334865 @default.
- W3014334865 citedByCount "2" @default.
- W3014334865 countsByYear W30143348652020 @default.
- W3014334865 countsByYear W30143348652021 @default.
- W3014334865 crossrefType "posted-content" @default.
- W3014334865 hasAuthorship W3014334865A5025733574 @default.
- W3014334865 hasAuthorship W3014334865A5040010053 @default.
- W3014334865 hasAuthorship W3014334865A5065964089 @default.
- W3014334865 hasAuthorship W3014334865A5071037763 @default.
- W3014334865 hasAuthorship W3014334865A5074618663 @default.
- W3014334865 hasAuthorship W3014334865A5075329194 @default.
- W3014334865 hasBestOaLocation W30143348651 @default.
- W3014334865 hasConcept C104317684 @default.
- W3014334865 hasConcept C105795698 @default.