Matches in SemOpenAlex for { <https://semopenalex.org/work/W4385570518> ?p ?o ?g. }
Showing items 1 to 71 of
71
with 100 items per page.
- W4385570518 abstract "To improve the ability of language models to handle Natural Language Processing(NLP) tasks and intermediate step of pre-training has recently beenintroduced. In this setup, one takes a pre-trained language model, trains it ona (set of) NLP dataset(s), and then finetunes it for a target task. It isknown that the selection of relevant transfer tasks is important, but recentlysome work has shown substantial performance gains by doing intermediatetraining on a very large set of datasets. Most previous work uses generativelanguage models or only focuses on one or a couple of tasks and uses acarefully curated setup. We compare intermediate training with one or manytasks in a setup where the choice of datasets is more arbitrary; we use allSemEval 2023 text-based tasks. We reach performance improvements for most taskswhen using intermediate training. Gains are higher when doing intermediatetraining on single tasks than all tasks if the right transfer taskis identified. Dataset smoothing and heterogeneous batching did not lead torobust gains in our setup." @default.
- W4385570518 created "2023-08-05" @default.
- W4385570518 creator A5039811894 @default.
- W4385570518 date "2023-01-01" @default.
- W4385570518 modified "2023-09-24" @default.
- W4385570518 title "MaChAmp at SemEval-2023 tasks 2, 3, 4, 5, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Intermediate Training on an Uncurated Collection of Datasets." @default.
- W4385570518 doi "https://doi.org/10.18653/v1/2023.semeval-1.32" @default.
- W4385570518 hasPublicationYear "2023" @default.
- W4385570518 type Work @default.
- W4385570518 citedByCount "0" @default.
- W4385570518 crossrefType "proceedings-article" @default.
- W4385570518 hasAuthorship W4385570518A5039811894 @default.
- W4385570518 hasBestOaLocation W43855705181 @default.
- W4385570518 hasConcept C119857082 @default.
- W4385570518 hasConcept C121332964 @default.
- W4385570518 hasConcept C137293760 @default.
- W4385570518 hasConcept C150899416 @default.
- W4385570518 hasConcept C153294291 @default.
- W4385570518 hasConcept C154945302 @default.
- W4385570518 hasConcept C162324750 @default.
- W4385570518 hasConcept C173608175 @default.
- W4385570518 hasConcept C177264268 @default.
- W4385570518 hasConcept C187736073 @default.
- W4385570518 hasConcept C199360897 @default.
- W4385570518 hasConcept C204321447 @default.
- W4385570518 hasConcept C2776175482 @default.
- W4385570518 hasConcept C2777211547 @default.
- W4385570518 hasConcept C2780451532 @default.
- W4385570518 hasConcept C31972630 @default.
- W4385570518 hasConcept C3770464 @default.
- W4385570518 hasConcept C41008148 @default.
- W4385570518 hasConcept C44572571 @default.
- W4385570518 hasConcept C51632099 @default.
- W4385570518 hasConcept C81917197 @default.
- W4385570518 hasConceptScore W4385570518C119857082 @default.
- W4385570518 hasConceptScore W4385570518C121332964 @default.
- W4385570518 hasConceptScore W4385570518C137293760 @default.
- W4385570518 hasConceptScore W4385570518C150899416 @default.
- W4385570518 hasConceptScore W4385570518C153294291 @default.
- W4385570518 hasConceptScore W4385570518C154945302 @default.
- W4385570518 hasConceptScore W4385570518C162324750 @default.
- W4385570518 hasConceptScore W4385570518C173608175 @default.
- W4385570518 hasConceptScore W4385570518C177264268 @default.
- W4385570518 hasConceptScore W4385570518C187736073 @default.
- W4385570518 hasConceptScore W4385570518C199360897 @default.
- W4385570518 hasConceptScore W4385570518C204321447 @default.
- W4385570518 hasConceptScore W4385570518C2776175482 @default.
- W4385570518 hasConceptScore W4385570518C2777211547 @default.
- W4385570518 hasConceptScore W4385570518C2780451532 @default.
- W4385570518 hasConceptScore W4385570518C31972630 @default.
- W4385570518 hasConceptScore W4385570518C3770464 @default.
- W4385570518 hasConceptScore W4385570518C41008148 @default.
- W4385570518 hasConceptScore W4385570518C44572571 @default.
- W4385570518 hasConceptScore W4385570518C51632099 @default.
- W4385570518 hasConceptScore W4385570518C81917197 @default.
- W4385570518 hasLocation W43855705181 @default.
- W4385570518 hasOpenAccess W4385570518 @default.
- W4385570518 hasPrimaryLocation W43855705181 @default.
- W4385570518 hasRelatedWork W2128514324 @default.
- W4385570518 hasRelatedWork W2608096034 @default.
- W4385570518 hasRelatedWork W2963277000 @default.
- W4385570518 hasRelatedWork W2964177319 @default.
- W4385570518 hasRelatedWork W3082447286 @default.
- W4385570518 hasRelatedWork W3114100246 @default.
- W4385570518 hasRelatedWork W3116646283 @default.
- W4385570518 hasRelatedWork W3186948874 @default.
- W4385570518 hasRelatedWork W4301299980 @default.
- W4385570518 hasRelatedWork W4308262314 @default.
- W4385570518 isParatext "false" @default.
- W4385570518 isRetracted "false" @default.
- W4385570518 workType "article" @default.