Matches in SemOpenAlex for { <https://semopenalex.org/work/W3139080614> ?p ?o ?g. }
- W3139080614 abstract "While GPTs with traditional fine-tuning fail to achieve strong results on natural language understanding (NLU), we show that GPTs can be better than or comparable to similar-sized BERTs on NLU tasks with a novel method P-tuning -- which employs trainable continuous prompt embeddings. On the knowledge probing (LAMA) benchmark, the best GPT recovers 64% (P@1) of world knowledge without any additional text provided during test time, which substantially improves the previous best by 20+ percentage points. On the SuperGlue benchmark, GPTs achieve comparable and sometimes better performance to similar-sized BERTs in supervised learning. Importantly, we find that P-tuning also improves BERTs' performance in both few-shot and supervised settings while largely reducing the need for prompt engineering. Consequently, P-tuning outperforms the state-of-the-art approaches on the few-shot SuperGlue benchmark." @default.
- W3139080614 created "2021-03-29" @default.
- W3139080614 creator A5003621726 @default.
- W3139080614 creator A5016643106 @default.
- W3139080614 creator A5018336656 @default.
- W3139080614 creator A5021830722 @default.
- W3139080614 creator A5037034281 @default.
- W3139080614 creator A5044791875 @default.
- W3139080614 creator A5082893971 @default.
- W3139080614 date "2021-03-18" @default.
- W3139080614 modified "2023-09-23" @default.
- W3139080614 title "GPT Understands, Too" @default.
- W3139080614 cites W1532854728 @default.
- W3139080614 cites W1599016936 @default.
- W3139080614 cites W2145755360 @default.
- W3139080614 cites W2804897457 @default.
- W3139080614 cites W2898662126 @default.
- W3139080614 cites W2911109671 @default.
- W3139080614 cites W2945260553 @default.
- W3139080614 cites W2946659172 @default.
- W3139080614 cites W2949202705 @default.
- W3139080614 cites W2950613642 @default.
- W3139080614 cites W2953337107 @default.
- W3139080614 cites W2963341956 @default.
- W3139080614 cites W2964098911 @default.
- W3139080614 cites W2965373594 @default.
- W3139080614 cites W2970161131 @default.
- W3139080614 cites W2970597249 @default.
- W3139080614 cites W2971600926 @default.
- W3139080614 cites W2971869958 @default.
- W3139080614 cites W2981852735 @default.
- W3139080614 cites W2982399380 @default.
- W3139080614 cites W2990704537 @default.
- W3139080614 cites W3021533447 @default.
- W3139080614 cites W3026404337 @default.
- W3139080614 cites W3030163527 @default.
- W3139080614 cites W3044438666 @default.
- W3139080614 cites W3085177480 @default.
- W3139080614 cites W3093871960 @default.
- W3139080614 cites W3096331697 @default.
- W3139080614 cites W3102146042 @default.
- W3139080614 cites W3104163040 @default.
- W3139080614 cites W3119438769 @default.
- W3139080614 cites W3122241445 @default.
- W3139080614 cites W3126960149 @default.
- W3139080614 cites W3129576130 @default.
- W3139080614 cites W3130319171 @default.
- W3139080614 cites W3132736064 @default.
- W3139080614 cites W3141023492 @default.
- W3139080614 cites W2525127255 @default.
- W3139080614 hasPublicationYear "2021" @default.
- W3139080614 type Work @default.
- W3139080614 sameAs 3139080614 @default.
- W3139080614 citedByCount "72" @default.
- W3139080614 countsByYear W31390806142020 @default.
- W3139080614 countsByYear W31390806142021 @default.
- W3139080614 countsByYear W31390806142022 @default.
- W3139080614 crossrefType "posted-content" @default.
- W3139080614 hasAuthorship W3139080614A5003621726 @default.
- W3139080614 hasAuthorship W3139080614A5016643106 @default.
- W3139080614 hasAuthorship W3139080614A5018336656 @default.
- W3139080614 hasAuthorship W3139080614A5021830722 @default.
- W3139080614 hasAuthorship W3139080614A5037034281 @default.
- W3139080614 hasAuthorship W3139080614A5044791875 @default.
- W3139080614 hasAuthorship W3139080614A5082893971 @default.
- W3139080614 hasConcept C119857082 @default.
- W3139080614 hasConcept C127413603 @default.
- W3139080614 hasConcept C154945302 @default.
- W3139080614 hasConcept C185798385 @default.
- W3139080614 hasConcept C191897082 @default.
- W3139080614 hasConcept C192562407 @default.
- W3139080614 hasConcept C205649164 @default.
- W3139080614 hasConcept C2778344882 @default.
- W3139080614 hasConcept C2992734406 @default.
- W3139080614 hasConcept C41008148 @default.
- W3139080614 hasConcept C58640448 @default.
- W3139080614 hasConcept C78519656 @default.
- W3139080614 hasConceptScore W3139080614C119857082 @default.
- W3139080614 hasConceptScore W3139080614C127413603 @default.
- W3139080614 hasConceptScore W3139080614C154945302 @default.
- W3139080614 hasConceptScore W3139080614C185798385 @default.
- W3139080614 hasConceptScore W3139080614C191897082 @default.
- W3139080614 hasConceptScore W3139080614C192562407 @default.
- W3139080614 hasConceptScore W3139080614C205649164 @default.
- W3139080614 hasConceptScore W3139080614C2778344882 @default.
- W3139080614 hasConceptScore W3139080614C2992734406 @default.
- W3139080614 hasConceptScore W3139080614C41008148 @default.
- W3139080614 hasConceptScore W3139080614C58640448 @default.
- W3139080614 hasConceptScore W3139080614C78519656 @default.
- W3139080614 hasLocation W31390806141 @default.
- W3139080614 hasOpenAccess W3139080614 @default.
- W3139080614 hasPrimaryLocation W31390806141 @default.
- W3139080614 hasRelatedWork W2963310665 @default.
- W3139080614 hasRelatedWork W2963341956 @default.
- W3139080614 hasRelatedWork W2963403868 @default.
- W3139080614 hasRelatedWork W2964121744 @default.
- W3139080614 hasRelatedWork W2964303773 @default.
- W3139080614 hasRelatedWork W2965373594 @default.
- W3139080614 hasRelatedWork W2970476646 @default.
- W3139080614 hasRelatedWork W2982399380 @default.