Matches in SemOpenAlex for { <https://semopenalex.org/work/W4221165780> ?p ?o ?g. }
Showing items 1 to 85 of
85
with 100 items per page.
- W4221165780 endingPage "471" @default.
- W4221165780 startingPage "457" @default.
- W4221165780 abstract "Keyphrase extraction from a given document is the task of automatically extracting salient phrases that best describe the document. This paper proposes a novel unsupervised graph-based ranking method to extract high-quality phrases from a given document. We obtain the contextualized embeddings from pre-trained language models enriched with topic vectors from Latent Dirichlet Allocation (LDA) to represent the candidate phrases and the document. We introduce a scoring mechanism for the phrases using the information obtained from contextualized embeddings and the topic vectors. The salient phrases are extracted using a ranking algorithm on an undirected graph constructed for the given document. In the undirected graph, the nodes represent the phrases, and the edges between the phrases represent the semantic relatedness between them, weighted by a score obtained from the scoring mechanism. To demonstrate the efficacy of our proposed method, we perform several experiments on open source datasets in the science domain and observe that our novel method outperforms existing unsupervised embedding based keyphrase extraction methods. For instance, on the SemEval2017 dataset, our method advances the F1 score from 0.2195 (EmbedRank) to 0.2819 at the top 10 extracted keyphrases. Several variants of the proposed algorithm are investigated to determine their effect on the quality of keyphrases. We further demonstrate the ability of our proposed method to collect additional high-quality keyphrases that are not present in the document from external knowledge bases like Wikipedia for enriching the document with newly discovered keyphrases. We evaluate this step on a collection of annotated documents. The F1-score at the top 10 expanded keyphrases is 0.60, indicating that our algorithm can also be used for ‘concept’ expansion using external knowledge." @default.
- W4221165780 created "2022-04-03" @default.
- W4221165780 creator A5035621906 @default.
- W4221165780 creator A5047987914 @default.
- W4221165780 creator A5085585789 @default.
- W4221165780 date "2022-01-01" @default.
- W4221165780 modified "2023-09-30" @default.
- W4221165780 title "Topic Aware Contextualized Embeddings for High Quality Phrase Extraction" @default.
- W4221165780 cites W1975432235 @default.
- W4221165780 cites W1982242209 @default.
- W4221165780 cites W2045181608 @default.
- W4221165780 cites W2064418625 @default.
- W4221165780 cites W2167329753 @default.
- W4221165780 cites W2250539671 @default.
- W4221165780 cites W2493916176 @default.
- W4221165780 cites W2605035112 @default.
- W4221165780 cites W2740811004 @default.
- W4221165780 cites W2742094278 @default.
- W4221165780 cites W2890179025 @default.
- W4221165780 cites W2891177506 @default.
- W4221165780 cites W2962903510 @default.
- W4221165780 cites W2963245897 @default.
- W4221165780 cites W2970641574 @default.
- W4221165780 cites W2974528752 @default.
- W4221165780 doi "https://doi.org/10.1007/978-3-030-99736-6_31" @default.
- W4221165780 hasPublicationYear "2022" @default.
- W4221165780 type Work @default.
- W4221165780 citedByCount "1" @default.
- W4221165780 countsByYear W42211657802022 @default.
- W4221165780 crossrefType "book-chapter" @default.
- W4221165780 hasAuthorship W4221165780A5035621906 @default.
- W4221165780 hasAuthorship W4221165780A5047987914 @default.
- W4221165780 hasAuthorship W4221165780A5085585789 @default.
- W4221165780 hasBestOaLocation W42211657802 @default.
- W4221165780 hasConcept C132525143 @default.
- W4221165780 hasConcept C154945302 @default.
- W4221165780 hasConcept C162324750 @default.
- W4221165780 hasConcept C171686336 @default.
- W4221165780 hasConcept C187736073 @default.
- W4221165780 hasConcept C189430467 @default.
- W4221165780 hasConcept C204321447 @default.
- W4221165780 hasConcept C23123220 @default.
- W4221165780 hasConcept C2776224158 @default.
- W4221165780 hasConcept C2780288562 @default.
- W4221165780 hasConcept C2780451532 @default.
- W4221165780 hasConcept C2780719617 @default.
- W4221165780 hasConcept C41008148 @default.
- W4221165780 hasConcept C41608201 @default.
- W4221165780 hasConcept C500882744 @default.
- W4221165780 hasConcept C80444323 @default.
- W4221165780 hasConceptScore W4221165780C132525143 @default.
- W4221165780 hasConceptScore W4221165780C154945302 @default.
- W4221165780 hasConceptScore W4221165780C162324750 @default.
- W4221165780 hasConceptScore W4221165780C171686336 @default.
- W4221165780 hasConceptScore W4221165780C187736073 @default.
- W4221165780 hasConceptScore W4221165780C189430467 @default.
- W4221165780 hasConceptScore W4221165780C204321447 @default.
- W4221165780 hasConceptScore W4221165780C23123220 @default.
- W4221165780 hasConceptScore W4221165780C2776224158 @default.
- W4221165780 hasConceptScore W4221165780C2780288562 @default.
- W4221165780 hasConceptScore W4221165780C2780451532 @default.
- W4221165780 hasConceptScore W4221165780C2780719617 @default.
- W4221165780 hasConceptScore W4221165780C41008148 @default.
- W4221165780 hasConceptScore W4221165780C41608201 @default.
- W4221165780 hasConceptScore W4221165780C500882744 @default.
- W4221165780 hasConceptScore W4221165780C80444323 @default.
- W4221165780 hasLocation W42211657801 @default.
- W4221165780 hasLocation W42211657802 @default.
- W4221165780 hasOpenAccess W4221165780 @default.
- W4221165780 hasPrimaryLocation W42211657801 @default.
- W4221165780 hasRelatedWork W2086064646 @default.
- W4221165780 hasRelatedWork W2182671698 @default.
- W4221165780 hasRelatedWork W219090214 @default.
- W4221165780 hasRelatedWork W2291407110 @default.
- W4221165780 hasRelatedWork W2368542989 @default.
- W4221165780 hasRelatedWork W2369308426 @default.
- W4221165780 hasRelatedWork W2400490375 @default.
- W4221165780 hasRelatedWork W2947781947 @default.
- W4221165780 hasRelatedWork W36931920 @default.
- W4221165780 hasRelatedWork W4250382823 @default.
- W4221165780 isParatext "false" @default.
- W4221165780 isRetracted "false" @default.
- W4221165780 workType "book-chapter" @default.