Matches in SemOpenAlex for { <https://semopenalex.org/work/W3041171784> ?p ?o ?g. }
Showing items 1 to 90 of
90
with 100 items per page.
- W3041171784 abstract "Accurate parsing of citation reference strings is crucial to automatically construct scholarly databases such as Google Scholar or Semantic Scholar. Citation field extraction (CFE) is precisely this task---given a reference label which tokens refer to the authors, venue, title, editor, journal, pages, etc. Most methods for CFE are supervised and rely on training from labeled datasets that are quite small compared to the great variety of reference formats. BibTeX, the widely used reference management tool, provides a natural method to automatically generate and label training data for CFE. In this paper, we describe a technique for using BibTeX to generate, automatically, a large-scale 41M labeled strings), labeled dataset, that is four orders of magnitude larger than the current largest CFE dataset, namely the UMass Citation Field Extraction dataset [Anzaroot and McCallum, 2013]. We experimentally demonstrate how our dataset can be used to improve the performance of the UMass CFE using a RoBERTa-based [Liu et al., 2019] model. In comparison to previous SoTA, we achieve a 24.48% relative error reduction, achieving span level F1-scores of 96.3%." @default.
- W3041171784 created "2020-07-16" @default.
- W3041171784 creator A5008354502 @default.
- W3041171784 creator A5022749617 @default.
- W3041171784 creator A5029504586 @default.
- W3041171784 creator A5041743374 @default.
- W3041171784 creator A5076343254 @default.
- W3041171784 date "2020-06-09" @default.
- W3041171784 modified "2023-09-27" @default.
- W3041171784 title "Using BibTeX to Automatically Generate Labeled Data for Citation Field Extraction" @default.
- W3041171784 cites W1481355931 @default.
- W3041171784 cites W2147880316 @default.
- W3041171784 cites W2250539671 @default.
- W3041171784 cites W2787560479 @default.
- W3041171784 cites W2795835984 @default.
- W3041171784 cites W2891499495 @default.
- W3041171784 cites W2951099506 @default.
- W3041171784 cites W2963341956 @default.
- W3041171784 cites W2963552443 @default.
- W3041171784 cites W2964121744 @default.
- W3041171784 cites W2965373594 @default.
- W3041171784 hasPublicationYear "2020" @default.
- W3041171784 type Work @default.
- W3041171784 sameAs 3041171784 @default.
- W3041171784 citedByCount "0" @default.
- W3041171784 crossrefType "posted-content" @default.
- W3041171784 hasAuthorship W3041171784A5008354502 @default.
- W3041171784 hasAuthorship W3041171784A5022749617 @default.
- W3041171784 hasAuthorship W3041171784A5029504586 @default.
- W3041171784 hasAuthorship W3041171784A5041743374 @default.
- W3041171784 hasAuthorship W3041171784A5076343254 @default.
- W3041171784 hasConcept C124101348 @default.
- W3041171784 hasConcept C136764020 @default.
- W3041171784 hasConcept C154945302 @default.
- W3041171784 hasConcept C162324750 @default.
- W3041171784 hasConcept C186644900 @default.
- W3041171784 hasConcept C187736073 @default.
- W3041171784 hasConcept C199360897 @default.
- W3041171784 hasConcept C202444582 @default.
- W3041171784 hasConcept C204321447 @default.
- W3041171784 hasConcept C23123220 @default.
- W3041171784 hasConcept C2778805511 @default.
- W3041171784 hasConcept C2780451532 @default.
- W3041171784 hasConcept C2780801425 @default.
- W3041171784 hasConcept C33923547 @default.
- W3041171784 hasConcept C41008148 @default.
- W3041171784 hasConcept C9652623 @default.
- W3041171784 hasConceptScore W3041171784C124101348 @default.
- W3041171784 hasConceptScore W3041171784C136764020 @default.
- W3041171784 hasConceptScore W3041171784C154945302 @default.
- W3041171784 hasConceptScore W3041171784C162324750 @default.
- W3041171784 hasConceptScore W3041171784C186644900 @default.
- W3041171784 hasConceptScore W3041171784C187736073 @default.
- W3041171784 hasConceptScore W3041171784C199360897 @default.
- W3041171784 hasConceptScore W3041171784C202444582 @default.
- W3041171784 hasConceptScore W3041171784C204321447 @default.
- W3041171784 hasConceptScore W3041171784C23123220 @default.
- W3041171784 hasConceptScore W3041171784C2778805511 @default.
- W3041171784 hasConceptScore W3041171784C2780451532 @default.
- W3041171784 hasConceptScore W3041171784C2780801425 @default.
- W3041171784 hasConceptScore W3041171784C33923547 @default.
- W3041171784 hasConceptScore W3041171784C41008148 @default.
- W3041171784 hasConceptScore W3041171784C9652623 @default.
- W3041171784 hasLocation W30411717841 @default.
- W3041171784 hasOpenAccess W3041171784 @default.
- W3041171784 hasPrimaryLocation W30411717841 @default.
- W3041171784 hasRelatedWork W1537919408 @default.
- W3041171784 hasRelatedWork W1968281294 @default.
- W3041171784 hasRelatedWork W1972495172 @default.
- W3041171784 hasRelatedWork W2028821798 @default.
- W3041171784 hasRelatedWork W2384754881 @default.
- W3041171784 hasRelatedWork W2607557087 @default.
- W3041171784 hasRelatedWork W2613027510 @default.
- W3041171784 hasRelatedWork W2756118627 @default.
- W3041171784 hasRelatedWork W2786162033 @default.
- W3041171784 hasRelatedWork W2807932277 @default.
- W3041171784 hasRelatedWork W2913431579 @default.
- W3041171784 hasRelatedWork W2919502278 @default.
- W3041171784 hasRelatedWork W2958977883 @default.
- W3041171784 hasRelatedWork W2981611750 @default.
- W3041171784 hasRelatedWork W3033787882 @default.
- W3041171784 hasRelatedWork W3138414763 @default.
- W3041171784 hasRelatedWork W3160390043 @default.
- W3041171784 hasRelatedWork W3189050137 @default.
- W3041171784 hasRelatedWork W3196815667 @default.
- W3041171784 hasRelatedWork W2559815012 @default.
- W3041171784 isParatext "false" @default.
- W3041171784 isRetracted "false" @default.
- W3041171784 magId "3041171784" @default.
- W3041171784 workType "article" @default.