Matches in SemOpenAlex for { <https://semopenalex.org/work/W4289422179> ?p ?o ?g. }
Showing items 1 to 62 of
62
with 100 items per page.
- W4289422179 abstract "Programs for extracting structured information from text, namely information extractors, often operate separately on document segments obtained from a generic splitting operation such as sentences, paragraphs, k-grams, HTTP requests, and so on. An automated detection of this behavior of extractors, which we refer to as split-correctness, would allow text analysis systems to devise query plans with parallel evaluation on segments for accelerating the processing of large documents. Other applications include the incremental evaluation on dynamic content, where re-evaluation of information extractors can be restricted to revised segments, and debugging, where developers of information extractors are informed about potential boundary crossing of different semantic components. We propose a new formal framework for split-correctness within the formalism of document spanners. Our analysis studies the complexity of split-correctness over regular spanners. We also discuss different variants of split-correctness, for instance, in the presence of black-box extractors with split constraints." @default.
- W4289422179 created "2022-08-02" @default.
- W4289422179 creator A5006706357 @default.
- W4289422179 creator A5009133355 @default.
- W4289422179 creator A5016412428 @default.
- W4289422179 creator A5022264843 @default.
- W4289422179 creator A5082548231 @default.
- W4289422179 date "2018-10-08" @default.
- W4289422179 modified "2023-09-30" @default.
- W4289422179 title "Split-Correctness in Information Extraction" @default.
- W4289422179 doi "https://doi.org/10.48550/arxiv.1810.03367" @default.
- W4289422179 hasPublicationYear "2018" @default.
- W4289422179 type Work @default.
- W4289422179 citedByCount "0" @default.
- W4289422179 crossrefType "posted-content" @default.
- W4289422179 hasAuthorship W4289422179A5006706357 @default.
- W4289422179 hasAuthorship W4289422179A5009133355 @default.
- W4289422179 hasAuthorship W4289422179A5016412428 @default.
- W4289422179 hasAuthorship W4289422179A5022264843 @default.
- W4289422179 hasAuthorship W4289422179A5082548231 @default.
- W4289422179 hasBestOaLocation W42894221791 @default.
- W4289422179 hasConcept C124101348 @default.
- W4289422179 hasConcept C142362112 @default.
- W4289422179 hasConcept C153349607 @default.
- W4289422179 hasConcept C168065819 @default.
- W4289422179 hasConcept C195807954 @default.
- W4289422179 hasConcept C199360897 @default.
- W4289422179 hasConcept C23123220 @default.
- W4289422179 hasConcept C41008148 @default.
- W4289422179 hasConcept C55439883 @default.
- W4289422179 hasConcept C558565934 @default.
- W4289422179 hasConcept C73301696 @default.
- W4289422179 hasConcept C80444323 @default.
- W4289422179 hasConceptScore W4289422179C124101348 @default.
- W4289422179 hasConceptScore W4289422179C142362112 @default.
- W4289422179 hasConceptScore W4289422179C153349607 @default.
- W4289422179 hasConceptScore W4289422179C168065819 @default.
- W4289422179 hasConceptScore W4289422179C195807954 @default.
- W4289422179 hasConceptScore W4289422179C199360897 @default.
- W4289422179 hasConceptScore W4289422179C23123220 @default.
- W4289422179 hasConceptScore W4289422179C41008148 @default.
- W4289422179 hasConceptScore W4289422179C55439883 @default.
- W4289422179 hasConceptScore W4289422179C558565934 @default.
- W4289422179 hasConceptScore W4289422179C73301696 @default.
- W4289422179 hasConceptScore W4289422179C80444323 @default.
- W4289422179 hasLocation W42894221791 @default.
- W4289422179 hasLocation W42894221792 @default.
- W4289422179 hasOpenAccess W4289422179 @default.
- W4289422179 hasPrimaryLocation W42894221791 @default.
- W4289422179 hasRelatedWork W106084318 @default.
- W4289422179 hasRelatedWork W1578778518 @default.
- W4289422179 hasRelatedWork W1579177548 @default.
- W4289422179 hasRelatedWork W1587224678 @default.
- W4289422179 hasRelatedWork W1601811574 @default.
- W4289422179 hasRelatedWork W1798975336 @default.
- W4289422179 hasRelatedWork W2008310423 @default.
- W4289422179 hasRelatedWork W2165124476 @default.
- W4289422179 hasRelatedWork W379834542 @default.
- W4289422179 hasRelatedWork W4234604123 @default.
- W4289422179 isParatext "false" @default.
- W4289422179 isRetracted "false" @default.
- W4289422179 workType "article" @default.