Matches in SemOpenAlex for { <https://semopenalex.org/work/W2059395655> ?p ?o ?g. }
Showing items 1 to 76 of
76
with 100 items per page.
- W2059395655 abstract "During the last decade, a huge amount of OCRed historical texts has been made available on the Internet. For most of these documents meta data are missing that assign topic categories from library classification systems to texts. Data of this form would offer a much better access to these collections. We report on an experiment where we used a completely automated system for topic assignment, originally designed for modern texts, and apply it to OCRed texts from an 18th century German lexicon (Zedler). Lexicon pages/images used in the experiment lead to poor OCR quality and are full of historical spelling variants. In order to measure the influence of OCR errors and historical orthography on topic detection, we created ground truth versions and in addition ground truth versions with modernized orthography for all texts. We found that automated topic assignment leads to useful results for both the OCR output and the two ground truth versions. Difficulties arise from a changing world in a (e.g., technical) field as well as language changes beyond simple orthography." @default.
- W2059395655 created "2016-06-24" @default.
- W2059395655 creator A5001867012 @default.
- W2059395655 creator A5048047920 @default.
- W2059395655 creator A5082743420 @default.
- W2059395655 date "2014-05-19" @default.
- W2059395655 modified "2023-09-25" @default.
- W2059395655 title "Automated assignment of topics to OCRed historical texts" @default.
- W2059395655 cites W1880262756 @default.
- W2059395655 cites W1978841319 @default.
- W2059395655 cites W2041711198 @default.
- W2059395655 cites W2042980227 @default.
- W2059395655 doi "https://doi.org/10.1145/2595188.2595206" @default.
- W2059395655 hasPublicationYear "2014" @default.
- W2059395655 type Work @default.
- W2059395655 sameAs 2059395655 @default.
- W2059395655 citedByCount "0" @default.
- W2059395655 crossrefType "proceedings-article" @default.
- W2059395655 hasAuthorship W2059395655A5001867012 @default.
- W2059395655 hasAuthorship W2059395655A5048047920 @default.
- W2059395655 hasAuthorship W2059395655A5082743420 @default.
- W2059395655 hasConcept C110875604 @default.
- W2059395655 hasConcept C111472728 @default.
- W2059395655 hasConcept C136764020 @default.
- W2059395655 hasConcept C138885662 @default.
- W2059395655 hasConcept C146849305 @default.
- W2059395655 hasConcept C150670947 @default.
- W2059395655 hasConcept C154775046 @default.
- W2059395655 hasConcept C154945302 @default.
- W2059395655 hasConcept C202444582 @default.
- W2059395655 hasConcept C204321447 @default.
- W2059395655 hasConcept C23123220 @default.
- W2059395655 hasConcept C2777801307 @default.
- W2059395655 hasConcept C2778121359 @default.
- W2059395655 hasConcept C2780586882 @default.
- W2059395655 hasConcept C33923547 @default.
- W2059395655 hasConcept C41008148 @default.
- W2059395655 hasConcept C41895202 @default.
- W2059395655 hasConcept C554936623 @default.
- W2059395655 hasConcept C9652623 @default.
- W2059395655 hasConceptScore W2059395655C110875604 @default.
- W2059395655 hasConceptScore W2059395655C111472728 @default.
- W2059395655 hasConceptScore W2059395655C136764020 @default.
- W2059395655 hasConceptScore W2059395655C138885662 @default.
- W2059395655 hasConceptScore W2059395655C146849305 @default.
- W2059395655 hasConceptScore W2059395655C150670947 @default.
- W2059395655 hasConceptScore W2059395655C154775046 @default.
- W2059395655 hasConceptScore W2059395655C154945302 @default.
- W2059395655 hasConceptScore W2059395655C202444582 @default.
- W2059395655 hasConceptScore W2059395655C204321447 @default.
- W2059395655 hasConceptScore W2059395655C23123220 @default.
- W2059395655 hasConceptScore W2059395655C2777801307 @default.
- W2059395655 hasConceptScore W2059395655C2778121359 @default.
- W2059395655 hasConceptScore W2059395655C2780586882 @default.
- W2059395655 hasConceptScore W2059395655C33923547 @default.
- W2059395655 hasConceptScore W2059395655C41008148 @default.
- W2059395655 hasConceptScore W2059395655C41895202 @default.
- W2059395655 hasConceptScore W2059395655C554936623 @default.
- W2059395655 hasConceptScore W2059395655C9652623 @default.
- W2059395655 hasLocation W20593956551 @default.
- W2059395655 hasOpenAccess W2059395655 @default.
- W2059395655 hasPrimaryLocation W20593956551 @default.
- W2059395655 hasRelatedWork W1840154465 @default.
- W2059395655 hasRelatedWork W1986787436 @default.
- W2059395655 hasRelatedWork W1994041352 @default.
- W2059395655 hasRelatedWork W2065885317 @default.
- W2059395655 hasRelatedWork W2093100277 @default.
- W2059395655 hasRelatedWork W2166005393 @default.
- W2059395655 hasRelatedWork W2293456502 @default.
- W2059395655 hasRelatedWork W2403872937 @default.
- W2059395655 hasRelatedWork W3003948647 @default.
- W2059395655 hasRelatedWork W2890906110 @default.
- W2059395655 isParatext "false" @default.
- W2059395655 isRetracted "false" @default.
- W2059395655 magId "2059395655" @default.
- W2059395655 workType "article" @default.