Matches in SemOpenAlex for { <https://semopenalex.org/work/W2014468488> ?p ?o ?g. }
- W2014468488 abstract "Recent library digitization projects attempt to provide large collections of printed material from varying sources in a searchable format. The scanned documents are typically processed using Optical Character Recognition (OCR), which typically introduces errors in the text. This paper proposes a technique for correction of OCR degraded text that is independent of character-level OCR errors, and hence independent of scanned document source. It is based on language modeling in conjunction with a uniform character model that uses edit distance only. The technique compares well to state-of-the-art correction techniques that are based on language modeling and source-specific character error models. Although the proposed technique yielded lower correction effectiveness, its impact on retrieval effectiveness is statistically significant and at par with state-of-the-art correction techniques. The main requirement of the proposed technique is the training of a “good” language model matching genre, style, and temporal coverage. The advantage of being independent of character level errors is clear in applications were printed documents vary in source, font, and degradation level." @default.
- W2014468488 created "2016-06-24" @default.
- W2014468488 creator A5006645360 @default.
- W2014468488 creator A5070783596 @default.
- W2014468488 date "2010-11-01" @default.
- W2014468488 modified "2023-09-23" @default.
- W2014468488 title "Omni font OCR error correction with effect on retrieval" @default.
- W2014468488 cites W1501437810 @default.
- W2014468488 cites W1510248492 @default.
- W2014468488 cites W1593368611 @default.
- W2014468488 cites W1631260214 @default.
- W2014468488 cites W1650656906 @default.
- W2014468488 cites W182831726 @default.
- W2014468488 cites W1985708392 @default.
- W2014468488 cites W2006477783 @default.
- W2014468488 cites W2011586660 @default.
- W2014468488 cites W2012191724 @default.
- W2014468488 cites W2015338694 @default.
- W2014468488 cites W2018616927 @default.
- W2014468488 cites W2033937535 @default.
- W2014468488 cites W2038218748 @default.
- W2014468488 cites W2056250865 @default.
- W2014468488 cites W2057900969 @default.
- W2014468488 cites W2070661748 @default.
- W2014468488 cites W2096550092 @default.
- W2014468488 cites W2103430523 @default.
- W2014468488 cites W2109556717 @default.
- W2014468488 cites W2126815469 @default.
- W2014468488 cites W2128254978 @default.
- W2014468488 cites W2135809805 @default.
- W2014468488 cites W2137970221 @default.
- W2014468488 cites W2155903443 @default.
- W2014468488 cites W2166968190 @default.
- W2014468488 cites W2436413986 @default.
- W2014468488 doi "https://doi.org/10.1109/isda.2010.5687228" @default.
- W2014468488 hasPublicationYear "2010" @default.
- W2014468488 type Work @default.
- W2014468488 sameAs 2014468488 @default.
- W2014468488 citedByCount "3" @default.
- W2014468488 countsByYear W20144684882014 @default.
- W2014468488 countsByYear W20144684882016 @default.
- W2014468488 crossrefType "proceedings-article" @default.
- W2014468488 hasAuthorship W2014468488A5006645360 @default.
- W2014468488 hasAuthorship W2014468488A5070783596 @default.
- W2014468488 hasConcept C103088060 @default.
- W2014468488 hasConcept C105795698 @default.
- W2014468488 hasConcept C11413529 @default.
- W2014468488 hasConcept C115961682 @default.
- W2014468488 hasConcept C137293760 @default.
- W2014468488 hasConcept C153180895 @default.
- W2014468488 hasConcept C154945302 @default.
- W2014468488 hasConcept C165064840 @default.
- W2014468488 hasConcept C204321447 @default.
- W2014468488 hasConcept C23123220 @default.
- W2014468488 hasConcept C2524010 @default.
- W2014468488 hasConcept C2777737414 @default.
- W2014468488 hasConcept C2779308522 @default.
- W2014468488 hasConcept C2780861071 @default.
- W2014468488 hasConcept C28490314 @default.
- W2014468488 hasConcept C31972630 @default.
- W2014468488 hasConcept C33923547 @default.
- W2014468488 hasConcept C41008148 @default.
- W2014468488 hasConcept C546480517 @default.
- W2014468488 hasConceptScore W2014468488C103088060 @default.
- W2014468488 hasConceptScore W2014468488C105795698 @default.
- W2014468488 hasConceptScore W2014468488C11413529 @default.
- W2014468488 hasConceptScore W2014468488C115961682 @default.
- W2014468488 hasConceptScore W2014468488C137293760 @default.
- W2014468488 hasConceptScore W2014468488C153180895 @default.
- W2014468488 hasConceptScore W2014468488C154945302 @default.
- W2014468488 hasConceptScore W2014468488C165064840 @default.
- W2014468488 hasConceptScore W2014468488C204321447 @default.
- W2014468488 hasConceptScore W2014468488C23123220 @default.
- W2014468488 hasConceptScore W2014468488C2524010 @default.
- W2014468488 hasConceptScore W2014468488C2777737414 @default.
- W2014468488 hasConceptScore W2014468488C2779308522 @default.
- W2014468488 hasConceptScore W2014468488C2780861071 @default.
- W2014468488 hasConceptScore W2014468488C28490314 @default.
- W2014468488 hasConceptScore W2014468488C31972630 @default.
- W2014468488 hasConceptScore W2014468488C33923547 @default.
- W2014468488 hasConceptScore W2014468488C41008148 @default.
- W2014468488 hasConceptScore W2014468488C546480517 @default.
- W2014468488 hasLocation W20144684881 @default.
- W2014468488 hasOpenAccess W2014468488 @default.
- W2014468488 hasPrimaryLocation W20144684881 @default.
- W2014468488 hasRelatedWork W1480066716 @default.
- W2014468488 hasRelatedWork W1510248492 @default.
- W2014468488 hasRelatedWork W176867939 @default.
- W2014468488 hasRelatedWork W1968290777 @default.
- W2014468488 hasRelatedWork W1990871427 @default.
- W2014468488 hasRelatedWork W2038218748 @default.
- W2014468488 hasRelatedWork W2044369666 @default.
- W2014468488 hasRelatedWork W2085763602 @default.
- W2014468488 hasRelatedWork W2086625764 @default.
- W2014468488 hasRelatedWork W2096550858 @default.
- W2014468488 hasRelatedWork W2134574126 @default.
- W2014468488 hasRelatedWork W2141960239 @default.
- W2014468488 hasRelatedWork W240009282 @default.
- W2014468488 hasRelatedWork W2795751515 @default.
- W2014468488 hasRelatedWork W2919697638 @default.