Matches in SemOpenAlex for { <https://semopenalex.org/work/W1594578975> ?p ?o ?g. }
- W1594578975 abstract "Since the dawn of the computing era, information has been represented digitally so that it can be processed by electronic computers. Paper books and documents were abundant and widely being published at that time; and hence, there was a need to convert them into digital format. OCR, short for Optical Character Recognition was conceived to translate paper-based books into digital e-books. Regrettably, OCR systems are still erroneous and inaccurate as they produce misspellings in the recognized text, especially when the source document is of low printing quality. This paper proposes a post-processing OCR context-sensitive error correction method for detecting and correcting non-word and real-word OCR errors. The cornerstone of this proposed approach is the use of Google Web 1T 5-gram data set as a dictionary of words to spell-check OCR text. The Google data set incorporates a very large vocabulary and word statistics entirely reaped from the Internet, making it a reliable source to perform dictionary-based error correction. The core of the proposed solution is a combination of three algorithms: The error detection, candidate spellings generator, and error correction algorithms, which all exploit information extracted from Google Web 1T 5-gram data set. Experiments conducted on scanned images written in different languages showed a substantial improvement in the OCR error correction rate. As future developments, the proposed algorithm is to be parallelised so as to support parallel and distributed computing architectures." @default.
- W1594578975 created "2016-06-24" @default.
- W1594578975 creator A5017217522 @default.
- W1594578975 creator A5035889457 @default.
- W1594578975 date "2012-04-01" @default.
- W1594578975 modified "2023-09-27" @default.
- W1594578975 title "OCR Context-Sensitive Error Correction Based on Google Web 1T 5-Gram Data Set" @default.
- W1594578975 cites W149212122 @default.
- W1594578975 cites W1647671624 @default.
- W1594578975 cites W1971138419 @default.
- W1594578975 cites W2010595692 @default.
- W1594578975 cites W2011117553 @default.
- W1594578975 cites W2017787659 @default.
- W1594578975 cites W2021795960 @default.
- W1594578975 cites W2024808224 @default.
- W1594578975 cites W2029189646 @default.
- W1594578975 cites W2040062114 @default.
- W1594578975 cites W2040304231 @default.
- W1594578975 cites W2054885500 @default.
- W1594578975 cites W2081366726 @default.
- W1594578975 cites W2082558327 @default.
- W1594578975 cites W2085763602 @default.
- W1594578975 cites W2093064489 @default.
- W1594578975 cites W2102834748 @default.
- W1594578975 cites W2106071847 @default.
- W1594578975 cites W2137421497 @default.
- W1594578975 cites W2144872023 @default.
- W1594578975 cites W2611456052 @default.
- W1594578975 cites W373434254 @default.
- W1594578975 hasPublicationYear "2012" @default.
- W1594578975 type Work @default.
- W1594578975 sameAs 1594578975 @default.
- W1594578975 citedByCount "10" @default.
- W1594578975 countsByYear W15945789752013 @default.
- W1594578975 countsByYear W15945789752016 @default.
- W1594578975 countsByYear W15945789752017 @default.
- W1594578975 countsByYear W15945789752019 @default.
- W1594578975 countsByYear W15945789752020 @default.
- W1594578975 crossrefType "posted-content" @default.
- W1594578975 hasAuthorship W1594578975A5017217522 @default.
- W1594578975 hasAuthorship W1594578975A5035889457 @default.
- W1594578975 hasConcept C103088060 @default.
- W1594578975 hasConcept C11413529 @default.
- W1594578975 hasConcept C115961682 @default.
- W1594578975 hasConcept C117884012 @default.
- W1594578975 hasConcept C137293760 @default.
- W1594578975 hasConcept C138885662 @default.
- W1594578975 hasConcept C151730666 @default.
- W1594578975 hasConcept C154945302 @default.
- W1594578975 hasConcept C177264268 @default.
- W1594578975 hasConcept C199360897 @default.
- W1594578975 hasConcept C204321447 @default.
- W1594578975 hasConcept C23123220 @default.
- W1594578975 hasConcept C2777601683 @default.
- W1594578975 hasConcept C2779343474 @default.
- W1594578975 hasConcept C2983335612 @default.
- W1594578975 hasConcept C41008148 @default.
- W1594578975 hasConcept C41895202 @default.
- W1594578975 hasConcept C546480517 @default.
- W1594578975 hasConcept C86803240 @default.
- W1594578975 hasConcept C90805587 @default.
- W1594578975 hasConceptScore W1594578975C103088060 @default.
- W1594578975 hasConceptScore W1594578975C11413529 @default.
- W1594578975 hasConceptScore W1594578975C115961682 @default.
- W1594578975 hasConceptScore W1594578975C117884012 @default.
- W1594578975 hasConceptScore W1594578975C137293760 @default.
- W1594578975 hasConceptScore W1594578975C138885662 @default.
- W1594578975 hasConceptScore W1594578975C151730666 @default.
- W1594578975 hasConceptScore W1594578975C154945302 @default.
- W1594578975 hasConceptScore W1594578975C177264268 @default.
- W1594578975 hasConceptScore W1594578975C199360897 @default.
- W1594578975 hasConceptScore W1594578975C204321447 @default.
- W1594578975 hasConceptScore W1594578975C23123220 @default.
- W1594578975 hasConceptScore W1594578975C2777601683 @default.
- W1594578975 hasConceptScore W1594578975C2779343474 @default.
- W1594578975 hasConceptScore W1594578975C2983335612 @default.
- W1594578975 hasConceptScore W1594578975C41008148 @default.
- W1594578975 hasConceptScore W1594578975C41895202 @default.
- W1594578975 hasConceptScore W1594578975C546480517 @default.
- W1594578975 hasConceptScore W1594578975C86803240 @default.
- W1594578975 hasConceptScore W1594578975C90805587 @default.
- W1594578975 hasLocation W15945789751 @default.
- W1594578975 hasOpenAccess W1594578975 @default.
- W1594578975 hasPrimaryLocation W15945789751 @default.
- W1594578975 hasRelatedWork W104170146 @default.
- W1594578975 hasRelatedWork W1647671624 @default.
- W1594578975 hasRelatedWork W1987366206 @default.
- W1594578975 hasRelatedWork W2010595692 @default.
- W1594578975 hasRelatedWork W2056471870 @default.
- W1594578975 hasRelatedWork W2116061846 @default.
- W1594578975 hasRelatedWork W2152162047 @default.
- W1594578975 hasRelatedWork W2168005840 @default.
- W1594578975 hasRelatedWork W2288418054 @default.
- W1594578975 hasRelatedWork W2405914773 @default.
- W1594578975 hasRelatedWork W2575782020 @default.
- W1594578975 hasRelatedWork W2786850497 @default.
- W1594578975 hasRelatedWork W2787747535 @default.
- W1594578975 hasRelatedWork W3003523497 @default.
- W1594578975 hasRelatedWork W3023805750 @default.
- W1594578975 hasRelatedWork W3107113245 @default.