Matches in SemOpenAlex for { <https://semopenalex.org/work/W2139097105> ?p ?o ?g. }
Showing items 1 to 90 of
90
with 100 items per page.
- W2139097105 endingPage "117" @default.
- W2139097105 startingPage "112" @default.
- W2139097105 abstract "Language classification is a preliminary step for most natural-language related processes. The significant quantity of multilingual documents poses a problem for traditional language-classification schemes and requires segmentation of the document to monolingual sections. This phenomenon is characteristic of classical and medieval Jewish literature, which frequently mixes Hebrew, Aramaic, Judeo-Arabic and other Hebrew-script languages. We propose a method for classification and segmentation of multi-lingual texts in the Hebrew character set, using bigram statistics. For texts, such as the manuscripts found in the Cairo Genizah, we are also forced to deal with a significant level of noise in OCR-processed text." @default.
- W2139097105 created "2016-06-24" @default.
- W2139097105 creator A5039993352 @default.
- W2139097105 creator A5047605219 @default.
- W2139097105 date "2012-04-24" @default.
- W2139097105 modified "2023-09-25" @default.
- W2139097105 title "Language Classification and Segmentation of Noisy Documents in Hebrew Scripts" @default.
- W2139097105 cites W1176919 @default.
- W2139097105 cites W1515020792 @default.
- W2139097105 cites W1626945812 @default.
- W2139097105 cites W2010595692 @default.
- W2139097105 cites W2118229299 @default.
- W2139097105 cites W2144430322 @default.
- W2139097105 cites W91776986 @default.
- W2139097105 hasPublicationYear "2012" @default.
- W2139097105 type Work @default.
- W2139097105 sameAs 2139097105 @default.
- W2139097105 citedByCount "0" @default.
- W2139097105 crossrefType "proceedings-article" @default.
- W2139097105 hasAuthorship W2139097105A5039993352 @default.
- W2139097105 hasAuthorship W2139097105A5047605219 @default.
- W2139097105 hasConcept C108757681 @default.
- W2139097105 hasConcept C109901321 @default.
- W2139097105 hasConcept C137546455 @default.
- W2139097105 hasConcept C138885662 @default.
- W2139097105 hasConcept C150152722 @default.
- W2139097105 hasConcept C154945302 @default.
- W2139097105 hasConcept C166957645 @default.
- W2139097105 hasConcept C199360897 @default.
- W2139097105 hasConcept C204321447 @default.
- W2139097105 hasConcept C2524010 @default.
- W2139097105 hasConcept C2780861071 @default.
- W2139097105 hasConcept C28490314 @default.
- W2139097105 hasConcept C33923547 @default.
- W2139097105 hasConcept C41008148 @default.
- W2139097105 hasConcept C41895202 @default.
- W2139097105 hasConcept C61423126 @default.
- W2139097105 hasConcept C89600930 @default.
- W2139097105 hasConcept C91304198 @default.
- W2139097105 hasConcept C95457728 @default.
- W2139097105 hasConcept C96455323 @default.
- W2139097105 hasConceptScore W2139097105C108757681 @default.
- W2139097105 hasConceptScore W2139097105C109901321 @default.
- W2139097105 hasConceptScore W2139097105C137546455 @default.
- W2139097105 hasConceptScore W2139097105C138885662 @default.
- W2139097105 hasConceptScore W2139097105C150152722 @default.
- W2139097105 hasConceptScore W2139097105C154945302 @default.
- W2139097105 hasConceptScore W2139097105C166957645 @default.
- W2139097105 hasConceptScore W2139097105C199360897 @default.
- W2139097105 hasConceptScore W2139097105C204321447 @default.
- W2139097105 hasConceptScore W2139097105C2524010 @default.
- W2139097105 hasConceptScore W2139097105C2780861071 @default.
- W2139097105 hasConceptScore W2139097105C28490314 @default.
- W2139097105 hasConceptScore W2139097105C33923547 @default.
- W2139097105 hasConceptScore W2139097105C41008148 @default.
- W2139097105 hasConceptScore W2139097105C41895202 @default.
- W2139097105 hasConceptScore W2139097105C61423126 @default.
- W2139097105 hasConceptScore W2139097105C89600930 @default.
- W2139097105 hasConceptScore W2139097105C91304198 @default.
- W2139097105 hasConceptScore W2139097105C95457728 @default.
- W2139097105 hasConceptScore W2139097105C96455323 @default.
- W2139097105 hasLocation W21390971051 @default.
- W2139097105 hasOpenAccess W2139097105 @default.
- W2139097105 hasPrimaryLocation W21390971051 @default.
- W2139097105 hasRelatedWork W1510410713 @default.
- W2139097105 hasRelatedWork W1591544191 @default.
- W2139097105 hasRelatedWork W1977618669 @default.
- W2139097105 hasRelatedWork W2011064408 @default.
- W2139097105 hasRelatedWork W2040856704 @default.
- W2139097105 hasRelatedWork W2107703355 @default.
- W2139097105 hasRelatedWork W2164394329 @default.
- W2139097105 hasRelatedWork W2250659129 @default.
- W2139097105 hasRelatedWork W2251053923 @default.
- W2139097105 hasRelatedWork W2357351332 @default.
- W2139097105 hasRelatedWork W2527844536 @default.
- W2139097105 hasRelatedWork W2793117819 @default.
- W2139097105 hasRelatedWork W2950533189 @default.
- W2139097105 hasRelatedWork W2978768392 @default.
- W2139097105 hasRelatedWork W2982242125 @default.
- W2139097105 hasRelatedWork W3003603507 @default.
- W2139097105 hasRelatedWork W3088908336 @default.
- W2139097105 hasRelatedWork W3159550228 @default.
- W2139097105 hasRelatedWork W829621552 @default.
- W2139097105 hasRelatedWork W97157637 @default.
- W2139097105 isParatext "false" @default.
- W2139097105 isRetracted "false" @default.
- W2139097105 magId "2139097105" @default.
- W2139097105 workType "article" @default.