Matches in SemOpenAlex for { <https://semopenalex.org/work/W2803771812> ?p ?o ?g. }
- W2803771812 endingPage "24" @default.
- W2803771812 startingPage "1" @default.
- W2803771812 abstract "It is of great interest to researchers and scholars in many disciplines (particularly those working on cultural heritage projects) to study parallel passages (i.e., identical or similar pieces of text describing the same thing) in digital text archives. Although there exist a few software tools for this purpose, they are restricted to a specific domain (e.g., the Bible) or a specific language (e.g., Hebrew). In this article, we present in detail how we build a digital infrastructure that can facilitate the search and discovery of parallel passages for any domain in any language. It is at the core of our Samtla (Search And Mining Tools with Linguistic Analysis) system designed in collaboration with historians and linguists. The system has already been used to support research on five large text corpora that span a number of different domains and languages. The key to such a domain-independent and language-independent digital infrastructure is a novel combination of a character-based n -gram language model, space-optimized suffix tree, and generalized edit distance. A comprehensive evaluation through crowdsourcing shows that the effectiveness of our system’s search functionality is on par with the human-level performance." @default.
- W2803771812 created "2018-06-01" @default.
- W2803771812 creator A5014086872 @default.
- W2803771812 creator A5015725705 @default.
- W2803771812 creator A5015784640 @default.
- W2803771812 creator A5067116669 @default.
- W2803771812 date "2018-08-22" @default.
- W2803771812 modified "2023-09-25" @default.
- W2803771812 title "Finding Parallel Passages in Cultural Heritage Archives" @default.
- W2803771812 cites W173640133 @default.
- W2803771812 cites W1972594981 @default.
- W2803771812 cites W1974360117 @default.
- W2803771812 cites W1974568263 @default.
- W2803771812 cites W1990190154 @default.
- W2803771812 cites W2000246295 @default.
- W2803771812 cites W2005892921 @default.
- W2803771812 cites W2027447543 @default.
- W2803771812 cites W2029097226 @default.
- W2803771812 cites W2029203225 @default.
- W2803771812 cites W2058896506 @default.
- W2803771812 cites W2069870183 @default.
- W2803771812 cites W2100506586 @default.
- W2803771812 cites W2116316001 @default.
- W2803771812 cites W2129444086 @default.
- W2803771812 cites W2151401338 @default.
- W2803771812 cites W2152263452 @default.
- W2803771812 cites W2158195707 @default.
- W2803771812 cites W2161563551 @default.
- W2803771812 cites W2168859760 @default.
- W2803771812 cites W2217516311 @default.
- W2803771812 cites W3004423609 @default.
- W2803771812 cites W3044876782 @default.
- W2803771812 cites W4252733585 @default.
- W2803771812 cites W4298872162 @default.
- W2803771812 doi "https://doi.org/10.1145/3195727" @default.
- W2803771812 hasPublicationYear "2018" @default.
- W2803771812 type Work @default.
- W2803771812 sameAs 2803771812 @default.
- W2803771812 citedByCount "7" @default.
- W2803771812 countsByYear W28037718122018 @default.
- W2803771812 countsByYear W28037718122019 @default.
- W2803771812 countsByYear W28037718122020 @default.
- W2803771812 countsByYear W28037718122021 @default.
- W2803771812 countsByYear W28037718122022 @default.
- W2803771812 countsByYear W28037718122023 @default.
- W2803771812 crossrefType "journal-article" @default.
- W2803771812 hasAuthorship W2803771812A5014086872 @default.
- W2803771812 hasAuthorship W2803771812A5015725705 @default.
- W2803771812 hasAuthorship W2803771812A5015784640 @default.
- W2803771812 hasAuthorship W2803771812A5067116669 @default.
- W2803771812 hasBestOaLocation W28037718122 @default.
- W2803771812 hasConcept C111919701 @default.
- W2803771812 hasConcept C134306372 @default.
- W2803771812 hasConcept C136764020 @default.
- W2803771812 hasConcept C138885662 @default.
- W2803771812 hasConcept C154945302 @default.
- W2803771812 hasConcept C164913051 @default.
- W2803771812 hasConcept C166957645 @default.
- W2803771812 hasConcept C199360897 @default.
- W2803771812 hasConcept C204321447 @default.
- W2803771812 hasConcept C23123220 @default.
- W2803771812 hasConcept C2524010 @default.
- W2803771812 hasConcept C26517878 @default.
- W2803771812 hasConcept C2777904410 @default.
- W2803771812 hasConcept C2778572836 @default.
- W2803771812 hasConcept C2779804580 @default.
- W2803771812 hasConcept C2780861071 @default.
- W2803771812 hasConcept C33923547 @default.
- W2803771812 hasConcept C36503486 @default.
- W2803771812 hasConcept C38652104 @default.
- W2803771812 hasConcept C41008148 @default.
- W2803771812 hasConcept C41895202 @default.
- W2803771812 hasConcept C513874922 @default.
- W2803771812 hasConcept C60671577 @default.
- W2803771812 hasConcept C62230096 @default.
- W2803771812 hasConcept C91304198 @default.
- W2803771812 hasConcept C95457728 @default.
- W2803771812 hasConceptScore W2803771812C111919701 @default.
- W2803771812 hasConceptScore W2803771812C134306372 @default.
- W2803771812 hasConceptScore W2803771812C136764020 @default.
- W2803771812 hasConceptScore W2803771812C138885662 @default.
- W2803771812 hasConceptScore W2803771812C154945302 @default.
- W2803771812 hasConceptScore W2803771812C164913051 @default.
- W2803771812 hasConceptScore W2803771812C166957645 @default.
- W2803771812 hasConceptScore W2803771812C199360897 @default.
- W2803771812 hasConceptScore W2803771812C204321447 @default.
- W2803771812 hasConceptScore W2803771812C23123220 @default.
- W2803771812 hasConceptScore W2803771812C2524010 @default.
- W2803771812 hasConceptScore W2803771812C26517878 @default.
- W2803771812 hasConceptScore W2803771812C2777904410 @default.
- W2803771812 hasConceptScore W2803771812C2778572836 @default.
- W2803771812 hasConceptScore W2803771812C2779804580 @default.
- W2803771812 hasConceptScore W2803771812C2780861071 @default.
- W2803771812 hasConceptScore W2803771812C33923547 @default.
- W2803771812 hasConceptScore W2803771812C36503486 @default.
- W2803771812 hasConceptScore W2803771812C38652104 @default.
- W2803771812 hasConceptScore W2803771812C41008148 @default.
- W2803771812 hasConceptScore W2803771812C41895202 @default.