Matches in SemOpenAlex for { <https://semopenalex.org/work/W2233998645> ?p ?o ?g. }
Showing items 1 to 72 of
72
with 100 items per page.
- W2233998645 abstract "We explore the usability of different bilingual corpora for the purpose of multilingual and cross-lingual natural language processing. The usability of bilingual corpus is evaluated by the lexical alignment score calculated for the bi-lexicon pair distributed in the aligned bilingual sentence pairs. We compare and contrast a number of bilingual corpora, ranging from parallel, to comparable, and to nonparallel corpora. We compare different methods of mining parallel sentences and bilingual lexicon from bilingual corpora. These methods make several sentence-level assumptions on the bilingual corpora. We have found that some of them are applicable to bilingual parallel documents but non-applicable to non-parallel, comparable documents. None of the sentence-level assumptions can be made about nonparallel and quasi-comparable corpora. The latter contain bilingual documents that may or may not be on the same topic. By postulating additional assumptions on comparable documents, we propose a completely unsupervised method to extract useful material, such as parallel sentences and bilexicons, from quasi-comparable corpora. The lexical alignment score for the comparable sentences extracted with our unsupervised method is found to be very close to that of the parallel corpus. This shows that our extraction method is effective." @default.
- W2233998645 created "2016-06-24" @default.
- W2233998645 creator A5065856469 @default.
- W2233998645 creator A5069889379 @default.
- W2233998645 date "2004-01-01" @default.
- W2233998645 modified "2023-09-26" @default.
- W2233998645 title "Sentence Alignment in Paralllel, Comparable and Quasi-comparable Corpora" @default.
- W2233998645 cites W136035420 @default.
- W2233998645 cites W1543039027 @default.
- W2233998645 cites W1570490112 @default.
- W2233998645 cites W1574901103 @default.
- W2233998645 cites W2041232209 @default.
- W2233998645 cites W2047295649 @default.
- W2233998645 cites W2061118075 @default.
- W2233998645 cites W2061235289 @default.
- W2233998645 cites W2097662711 @default.
- W2233998645 cites W2162059093 @default.
- W2233998645 cites W2166098990 @default.
- W2233998645 cites W2277993829 @default.
- W2233998645 cites W354027070 @default.
- W2233998645 hasPublicationYear "2004" @default.
- W2233998645 type Work @default.
- W2233998645 sameAs 2233998645 @default.
- W2233998645 citedByCount "4" @default.
- W2233998645 crossrefType "journal-article" @default.
- W2233998645 hasAuthorship W2233998645A5065856469 @default.
- W2233998645 hasAuthorship W2233998645A5069889379 @default.
- W2233998645 hasConcept C107457646 @default.
- W2233998645 hasConcept C154945302 @default.
- W2233998645 hasConcept C170130773 @default.
- W2233998645 hasConcept C203005215 @default.
- W2233998645 hasConcept C204321447 @default.
- W2233998645 hasConcept C2777530160 @default.
- W2233998645 hasConcept C2778121359 @default.
- W2233998645 hasConcept C2985367798 @default.
- W2233998645 hasConcept C41008148 @default.
- W2233998645 hasConceptScore W2233998645C107457646 @default.
- W2233998645 hasConceptScore W2233998645C154945302 @default.
- W2233998645 hasConceptScore W2233998645C170130773 @default.
- W2233998645 hasConceptScore W2233998645C203005215 @default.
- W2233998645 hasConceptScore W2233998645C204321447 @default.
- W2233998645 hasConceptScore W2233998645C2777530160 @default.
- W2233998645 hasConceptScore W2233998645C2778121359 @default.
- W2233998645 hasConceptScore W2233998645C2985367798 @default.
- W2233998645 hasConceptScore W2233998645C41008148 @default.
- W2233998645 hasLocation W22339986451 @default.
- W2233998645 hasOpenAccess W2233998645 @default.
- W2233998645 hasPrimaryLocation W22339986451 @default.
- W2233998645 hasRelatedWork W1489181569 @default.
- W2233998645 hasRelatedWork W1581740421 @default.
- W2233998645 hasRelatedWork W1626233182 @default.
- W2233998645 hasRelatedWork W1889220380 @default.
- W2233998645 hasRelatedWork W2006969979 @default.
- W2233998645 hasRelatedWork W2041232209 @default.
- W2233998645 hasRelatedWork W2047295649 @default.
- W2233998645 hasRelatedWork W2091889711 @default.
- W2233998645 hasRelatedWork W2101105183 @default.
- W2233998645 hasRelatedWork W2102749417 @default.
- W2233998645 hasRelatedWork W2131035834 @default.
- W2233998645 hasRelatedWork W2133837072 @default.
- W2233998645 hasRelatedWork W2138247936 @default.
- W2233998645 hasRelatedWork W2150028966 @default.
- W2233998645 hasRelatedWork W2153653739 @default.
- W2233998645 hasRelatedWork W1528631851 @default.
- W2233998645 hasRelatedWork W1548773254 @default.
- W2233998645 hasRelatedWork W1569645395 @default.
- W2233998645 hasRelatedWork W1785269842 @default.
- W2233998645 hasRelatedWork W1818145723 @default.
- W2233998645 isParatext "false" @default.
- W2233998645 isRetracted "false" @default.
- W2233998645 magId "2233998645" @default.
- W2233998645 workType "article" @default.