Matches in SemOpenAlex for { <https://semopenalex.org/work/W2149366535> ?p ?o ?g. }
Showing items 1 to 99 of
99
with 100 items per page.
- W2149366535 abstract "The overwhelming majority of the languages in the world are spoken by less than 50 million native speakers, and automatic translation of many of these languages is less investigated due to the lack of linguistic resources such as parallel corpora. In the ACCURAT project we will work on novel methods how comparable corpora can compensate for this shortage and improve machine translation systems of under-resourced languages. Translation systems on eighteen European language pairs will be investigated and methodologies in corpus linguistics will be greatly advanced. We will explore the use of preliminary SMT models to identify the parallel parts within comparable corpora, which will allow us to derive better SMT models via a bootstrapping loop. State-of-the-art machine translation based on the statistical approach is a data-driven process. The quality and quantity of the training data is crucial for the performance of a translation system. However, the increasing amount of training corpora can still not meet the demand of automatic translation on different language pairs and in various domains. Rich data are mostly available for few languages and only certain domains. There are still a great number of underresourced languages. Thousands of languages are spoken by less than 50 million native speakers, with a big group of more than 200 languages that have between 1 and 50 million native speakers. Most of these languages are lacking sufficient linguistic resources. This brings difficulties to improve the translation qualities on these languages. For instance, the majority of the European languages are under-resourced and lack both parallel corpora and language technologies for MT. The project ACCURAT (Analysis and Evaluation of Comparable Corpora for Under-Resourced Areas of Machine Translation) will focus on developing and evaluating language pairs of English-Latvian, English-Lithuanian, English-Estonian, English-Greek, English-Croatian, Croatian-English, English-Romanian, English-Slovenian, Slovenian-English, English-German, German-English, German-Romanian, Romanian-German, Greek-Romanian, LithuanianRomanian, Romanian-Greek, Romanian-English and Latvian-Lithuanian. We also work on the language pair of German and English which is well investigated previously. This can help us find the impact of comparable corpora on translations between language pairs with both rich and poor resources. More details can be found in (Skadina et al., 2010). The participants include organizations of Tilde," @default.
- W2149366535 created "2016-06-24" @default.
- W2149366535 creator A5063975847 @default.
- W2149366535 creator A5071924020 @default.
- W2149366535 date "2010-01-01" @default.
- W2149366535 modified "2023-09-23" @default.
- W2149366535 title "Improving Machine Translation Performance Using Comparable Corpora" @default.
- W2149366535 cites W1489181569 @default.
- W2149366535 cites W1586660424 @default.
- W2149366535 cites W158832470 @default.
- W2149366535 cites W1625582487 @default.
- W2149366535 cites W171093852 @default.
- W2149366535 cites W1819903106 @default.
- W2149366535 cites W1973152633 @default.
- W2149366535 cites W2006969979 @default.
- W2149366535 cites W2044488513 @default.
- W2149366535 cites W2104103102 @default.
- W2149366535 cites W2107695330 @default.
- W2149366535 cites W2113788796 @default.
- W2149366535 cites W2117652747 @default.
- W2149366535 cites W2123143128 @default.
- W2149366535 cites W2129840547 @default.
- W2149366535 cites W2132713736 @default.
- W2149366535 cites W2139812240 @default.
- W2149366535 cites W2145251161 @default.
- W2149366535 cites W2156985047 @default.
- W2149366535 cites W2161792612 @default.
- W2149366535 cites W2166098990 @default.
- W2149366535 cites W2168929382 @default.
- W2149366535 cites W22168010 @default.
- W2149366535 cites W2243304196 @default.
- W2149366535 cites W2395333260 @default.
- W2149366535 cites W2949089825 @default.
- W2149366535 cites W3099387691 @default.
- W2149366535 cites W3208719332 @default.
- W2149366535 cites W635530177 @default.
- W2149366535 cites W84079877 @default.
- W2149366535 cites W8895266 @default.
- W2149366535 cites W92412080 @default.
- W2149366535 hasPublicationYear "2010" @default.
- W2149366535 type Work @default.
- W2149366535 sameAs 2149366535 @default.
- W2149366535 citedByCount "5" @default.
- W2149366535 crossrefType "journal-article" @default.
- W2149366535 hasAuthorship W2149366535A5063975847 @default.
- W2149366535 hasAuthorship W2149366535A5071924020 @default.
- W2149366535 hasConcept C106159729 @default.
- W2149366535 hasConcept C138885662 @default.
- W2149366535 hasConcept C148526163 @default.
- W2149366535 hasConcept C154945302 @default.
- W2149366535 hasConcept C155092808 @default.
- W2149366535 hasConcept C162324750 @default.
- W2149366535 hasConcept C203005215 @default.
- W2149366535 hasConcept C204321447 @default.
- W2149366535 hasConcept C207609745 @default.
- W2149366535 hasConcept C24687705 @default.
- W2149366535 hasConcept C2985367798 @default.
- W2149366535 hasConcept C41008148 @default.
- W2149366535 hasConcept C41895202 @default.
- W2149366535 hasConceptScore W2149366535C106159729 @default.
- W2149366535 hasConceptScore W2149366535C138885662 @default.
- W2149366535 hasConceptScore W2149366535C148526163 @default.
- W2149366535 hasConceptScore W2149366535C154945302 @default.
- W2149366535 hasConceptScore W2149366535C155092808 @default.
- W2149366535 hasConceptScore W2149366535C162324750 @default.
- W2149366535 hasConceptScore W2149366535C203005215 @default.
- W2149366535 hasConceptScore W2149366535C204321447 @default.
- W2149366535 hasConceptScore W2149366535C207609745 @default.
- W2149366535 hasConceptScore W2149366535C24687705 @default.
- W2149366535 hasConceptScore W2149366535C2985367798 @default.
- W2149366535 hasConceptScore W2149366535C41008148 @default.
- W2149366535 hasConceptScore W2149366535C41895202 @default.
- W2149366535 hasLocation W21493665351 @default.
- W2149366535 hasOpenAccess W2149366535 @default.
- W2149366535 hasPrimaryLocation W21493665351 @default.
- W2149366535 hasRelatedWork W2066308426 @default.
- W2149366535 hasRelatedWork W2151521349 @default.
- W2149366535 hasRelatedWork W2156985047 @default.
- W2149366535 hasRelatedWork W2199789610 @default.
- W2149366535 hasRelatedWork W2277817821 @default.
- W2149366535 hasRelatedWork W2352286444 @default.
- W2149366535 hasRelatedWork W2577320159 @default.
- W2149366535 hasRelatedWork W2583877189 @default.
- W2149366535 hasRelatedWork W2796150856 @default.
- W2149366535 hasRelatedWork W2913434276 @default.
- W2149366535 hasRelatedWork W2951107454 @default.
- W2149366535 hasRelatedWork W3030042997 @default.
- W2149366535 hasRelatedWork W3092085609 @default.
- W2149366535 hasRelatedWork W3120112486 @default.
- W2149366535 hasRelatedWork W3123572135 @default.
- W2149366535 hasRelatedWork W3164607486 @default.
- W2149366535 hasRelatedWork W3181014475 @default.
- W2149366535 hasRelatedWork W614252839 @default.
- W2149366535 hasRelatedWork W754316637 @default.
- W2149366535 hasRelatedWork W2151968157 @default.
- W2149366535 isParatext "false" @default.
- W2149366535 isRetracted "false" @default.
- W2149366535 magId "2149366535" @default.
- W2149366535 workType "article" @default.