Matches in SemOpenAlex for { <https://semopenalex.org/work/W2904697032> ?p ?o ?g. }
Showing items 1 to 69 of
69
with 100 items per page.
- W2904697032 endingPage "73" @default.
- W2904697032 startingPage "61" @default.
- W2904697032 abstract "Recently, more data-driven approaches are demanding multilingual parallel resources primarily in the cross-language studies. To meet these demands, building multilingual parallel corpora are becoming the focus of many Natural Language Processing (NLP) scientific groups. Unlike monolingual corpora, the number of available multilingual parallel corpora is limited. In this paper, the MulTed, a corpus of subtitles extracted from TEDx talks is introduced. It is multilingual, Part of Speech (PoS) tagged, and bilingually sentence-aligned with English as a pivot language. This corpus is designed for many NLP applications, where the sentence-alignment, the PoS tagging, and the size of corpora are influential such as statistical machine translation, language recognition, and bilingual dictionary generation. Currently, the corpus has subtitles that cover 1100 talks available in over 100 languages. The subtitles are classified based on a variety of topics such as Business, Education, and Sport. Regarding the PoS tagging, the Treetagger, a language-independent PoS tagger, is used; then, to make the PoS tagging maximally useful, a mapping process to a universal common tagset is performed. Finally, we believe that making the MulTed corpus available for a public use can be a significant contribution to the literature of NLP and corpus linguistics, especially for under-resourced languages." @default.
- W2904697032 created "2018-12-22" @default.
- W2904697032 creator A5036610488 @default.
- W2904697032 creator A5043920765 @default.
- W2904697032 date "2020-07-17" @default.
- W2904697032 modified "2023-09-27" @default.
- W2904697032 title "MulTed: a multilingual aligned and tagged parallel corpus" @default.
- W2904697032 cites W1522263329 @default.
- W2904697032 cites W1967169497 @default.
- W2904697032 cites W2131134557 @default.
- W2904697032 cites W2296246537 @default.
- W2904697032 cites W2515295520 @default.
- W2904697032 cites W2769731047 @default.
- W2904697032 cites W354445564 @default.
- W2904697032 doi "https://doi.org/10.1016/j.aci.2018.12.003" @default.
- W2904697032 hasPublicationYear "2020" @default.
- W2904697032 type Work @default.
- W2904697032 sameAs 2904697032 @default.
- W2904697032 citedByCount "5" @default.
- W2904697032 countsByYear W29046970322020 @default.
- W2904697032 countsByYear W29046970322021 @default.
- W2904697032 countsByYear W29046970322023 @default.
- W2904697032 crossrefType "journal-article" @default.
- W2904697032 hasAuthorship W2904697032A5036610488 @default.
- W2904697032 hasAuthorship W2904697032A5043920765 @default.
- W2904697032 hasBestOaLocation W29046970321 @default.
- W2904697032 hasConcept C120665830 @default.
- W2904697032 hasConcept C121332964 @default.
- W2904697032 hasConcept C136197465 @default.
- W2904697032 hasConcept C154945302 @default.
- W2904697032 hasConcept C192209626 @default.
- W2904697032 hasConcept C203005215 @default.
- W2904697032 hasConcept C204321447 @default.
- W2904697032 hasConcept C2777530160 @default.
- W2904697032 hasConcept C2985367798 @default.
- W2904697032 hasConcept C41008148 @default.
- W2904697032 hasConcept C532629269 @default.
- W2904697032 hasConceptScore W2904697032C120665830 @default.
- W2904697032 hasConceptScore W2904697032C121332964 @default.
- W2904697032 hasConceptScore W2904697032C136197465 @default.
- W2904697032 hasConceptScore W2904697032C154945302 @default.
- W2904697032 hasConceptScore W2904697032C192209626 @default.
- W2904697032 hasConceptScore W2904697032C203005215 @default.
- W2904697032 hasConceptScore W2904697032C204321447 @default.
- W2904697032 hasConceptScore W2904697032C2777530160 @default.
- W2904697032 hasConceptScore W2904697032C2985367798 @default.
- W2904697032 hasConceptScore W2904697032C41008148 @default.
- W2904697032 hasConceptScore W2904697032C532629269 @default.
- W2904697032 hasIssue "1/2" @default.
- W2904697032 hasLocation W29046970321 @default.
- W2904697032 hasOpenAccess W2904697032 @default.
- W2904697032 hasPrimaryLocation W29046970321 @default.
- W2904697032 hasRelatedWork W1513695776 @default.
- W2904697032 hasRelatedWork W22168010 @default.
- W2904697032 hasRelatedWork W2250876691 @default.
- W2904697032 hasRelatedWork W2532135999 @default.
- W2904697032 hasRelatedWork W2574026469 @default.
- W2904697032 hasRelatedWork W3164405410 @default.
- W2904697032 hasRelatedWork W3185884227 @default.
- W2904697032 hasRelatedWork W3208033360 @default.
- W2904697032 hasRelatedWork W4379525811 @default.
- W2904697032 hasRelatedWork W4385065630 @default.
- W2904697032 hasVolume "18" @default.
- W2904697032 isParatext "false" @default.
- W2904697032 isRetracted "false" @default.
- W2904697032 magId "2904697032" @default.
- W2904697032 workType "article" @default.