Matches in SemOpenAlex for { <https://semopenalex.org/work/W3210390771> ?p ?o ?g. }
- W3210390771 abstract "We introduce a high-quality and large-scale Vietnamese-English parallel dataset of 3.02M sentence pairs, which is 2.9M pairs larger than the benchmark Vietnamese-English machine translation corpus IWSLT15. We conduct experiments comparing strong neural baselines and well-known automatic translation engines on our dataset and find that in both automatic and human evaluations: the best performance is obtained by fine-tuning the pre-trained sequence-to-sequence denoising auto-encoder mBART. To our best knowledge, this is the first large-scale Vietnamese-English machine translation study. We hope our publicly available dataset and study can serve as a starting point for future research and applications on Vietnamese-English machine translation. We release our dataset at: https://github.com/VinAIResearch/PhoMT" @default.
- W3210390771 created "2021-11-08" @default.
- W3210390771 creator A5010886079 @default.
- W3210390771 creator A5011218050 @default.
- W3210390771 creator A5042805366 @default.
- W3210390771 creator A5060483283 @default.
- W3210390771 date "2021-01-01" @default.
- W3210390771 modified "2023-09-25" @default.
- W3210390771 title "PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation" @default.
- W3210390771 cites W1975879668 @default.
- W3210390771 cites W2101105183 @default.
- W3210390771 cites W2123442489 @default.
- W3210390771 cites W2148334774 @default.
- W3210390771 cites W2149327368 @default.
- W3210390771 cites W2250590325 @default.
- W3210390771 cites W2251654371 @default.
- W3210390771 cites W2419539795 @default.
- W3210390771 cites W2496235729 @default.
- W3210390771 cites W2525778437 @default.
- W3210390771 cites W2905927205 @default.
- W3210390771 cites W2933138175 @default.
- W3210390771 cites W2949303037 @default.
- W3210390771 cites W2962784628 @default.
- W3210390771 cites W2963403868 @default.
- W3210390771 cites W2963532001 @default.
- W3210390771 cites W2963626623 @default.
- W3210390771 cites W2963807318 @default.
- W3210390771 cites W2964121744 @default.
- W3210390771 cites W2964336292 @default.
- W3210390771 cites W2978927999 @default.
- W3210390771 cites W2978943549 @default.
- W3210390771 cites W3008897815 @default.
- W3210390771 cites W3100806282 @default.
- W3210390771 cites W3104453603 @default.
- W3210390771 cites W3106445907 @default.
- W3210390771 cites W3107826490 @default.
- W3210390771 cites W3152788712 @default.
- W3210390771 cites W630532510 @default.
- W3210390771 doi "https://doi.org/10.18653/v1/2021.emnlp-main.369" @default.
- W3210390771 hasPublicationYear "2021" @default.
- W3210390771 type Work @default.
- W3210390771 sameAs 3210390771 @default.
- W3210390771 citedByCount "1" @default.
- W3210390771 countsByYear W32103907712022 @default.
- W3210390771 crossrefType "proceedings-article" @default.
- W3210390771 hasAuthorship W3210390771A5010886079 @default.
- W3210390771 hasAuthorship W3210390771A5011218050 @default.
- W3210390771 hasAuthorship W3210390771A5042805366 @default.
- W3210390771 hasAuthorship W3210390771A5060483283 @default.
- W3210390771 hasBestOaLocation W32103907711 @default.
- W3210390771 hasConcept C103621254 @default.
- W3210390771 hasConcept C104317684 @default.
- W3210390771 hasConcept C105580179 @default.
- W3210390771 hasConcept C111472728 @default.
- W3210390771 hasConcept C111919701 @default.
- W3210390771 hasConcept C118505674 @default.
- W3210390771 hasConcept C121332964 @default.
- W3210390771 hasConcept C13280743 @default.
- W3210390771 hasConcept C138885662 @default.
- W3210390771 hasConcept C149364088 @default.
- W3210390771 hasConcept C154945302 @default.
- W3210390771 hasConcept C185592680 @default.
- W3210390771 hasConcept C185798385 @default.
- W3210390771 hasConcept C203005215 @default.
- W3210390771 hasConcept C204321447 @default.
- W3210390771 hasConcept C205649164 @default.
- W3210390771 hasConcept C2524010 @default.
- W3210390771 hasConcept C2777530160 @default.
- W3210390771 hasConcept C2778755073 @default.
- W3210390771 hasConcept C2779530757 @default.
- W3210390771 hasConcept C28490314 @default.
- W3210390771 hasConcept C28719098 @default.
- W3210390771 hasConcept C2985367798 @default.
- W3210390771 hasConcept C33923547 @default.
- W3210390771 hasConcept C41008148 @default.
- W3210390771 hasConcept C41895202 @default.
- W3210390771 hasConcept C55493867 @default.
- W3210390771 hasConcept C62520636 @default.
- W3210390771 hasConceptScore W3210390771C103621254 @default.
- W3210390771 hasConceptScore W3210390771C104317684 @default.
- W3210390771 hasConceptScore W3210390771C105580179 @default.
- W3210390771 hasConceptScore W3210390771C111472728 @default.
- W3210390771 hasConceptScore W3210390771C111919701 @default.
- W3210390771 hasConceptScore W3210390771C118505674 @default.
- W3210390771 hasConceptScore W3210390771C121332964 @default.
- W3210390771 hasConceptScore W3210390771C13280743 @default.
- W3210390771 hasConceptScore W3210390771C138885662 @default.
- W3210390771 hasConceptScore W3210390771C149364088 @default.
- W3210390771 hasConceptScore W3210390771C154945302 @default.
- W3210390771 hasConceptScore W3210390771C185592680 @default.
- W3210390771 hasConceptScore W3210390771C185798385 @default.
- W3210390771 hasConceptScore W3210390771C203005215 @default.
- W3210390771 hasConceptScore W3210390771C204321447 @default.
- W3210390771 hasConceptScore W3210390771C205649164 @default.
- W3210390771 hasConceptScore W3210390771C2524010 @default.
- W3210390771 hasConceptScore W3210390771C2777530160 @default.
- W3210390771 hasConceptScore W3210390771C2778755073 @default.
- W3210390771 hasConceptScore W3210390771C2779530757 @default.
- W3210390771 hasConceptScore W3210390771C28490314 @default.
- W3210390771 hasConceptScore W3210390771C28719098 @default.