Matches in SemOpenAlex for { <https://semopenalex.org/work/W3178733115> ?p ?o ?g. }
- W3178733115 abstract "Generating code-switched text is a problem of growing interest, especially given the scarcity of corpora containing large volumes of real code-switched text. In this work, we adapt a state-of-the-art neural machine translation model to generate Hindi-English code-switched sentences starting from monolingual Hindi sentences. We outline a carefully designed curriculum of pretraining steps, including the use of synthetic code-switched text, that enable the model to generate high-quality code-switched text. Using text generated from our model as data augmentation, we show significant reductions in perplexity on a language modeling task, compared to using text from other generative models of CS text. We also show improvements using our text for a downstream code-switched natural language inference task. Our generated text is further subjected to a rigorous evaluation using a human evaluation study and a range of objective metrics, where we show performance comparable (and sometimes even superior) to code-switched text obtained via crowd workers who are native Hindi speakers." @default.
- W3178733115 created "2021-07-19" @default.
- W3178733115 creator A5036738038 @default.
- W3178733115 creator A5050514068 @default.
- W3178733115 creator A5068894556 @default.
- W3178733115 date "2021-07-14" @default.
- W3178733115 modified "2023-09-27" @default.
- W3178733115 title "From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text" @default.
- W3178733115 cites W1969754958 @default.
- W3178733115 cites W2031292349 @default.
- W3178733115 cites W2034585809 @default.
- W3178733115 cites W2038116248 @default.
- W3178733115 cites W2061272101 @default.
- W3178733115 cites W2094655846 @default.
- W3178733115 cites W2098355803 @default.
- W3178733115 cites W2101105183 @default.
- W3178733115 cites W2107745473 @default.
- W3178733115 cites W2149778059 @default.
- W3178733115 cites W2153433699 @default.
- W3178733115 cites W2153579005 @default.
- W3178733115 cites W2419539795 @default.
- W3178733115 cites W2507756961 @default.
- W3178733115 cites W2585024066 @default.
- W3178733115 cites W2747481658 @default.
- W3178733115 cites W2762484717 @default.
- W3178733115 cites W2777647957 @default.
- W3178733115 cites W2798348125 @default.
- W3178733115 cites W2896068220 @default.
- W3178733115 cites W2962832505 @default.
- W3178733115 cites W2963206679 @default.
- W3178733115 cites W2963216553 @default.
- W3178733115 cites W2963341956 @default.
- W3178733115 cites W2963403868 @default.
- W3178733115 cites W2963456134 @default.
- W3178733115 cites W2963602293 @default.
- W3178733115 cites W2963877297 @default.
- W3178733115 cites W2963945964 @default.
- W3178733115 cites W2963952470 @default.
- W3178733115 cites W2964268978 @default.
- W3178733115 cites W2966263504 @default.
- W3178733115 cites W2972702443 @default.
- W3178733115 cites W2973082572 @default.
- W3178733115 cites W2989143494 @default.
- W3178733115 cites W2996403597 @default.
- W3178733115 cites W3034284720 @default.
- W3178733115 cites W3035464238 @default.
- W3178733115 cites W3088665616 @default.
- W3178733115 cites W3105813095 @default.
- W3178733115 cites W3107826490 @default.
- W3178733115 cites W3153869297 @default.
- W3178733115 cites W3156476125 @default.
- W3178733115 cites W3171500670 @default.
- W3178733115 cites W3171877493 @default.
- W3178733115 cites W46679369 @default.
- W3178733115 cites W630532510 @default.
- W3178733115 hasPublicationYear "2021" @default.
- W3178733115 type Work @default.
- W3178733115 sameAs 3178733115 @default.
- W3178733115 citedByCount "0" @default.
- W3178733115 crossrefType "posted-content" @default.
- W3178733115 hasAuthorship W3178733115A5036738038 @default.
- W3178733115 hasAuthorship W3178733115A5050514068 @default.
- W3178733115 hasAuthorship W3178733115A5068894556 @default.
- W3178733115 hasConcept C100279451 @default.
- W3178733115 hasConcept C111919701 @default.
- W3178733115 hasConcept C118505674 @default.
- W3178733115 hasConcept C133162039 @default.
- W3178733115 hasConcept C137293760 @default.
- W3178733115 hasConcept C138885662 @default.
- W3178733115 hasConcept C154945302 @default.
- W3178733115 hasConcept C162324750 @default.
- W3178733115 hasConcept C177264268 @default.
- W3178733115 hasConcept C18552078 @default.
- W3178733115 hasConcept C187736073 @default.
- W3178733115 hasConcept C199360897 @default.
- W3178733115 hasConcept C203005215 @default.
- W3178733115 hasConcept C204321447 @default.
- W3178733115 hasConcept C26517878 @default.
- W3178733115 hasConcept C2776214188 @default.
- W3178733115 hasConcept C2776760102 @default.
- W3178733115 hasConcept C2780451532 @default.
- W3178733115 hasConcept C38652104 @default.
- W3178733115 hasConcept C39890363 @default.
- W3178733115 hasConcept C41008148 @default.
- W3178733115 hasConcept C41895202 @default.
- W3178733115 hasConcept C519982507 @default.
- W3178733115 hasConceptScore W3178733115C100279451 @default.
- W3178733115 hasConceptScore W3178733115C111919701 @default.
- W3178733115 hasConceptScore W3178733115C118505674 @default.
- W3178733115 hasConceptScore W3178733115C133162039 @default.
- W3178733115 hasConceptScore W3178733115C137293760 @default.
- W3178733115 hasConceptScore W3178733115C138885662 @default.
- W3178733115 hasConceptScore W3178733115C154945302 @default.
- W3178733115 hasConceptScore W3178733115C162324750 @default.
- W3178733115 hasConceptScore W3178733115C177264268 @default.
- W3178733115 hasConceptScore W3178733115C18552078 @default.
- W3178733115 hasConceptScore W3178733115C187736073 @default.
- W3178733115 hasConceptScore W3178733115C199360897 @default.
- W3178733115 hasConceptScore W3178733115C203005215 @default.
- W3178733115 hasConceptScore W3178733115C204321447 @default.