Matches in SemOpenAlex for { <https://semopenalex.org/work/W2743816787> ?p ?o ?g. }
- W2743816787 abstract "We develop, analyze and combine features for the automatic detection of Anglicisms included in German and Afrikaans text which can improve automatic speech recognition, speech synthesis and other fields such as natural language processing. To evaluate our methods we collected and annotated two German word lists from different domains (IT, general news). We also applied our detection methods to an Afrikaans word list from the NCHLT corpus. Our features are based on grapheme perplexity, grapheme-to-phoneme (G2P) confidence, Google hits count as well as spell-checker dictionary and Wiktionary lookup. With our G2P confidence and Wiktionary features we introduce new approaches to detect Anglicisms. Comparing features based on English models and models of the matrix language allows us to refrain from determining thresholds in a supervised way. Furthermore we do not rely on training data that needs to be expensively annotated – instead we use available resources like word lists and pronunciation dictionaries. Our best single feature is based on the G2P confidence with an f-score of up to 70.39%. Combining our features using a voting, decision tree or support vector machine (SVM) gives us further improvements, especially where the single features performed poorly. We achieve up to 44% relative improvement in f-score on our Afrikaans data. Our best result with a combination is an f-score of 75.44%." @default.
- W2743816787 created "2017-08-17" @default.
- W2743816787 creator A5006665568 @default.
- W2743816787 date "2014-01-01" @default.
- W2743816787 modified "2023-09-27" @default.
- W2743816787 title "Single and Combined Features for the Detection of Anglicisms in German and Afrikaans" @default.
- W2743816787 cites W10658944 @default.
- W2743816787 cites W130850236 @default.
- W2743816787 cites W1489384387 @default.
- W2743816787 cites W1522653827 @default.
- W2743816787 cites W1528022942 @default.
- W2743816787 cites W1539594493 @default.
- W2743816787 cites W1591338396 @default.
- W2743816787 cites W1876715006 @default.
- W2743816787 cites W1972407269 @default.
- W2743816787 cites W1982666937 @default.
- W2743816787 cites W1983279103 @default.
- W2743816787 cites W1997575393 @default.
- W2743816787 cites W2000074744 @default.
- W2743816787 cites W2041614298 @default.
- W2743816787 cites W2084534958 @default.
- W2743816787 cites W2090755665 @default.
- W2743816787 cites W2142005091 @default.
- W2743816787 cites W2143005840 @default.
- W2743816787 cites W2153200389 @default.
- W2743816787 cites W2163918411 @default.
- W2743816787 cites W2168596788 @default.
- W2743816787 cites W2187612371 @default.
- W2743816787 cites W2250513555 @default.
- W2743816787 cites W2394754513 @default.
- W2743816787 cites W2399033582 @default.
- W2743816787 cites W2399168871 @default.
- W2743816787 cites W3089316880 @default.
- W2743816787 cites W3197026893 @default.
- W2743816787 cites W939440578 @default.
- W2743816787 hasPublicationYear "2014" @default.
- W2743816787 type Work @default.
- W2743816787 sameAs 2743816787 @default.
- W2743816787 citedByCount "0" @default.
- W2743816787 crossrefType "journal-article" @default.
- W2743816787 hasAuthorship W2743816787A5006665568 @default.
- W2743816787 hasConcept C100279451 @default.
- W2743816787 hasConcept C121332964 @default.
- W2743816787 hasConcept C12267149 @default.
- W2743816787 hasConcept C137293760 @default.
- W2743816787 hasConcept C138885662 @default.
- W2743816787 hasConcept C154775046 @default.
- W2743816787 hasConcept C154945302 @default.
- W2743816787 hasConcept C204321447 @default.
- W2743816787 hasConcept C2776401178 @default.
- W2743816787 hasConcept C2776779415 @default.
- W2743816787 hasConcept C2780844864 @default.
- W2743816787 hasConcept C28490314 @default.
- W2743816787 hasConcept C30080830 @default.
- W2743816787 hasConcept C41008148 @default.
- W2743816787 hasConcept C41895202 @default.
- W2743816787 hasConcept C62520636 @default.
- W2743816787 hasConcept C90805587 @default.
- W2743816787 hasConceptScore W2743816787C100279451 @default.
- W2743816787 hasConceptScore W2743816787C121332964 @default.
- W2743816787 hasConceptScore W2743816787C12267149 @default.
- W2743816787 hasConceptScore W2743816787C137293760 @default.
- W2743816787 hasConceptScore W2743816787C138885662 @default.
- W2743816787 hasConceptScore W2743816787C154775046 @default.
- W2743816787 hasConceptScore W2743816787C154945302 @default.
- W2743816787 hasConceptScore W2743816787C204321447 @default.
- W2743816787 hasConceptScore W2743816787C2776401178 @default.
- W2743816787 hasConceptScore W2743816787C2776779415 @default.
- W2743816787 hasConceptScore W2743816787C2780844864 @default.
- W2743816787 hasConceptScore W2743816787C28490314 @default.
- W2743816787 hasConceptScore W2743816787C30080830 @default.
- W2743816787 hasConceptScore W2743816787C41008148 @default.
- W2743816787 hasConceptScore W2743816787C41895202 @default.
- W2743816787 hasConceptScore W2743816787C62520636 @default.
- W2743816787 hasConceptScore W2743816787C90805587 @default.
- W2743816787 hasLocation W27438167871 @default.
- W2743816787 hasOpenAccess W2743816787 @default.
- W2743816787 hasPrimaryLocation W27438167871 @default.
- W2743816787 hasRelatedWork W1192346563 @default.
- W2743816787 hasRelatedWork W2014912547 @default.
- W2743816787 hasRelatedWork W2057389798 @default.
- W2743816787 hasRelatedWork W2100044354 @default.
- W2743816787 hasRelatedWork W2151504427 @default.
- W2743816787 hasRelatedWork W2167265720 @default.
- W2743816787 hasRelatedWork W2250493249 @default.
- W2743816787 hasRelatedWork W2394738290 @default.
- W2743816787 hasRelatedWork W2412793475 @default.
- W2743816787 hasRelatedWork W2502438517 @default.
- W2743816787 hasRelatedWork W2574133711 @default.
- W2743816787 hasRelatedWork W2577871717 @default.
- W2743816787 hasRelatedWork W2739575608 @default.
- W2743816787 hasRelatedWork W2786980120 @default.
- W2743816787 hasRelatedWork W2788762724 @default.
- W2743816787 hasRelatedWork W3003259128 @default.
- W2743816787 hasRelatedWork W3032676245 @default.
- W2743816787 hasRelatedWork W3037026767 @default.
- W2743816787 hasRelatedWork W3092282134 @default.
- W2743816787 hasRelatedWork W3192188649 @default.
- W2743816787 isParatext "false" @default.
- W2743816787 isRetracted "false" @default.