Matches in SemOpenAlex for { <https://semopenalex.org/work/W2964179938> ?p ?o ?g. }
- W2964179938 abstract "Nationality identification unlocks important demographic information, with many applications in biomedical and sociological research. Existing name-based nationality classifiers use name substrings as features and are trained on small, unrepresentative sets of labeled names, typically extracted from Wikipedia. As a result, these methods achieve limited performance and cannot support fine-grained classification. We exploit the phenomena of homophily in communication patterns to learn name embeddings, a new representation that encodes gender, ethnicity, and nationality which is readily applicable to building classifiers and other systems. Through our analysis of 57M contact lists from a major Internet company, we are able to design a fine-grained nationality classifier covering 39 groups representing over 90% of the world population. In an evaluation against other published systems over 13 common classes, our F1 score (0.795) is substantial better than our closest competitor Ethnea (0.580). To the best of our knowledge, this is the most accurate, fine-grained nationality classifier available. As a social media application, we apply our classifiers to the followers of major Twitter celebrities over six different domains. We demonstrate stark differences in the ethnicities of the followers of Trump and Obama, and in the sports and entertainments favored by different groups. Finally, we identify an anomalous political figure whose presumably inflated following appears largely incapable of reading the language he posts in." @default.
- W2964179938 created "2019-07-30" @default.
- W2964179938 creator A5000245012 @default.
- W2964179938 creator A5016087081 @default.
- W2964179938 creator A5053106292 @default.
- W2964179938 creator A5053476652 @default.
- W2964179938 creator A5060741187 @default.
- W2964179938 creator A5062283585 @default.
- W2964179938 creator A5063211322 @default.
- W2964179938 date "2017-11-06" @default.
- W2964179938 modified "2023-10-15" @default.
- W2964179938 title "Nationality Classification Using Name Embeddings" @default.
- W2964179938 cites W137202426 @default.
- W2964179938 cites W1989224082 @default.
- W2964179938 cites W2030657688 @default.
- W2964179938 cites W2106187803 @default.
- W2964179938 cites W2111330212 @default.
- W2964179938 cites W2148005820 @default.
- W2964179938 cites W2167458573 @default.
- W2964179938 cites W2250539671 @default.
- W2964179938 cites W2330225546 @default.
- W2964179938 cites W2998704965 @default.
- W2964179938 cites W3104097132 @default.
- W2964179938 cites W3105705953 @default.
- W2964179938 doi "https://doi.org/10.1145/3132847.3133008" @default.
- W2964179938 hasPublicationYear "2017" @default.
- W2964179938 type Work @default.
- W2964179938 sameAs 2964179938 @default.
- W2964179938 citedByCount "50" @default.
- W2964179938 countsByYear W29641799382017 @default.
- W2964179938 countsByYear W29641799382018 @default.
- W2964179938 countsByYear W29641799382019 @default.
- W2964179938 countsByYear W29641799382020 @default.
- W2964179938 countsByYear W29641799382021 @default.
- W2964179938 countsByYear W29641799382022 @default.
- W2964179938 countsByYear W29641799382023 @default.
- W2964179938 crossrefType "proceedings-article" @default.
- W2964179938 hasAuthorship W2964179938A5000245012 @default.
- W2964179938 hasAuthorship W2964179938A5016087081 @default.
- W2964179938 hasAuthorship W2964179938A5053106292 @default.
- W2964179938 hasAuthorship W2964179938A5053476652 @default.
- W2964179938 hasAuthorship W2964179938A5060741187 @default.
- W2964179938 hasAuthorship W2964179938A5062283585 @default.
- W2964179938 hasAuthorship W2964179938A5063211322 @default.
- W2964179938 hasBestOaLocation W29641799382 @default.
- W2964179938 hasConcept C110875604 @default.
- W2964179938 hasConcept C136764020 @default.
- W2964179938 hasConcept C137403100 @default.
- W2964179938 hasConcept C144024400 @default.
- W2964179938 hasConcept C149923435 @default.
- W2964179938 hasConcept C154945302 @default.
- W2964179938 hasConcept C165696696 @default.
- W2964179938 hasConcept C166957645 @default.
- W2964179938 hasConcept C19165224 @default.
- W2964179938 hasConcept C204321447 @default.
- W2964179938 hasConcept C205649164 @default.
- W2964179938 hasConcept C2777138209 @default.
- W2964179938 hasConcept C2779812341 @default.
- W2964179938 hasConcept C2908647359 @default.
- W2964179938 hasConcept C36289849 @default.
- W2964179938 hasConcept C38652104 @default.
- W2964179938 hasConcept C41008148 @default.
- W2964179938 hasConcept C70036468 @default.
- W2964179938 hasConcept C95623464 @default.
- W2964179938 hasConceptScore W2964179938C110875604 @default.
- W2964179938 hasConceptScore W2964179938C136764020 @default.
- W2964179938 hasConceptScore W2964179938C137403100 @default.
- W2964179938 hasConceptScore W2964179938C144024400 @default.
- W2964179938 hasConceptScore W2964179938C149923435 @default.
- W2964179938 hasConceptScore W2964179938C154945302 @default.
- W2964179938 hasConceptScore W2964179938C165696696 @default.
- W2964179938 hasConceptScore W2964179938C166957645 @default.
- W2964179938 hasConceptScore W2964179938C19165224 @default.
- W2964179938 hasConceptScore W2964179938C204321447 @default.
- W2964179938 hasConceptScore W2964179938C205649164 @default.
- W2964179938 hasConceptScore W2964179938C2777138209 @default.
- W2964179938 hasConceptScore W2964179938C2779812341 @default.
- W2964179938 hasConceptScore W2964179938C2908647359 @default.
- W2964179938 hasConceptScore W2964179938C36289849 @default.
- W2964179938 hasConceptScore W2964179938C38652104 @default.
- W2964179938 hasConceptScore W2964179938C41008148 @default.
- W2964179938 hasConceptScore W2964179938C70036468 @default.
- W2964179938 hasConceptScore W2964179938C95623464 @default.
- W2964179938 hasFunder F4320306076 @default.
- W2964179938 hasLocation W29641799381 @default.
- W2964179938 hasLocation W29641799382 @default.
- W2964179938 hasOpenAccess W2964179938 @default.
- W2964179938 hasPrimaryLocation W29641799381 @default.
- W2964179938 hasRelatedWork W1483367581 @default.
- W2964179938 hasRelatedWork W2099030945 @default.
- W2964179938 hasRelatedWork W2293779257 @default.
- W2964179938 hasRelatedWork W2295435105 @default.
- W2964179938 hasRelatedWork W2752995228 @default.
- W2964179938 hasRelatedWork W2794181734 @default.
- W2964179938 hasRelatedWork W2944524354 @default.
- W2964179938 hasRelatedWork W2945445003 @default.
- W2964179938 hasRelatedWork W2964179938 @default.
- W2964179938 hasRelatedWork W2980771618 @default.
- W2964179938 isParatext "false" @default.
- W2964179938 isRetracted "false" @default.