Matches in SemOpenAlex for { <https://semopenalex.org/work/W3137361708> ?p ?o ?g. }
Showing items 1 to 93 of
93
with 100 items per page.
- W3137361708 abstract "Since the start of COVID-19, several relevant corpora from various sources are presented in the literature that contain millions of data points. While these corpora are valuable in supporting many analyses on this specific pandemic, researchers require additional benchmark corpora that contain other epidemics to facilitate cross-epidemic pattern recognition and trend analysis tasks. During our other efforts on COVID-19 related work, we discover very little disease related corpora in the literature that are sizable and rich enough to support such cross-epidemic analysis tasks. In this paper, we present EPIC30M, a large-scale epidemic corpus that contains 30 millions micro-blog posts, i.e., tweets crawled from Twitter, from year 2006 to 2020. EPIC30M contains a subset of 26.2 millions tweets related to three general diseases, namely Ebola, Cholera and Swine Flu, and another subset of 4.7 millions tweets of six global epidemic outbreaks, including 2009 H1N1 Swine Flu, 2010 Haiti Cholera, 2012 Middle-East Respiratory Syndrome (MERS), 2013 West African Ebola, 2016 Yemen Cholera and 2018 Kivu Ebola. Furthermore, we explore and discuss the properties of the corpus with statistics of key terms and hashtags and trends analysis for each subset. Finally, we demonstrate the value and impact that EPIC30M could create through a discussion of multiple use cases of cross-epidemic research topics that attract growing interest in recent years. These use cases span multiple research areas, such as epidemiological modeling, pattern recognition, natural language understanding and economical modeling." @default.
- W3137361708 created "2021-03-29" @default.
- W3137361708 creator A5002004426 @default.
- W3137361708 creator A5005406384 @default.
- W3137361708 creator A5033426375 @default.
- W3137361708 creator A5067002888 @default.
- W3137361708 creator A5067484594 @default.
- W3137361708 date "2020-12-10" @default.
- W3137361708 modified "2023-09-26" @default.
- W3137361708 title "EPIC30M: An Epidemics Corpus of Over 30 Million Relevant Tweets" @default.
- W3137361708 cites W1529574916 @default.
- W3137361708 cites W2004192095 @default.
- W3137361708 cites W2023666594 @default.
- W3137361708 cites W2027687511 @default.
- W3137361708 cites W2134020369 @default.
- W3137361708 cites W2139188905 @default.
- W3137361708 cites W2151098288 @default.
- W3137361708 cites W2164082612 @default.
- W3137361708 cites W2238265222 @default.
- W3137361708 cites W2250734828 @default.
- W3137361708 cites W2281420995 @default.
- W3137361708 cites W2290393813 @default.
- W3137361708 cites W235909202 @default.
- W3137361708 cites W2460513959 @default.
- W3137361708 cites W2469351952 @default.
- W3137361708 cites W2561761609 @default.
- W3137361708 cites W2742330194 @default.
- W3137361708 cites W2783064254 @default.
- W3137361708 cites W2785615365 @default.
- W3137361708 cites W2791544114 @default.
- W3137361708 cites W2798683079 @default.
- W3137361708 cites W2810266289 @default.
- W3137361708 cites W2887425423 @default.
- W3137361708 cites W2949179185 @default.
- W3137361708 cites W2966969968 @default.
- W3137361708 cites W2977413798 @default.
- W3137361708 cites W2979981023 @default.
- W3137361708 cites W2984129085 @default.
- W3137361708 cites W2987361372 @default.
- W3137361708 cites W3011089770 @default.
- W3137361708 cites W3015218641 @default.
- W3137361708 cites W3023793048 @default.
- W3137361708 cites W3033322679 @default.
- W3137361708 cites W3036188798 @default.
- W3137361708 cites W3041250660 @default.
- W3137361708 cites W3097121202 @default.
- W3137361708 cites W3099702889 @default.
- W3137361708 cites W3145795540 @default.
- W3137361708 cites W4241115065 @default.
- W3137361708 cites W425647333 @default.
- W3137361708 cites W641710284 @default.
- W3137361708 doi "https://doi.org/10.1109/bigdata50022.2020.9377739" @default.
- W3137361708 hasPublicationYear "2020" @default.
- W3137361708 type Work @default.
- W3137361708 sameAs 3137361708 @default.
- W3137361708 citedByCount "5" @default.
- W3137361708 countsByYear W31373617082020 @default.
- W3137361708 countsByYear W31373617082021 @default.
- W3137361708 crossrefType "proceedings-article" @default.
- W3137361708 hasAuthorship W3137361708A5002004426 @default.
- W3137361708 hasAuthorship W3137361708A5005406384 @default.
- W3137361708 hasAuthorship W3137361708A5033426375 @default.
- W3137361708 hasAuthorship W3137361708A5067002888 @default.
- W3137361708 hasAuthorship W3137361708A5067484594 @default.
- W3137361708 hasBestOaLocation W31373617082 @default.
- W3137361708 hasConcept C136764020 @default.
- W3137361708 hasConcept C154945302 @default.
- W3137361708 hasConcept C204321447 @default.
- W3137361708 hasConcept C2522767166 @default.
- W3137361708 hasConcept C41008148 @default.
- W3137361708 hasConceptScore W3137361708C136764020 @default.
- W3137361708 hasConceptScore W3137361708C154945302 @default.
- W3137361708 hasConceptScore W3137361708C204321447 @default.
- W3137361708 hasConceptScore W3137361708C2522767166 @default.
- W3137361708 hasConceptScore W3137361708C41008148 @default.
- W3137361708 hasLocation W31373617081 @default.
- W3137361708 hasLocation W31373617082 @default.
- W3137361708 hasOpenAccess W3137361708 @default.
- W3137361708 hasPrimaryLocation W31373617081 @default.
- W3137361708 hasRelatedWork W1552159754 @default.
- W3137361708 hasRelatedWork W2148757832 @default.
- W3137361708 hasRelatedWork W2151447942 @default.
- W3137361708 hasRelatedWork W2293457016 @default.
- W3137361708 hasRelatedWork W2368651715 @default.
- W3137361708 hasRelatedWork W2611614995 @default.
- W3137361708 hasRelatedWork W2748952813 @default.
- W3137361708 hasRelatedWork W2789919619 @default.
- W3137361708 hasRelatedWork W3107474891 @default.
- W3137361708 hasRelatedWork W3169305685 @default.
- W3137361708 isParatext "false" @default.
- W3137361708 isRetracted "false" @default.
- W3137361708 magId "3137361708" @default.
- W3137361708 workType "article" @default.