Matches in SemOpenAlex for { <https://semopenalex.org/work/W2536339198> ?p ?o ?g. }
Showing items 1 to 89 of
89
with 100 items per page.
- W2536339198 abstract "Several recent studies have presented different approaches for clustering and classifying machine-generated mail based on email headers. We propose to expand these approaches by considering email message bodies. We argue that our approach can help increase coverage and precision in several tasks, and is especially critical for mail extraction. We remind that mail extraction supports a variety of mail mining applications such as ad re-targeting, mail search, and mail summarization. We introduce new structural clustering methods that leverage the HTML structure that is common to messages generated by a same mass-sender script. We discuss how such structural clustering can be conducted at different levels of granularity, using either strict or flexible matching constraints, depending on the use cases. We present large scale experiments carried over real Yahoo mail traffic. For our first use case of automatic mail extraction, we describe novel flexible-matching clustering methods that meet the key requirements of high intra-cluster similarity, adequate clusters size, and relatively small overall number of clusters. We identify the precise level of flexibility that is needed in order to achieve extremely high extraction precision (close to 100%), while producing relatively small number of clusters. For our second use case, namely, mail classification, we show that strict structural matching is more adequate, achieving precision and recall rates between 85%-90%, while converging to a stable classification after a short learning cycle. This represents an increase of 10%-20% compared to the sender-based method described in previous work, when run over the same period length. Our work has been deployed in production in Yahoo mail backend." @default.
- W2536339198 created "2016-10-28" @default.
- W2536339198 creator A5020208985 @default.
- W2536339198 creator A5026632851 @default.
- W2536339198 creator A5032709945 @default.
- W2536339198 creator A5054778502 @default.
- W2536339198 creator A5066047568 @default.
- W2536339198 creator A5070677186 @default.
- W2536339198 creator A5089913322 @default.
- W2536339198 date "2016-10-24" @default.
- W2536339198 modified "2023-09-25" @default.
- W2536339198 title "Structural Clustering of Machine-Generated Mail" @default.
- W2536339198 cites W1603920809 @default.
- W2536339198 cites W1954804603 @default.
- W2536339198 cites W1970026646 @default.
- W2536339198 cites W1974922810 @default.
- W2536339198 cites W1982119153 @default.
- W2536339198 cites W1999361961 @default.
- W2536339198 cites W2002956097 @default.
- W2536339198 cites W2011632873 @default.
- W2536339198 cites W2015551056 @default.
- W2536339198 cites W2035836267 @default.
- W2536339198 cites W2039262760 @default.
- W2536339198 cites W2040757233 @default.
- W2536339198 cites W2049365470 @default.
- W2536339198 cites W2059014657 @default.
- W2536339198 cites W2069388662 @default.
- W2536339198 cites W2081193615 @default.
- W2536339198 cites W2085922539 @default.
- W2536339198 cites W2091858563 @default.
- W2536339198 cites W2098162425 @default.
- W2536339198 cites W2106568316 @default.
- W2536339198 cites W2134172329 @default.
- W2536339198 cites W2137313854 @default.
- W2536339198 cites W2143915064 @default.
- W2536339198 cites W2150721933 @default.
- W2536339198 cites W2152565070 @default.
- W2536339198 cites W2160189941 @default.
- W2536339198 cites W2160196229 @default.
- W2536339198 cites W2161861392 @default.
- W2536339198 cites W2171364811 @default.
- W2536339198 cites W2255862008 @default.
- W2536339198 cites W2264482454 @default.
- W2536339198 cites W2295816791 @default.
- W2536339198 cites W2474838075 @default.
- W2536339198 doi "https://doi.org/10.1145/2983323.2983350" @default.
- W2536339198 hasPublicationYear "2016" @default.
- W2536339198 type Work @default.
- W2536339198 sameAs 2536339198 @default.
- W2536339198 citedByCount "12" @default.
- W2536339198 countsByYear W25363391982017 @default.
- W2536339198 countsByYear W25363391982018 @default.
- W2536339198 countsByYear W25363391982019 @default.
- W2536339198 countsByYear W25363391982020 @default.
- W2536339198 crossrefType "proceedings-article" @default.
- W2536339198 hasAuthorship W2536339198A5020208985 @default.
- W2536339198 hasAuthorship W2536339198A5026632851 @default.
- W2536339198 hasAuthorship W2536339198A5032709945 @default.
- W2536339198 hasAuthorship W2536339198A5054778502 @default.
- W2536339198 hasAuthorship W2536339198A5066047568 @default.
- W2536339198 hasAuthorship W2536339198A5070677186 @default.
- W2536339198 hasAuthorship W2536339198A5089913322 @default.
- W2536339198 hasConcept C136764020 @default.
- W2536339198 hasConcept C154945302 @default.
- W2536339198 hasConcept C3020028006 @default.
- W2536339198 hasConcept C41008148 @default.
- W2536339198 hasConcept C73555534 @default.
- W2536339198 hasConceptScore W2536339198C136764020 @default.
- W2536339198 hasConceptScore W2536339198C154945302 @default.
- W2536339198 hasConceptScore W2536339198C3020028006 @default.
- W2536339198 hasConceptScore W2536339198C41008148 @default.
- W2536339198 hasConceptScore W2536339198C73555534 @default.
- W2536339198 hasLocation W25363391981 @default.
- W2536339198 hasOpenAccess W2536339198 @default.
- W2536339198 hasPrimaryLocation W25363391981 @default.
- W2536339198 hasRelatedWork W1849651648 @default.
- W2536339198 hasRelatedWork W1999627569 @default.
- W2536339198 hasRelatedWork W2000677594 @default.
- W2536339198 hasRelatedWork W2095737312 @default.
- W2536339198 hasRelatedWork W2097782160 @default.
- W2536339198 hasRelatedWork W2358668433 @default.
- W2536339198 hasRelatedWork W2390279801 @default.
- W2536339198 hasRelatedWork W2748952813 @default.
- W2536339198 hasRelatedWork W2899084033 @default.
- W2536339198 hasRelatedWork W763609066 @default.
- W2536339198 isParatext "false" @default.
- W2536339198 isRetracted "false" @default.
- W2536339198 magId "2536339198" @default.
- W2536339198 workType "article" @default.