Matches in SemOpenAlex for { <https://semopenalex.org/work/W3043285902> ?p ?o ?g. }
- W3043285902 abstract "Weighted sampling is a fundamental tool in data analysis and machine learning pipelines. Samples are used for efficient estimation of statistics or as sparse representations of the data. When weight distributions are skewed, as is often the case in practice, without-replacement (WOR) sampling is much more effective than with-replacement (WR) sampling: it provides a broader representation and higher accuracy for the same number of samples. We design novel composable sketches for WOR $ell_p$ sampling, weighted sampling of keys according to a power $pin[0,2]$ of their frequency (or for signed data, sum of updates). Our sketches have size that grows only linearly with the sample size. Our design is simple and practical, despite intricate analysis, and based on off-the-shelf use of widely implemented heavy hitters sketches such as CountSketch. Our method is the first to provide WOR sampling in the important regime of $p>1$ and the first to handle signed updates for $p>0$." @default.
- W3043285902 created "2020-07-23" @default.
- W3043285902 creator A5014293815 @default.
- W3043285902 creator A5024805876 @default.
- W3043285902 creator A5026385549 @default.
- W3043285902 date "2020-07-14" @default.
- W3043285902 modified "2023-09-27" @default.
- W3043285902 title "WOR and $p$'s: Sketches for $ell_p$-Sampling Without Replacement" @default.
- W3043285902 cites W1493892051 @default.
- W3043285902 cites W1553409264 @default.
- W3043285902 cites W1570406128 @default.
- W3043285902 cites W1571664355 @default.
- W3043285902 cites W1605301393 @default.
- W3043285902 cites W1645165697 @default.
- W3043285902 cites W1777225603 @default.
- W3043285902 cites W1965996575 @default.
- W3043285902 cites W1977141583 @default.
- W3043285902 cites W1979819093 @default.
- W3043285902 cites W1980242380 @default.
- W3043285902 cites W1981663184 @default.
- W3043285902 cites W1991099830 @default.
- W3043285902 cites W1993482412 @default.
- W3043285902 cites W1994945255 @default.
- W3043285902 cites W2001947543 @default.
- W3043285902 cites W2006355640 @default.
- W3043285902 cites W2013809345 @default.
- W3043285902 cites W2018456433 @default.
- W3043285902 cites W2027689065 @default.
- W3043285902 cites W2031492799 @default.
- W3043285902 cites W2031657002 @default.
- W3043285902 cites W2034417563 @default.
- W3043285902 cites W2045555847 @default.
- W3043285902 cites W2060385919 @default.
- W3043285902 cites W2068936568 @default.
- W3043285902 cites W2069980026 @default.
- W3043285902 cites W2080234606 @default.
- W3043285902 cites W2080745194 @default.
- W3043285902 cites W2092236286 @default.
- W3043285902 cites W2107917944 @default.
- W3043285902 cites W2109529118 @default.
- W3043285902 cites W2111806841 @default.
- W3043285902 cites W2119050385 @default.
- W3043285902 cites W2122929038 @default.
- W3043285902 cites W2127455097 @default.
- W3043285902 cites W2132069633 @default.
- W3043285902 cites W2136987366 @default.
- W3043285902 cites W2142035328 @default.
- W3043285902 cites W2142224482 @default.
- W3043285902 cites W2143606444 @default.
- W3043285902 cites W2144620640 @default.
- W3043285902 cites W2147717514 @default.
- W3043285902 cites W2153579005 @default.
- W3043285902 cites W2250539671 @default.
- W3043285902 cites W2294895103 @default.
- W3043285902 cites W2295428206 @default.
- W3043285902 cites W2296073425 @default.
- W3043285902 cites W2346474025 @default.
- W3043285902 cites W2479510968 @default.
- W3043285902 cites W2487095677 @default.
- W3043285902 cites W2752853835 @default.
- W3043285902 cites W2769644379 @default.
- W3043285902 cites W2885668447 @default.
- W3043285902 cites W2890924858 @default.
- W3043285902 cites W2893034441 @default.
- W3043285902 cites W2934975900 @default.
- W3043285902 cites W2950284340 @default.
- W3043285902 cites W2962904868 @default.
- W3043285902 cites W2963496590 @default.
- W3043285902 cites W2963799254 @default.
- W3043285902 cites W2967106834 @default.
- W3043285902 cites W2970817052 @default.
- W3043285902 cites W3034971881 @default.
- W3043285902 cites W3043924052 @default.
- W3043285902 hasPublicationYear "2020" @default.
- W3043285902 type Work @default.
- W3043285902 sameAs 3043285902 @default.
- W3043285902 citedByCount "0" @default.
- W3043285902 crossrefType "posted-content" @default.
- W3043285902 hasAuthorship W3043285902A5014293815 @default.
- W3043285902 hasAuthorship W3043285902A5024805876 @default.
- W3043285902 hasAuthorship W3043285902A5026385549 @default.
- W3043285902 hasConcept C105795698 @default.
- W3043285902 hasConcept C111472728 @default.
- W3043285902 hasConcept C11413529 @default.
- W3043285902 hasConcept C121332964 @default.
- W3043285902 hasConcept C124101348 @default.
- W3043285902 hasConcept C129848803 @default.
- W3043285902 hasConcept C138885662 @default.
- W3043285902 hasConcept C140779682 @default.
- W3043285902 hasConcept C144024400 @default.
- W3043285902 hasConcept C149923435 @default.
- W3043285902 hasConcept C163258240 @default.
- W3043285902 hasConcept C17744445 @default.
- W3043285902 hasConcept C185592680 @default.
- W3043285902 hasConcept C198531522 @default.
- W3043285902 hasConcept C199539241 @default.
- W3043285902 hasConcept C20353970 @default.
- W3043285902 hasConcept C2776359362 @default.
- W3043285902 hasConcept C2780586882 @default.
- W3043285902 hasConcept C2908647359 @default.