Matches in SemOpenAlex for { <https://semopenalex.org/work/W4298212522> ?p ?o ?g. }
Showing items 1 to 70 of
70
with 100 items per page.
- W4298212522 abstract "Sketching is a probabilistic data compression technique that has been largely developed in the computer science community. Numerical operations on big datasets can be intolerably slow; sketching algorithms address this issue by generating a smaller surrogate dataset. Typically, inference proceeds on the compressed dataset. Sketching algorithms generally use random projections to compress the original dataset and this stochastic generation process makes them amenable to statistical analysis. We argue that the sketched data can be modelled as a random sample, thus placing this family of data compression methods firmly within an inferential framework. In particular, we focus on the Gaussian, Hadamard and Clarkson-Woodruff sketches, and their use in single pass sketching algorithms for linear regression with huge $n$. We explore the statistical properties of sketched regression algorithms and derive new distributional results for a large class of sketched estimators. A key result is a conditional central limit theorem for data oblivious sketches. An important finding is that the best choice of sketching algorithm in terms of mean square error is related to the signal to noise ratio in the source dataset. Finally, we demonstrate the theory and the limits of its applicability on two real datasets." @default.
- W4298212522 created "2022-10-01" @default.
- W4298212522 creator A5003779182 @default.
- W4298212522 creator A5016383307 @default.
- W4298212522 creator A5067650620 @default.
- W4298212522 date "2017-06-12" @default.
- W4298212522 modified "2023-10-16" @default.
- W4298212522 title "Statistical properties of sketching algorithms" @default.
- W4298212522 doi "https://doi.org/10.48550/arxiv.1706.03665" @default.
- W4298212522 hasPublicationYear "2017" @default.
- W4298212522 type Work @default.
- W4298212522 citedByCount "0" @default.
- W4298212522 crossrefType "posted-content" @default.
- W4298212522 hasAuthorship W4298212522A5003779182 @default.
- W4298212522 hasAuthorship W4298212522A5016383307 @default.
- W4298212522 hasAuthorship W4298212522A5067650620 @default.
- W4298212522 hasBestOaLocation W42982125221 @default.
- W4298212522 hasConcept C105795698 @default.
- W4298212522 hasConcept C11413529 @default.
- W4298212522 hasConcept C115961682 @default.
- W4298212522 hasConcept C120665830 @default.
- W4298212522 hasConcept C121332964 @default.
- W4298212522 hasConcept C134261354 @default.
- W4298212522 hasConcept C134306372 @default.
- W4298212522 hasConcept C151201525 @default.
- W4298212522 hasConcept C154945302 @default.
- W4298212522 hasConcept C185429906 @default.
- W4298212522 hasConcept C192209626 @default.
- W4298212522 hasConcept C2776214188 @default.
- W4298212522 hasConcept C33923547 @default.
- W4298212522 hasConcept C41008148 @default.
- W4298212522 hasConcept C49937458 @default.
- W4298212522 hasConcept C60292330 @default.
- W4298212522 hasConcept C80444323 @default.
- W4298212522 hasConcept C99498987 @default.
- W4298212522 hasConceptScore W4298212522C105795698 @default.
- W4298212522 hasConceptScore W4298212522C11413529 @default.
- W4298212522 hasConceptScore W4298212522C115961682 @default.
- W4298212522 hasConceptScore W4298212522C120665830 @default.
- W4298212522 hasConceptScore W4298212522C121332964 @default.
- W4298212522 hasConceptScore W4298212522C134261354 @default.
- W4298212522 hasConceptScore W4298212522C134306372 @default.
- W4298212522 hasConceptScore W4298212522C151201525 @default.
- W4298212522 hasConceptScore W4298212522C154945302 @default.
- W4298212522 hasConceptScore W4298212522C185429906 @default.
- W4298212522 hasConceptScore W4298212522C192209626 @default.
- W4298212522 hasConceptScore W4298212522C2776214188 @default.
- W4298212522 hasConceptScore W4298212522C33923547 @default.
- W4298212522 hasConceptScore W4298212522C41008148 @default.
- W4298212522 hasConceptScore W4298212522C49937458 @default.
- W4298212522 hasConceptScore W4298212522C60292330 @default.
- W4298212522 hasConceptScore W4298212522C80444323 @default.
- W4298212522 hasConceptScore W4298212522C99498987 @default.
- W4298212522 hasLocation W42982125221 @default.
- W4298212522 hasLocation W42982125222 @default.
- W4298212522 hasOpenAccess W4298212522 @default.
- W4298212522 hasPrimaryLocation W42982125221 @default.
- W4298212522 hasRelatedWork W1974090045 @default.
- W4298212522 hasRelatedWork W2926374695 @default.
- W4298212522 hasRelatedWork W2949919985 @default.
- W4298212522 hasRelatedWork W2964209567 @default.
- W4298212522 hasRelatedWork W3026670957 @default.
- W4298212522 hasRelatedWork W3159730769 @default.
- W4298212522 hasRelatedWork W4221155167 @default.
- W4298212522 hasRelatedWork W4287197217 @default.
- W4298212522 hasRelatedWork W4293812554 @default.
- W4298212522 hasRelatedWork W3123288520 @default.
- W4298212522 isParatext "false" @default.
- W4298212522 isRetracted "false" @default.
- W4298212522 workType "article" @default.