Matches in SemOpenAlex for { <https://semopenalex.org/work/W4288288291> ?p ?o ?g. }
Showing items 1 to 73 of
73
with 100 items per page.
- W4288288291 abstract "Datasets that are terabytes in size are increasingly common, but computer bottlenecks often frustrate a complete analysis of the data. While more data are better than less, diminishing returns suggest that we may not need terabytes of data to estimate a parameter or test a hypothesis. But which rows of data should we analyze, and might an arbitrary subset of rows preserve the features of the original data? This paper reviews a line of work that is grounded in theoretical computer science and numerical linear algebra, and which finds that an algorithmically desirable sketch, which is a randomly chosen subset of the data, must preserve the eigenstructure of the data, a property known as a subspace embedding. Building on this work, we study how prediction and inference can be affected by data sketching within a linear regression setup. We show that the sketching error is small compared to the sample size effect which a researcher can control. As a sketch size that is algorithmically optimal may not be suitable for prediction and inference, we use statistical arguments to provide 'inference conscious' guides to the sketch size. When appropriately implemented, an estimator that pools over different sketches can be nearly as efficient as the infeasible one using the full sample." @default.
- W4288288291 created "2022-07-28" @default.
- W4288288291 creator A5028536256 @default.
- W4288288291 creator A5056976753 @default.
- W4288288291 date "2019-07-03" @default.
- W4288288291 modified "2023-09-30" @default.
- W4288288291 title "An Econometric Perspective on Algorithmic Subsampling" @default.
- W4288288291 doi "https://doi.org/10.48550/arxiv.1907.01954" @default.
- W4288288291 hasPublicationYear "2019" @default.
- W4288288291 type Work @default.
- W4288288291 citedByCount "0" @default.
- W4288288291 crossrefType "posted-content" @default.
- W4288288291 hasAuthorship W4288288291A5028536256 @default.
- W4288288291 hasAuthorship W4288288291A5056976753 @default.
- W4288288291 hasBestOaLocation W42882882911 @default.
- W4288288291 hasConcept C105795698 @default.
- W4288288291 hasConcept C111919701 @default.
- W4288288291 hasConcept C11413529 @default.
- W4288288291 hasConcept C124101348 @default.
- W4288288291 hasConcept C12713177 @default.
- W4288288291 hasConcept C129848803 @default.
- W4288288291 hasConcept C134261354 @default.
- W4288288291 hasConcept C135598885 @default.
- W4288288291 hasConcept C154945302 @default.
- W4288288291 hasConcept C185429906 @default.
- W4288288291 hasConcept C185592680 @default.
- W4288288291 hasConcept C198531522 @default.
- W4288288291 hasConcept C199683683 @default.
- W4288288291 hasConcept C2776214188 @default.
- W4288288291 hasConcept C2779231336 @default.
- W4288288291 hasConcept C32834561 @default.
- W4288288291 hasConcept C33923547 @default.
- W4288288291 hasConcept C41008148 @default.
- W4288288291 hasConcept C43617362 @default.
- W4288288291 hasConcept C77088390 @default.
- W4288288291 hasConcept C80444323 @default.
- W4288288291 hasConceptScore W4288288291C105795698 @default.
- W4288288291 hasConceptScore W4288288291C111919701 @default.
- W4288288291 hasConceptScore W4288288291C11413529 @default.
- W4288288291 hasConceptScore W4288288291C124101348 @default.
- W4288288291 hasConceptScore W4288288291C12713177 @default.
- W4288288291 hasConceptScore W4288288291C129848803 @default.
- W4288288291 hasConceptScore W4288288291C134261354 @default.
- W4288288291 hasConceptScore W4288288291C135598885 @default.
- W4288288291 hasConceptScore W4288288291C154945302 @default.
- W4288288291 hasConceptScore W4288288291C185429906 @default.
- W4288288291 hasConceptScore W4288288291C185592680 @default.
- W4288288291 hasConceptScore W4288288291C198531522 @default.
- W4288288291 hasConceptScore W4288288291C199683683 @default.
- W4288288291 hasConceptScore W4288288291C2776214188 @default.
- W4288288291 hasConceptScore W4288288291C2779231336 @default.
- W4288288291 hasConceptScore W4288288291C32834561 @default.
- W4288288291 hasConceptScore W4288288291C33923547 @default.
- W4288288291 hasConceptScore W4288288291C41008148 @default.
- W4288288291 hasConceptScore W4288288291C43617362 @default.
- W4288288291 hasConceptScore W4288288291C77088390 @default.
- W4288288291 hasConceptScore W4288288291C80444323 @default.
- W4288288291 hasLocation W42882882911 @default.
- W4288288291 hasOpenAccess W4288288291 @default.
- W4288288291 hasPrimaryLocation W42882882911 @default.
- W4288288291 hasRelatedWork W1710041548 @default.
- W4288288291 hasRelatedWork W1809688448 @default.
- W4288288291 hasRelatedWork W2345894225 @default.
- W4288288291 hasRelatedWork W2556976081 @default.
- W4288288291 hasRelatedWork W2954154946 @default.
- W4288288291 hasRelatedWork W2965434730 @default.
- W4288288291 hasRelatedWork W3106484983 @default.
- W4288288291 hasRelatedWork W4247364450 @default.
- W4288288291 hasRelatedWork W4293502472 @default.
- W4288288291 hasRelatedWork W4293542482 @default.
- W4288288291 isParatext "false" @default.
- W4288288291 isRetracted "false" @default.
- W4288288291 workType "article" @default.