Matches in SemOpenAlex for { <https://semopenalex.org/work/W2972447388> ?p ?o ?g. }
- W2972447388 abstract "In this paper, we consider the problem of classification of $M$ high dimensional queries $y^1,cdots,y^Min B^S$ to $N$ high dimensional classes $x^1,cdots,x^Nin A^S$ where $A$ and $B$ are discrete alphabets and the probabilistic model that relates data to the classes $P(x,y)$ is known. This problem has applications in various fields including the database search problem in mass spectrometry. The problem is analogous to the nearest neighbor search problem, where the goal is to find the data point in a database that is the most similar to a query point. The state of the art method for solving an approximate version of the nearest neighbor search problem in high dimensions is locality sensitive hashing (LSH). LSH is based on designing hash functions that map near points to the same buckets with a probability higher than random (far) points. To solve our high dimensional classification problem, we introduce distribution sensitive hashes that map jointly generated pairs $(x,y)sim P$ to the same bucket with probability higher than random pairs $xsim P^A$ and $ysim P^B$, where $P^A$ and $P^B$ are the marginal probability distributions of $P$. We design distribution sensitive hashes using a forest of decision trees and we show that the complexity of search grows with $O(N^{lambda^*(P)})$ where $lambda^*(P)$ is expressed in an analytical form. We further show that the proposed hashes perform faster than state of the art approximate nearest neighbor search methods for a range of probability distributions, in both theory and simulations. Finally, we apply our method to the spectral library search problem in mass spectrometry, and show that it is an order of magnitude faster than the state of the art methods." @default.
- W2972447388 created "2019-09-19" @default.
- W2972447388 creator A5010942187 @default.
- W2972447388 creator A5013024599 @default.
- W2972447388 creator A5024347525 @default.
- W2972447388 creator A5033781263 @default.
- W2972447388 creator A5071221873 @default.
- W2972447388 creator A5071500676 @default.
- W2972447388 date "2019-05-11" @default.
- W2972447388 modified "2023-09-27" @default.
- W2972447388 title "ForestDSH: A Universal Hash Design for Discrete Probability Distributions" @default.
- W2972447388 cites W1502916507 @default.
- W2972447388 cites W1503158680 @default.
- W2972447388 cites W1672197616 @default.
- W2972447388 cites W2012833704 @default.
- W2972447388 cites W2014096895 @default.
- W2972447388 cites W2017851434 @default.
- W2972447388 cites W2023096047 @default.
- W2972447388 cites W2024668293 @default.
- W2972447388 cites W2028673251 @default.
- W2972447388 cites W2068074736 @default.
- W2972447388 cites W2084809122 @default.
- W2972447388 cites W2097921974 @default.
- W2972447388 cites W2106854428 @default.
- W2972447388 cites W2118269922 @default.
- W2972447388 cites W2118323718 @default.
- W2972447388 cites W2121386739 @default.
- W2972447388 cites W2123485784 @default.
- W2972447388 cites W2138742001 @default.
- W2972447388 cites W2147717514 @default.
- W2972447388 cites W2148781362 @default.
- W2972447388 cites W2152926413 @default.
- W2972447388 cites W2165558283 @default.
- W2972447388 cites W2183087644 @default.
- W2972447388 cites W2194357627 @default.
- W2972447388 cites W2362855512 @default.
- W2972447388 cites W2461743311 @default.
- W2972447388 cites W2574633002 @default.
- W2972447388 cites W2739554176 @default.
- W2972447388 cites W2744136723 @default.
- W2972447388 cites W2751120573 @default.
- W2972447388 cites W2757686304 @default.
- W2972447388 cites W2794407684 @default.
- W2972447388 cites W2809514611 @default.
- W2972447388 cites W2963056065 @default.
- W2972447388 cites W2963964051 @default.
- W2972447388 cites W3017143921 @default.
- W2972447388 cites W1958328454 @default.
- W2972447388 cites W2562090743 @default.
- W2972447388 hasPublicationYear "2019" @default.
- W2972447388 type Work @default.
- W2972447388 sameAs 2972447388 @default.
- W2972447388 citedByCount "0" @default.
- W2972447388 crossrefType "posted-content" @default.
- W2972447388 hasAuthorship W2972447388A5010942187 @default.
- W2972447388 hasAuthorship W2972447388A5013024599 @default.
- W2972447388 hasAuthorship W2972447388A5024347525 @default.
- W2972447388 hasAuthorship W2972447388A5033781263 @default.
- W2972447388 hasAuthorship W2972447388A5071221873 @default.
- W2972447388 hasAuthorship W2972447388A5071500676 @default.
- W2972447388 hasConcept C113238511 @default.
- W2972447388 hasConcept C11413529 @default.
- W2972447388 hasConcept C114614502 @default.
- W2972447388 hasConcept C116738811 @default.
- W2972447388 hasConcept C118615104 @default.
- W2972447388 hasConcept C124101348 @default.
- W2972447388 hasConcept C154945302 @default.
- W2972447388 hasConcept C33923547 @default.
- W2972447388 hasConcept C38652104 @default.
- W2972447388 hasConcept C41008148 @default.
- W2972447388 hasConcept C48103436 @default.
- W2972447388 hasConcept C67388219 @default.
- W2972447388 hasConcept C74270461 @default.
- W2972447388 hasConcept C99138194 @default.
- W2972447388 hasConceptScore W2972447388C113238511 @default.
- W2972447388 hasConceptScore W2972447388C11413529 @default.
- W2972447388 hasConceptScore W2972447388C114614502 @default.
- W2972447388 hasConceptScore W2972447388C116738811 @default.
- W2972447388 hasConceptScore W2972447388C118615104 @default.
- W2972447388 hasConceptScore W2972447388C124101348 @default.
- W2972447388 hasConceptScore W2972447388C154945302 @default.
- W2972447388 hasConceptScore W2972447388C33923547 @default.
- W2972447388 hasConceptScore W2972447388C38652104 @default.
- W2972447388 hasConceptScore W2972447388C41008148 @default.
- W2972447388 hasConceptScore W2972447388C48103436 @default.
- W2972447388 hasConceptScore W2972447388C67388219 @default.
- W2972447388 hasConceptScore W2972447388C74270461 @default.
- W2972447388 hasConceptScore W2972447388C99138194 @default.
- W2972447388 hasLocation W29724473881 @default.
- W2972447388 hasOpenAccess W2972447388 @default.
- W2972447388 hasPrimaryLocation W29724473881 @default.
- W2972447388 hasRelatedWork W1492925305 @default.
- W2972447388 hasRelatedWork W1979848457 @default.
- W2972447388 hasRelatedWork W2050749090 @default.
- W2972447388 hasRelatedWork W2053993223 @default.
- W2972447388 hasRelatedWork W2071572981 @default.
- W2972447388 hasRelatedWork W2071866949 @default.
- W2972447388 hasRelatedWork W2077178480 @default.
- W2972447388 hasRelatedWork W2116487896 @default.
- W2972447388 hasRelatedWork W2279110065 @default.