Matches in SemOpenAlex for { <https://semopenalex.org/work/W2000747708> ?p ?o ?g. }
- W2000747708 endingPage "251" @default.
- W2000747708 startingPage "241" @default.
- W2000747708 abstract "Abstract This paper reviews the use of measures of intermolecular similarity for processing databases of chemical structures, which play an important role in the discovery of new drugs by the pharmaceutical industry. The similarity measures considered here are based on the use of a fingerprint representation of molecular structure, where a fingerprint is a vector encoding the presence of fragment substructures in a molecule and where the similarity between pairs of such fingerprints is computed using an association coefficient such as the Tanimoto coefficient. The Similar Property Principle provides the basic rationale for the use of similarity methods in three important chemoinformatics applications—similarity searching, database clustering, and molecular diversity analysis. Similarity searching enables the identification of those molecules in a database that are most similar to a user‐defined, biologically active query molecule, with data fusion providing an effective way of combining the results of multiple similarity searches. Cluster analysis, typically using the Jarvis–Patrick, Ward, or divisive k ‐means clustering methods, enables the cost‐effective selection of molecules for biological testing, for property prediction and for investigating database overlap. Molecular diversity analysis, typically using cluster‐based, dissimilarity‐based, or optimization‐based approaches, enables the identification of structurally diverse sets of molecules, so as to ensure that the full chemical space spanned by a database is tested in the search for novel bioactive molecules. © 2011 John Wiley & Sons, Inc. WIREs Data Mining Knowl Discov 2011 1 241–251 DOI: 10.1002/widm.26 This article is categorized under: Application Areas > Business and Industry Application Areas > Industry Specific Applications" @default.
- W2000747708 created "2016-06-24" @default.
- W2000747708 creator A5028542544 @default.
- W2000747708 date "2011-03-24" @default.
- W2000747708 modified "2023-10-16" @default.
- W2000747708 title "Similarity‐based data mining in files of two‐dimensional chemical structures using fingerprint measures of molecular resemblance" @default.
- W2000747708 cites W1501603356 @default.
- W2000747708 cites W1514906651 @default.
- W2000747708 cites W1531524766 @default.
- W2000747708 cites W1553690220 @default.
- W2000747708 cites W1557618397 @default.
- W2000747708 cites W1574994167 @default.
- W2000747708 cites W1595350250 @default.
- W2000747708 cites W1603605260 @default.
- W2000747708 cites W19552341 @default.
- W2000747708 cites W1963790256 @default.
- W2000747708 cites W1965721706 @default.
- W2000747708 cites W1969462276 @default.
- W2000747708 cites W1972155638 @default.
- W2000747708 cites W1973175049 @default.
- W2000747708 cites W1973552820 @default.
- W2000747708 cites W1974880836 @default.
- W2000747708 cites W1977934535 @default.
- W2000747708 cites W1978827558 @default.
- W2000747708 cites W1982870104 @default.
- W2000747708 cites W1983610905 @default.
- W2000747708 cites W1984954112 @default.
- W2000747708 cites W1985067238 @default.
- W2000747708 cites W1988037271 @default.
- W2000747708 cites W1990451437 @default.
- W2000747708 cites W1991262449 @default.
- W2000747708 cites W1994440731 @default.
- W2000747708 cites W1999440281 @default.
- W2000747708 cites W2000376564 @default.
- W2000747708 cites W2002009218 @default.
- W2000747708 cites W2005685204 @default.
- W2000747708 cites W2006690943 @default.
- W2000747708 cites W2010857114 @default.
- W2000747708 cites W2012540065 @default.
- W2000747708 cites W2012839892 @default.
- W2000747708 cites W2012949016 @default.
- W2000747708 cites W2016381774 @default.
- W2000747708 cites W2017380317 @default.
- W2000747708 cites W2020118174 @default.
- W2000747708 cites W2021083597 @default.
- W2000747708 cites W2021748110 @default.
- W2000747708 cites W2023550615 @default.
- W2000747708 cites W2025984715 @default.
- W2000747708 cites W2026265322 @default.
- W2000747708 cites W2027065525 @default.
- W2000747708 cites W2032726880 @default.
- W2000747708 cites W2036387645 @default.
- W2000747708 cites W2039709045 @default.
- W2000747708 cites W2042007894 @default.
- W2000747708 cites W2051594610 @default.
- W2000747708 cites W2052367365 @default.
- W2000747708 cites W2052633277 @default.
- W2000747708 cites W2056881083 @default.
- W2000747708 cites W2057350662 @default.
- W2000747708 cites W2058119599 @default.
- W2000747708 cites W2060097376 @default.
- W2000747708 cites W2071200203 @default.
- W2000747708 cites W2072598797 @default.
- W2000747708 cites W2073579345 @default.
- W2000747708 cites W2075917770 @default.
- W2000747708 cites W2077946617 @default.
- W2000747708 cites W2078882389 @default.
- W2000747708 cites W2085890279 @default.
- W2000747708 cites W2087302012 @default.
- W2000747708 cites W2089923519 @default.
- W2000747708 cites W2091589454 @default.
- W2000747708 cites W2096729078 @default.
- W2000747708 cites W2100916935 @default.
- W2000747708 cites W2103626206 @default.
- W2000747708 cites W2112912103 @default.
- W2000747708 cites W2120540345 @default.
- W2000747708 cites W2121895929 @default.
- W2000747708 cites W2125253492 @default.
- W2000747708 cites W2125792789 @default.
- W2000747708 cites W2139731838 @default.
- W2000747708 cites W2143456020 @default.
- W2000747708 cites W2150155198 @default.
- W2000747708 cites W2151175139 @default.
- W2000747708 cites W2153341679 @default.
- W2000747708 cites W2156077095 @default.
- W2000747708 cites W2158024647 @default.
- W2000747708 cites W2159822007 @default.
- W2000747708 cites W2160114756 @default.
- W2000747708 cites W2172024214 @default.
- W2000747708 cites W2177658341 @default.
- W2000747708 cites W2201639514 @default.
- W2000747708 cites W2204197091 @default.
- W2000747708 cites W2486095760 @default.
- W2000747708 cites W2498752956 @default.
- W2000747708 cites W2522646202 @default.
- W2000747708 cites W2614215910 @default.
- W2000747708 cites W3010454804 @default.
- W2000747708 cites W4236248134 @default.