Matches in SemOpenAlex for { <https://semopenalex.org/work/W4225900888> ?p ?o ?g. }
- W4225900888 abstract "Fitness functions map biological sequences to a scalar property of interest. Accurate estimation of these functions yields biological insight and sets the foundation for model-based sequence design. However, the fitness datasets available to learn these functions are typically small relative to the large combinatorial space of sequences; characterizing how much data are needed for accurate estimation remains an open problem. There is a growing body of evidence demonstrating that empirical fitness functions display substantial sparsity when represented in terms of epistatic interactions. Moreover, the theory of Compressed Sensing provides scaling laws for the number of samples required to exactly recover a sparse function. Motivated by these results, we develop a framework to study the sparsity of fitness functions sampled from a generalization of the NK model, a widely used random field model of fitness functions. In particular, we present results that allow us to test the effect of the Generalized NK (GNK) model's interpretable parameters-sequence length, alphabet size, and assumed interactions between sequence positions-on the sparsity of fitness functions sampled from the model and, consequently, the number of measurements required to exactly recover these functions. We validate our framework by demonstrating that GNK models with parameters set according to structural considerations can be used to accurately approximate the number of samples required to recover two empirical protein fitness functions and an RNA fitness function. In addition, we show that these GNK models identify important higher-order epistatic interactions in the empirical fitness functions using only structural information." @default.
- W4225900888 created "2022-05-05" @default.
- W4225900888 creator A5035371524 @default.
- W4225900888 creator A5062650028 @default.
- W4225900888 creator A5081006679 @default.
- W4225900888 date "2021-12-22" @default.
- W4225900888 modified "2023-10-09" @default.
- W4225900888 title "On the sparsity of fitness functions and implications for learning" @default.
- W4225900888 cites W1523985187 @default.
- W4225900888 cites W1526446139 @default.
- W4225900888 cites W1616786285 @default.
- W4225900888 cites W1857209124 @default.
- W4225900888 cites W1966234892 @default.
- W4225900888 cites W1968961038 @default.
- W4225900888 cites W1969857637 @default.
- W4225900888 cites W1971902669 @default.
- W4225900888 cites W1989901670 @default.
- W4225900888 cites W1997080762 @default.
- W4225900888 cites W1999064315 @default.
- W4225900888 cites W1999104032 @default.
- W4225900888 cites W2014159272 @default.
- W4225900888 cites W2016666153 @default.
- W4225900888 cites W2020552506 @default.
- W4225900888 cites W2038774686 @default.
- W4225900888 cites W2057239104 @default.
- W4225900888 cites W2064164319 @default.
- W4225900888 cites W2073171022 @default.
- W4225900888 cites W2086561953 @default.
- W4225900888 cites W2094458603 @default.
- W4225900888 cites W2116963188 @default.
- W4225900888 cites W2122881260 @default.
- W4225900888 cites W2130289825 @default.
- W4225900888 cites W2137198385 @default.
- W4225900888 cites W2145096794 @default.
- W4225900888 cites W2156454061 @default.
- W4225900888 cites W2370303266 @default.
- W4225900888 cites W2379594833 @default.
- W4225900888 cites W2483469645 @default.
- W4225900888 cites W2490805901 @default.
- W4225900888 cites W2765744127 @default.
- W4225900888 cites W2795578759 @default.
- W4225900888 cites W2913423296 @default.
- W4225900888 cites W2917580301 @default.
- W4225900888 cites W2937251344 @default.
- W4225900888 cites W2950308590 @default.
- W4225900888 cites W2951433247 @default.
- W4225900888 cites W2951905689 @default.
- W4225900888 cites W2953058007 @default.
- W4225900888 cites W2956569764 @default.
- W4225900888 cites W2971227267 @default.
- W4225900888 cites W2972979941 @default.
- W4225900888 cites W2974767112 @default.
- W4225900888 cites W2980172883 @default.
- W4225900888 cites W2980610099 @default.
- W4225900888 cites W3009349262 @default.
- W4225900888 cites W3015295562 @default.
- W4225900888 cites W3043104427 @default.
- W4225900888 cites W3098330097 @default.
- W4225900888 cites W3098888484 @default.
- W4225900888 cites W3098920008 @default.
- W4225900888 cites W3127426316 @default.
- W4225900888 cites W3144239152 @default.
- W4225900888 cites W3198879354 @default.
- W4225900888 cites W4250955649 @default.
- W4225900888 doi "https://doi.org/10.1073/pnas.2109649118" @default.
- W4225900888 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/34937698" @default.
- W4225900888 hasPublicationYear "2021" @default.
- W4225900888 type Work @default.
- W4225900888 citedByCount "14" @default.
- W4225900888 countsByYear W42259008882022 @default.
- W4225900888 countsByYear W42259008882023 @default.
- W4225900888 crossrefType "journal-article" @default.
- W4225900888 hasAuthorship W4225900888A5035371524 @default.
- W4225900888 hasAuthorship W4225900888A5062650028 @default.
- W4225900888 hasAuthorship W4225900888A5081006679 @default.
- W4225900888 hasBestOaLocation W42259008881 @default.
- W4225900888 hasConcept C104317684 @default.
- W4225900888 hasConcept C126255220 @default.
- W4225900888 hasConcept C134306372 @default.
- W4225900888 hasConcept C14036430 @default.
- W4225900888 hasConcept C144024400 @default.
- W4225900888 hasConcept C148392497 @default.
- W4225900888 hasConcept C149923435 @default.
- W4225900888 hasConcept C154945302 @default.
- W4225900888 hasConcept C176066374 @default.
- W4225900888 hasConcept C177148314 @default.
- W4225900888 hasConcept C2778112365 @default.
- W4225900888 hasConcept C2908647359 @default.
- W4225900888 hasConcept C2910387474 @default.
- W4225900888 hasConcept C33923547 @default.
- W4225900888 hasConcept C41008148 @default.
- W4225900888 hasConcept C54355233 @default.
- W4225900888 hasConcept C55493867 @default.
- W4225900888 hasConcept C61727976 @default.
- W4225900888 hasConcept C78458016 @default.
- W4225900888 hasConcept C81917197 @default.
- W4225900888 hasConcept C86803240 @default.
- W4225900888 hasConcept C8880873 @default.
- W4225900888 hasConcept C91852762 @default.
- W4225900888 hasConceptScore W4225900888C104317684 @default.