Matches in SemOpenAlex for { <https://semopenalex.org/work/W2610239507> ?p ?o ?g. }
Showing items 1 to 69 of
69
with 100 items per page.
- W2610239507 abstract "A new method for categorization of macromolecular sequences is developed by applying the Minimal Length Encoding principle. While the method is developed with the particular application in mind, the formal model may be applied to any categorization problem where the observations can be represented by using a finite set of attributes that take values from a finite set. The proposed method is shown to be more general than the categorization procedures based on Weighted Parsimony and Compatibility principles. The proposed method also provides a principled tradeoff between the number of inferred classes and their fit to data. A statistical significance test for the presence of classes is also proposed.The categorization problem is proven to be computationally hard even under various simplifying assumptions. To avoid the excessive computation time, an algorithm that is based on a local search strategy is proposed. The algorithm was applied to rediscover several known macromolecular sequence families as well as to discover new classes of Bacteriophage T7 promoters and Alu sequences. The algorithm was also tested on simulated data to determine the range of data parameters where it categorizes the sequences correctly.While the application of the Minimal Length Encoding principle was restricted to the categorization of macromolecules, a very general background on the process of categorization and the Minimal Length Encoding principle is also provided. It is proposed that categorization, and perhaps inductive inference in general, can be viewed as a process of minimal length encoding of observations." @default.
- W2610239507 created "2017-05-12" @default.
- W2610239507 creator A5031403410 @default.
- W2610239507 creator A5090849321 @default.
- W2610239507 date "1990-01-01" @default.
- W2610239507 modified "2023-09-26" @default.
- W2610239507 title "Categorization of macromolecular sequences by minimal length encoding" @default.
- W2610239507 hasPublicationYear "1990" @default.
- W2610239507 type Work @default.
- W2610239507 sameAs 2610239507 @default.
- W2610239507 citedByCount "1" @default.
- W2610239507 crossrefType "journal-article" @default.
- W2610239507 hasAuthorship W2610239507A5031403410 @default.
- W2610239507 hasAuthorship W2610239507A5090849321 @default.
- W2610239507 hasConcept C105795698 @default.
- W2610239507 hasConcept C11413529 @default.
- W2610239507 hasConcept C125411270 @default.
- W2610239507 hasConcept C153180895 @default.
- W2610239507 hasConcept C154945302 @default.
- W2610239507 hasConcept C177264268 @default.
- W2610239507 hasConcept C179518139 @default.
- W2610239507 hasConcept C199360897 @default.
- W2610239507 hasConcept C2776214188 @default.
- W2610239507 hasConcept C33923547 @default.
- W2610239507 hasConcept C41008148 @default.
- W2610239507 hasConcept C45374587 @default.
- W2610239507 hasConcept C80444323 @default.
- W2610239507 hasConcept C94124525 @default.
- W2610239507 hasConceptScore W2610239507C105795698 @default.
- W2610239507 hasConceptScore W2610239507C11413529 @default.
- W2610239507 hasConceptScore W2610239507C125411270 @default.
- W2610239507 hasConceptScore W2610239507C153180895 @default.
- W2610239507 hasConceptScore W2610239507C154945302 @default.
- W2610239507 hasConceptScore W2610239507C177264268 @default.
- W2610239507 hasConceptScore W2610239507C179518139 @default.
- W2610239507 hasConceptScore W2610239507C199360897 @default.
- W2610239507 hasConceptScore W2610239507C2776214188 @default.
- W2610239507 hasConceptScore W2610239507C33923547 @default.
- W2610239507 hasConceptScore W2610239507C41008148 @default.
- W2610239507 hasConceptScore W2610239507C45374587 @default.
- W2610239507 hasConceptScore W2610239507C80444323 @default.
- W2610239507 hasConceptScore W2610239507C94124525 @default.
- W2610239507 hasLocation W26102395071 @default.
- W2610239507 hasOpenAccess W2610239507 @default.
- W2610239507 hasPrimaryLocation W26102395071 @default.
- W2610239507 hasRelatedWork W139939135 @default.
- W2610239507 hasRelatedWork W1498281580 @default.
- W2610239507 hasRelatedWork W180409659 @default.
- W2610239507 hasRelatedWork W2045954434 @default.
- W2610239507 hasRelatedWork W2072780137 @default.
- W2610239507 hasRelatedWork W2086632933 @default.
- W2610239507 hasRelatedWork W2095785277 @default.
- W2610239507 hasRelatedWork W2143491739 @default.
- W2610239507 hasRelatedWork W2144796611 @default.
- W2610239507 hasRelatedWork W2151882249 @default.
- W2610239507 hasRelatedWork W2477323061 @default.
- W2610239507 hasRelatedWork W2515698080 @default.
- W2610239507 hasRelatedWork W2614833841 @default.
- W2610239507 hasRelatedWork W2947185384 @default.
- W2610239507 hasRelatedWork W2963338864 @default.
- W2610239507 hasRelatedWork W2968374616 @default.
- W2610239507 hasRelatedWork W3029785921 @default.
- W2610239507 hasRelatedWork W3186151557 @default.
- W2610239507 hasRelatedWork W3212629129 @default.
- W2610239507 hasRelatedWork W78983331 @default.
- W2610239507 isParatext "false" @default.
- W2610239507 isRetracted "false" @default.
- W2610239507 magId "2610239507" @default.
- W2610239507 workType "article" @default.