Matches in SemOpenAlex for { <https://semopenalex.org/work/W2000937352> ?p ?o ?g. }
- W2000937352 abstract "A new algorithm has been developed for generating conservation profiles that reflect the evolutionary history of the subfamily associated with a query sequence. It is based on n-gram patterns (NP{n,m}) which are sets of n residues and m wildcards in windows of size n+m. The generation of conservation profiles is treated as a signal-to-noise problem where the signal is the count of n-gram patterns in target sequences that are similar to the query sequence and the noise is the count over all target sequences. The signal is differentiated from the noise by applying singular value decomposition to sets of target sequences rank ordered by similarity with respect to the query. The new algorithm was used to construct 4,248 profiles from 120 randomly selected Pfam-A families. These were compared to profiles generated from multiple alignments using the consensus approach. The two profiles were similar whenever the subfamily associated with the query sequence was well represented in the multiple alignment. It was possible to construct subfamily specific conservation profiles using the new algorithm for subfamilies with as few as five members. The speed of the new algorithm was comparable to the multiple alignment approach. Subfamily specific conservation profiles can be generated by the new algorithm without aprioi knowledge of family relationships or domain architecture. This is useful when the subfamily contains multiple domains with different levels of representation in protein databases. It may also be applicable when the subfamily sample size is too small for the multiple alignment approach." @default.
- W2000937352 created "2016-06-24" @default.
- W2000937352 creator A5026769965 @default.
- W2000937352 creator A5049401693 @default.
- W2000937352 date "2008-01-30" @default.
- W2000937352 modified "2023-10-14" @default.
- W2000937352 title "Subfamily specific conservation profiles for proteins based on n-gram patterns" @default.
- W2000937352 cites W1485790693 @default.
- W2000937352 cites W1493206838 @default.
- W2000937352 cites W1513332069 @default.
- W2000937352 cites W1567621547 @default.
- W2000937352 cites W1965582988 @default.
- W2000937352 cites W1966628412 @default.
- W2000937352 cites W1967408543 @default.
- W2000937352 cites W1973578915 @default.
- W2000937352 cites W1990412107 @default.
- W2000937352 cites W2018998500 @default.
- W2000937352 cites W2043122071 @default.
- W2000937352 cites W2045197143 @default.
- W2000937352 cites W2048241494 @default.
- W2000937352 cites W2101017457 @default.
- W2000937352 cites W2113398700 @default.
- W2000937352 cites W2128328428 @default.
- W2000937352 cites W2136280642 @default.
- W2000937352 cites W2143210482 @default.
- W2000937352 cites W2148279834 @default.
- W2000937352 cites W2158714788 @default.
- W2000937352 cites W2161505943 @default.
- W2000937352 doi "https://doi.org/10.1186/1471-2105-9-72" @default.
- W2000937352 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/2267698" @default.
- W2000937352 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/18234090" @default.
- W2000937352 hasPublicationYear "2008" @default.
- W2000937352 type Work @default.
- W2000937352 sameAs 2000937352 @default.
- W2000937352 citedByCount "15" @default.
- W2000937352 countsByYear W20009373522012 @default.
- W2000937352 countsByYear W20009373522014 @default.
- W2000937352 countsByYear W20009373522015 @default.
- W2000937352 countsByYear W20009373522017 @default.
- W2000937352 countsByYear W20009373522018 @default.
- W2000937352 countsByYear W20009373522019 @default.
- W2000937352 countsByYear W20009373522020 @default.
- W2000937352 countsByYear W20009373522021 @default.
- W2000937352 countsByYear W20009373522022 @default.
- W2000937352 countsByYear W20009373522023 @default.
- W2000937352 crossrefType "journal-article" @default.
- W2000937352 hasAuthorship W2000937352A5026769965 @default.
- W2000937352 hasAuthorship W2000937352A5049401693 @default.
- W2000937352 hasBestOaLocation W20009373521 @default.
- W2000937352 hasConcept C104317684 @default.
- W2000937352 hasConcept C11413529 @default.
- W2000937352 hasConcept C153180895 @default.
- W2000937352 hasConcept C154945302 @default.
- W2000937352 hasConcept C167625842 @default.
- W2000937352 hasConcept C199216141 @default.
- W2000937352 hasConcept C2778112365 @default.
- W2000937352 hasConcept C3017666073 @default.
- W2000937352 hasConcept C41008148 @default.
- W2000937352 hasConcept C45484198 @default.
- W2000937352 hasConcept C50929876 @default.
- W2000937352 hasConcept C54355233 @default.
- W2000937352 hasConcept C70721500 @default.
- W2000937352 hasConcept C86803240 @default.
- W2000937352 hasConcept C88031987 @default.
- W2000937352 hasConceptScore W2000937352C104317684 @default.
- W2000937352 hasConceptScore W2000937352C11413529 @default.
- W2000937352 hasConceptScore W2000937352C153180895 @default.
- W2000937352 hasConceptScore W2000937352C154945302 @default.
- W2000937352 hasConceptScore W2000937352C167625842 @default.
- W2000937352 hasConceptScore W2000937352C199216141 @default.
- W2000937352 hasConceptScore W2000937352C2778112365 @default.
- W2000937352 hasConceptScore W2000937352C3017666073 @default.
- W2000937352 hasConceptScore W2000937352C41008148 @default.
- W2000937352 hasConceptScore W2000937352C45484198 @default.
- W2000937352 hasConceptScore W2000937352C50929876 @default.
- W2000937352 hasConceptScore W2000937352C54355233 @default.
- W2000937352 hasConceptScore W2000937352C70721500 @default.
- W2000937352 hasConceptScore W2000937352C86803240 @default.
- W2000937352 hasConceptScore W2000937352C88031987 @default.
- W2000937352 hasIssue "1" @default.
- W2000937352 hasLocation W20009373521 @default.
- W2000937352 hasLocation W20009373522 @default.
- W2000937352 hasLocation W20009373523 @default.
- W2000937352 hasLocation W20009373524 @default.
- W2000937352 hasLocation W20009373525 @default.
- W2000937352 hasOpenAccess W2000937352 @default.
- W2000937352 hasPrimaryLocation W20009373521 @default.
- W2000937352 hasRelatedWork W1578673849 @default.
- W2000937352 hasRelatedWork W2004613702 @default.
- W2000937352 hasRelatedWork W2023479323 @default.
- W2000937352 hasRelatedWork W2026660542 @default.
- W2000937352 hasRelatedWork W2028741573 @default.
- W2000937352 hasRelatedWork W2056619794 @default.
- W2000937352 hasRelatedWork W2105375733 @default.
- W2000937352 hasRelatedWork W2124718920 @default.
- W2000937352 hasRelatedWork W2514476306 @default.
- W2000937352 hasRelatedWork W3000366369 @default.
- W2000937352 hasVolume "9" @default.
- W2000937352 isParatext "false" @default.
- W2000937352 isRetracted "false" @default.