Matches in SemOpenAlex for { <https://semopenalex.org/work/W4364381330> ?p ?o ?g. }
- W4364381330 abstract "Abstract Here we frame the cis-regulatory code (that connects the regulatory functions of non-coding regions, such as promoters and UTRs, to their DNA sequences) as a representation building problem. Representation learning has emerged as a new approach to understand function of DNA and proteins, by projecting sequences into high-dimensional feature spaces, where the features are learned from data by a neural network. Inspired by these approaches, we seek to define a feature space where non-coding regions with similar regulatory functions are nearby each other. As a first attempt, we engineered features based on matches to biochemically characterized regulatory motifs in the DNA sequences of non-coding regions. Remarkably, we found that functionally similar promoters and 3’ UTRs could be grouped together in a feature space defined by simple averages of the best match scores in (unaligned) orthologous non-coding regions, which we refer to as phylogenetic average motif scores. Perhaps most important, because this feature space is based on known motifs and not fit to any data, it is fully interpretable and not limited to any particular cell type or experimental context. We find that we can read off known regulatory relationships and evolutionary rewiring from visualizations of phylogenetic average motif score representations, and that predicted regulatory interactions based on neighbors in the feature space are borne out in transcription factor deletion experiments. Phylogenetic averages of match scores to known motifs is a baseline for representation learning applied to non-coding sequences, and may continue to improve as databases of motifs become more complete." @default.
- W4364381330 created "2023-04-12" @default.
- W4364381330 creator A5002438634 @default.
- W4364381330 creator A5007939943 @default.
- W4364381330 creator A5023272254 @default.
- W4364381330 creator A5051419097 @default.
- W4364381330 date "2023-04-11" @default.
- W4364381330 modified "2023-09-26" @default.
- W4364381330 title "Functional similarity of non-coding regions is revealed in phylogenetic average motif score representations" @default.
- W4364381330 cites W110198413 @default.
- W4364381330 cites W1621181998 @default.
- W4364381330 cites W1962034131 @default.
- W4364381330 cites W1968569455 @default.
- W4364381330 cites W1969673449 @default.
- W4364381330 cites W1987122345 @default.
- W4364381330 cites W1989419347 @default.
- W4364381330 cites W1994267372 @default.
- W4364381330 cites W1996923044 @default.
- W4364381330 cites W2015312982 @default.
- W4364381330 cites W2015416439 @default.
- W4364381330 cites W2017017064 @default.
- W4364381330 cites W2020758057 @default.
- W4364381330 cites W2030337983 @default.
- W4364381330 cites W2033169664 @default.
- W4364381330 cites W2037652809 @default.
- W4364381330 cites W2037992536 @default.
- W4364381330 cites W2046111047 @default.
- W4364381330 cites W2049402769 @default.
- W4364381330 cites W2050833533 @default.
- W4364381330 cites W2059126678 @default.
- W4364381330 cites W2062592181 @default.
- W4364381330 cites W2082291109 @default.
- W4364381330 cites W2084728047 @default.
- W4364381330 cites W2087410568 @default.
- W4364381330 cites W2087672655 @default.
- W4364381330 cites W2089806726 @default.
- W4364381330 cites W2097311877 @default.
- W4364381330 cites W2097467142 @default.
- W4364381330 cites W2099189600 @default.
- W4364381330 cites W2101133358 @default.
- W4364381330 cites W2102159699 @default.
- W4364381330 cites W2102221598 @default.
- W4364381330 cites W2104262114 @default.
- W4364381330 cites W2105329815 @default.
- W4364381330 cites W2105694586 @default.
- W4364381330 cites W2107233900 @default.
- W4364381330 cites W2109314645 @default.
- W4364381330 cites W2109407861 @default.
- W4364381330 cites W2109818073 @default.
- W4364381330 cites W2118581428 @default.
- W4364381330 cites W2137683543 @default.
- W4364381330 cites W2143431735 @default.
- W4364381330 cites W2144454685 @default.
- W4364381330 cites W2144857133 @default.
- W4364381330 cites W2146206618 @default.
- W4364381330 cites W2150926065 @default.
- W4364381330 cites W2151233592 @default.
- W4364381330 cites W2151682452 @default.
- W4364381330 cites W2153562609 @default.
- W4364381330 cites W2158714788 @default.
- W4364381330 cites W2159686118 @default.
- W4364381330 cites W2171192413 @default.
- W4364381330 cites W2173548494 @default.
- W4364381330 cites W2180876606 @default.
- W4364381330 cites W2259938310 @default.
- W4364381330 cites W2307041907 @default.
- W4364381330 cites W2309916540 @default.
- W4364381330 cites W2331530999 @default.
- W4364381330 cites W2345512687 @default.
- W4364381330 cites W2601509136 @default.
- W4364381330 cites W2616964127 @default.
- W4364381330 cites W2759304130 @default.
- W4364381330 cites W2767222135 @default.
- W4364381330 cites W2804381698 @default.
- W4364381330 cites W2911390449 @default.
- W4364381330 cites W2911989762 @default.
- W4364381330 cites W2944545583 @default.
- W4364381330 cites W2952239877 @default.
- W4364381330 cites W2990395719 @default.
- W4364381330 cites W2995175313 @default.
- W4364381330 cites W3006349006 @default.
- W4364381330 cites W3012029968 @default.
- W4364381330 cites W3035587935 @default.
- W4364381330 cites W3083240949 @default.
- W4364381330 cites W3094755456 @default.
- W4364381330 cites W3099848476 @default.
- W4364381330 cites W3105951520 @default.
- W4364381330 cites W3112376646 @default.
- W4364381330 cites W3120635680 @default.
- W4364381330 cites W3127238141 @default.
- W4364381330 cites W3130819465 @default.
- W4364381330 cites W3131672097 @default.
- W4364381330 cites W3133944950 @default.
- W4364381330 cites W3135803532 @default.
- W4364381330 cites W3146944767 @default.
- W4364381330 cites W3184729641 @default.
- W4364381330 cites W3203588026 @default.
- W4364381330 cites W3215596355 @default.
- W4364381330 cites W4200135473 @default.
- W4364381330 cites W4221128536 @default.