Matches in SemOpenAlex for { <https://semopenalex.org/work/W2945019988> ?p ?o ?g. }
- W2945019988 abstract "ABSTRACT Genomic databases are substantially biased towards European ancestry populations, and this bias contributes to health disparities. Here, we quantify how well 66,971 experimentally characterized human protein 3D structures represent the diversity of protein sequences observed across the 1000 Genomes Project. More than 85% of available structures do not match a sequence observed in at least one individual, and on average structures match the sequence of 74% of individuals. Nearly 23% of human structures do not match any observed sequences; however, after masking engineered/known mutations, this decreases to ~4%. African ancestry sequences are modestly, but significantly, less likely to be represented by structures (73.5% vs. 74.0%). These differences are mainly driven by the greater genetic diversity of African populations. We identify thousands of variants unrepresented in available structures that influence protein structure and function. Thus, the use of a single structure as representative of “the wild type” protein will often bias results against many individuals. The diversity of protein sequence and structure must be considered to enable accurate, reproducible, and generalizable conclusions from structural analyses." @default.
- W2945019988 created "2019-05-29" @default.
- W2945019988 creator A5001777415 @default.
- W2945019988 creator A5007437458 @default.
- W2945019988 creator A5017645655 @default.
- W2945019988 creator A5025842562 @default.
- W2945019988 creator A5034773030 @default.
- W2945019988 creator A5035697464 @default.
- W2945019988 creator A5037381151 @default.
- W2945019988 date "2019-05-17" @default.
- W2945019988 modified "2023-09-23" @default.
- W2945019988 title "Do available protein 3D structures reflect human genetic and functional diversity?" @default.
- W2945019988 cites W1508885706 @default.
- W2945019988 cites W1528805424 @default.
- W2945019988 cites W1538925875 @default.
- W2945019988 cites W1762366815 @default.
- W2945019988 cites W1839757354 @default.
- W2945019988 cites W1935098740 @default.
- W2945019988 cites W1940319397 @default.
- W2945019988 cites W1951899808 @default.
- W2945019988 cites W1966503742 @default.
- W2945019988 cites W1969735540 @default.
- W2945019988 cites W1970519289 @default.
- W2945019988 cites W1971248251 @default.
- W2945019988 cites W1974599488 @default.
- W2945019988 cites W1982516282 @default.
- W2945019988 cites W1984068087 @default.
- W2945019988 cites W1987754412 @default.
- W2945019988 cites W1997476591 @default.
- W2945019988 cites W2001842251 @default.
- W2945019988 cites W2009313526 @default.
- W2945019988 cites W2010427019 @default.
- W2945019988 cites W2014342098 @default.
- W2945019988 cites W2018518196 @default.
- W2945019988 cites W2024405748 @default.
- W2945019988 cites W2026924192 @default.
- W2945019988 cites W2034699378 @default.
- W2945019988 cites W2036574792 @default.
- W2945019988 cites W2036742645 @default.
- W2945019988 cites W2044740902 @default.
- W2945019988 cites W2046626895 @default.
- W2945019988 cites W2052934239 @default.
- W2945019988 cites W2059145105 @default.
- W2945019988 cites W2061098698 @default.
- W2945019988 cites W2063274819 @default.
- W2945019988 cites W2075378764 @default.
- W2945019988 cites W2075468691 @default.
- W2945019988 cites W2084168484 @default.
- W2945019988 cites W2087588809 @default.
- W2945019988 cites W2093916754 @default.
- W2945019988 cites W2098432123 @default.
- W2945019988 cites W2102270144 @default.
- W2945019988 cites W2102517327 @default.
- W2945019988 cites W2104549677 @default.
- W2945019988 cites W2108237355 @default.
- W2945019988 cites W2110513645 @default.
- W2945019988 cites W2112092292 @default.
- W2945019988 cites W2114029728 @default.
- W2945019988 cites W2114162221 @default.
- W2945019988 cites W2114850508 @default.
- W2945019988 cites W2128865876 @default.
- W2945019988 cites W2129952088 @default.
- W2945019988 cites W2130479394 @default.
- W2945019988 cites W2132629607 @default.
- W2945019988 cites W2138230100 @default.
- W2945019988 cites W2145187337 @default.
- W2945019988 cites W2145203699 @default.
- W2945019988 cites W2145355174 @default.
- W2945019988 cites W2145809383 @default.
- W2945019988 cites W2153118028 @default.
- W2945019988 cites W2153457180 @default.
- W2945019988 cites W2154139219 @default.
- W2945019988 cites W2155699197 @default.
- W2945019988 cites W2163722216 @default.
- W2945019988 cites W2164004777 @default.
- W2945019988 cites W2168141575 @default.
- W2945019988 cites W2170187925 @default.
- W2945019988 cites W2170463736 @default.
- W2945019988 cites W2171777347 @default.
- W2945019988 cites W2256016639 @default.
- W2945019988 cites W2304402346 @default.
- W2945019988 cites W2329592726 @default.
- W2945019988 cites W2332540610 @default.
- W2945019988 cites W2343497810 @default.
- W2945019988 cites W2344561059 @default.
- W2945019988 cites W2416319244 @default.
- W2945019988 cites W2417483443 @default.
- W2945019988 cites W2437351093 @default.
- W2945019988 cites W2461009071 @default.
- W2945019988 cites W2469640117 @default.
- W2945019988 cites W2494596175 @default.
- W2945019988 cites W2507362179 @default.
- W2945019988 cites W2512464499 @default.
- W2945019988 cites W2516720184 @default.
- W2945019988 cites W2520345180 @default.
- W2945019988 cites W2522993044 @default.
- W2945019988 cites W2529098614 @default.
- W2945019988 cites W2529924676 @default.
- W2945019988 cites W2531208577 @default.
- W2945019988 cites W2531587846 @default.