Matches in SemOpenAlex for { <https://semopenalex.org/work/W2487091181> ?p ?o ?g. }
- W2487091181 endingPage "160275" @default.
- W2487091181 startingPage "160275" @default.
- W2487091181 abstract "The sizes of paralogues—gene families produced by ancestral duplication—are known to follow a power-law distribution. We examine the size distribution of gene sets or gene families where genes are grouped by a similar function or share a common property. The size distribution of Human Gene Nomenclature Committee (HGNC) gene sets deviate from the power-law, and can be fitted much better by a beta rank function. We propose a simple mechanism to break a power-law size distribution by a combination of splitting and merging operations. The largest gene sets are split into two to account for the subfunctional categories, and a small proportion of other gene sets are merged into larger sets as new common themes might be realized. These operations are not uncommon for a curator of gene sets. A simulation shows that iteration of these operations changes the size distribution of Ensembl paralogues and could lead to a distribution fitted by a rank beta function. We further illustrate application of beta rank function by the example of distribution of transcription factors and drug target genes among HGNC gene families." @default.
- W2487091181 created "2016-08-23" @default.
- W2487091181 creator A5022125219 @default.
- W2487091181 creator A5057425369 @default.
- W2487091181 creator A5066931410 @default.
- W2487091181 date "2016-08-01" @default.
- W2487091181 modified "2023-10-03" @default.
- W2487091181 title "Size distribution of function-based human gene sets and the split–merge model" @default.
- W2487091181 cites W1503101850 @default.
- W2487091181 cites W1511106231 @default.
- W2487091181 cites W1513618424 @default.
- W2487091181 cites W156481159 @default.
- W2487091181 cites W1581554695 @default.
- W2487091181 cites W1774711529 @default.
- W2487091181 cites W1914610732 @default.
- W2487091181 cites W1964903151 @default.
- W2487091181 cites W1965744864 @default.
- W2487091181 cites W1968971678 @default.
- W2487091181 cites W1969460352 @default.
- W2487091181 cites W1970853141 @default.
- W2487091181 cites W1971825415 @default.
- W2487091181 cites W1971921744 @default.
- W2487091181 cites W1980548655 @default.
- W2487091181 cites W1988714210 @default.
- W2487091181 cites W1991871385 @default.
- W2487091181 cites W1992375859 @default.
- W2487091181 cites W2001966208 @default.
- W2487091181 cites W2004759596 @default.
- W2487091181 cites W2006624552 @default.
- W2487091181 cites W2013571746 @default.
- W2487091181 cites W2017407156 @default.
- W2487091181 cites W2018990052 @default.
- W2487091181 cites W2027895157 @default.
- W2487091181 cites W2029161716 @default.
- W2487091181 cites W2032157052 @default.
- W2487091181 cites W2040640450 @default.
- W2487091181 cites W2048912867 @default.
- W2487091181 cites W2052794428 @default.
- W2487091181 cites W2055801034 @default.
- W2487091181 cites W2056179629 @default.
- W2487091181 cites W2059920749 @default.
- W2487091181 cites W2064231816 @default.
- W2487091181 cites W2072271719 @default.
- W2487091181 cites W2075286485 @default.
- W2487091181 cites W2076048958 @default.
- W2487091181 cites W2077399441 @default.
- W2487091181 cites W2078284062 @default.
- W2487091181 cites W2078817043 @default.
- W2487091181 cites W2082773101 @default.
- W2487091181 cites W2084394280 @default.
- W2487091181 cites W2084738599 @default.
- W2487091181 cites W2090939716 @default.
- W2487091181 cites W2091369243 @default.
- W2487091181 cites W2095829109 @default.
- W2487091181 cites W2096465161 @default.
- W2487091181 cites W2098412979 @default.
- W2487091181 cites W2103581172 @default.
- W2487091181 cites W2112493377 @default.
- W2487091181 cites W2116335962 @default.
- W2487091181 cites W2125406108 @default.
- W2487091181 cites W2127517102 @default.
- W2487091181 cites W2128296317 @default.
- W2487091181 cites W2130410032 @default.
- W2487091181 cites W2131620351 @default.
- W2487091181 cites W2136578623 @default.
- W2487091181 cites W2138280629 @default.
- W2487091181 cites W2139001898 @default.
- W2487091181 cites W2142635246 @default.
- W2487091181 cites W2144199168 @default.
- W2487091181 cites W2145100181 @default.
- W2487091181 cites W2150249052 @default.
- W2487091181 cites W2151763757 @default.
- W2487091181 cites W2152876917 @default.
- W2487091181 cites W2155095346 @default.
- W2487091181 cites W2155651473 @default.
- W2487091181 cites W2156564520 @default.
- W2487091181 cites W2162136869 @default.
- W2487091181 cites W2166085889 @default.
- W2487091181 cites W2168175751 @default.
- W2487091181 cites W2169239268 @default.
- W2487091181 cites W2170522882 @default.
- W2487091181 cites W2170570327 @default.
- W2487091181 cites W2183162925 @default.
- W2487091181 cites W2191975304 @default.
- W2487091181 cites W2194756172 @default.
- W2487091181 cites W2317633973 @default.
- W2487091181 cites W2915667579 @default.
- W2487091181 cites W2950627632 @default.
- W2487091181 cites W2962821542 @default.
- W2487091181 cites W3099005200 @default.
- W2487091181 cites W3103362336 @default.
- W2487091181 cites W3106889297 @default.
- W2487091181 cites W3126101131 @default.
- W2487091181 cites W319819633 @default.
- W2487091181 cites W4229487902 @default.
- W2487091181 cites W4230266413 @default.
- W2487091181 cites W765531983 @default.
- W2487091181 doi "https://doi.org/10.1098/rsos.160275" @default.