Matches in SemOpenAlex for { <https://semopenalex.org/work/W4387358414> ?p ?o ?g. }
- W4387358414 abstract "Abstract The application of machine learning techniques in biological research, especially when dealing with limited data availability, poses significant challenges. In this study, we leveraged advancements in method development for predicting protein-protein binding strength to conduct a systematic investigation into the application of machine learning on limited data. The binding strength, quantitatively measured as binding affinity, is vital for understanding the processes of recognition, association, and dysfunction that occur within protein complexes. By incorporating transfer learning, integrating domain knowledge, and employing both deep learning and traditional machine learning algorithms, we mitigate the impact of data limitations and make significant advancements in predicting protein-protein binding affinity. In particular, we developed over 20 models, ultimately selecting three representative best-performing ones that belong to distinct categories. The first model is structure-based, consisting of a random forest regression and thirteen handcrafted features. The second model is sequence-based, employing an architecture that combines transferred embedding features with a multilayer perceptron. Finally, we created an ensemble model by averaging the predictions of the two aforementioned models. The comparison with other predictors on three independent datasets confirmed the significant improvements achieved by our models in predicting protein-protein binding affinity. The source codes for these three models are available at https://github.com/minghuilab/BindPPI ." @default.
- W4387358414 created "2023-10-06" @default.
- W4387358414 creator A5003705537 @default.
- W4387358414 creator A5029039847 @default.
- W4387358414 creator A5045677847 @default.
- W4387358414 creator A5051405817 @default.
- W4387358414 creator A5071684691 @default.
- W4387358414 date "2023-10-05" @default.
- W4387358414 modified "2023-10-18" @default.
- W4387358414 title "Systematic Investigation of Machine Learning on Limited Data: A Study on Predicting Protein-Protein Binding Strength" @default.
- W4387358414 cites W1030883578 @default.
- W4387358414 cites W1570416110 @default.
- W4387358414 cites W1915490970 @default.
- W4387358414 cites W1948562438 @default.
- W4387358414 cites W1968430567 @default.
- W4387358414 cites W1978287590 @default.
- W4387358414 cites W1981244178 @default.
- W4387358414 cites W1997870466 @default.
- W4387358414 cites W2008708467 @default.
- W4387358414 cites W2010490135 @default.
- W4387358414 cites W2016385245 @default.
- W4387358414 cites W2027408247 @default.
- W4387358414 cites W2028286374 @default.
- W4387358414 cites W2029667189 @default.
- W4387358414 cites W2031772330 @default.
- W4387358414 cites W2034542871 @default.
- W4387358414 cites W2036309115 @default.
- W4387358414 cites W2038264143 @default.
- W4387358414 cites W2043338013 @default.
- W4387358414 cites W2049275721 @default.
- W4387358414 cites W2050526879 @default.
- W4387358414 cites W2065838297 @default.
- W4387358414 cites W2084493621 @default.
- W4387358414 cites W2087485814 @default.
- W4387358414 cites W2087714175 @default.
- W4387358414 cites W2089335658 @default.
- W4387358414 cites W2092285329 @default.
- W4387358414 cites W2092387745 @default.
- W4387358414 cites W2093327248 @default.
- W4387358414 cites W2098312592 @default.
- W4387358414 cites W2099861795 @default.
- W4387358414 cites W2102245393 @default.
- W4387358414 cites W2110505462 @default.
- W4387358414 cites W2112320038 @default.
- W4387358414 cites W2115161731 @default.
- W4387358414 cites W2117960073 @default.
- W4387358414 cites W2120984924 @default.
- W4387358414 cites W2126103104 @default.
- W4387358414 cites W2129454543 @default.
- W4387358414 cites W2130479394 @default.
- W4387358414 cites W2130859704 @default.
- W4387358414 cites W2131063642 @default.
- W4387358414 cites W2145793759 @default.
- W4387358414 cites W2150981663 @default.
- W4387358414 cites W2151581834 @default.
- W4387358414 cites W2152830137 @default.
- W4387358414 cites W2153457180 @default.
- W4387358414 cites W2156698976 @default.
- W4387358414 cites W2157733131 @default.
- W4387358414 cites W2160868144 @default.
- W4387358414 cites W2162166182 @default.
- W4387358414 cites W2166496883 @default.
- W4387358414 cites W2171681733 @default.
- W4387358414 cites W2311607323 @default.
- W4387358414 cites W2328176404 @default.
- W4387358414 cites W2345331290 @default.
- W4387358414 cites W2474747771 @default.
- W4387358414 cites W2512372054 @default.
- W4387358414 cites W2554019130 @default.
- W4387358414 cites W2560233715 @default.
- W4387358414 cites W2612814374 @default.
- W4387358414 cites W2748251752 @default.
- W4387358414 cites W2774509494 @default.
- W4387358414 cites W2845275279 @default.
- W4387358414 cites W2886522305 @default.
- W4387358414 cites W2887441413 @default.
- W4387358414 cites W2904602199 @default.
- W4387358414 cites W2919115771 @default.
- W4387358414 cites W2950954328 @default.
- W4387358414 cites W2953103783 @default.
- W4387358414 cites W2979986977 @default.
- W4387358414 cites W2982586377 @default.
- W4387358414 cites W3007500173 @default.
- W4387358414 cites W3016114432 @default.
- W4387358414 cites W3047233668 @default.
- W4387358414 cites W3117622225 @default.
- W4387358414 cites W3136963718 @default.
- W4387358414 cites W3177828909 @default.
- W4387358414 cites W3207204660 @default.
- W4387358414 cites W3213922610 @default.
- W4387358414 cites W4200321201 @default.
- W4387358414 cites W4206007286 @default.
- W4387358414 cites W4221040627 @default.
- W4387358414 cites W4234914623 @default.
- W4387358414 cites W4281627698 @default.
- W4387358414 cites W4281742422 @default.
- W4387358414 cites W4319164454 @default.
- W4387358414 cites W4327550249 @default.
- W4387358414 cites W4383216409 @default.
- W4387358414 doi "https://doi.org/10.1101/2023.10.03.560786" @default.