Matches in SemOpenAlex for { <https://semopenalex.org/work/W4225263025> ?p ?o ?g. }
- W4225263025 endingPage "4478" @default.
- W4225263025 startingPage "4469" @default.
- W4225263025 abstract "Detecting early-stage lung cancer is critical to reduce the lung cancer mortality rate; however, existing models based on germline variants perform poorly, and new models are needed. This study aimed to use extreme gradient boosting to develop a predictive model for the early diagnosis of lung cancer in a multicenter case-control study.A total of 974 cases and 1005 controls in Shanghai and Taizhou were recruited, and 61 single nucleotide polymorphisms (SNPs) were genotyped. Multivariate logistic regression was used to calculate the association between signal SNPs and lung cancer risk. Logistic regression (LR) and extreme gradient boosting (XGBoost) algorithms, a large-scale machine learning algorithm, were adopted to build the lung cancer risk model. In both models, 10-fold cross-validation was performed, and model predictive performance was evaluated by the area under the curve (AUC).After FDR adjustment, TYMS rs3819102 and BAG6 rs1077393 were significantly associated with lung cancer risk (p < 0.05). For lung cancer risk prediction, the model predicted only with epidemiology attained an AUC of 0.703 for LR and 0.744 for XGBoost. Compared with the LR model predicted only with epidemiology, further adding SNPs and applying XGBoost increased the AUC to 0.759 (p < 0.001) in the XGBoost model. BAG6 rs1077393 was the most important predictor among all SNPs in the lung cancer prediction XGBoost model, followed by TERT rs2735845 and CAMKK1 rs7214723. Further stratification in lung adenocarcinoma (ADC) showed a significantly elevated performance from 0.639 to 0.699 (p = 0.009) when applying XGBoost and adding SNPs to the model, while the best model for lung squamous cell carcinoma (SCC) prediction was the LR model predicted with epidemiology and SNPs (AUC = 0.833), compared with the XGBoost model (AUC = 0.816).Our lung cancer risk prediction models in the Chinese population have a strong predictive ability, especially for SCC. Adding SNPs and applying the XGBoost algorithm to the epidemiologic-based logistic regression risk prediction model significantly improves model performance." @default.
- W4225263025 created "2022-05-04" @default.
- W4225263025 creator A5001019377 @default.
- W4225263025 creator A5023740597 @default.
- W4225263025 creator A5029636776 @default.
- W4225263025 creator A5029667848 @default.
- W4225263025 creator A5047419908 @default.
- W4225263025 creator A5048792086 @default.
- W4225263025 creator A5050842690 @default.
- W4225263025 creator A5050874906 @default.
- W4225263025 creator A5051169949 @default.
- W4225263025 creator A5054331179 @default.
- W4225263025 creator A5057513774 @default.
- W4225263025 creator A5063297268 @default.
- W4225263025 creator A5069574081 @default.
- W4225263025 date "2022-05-02" @default.
- W4225263025 modified "2023-10-11" @default.
- W4225263025 title "Prediction of lung cancer risk in Chinese population with genetic‐environment factor using extreme gradient boosting" @default.
- W4225263025 cites W1723129825 @default.
- W4225263025 cites W1964955321 @default.
- W4225263025 cites W1972020976 @default.
- W4225263025 cites W1972759648 @default.
- W4225263025 cites W1982631803 @default.
- W4225263025 cites W2005312273 @default.
- W4225263025 cites W2021927695 @default.
- W4225263025 cites W2023026302 @default.
- W4225263025 cites W2046267064 @default.
- W4225263025 cites W2065121466 @default.
- W4225263025 cites W2079675254 @default.
- W4225263025 cites W2090934237 @default.
- W4225263025 cites W2096904202 @default.
- W4225263025 cites W2100393058 @default.
- W4225263025 cites W2106323628 @default.
- W4225263025 cites W2135820395 @default.
- W4225263025 cites W2145042701 @default.
- W4225263025 cites W2151232293 @default.
- W4225263025 cites W2157425429 @default.
- W4225263025 cites W2158143121 @default.
- W4225263025 cites W2166521422 @default.
- W4225263025 cites W2171697262 @default.
- W4225263025 cites W2275877493 @default.
- W4225263025 cites W2345128305 @default.
- W4225263025 cites W2346025119 @default.
- W4225263025 cites W2393905053 @default.
- W4225263025 cites W2401102263 @default.
- W4225263025 cites W2423356630 @default.
- W4225263025 cites W2465992929 @default.
- W4225263025 cites W2476435664 @default.
- W4225263025 cites W2580900181 @default.
- W4225263025 cites W2585225471 @default.
- W4225263025 cites W2612427157 @default.
- W4225263025 cites W2752287273 @default.
- W4225263025 cites W2883386660 @default.
- W4225263025 cites W2910178013 @default.
- W4225263025 cites W2910517555 @default.
- W4225263025 cites W2911188335 @default.
- W4225263025 cites W2915804716 @default.
- W4225263025 cites W2946657092 @default.
- W4225263025 cites W2990044667 @default.
- W4225263025 cites W3017142950 @default.
- W4225263025 cites W3128646645 @default.
- W4225263025 cites W4225263025 @default.
- W4225263025 cites W4297900711 @default.
- W4225263025 doi "https://doi.org/10.1002/cam4.4800" @default.
- W4225263025 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/35499292" @default.
- W4225263025 hasPublicationYear "2022" @default.
- W4225263025 type Work @default.
- W4225263025 citedByCount "6" @default.
- W4225263025 countsByYear W42252630252022 @default.
- W4225263025 countsByYear W42252630252023 @default.
- W4225263025 crossrefType "journal-article" @default.
- W4225263025 hasAuthorship W4225263025A5001019377 @default.
- W4225263025 hasAuthorship W4225263025A5023740597 @default.
- W4225263025 hasAuthorship W4225263025A5029636776 @default.
- W4225263025 hasAuthorship W4225263025A5029667848 @default.
- W4225263025 hasAuthorship W4225263025A5047419908 @default.
- W4225263025 hasAuthorship W4225263025A5048792086 @default.
- W4225263025 hasAuthorship W4225263025A5050842690 @default.
- W4225263025 hasAuthorship W4225263025A5050874906 @default.
- W4225263025 hasAuthorship W4225263025A5051169949 @default.
- W4225263025 hasAuthorship W4225263025A5054331179 @default.
- W4225263025 hasAuthorship W4225263025A5057513774 @default.
- W4225263025 hasAuthorship W4225263025A5063297268 @default.
- W4225263025 hasAuthorship W4225263025A5069574081 @default.
- W4225263025 hasBestOaLocation W42252630251 @default.
- W4225263025 hasConcept C104317684 @default.
- W4225263025 hasConcept C105795698 @default.
- W4225263025 hasConcept C126322002 @default.
- W4225263025 hasConcept C135763542 @default.
- W4225263025 hasConcept C143998085 @default.
- W4225263025 hasConcept C151956035 @default.
- W4225263025 hasConcept C153209595 @default.
- W4225263025 hasConcept C161584116 @default.
- W4225263025 hasConcept C2776256026 @default.
- W4225263025 hasConcept C33923547 @default.
- W4225263025 hasConcept C45804977 @default.
- W4225263025 hasConcept C54355233 @default.
- W4225263025 hasConcept C71924100 @default.