Matches in SemOpenAlex for { <https://semopenalex.org/work/W4292604031> ?p ?o ?g. }
- W4292604031 endingPage "1300" @default.
- W4292604031 startingPage "1281" @default.
- W4292604031 abstract "In marine remote sensing, supervised learning can link variables measured in-situ near the ocean surface to variables that can be measured from space. However, the in-situ data used for training and validating such empirical satellite algorithms are often spatially auto-correlated and clustered, giving rise to various statistical challenges such as overfitting to spatial structures. Furthermore, co-located in-situ and satellite measurements are rare in the oceans because of the cost of data collection from research vessels and frequent cloud cover. We propose two methods to mitigate these challenges. The first method builds on spatial leave-one-out cross-validation (SLOOCV), an approach designed to provide sound error estimates when data are spatially auto-correlated by enforcing a minimum separation distance between training and test observations. However, estimating this distance may be impossible with sparse and spatially clustered data. We hence propose to iterate and integrate error estimates over a range of separation distances (iSLOOCV). To address the often-small size of labeled data sets based on marine in-situ data, we tested if increasing the number of observations for algorithm training by means of cloud-filling algorithms for marine satellite data improved predictions. The potential of these two methods is demonstrated by developing empirical algorithms for mapping the proportions of seven diagnostic pigments (DPs) that serve as proxies for phytoplankton community composition in the northern Gulf of Mexico. We estimated the prediction accuracy of 13 algorithms with iSLOOCV, using various sets of satellite data products as input, and found adequate algorithms for 4 of the 7 DPs. Random forests combining ocean color and environmental variables as input had the lowest prediction errors overall. Correlations between predictions and observations estimated by iSLOOCV ranged from 0.69 to 0.85 and mean absolute errors from 0.02 to 0.13. Daily maps and longer-term composites of these DPs were broadly consistent with previously published results. Overall, errors increased when extrapolating over larger distances, highlighting how iSLOOCV can illuminate changes in algorithm performance based on sub-regional data coverage. Generating larger training sets by prior gap-filling substantially improved all error measures for 3 of the 7 DPs, with mixed results for the others. Therefore, data augmentation by gap-filling of satellite data should not be used as a default approach but can be a useful tool when supervised learning applications are suspected to be limited by the size of the training set." @default.
- W4292604031 created "2022-08-22" @default.
- W4292604031 creator A5022537700 @default.
- W4292604031 creator A5081593945 @default.
- W4292604031 date "2022-08-22" @default.
- W4292604031 modified "2023-09-30" @default.
- W4292604031 title "Iterative spatial leave-one-out cross-validation and gap-filling based data augmentation for supervised learning applications in marine remote sensing" @default.
- W4292604031 cites W1832367887 @default.
- W4292604031 cites W1970370394 @default.
- W4292604031 cites W1973755634 @default.
- W4292604031 cites W1979058049 @default.
- W4292604031 cites W1984338772 @default.
- W4292604031 cites W1990985374 @default.
- W4292604031 cites W1996537271 @default.
- W4292604031 cites W1998025025 @default.
- W4292604031 cites W1999676742 @default.
- W4292604031 cites W2007101051 @default.
- W4292604031 cites W2007873570 @default.
- W4292604031 cites W2012471223 @default.
- W4292604031 cites W2024643119 @default.
- W4292604031 cites W2028755967 @default.
- W4292604031 cites W2031113870 @default.
- W4292604031 cites W2034013489 @default.
- W4292604031 cites W2038720636 @default.
- W4292604031 cites W2040277622 @default.
- W4292604031 cites W2046772901 @default.
- W4292604031 cites W2048985333 @default.
- W4292604031 cites W2056891759 @default.
- W4292604031 cites W2065853418 @default.
- W4292604031 cites W2069337962 @default.
- W4292604031 cites W2076514432 @default.
- W4292604031 cites W2088665921 @default.
- W4292604031 cites W2090373435 @default.
- W4292604031 cites W2092351456 @default.
- W4292604031 cites W2101270983 @default.
- W4292604031 cites W2104987933 @default.
- W4292604031 cites W2106416836 @default.
- W4292604031 cites W2112260599 @default.
- W4292604031 cites W2115359280 @default.
- W4292604031 cites W2128876958 @default.
- W4292604031 cites W2130956715 @default.
- W4292604031 cites W2133454557 @default.
- W4292604031 cites W2133910563 @default.
- W4292604031 cites W2135695572 @default.
- W4292604031 cites W2137572869 @default.
- W4292604031 cites W2144276939 @default.
- W4292604031 cites W2159400622 @default.
- W4292604031 cites W2163330578 @default.
- W4292604031 cites W2169678197 @default.
- W4292604031 cites W2176494706 @default.
- W4292604031 cites W2261059368 @default.
- W4292604031 cites W2316541453 @default.
- W4292604031 cites W2328503207 @default.
- W4292604031 cites W2416221158 @default.
- W4292604031 cites W2422544722 @default.
- W4292604031 cites W2560136348 @default.
- W4292604031 cites W2565108701 @default.
- W4292604031 cites W2589592374 @default.
- W4292604031 cites W2594738842 @default.
- W4292604031 cites W2613806236 @default.
- W4292604031 cites W2625303305 @default.
- W4292604031 cites W2729033468 @default.
- W4292604031 cites W2789701861 @default.
- W4292604031 cites W2790063226 @default.
- W4292604031 cites W2792528062 @default.
- W4292604031 cites W2803867753 @default.
- W4292604031 cites W2804749422 @default.
- W4292604031 cites W2810320009 @default.
- W4292604031 cites W2887085544 @default.
- W4292604031 cites W2891441096 @default.
- W4292604031 cites W2904381509 @default.
- W4292604031 cites W2910353990 @default.
- W4292604031 cites W2911964244 @default.
- W4292604031 cites W2914377344 @default.
- W4292604031 cites W2925730803 @default.
- W4292604031 cites W2942895301 @default.
- W4292604031 cites W2952516441 @default.
- W4292604031 cites W2955038187 @default.
- W4292604031 cites W2963959434 @default.
- W4292604031 cites W2964110223 @default.
- W4292604031 cites W2980094333 @default.
- W4292604031 cites W2981516216 @default.
- W4292604031 cites W2984947547 @default.
- W4292604031 cites W3004942926 @default.
- W4292604031 cites W3007598347 @default.
- W4292604031 cites W3011664049 @default.
- W4292604031 cites W3017346894 @default.
- W4292604031 cites W3040058291 @default.
- W4292604031 cites W3045569511 @default.
- W4292604031 cites W3048814641 @default.
- W4292604031 cites W3104887532 @default.
- W4292604031 cites W3106984187 @default.
- W4292604031 cites W3120014374 @default.
- W4292604031 cites W3127443316 @default.
- W4292604031 cites W3175797661 @default.
- W4292604031 cites W4220808180 @default.
- W4292604031 cites W4235751205 @default.
- W4292604031 cites W4253770764 @default.