Matches in SemOpenAlex for { <https://semopenalex.org/work/W4224311959> ?p ?o ?g. }
- W4224311959 endingPage "e35734" @default.
- W4224311959 startingPage "e35734" @default.
- W4224311959 abstract "A regular task by developers and users of synthetic data generation (SDG) methods is to evaluate and compare the utility of these methods. Multiple utility metrics have been proposed and used to evaluate synthetic data. However, they have not been validated in general or for comparing SDG methods.This study evaluates the ability of common utility metrics to rank SDG methods according to performance on a specific analytic workload. The workload of interest is the use of synthetic data for logistic regression prediction models, which is a very frequent workload in health research.We evaluated 6 utility metrics on 30 different health data sets and 3 different SDG methods (a Bayesian network, a Generative Adversarial Network, and sequential tree synthesis). These metrics were computed by averaging across 20 synthetic data sets from the same generative model. The metrics were then tested on their ability to rank the SDG methods based on prediction performance. Prediction performance was defined as the difference between each of the area under the receiver operating characteristic curve and area under the precision-recall curve values on synthetic data logistic regression prediction models versus real data models.The utility metric best able to rank SDG methods was the multivariate Hellinger distance based on a Gaussian copula representation of real and synthetic joint distributions.This study has validated a generative model utility metric, the multivariate Hellinger distance, which can be used to reliably rank competing SDG methods on the same data set. The Hellinger distance metric can be used to evaluate and compare alternate SDG methods." @default.
- W4224311959 created "2022-04-26" @default.
- W4224311959 creator A5024567177 @default.
- W4224311959 creator A5035823866 @default.
- W4224311959 creator A5070248034 @default.
- W4224311959 creator A5071095124 @default.
- W4224311959 date "2022-04-07" @default.
- W4224311959 modified "2023-09-30" @default.
- W4224311959 title "Utility Metrics for Evaluating Synthetic Health Data Generation Methods: Validation Study" @default.
- W4224311959 cites W149976140 @default.
- W4224311959 cites W1530664329 @default.
- W4224311959 cites W1912982817 @default.
- W4224311959 cites W1976526581 @default.
- W4224311959 cites W1980515664 @default.
- W4224311959 cites W2003559619 @default.
- W4224311959 cites W2032938251 @default.
- W4224311959 cites W2088431433 @default.
- W4224311959 cites W2114268578 @default.
- W4224311959 cites W2150291618 @default.
- W4224311959 cites W2300859829 @default.
- W4224311959 cites W2574537113 @default.
- W4224311959 cites W2593512180 @default.
- W4224311959 cites W2622579191 @default.
- W4224311959 cites W2806276686 @default.
- W4224311959 cites W2884433501 @default.
- W4224311959 cites W2888003578 @default.
- W4224311959 cites W2889164652 @default.
- W4224311959 cites W2906805076 @default.
- W4224311959 cites W2913997948 @default.
- W4224311959 cites W2961396908 @default.
- W4224311959 cites W2963679759 @default.
- W4224311959 cites W2965372408 @default.
- W4224311959 cites W2965527189 @default.
- W4224311959 cites W2966525657 @default.
- W4224311959 cites W2980100290 @default.
- W4224311959 cites W298769045 @default.
- W4224311959 cites W2996889063 @default.
- W4224311959 cites W3002098238 @default.
- W4224311959 cites W3011243144 @default.
- W4224311959 cites W3022574011 @default.
- W4224311959 cites W3033682799 @default.
- W4224311959 cites W3071470454 @default.
- W4224311959 cites W3091951858 @default.
- W4224311959 cites W3103102495 @default.
- W4224311959 cites W3104912879 @default.
- W4224311959 cites W3112508023 @default.
- W4224311959 cites W3135275361 @default.
- W4224311959 cites W3135544356 @default.
- W4224311959 cites W3155972762 @default.
- W4224311959 cites W3175134484 @default.
- W4224311959 cites W4210488825 @default.
- W4224311959 cites W4245253886 @default.
- W4224311959 cites W4293508270 @default.
- W4224311959 doi "https://doi.org/10.2196/35734" @default.
- W4224311959 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/35389366" @default.
- W4224311959 hasPublicationYear "2022" @default.
- W4224311959 type Work @default.
- W4224311959 citedByCount "11" @default.
- W4224311959 countsByYear W42243119592022 @default.
- W4224311959 countsByYear W42243119592023 @default.
- W4224311959 crossrefType "journal-article" @default.
- W4224311959 hasAuthorship W4224311959A5024567177 @default.
- W4224311959 hasAuthorship W4224311959A5035823866 @default.
- W4224311959 hasAuthorship W4224311959A5070248034 @default.
- W4224311959 hasAuthorship W4224311959A5071095124 @default.
- W4224311959 hasBestOaLocation W42243119591 @default.
- W4224311959 hasConcept C119857082 @default.
- W4224311959 hasConcept C124101348 @default.
- W4224311959 hasConcept C154945302 @default.
- W4224311959 hasConcept C160920958 @default.
- W4224311959 hasConcept C162324750 @default.
- W4224311959 hasConcept C176217482 @default.
- W4224311959 hasConcept C21547014 @default.
- W4224311959 hasConcept C41008148 @default.
- W4224311959 hasConcept C58489278 @default.
- W4224311959 hasConceptScore W4224311959C119857082 @default.
- W4224311959 hasConceptScore W4224311959C124101348 @default.
- W4224311959 hasConceptScore W4224311959C154945302 @default.
- W4224311959 hasConceptScore W4224311959C160920958 @default.
- W4224311959 hasConceptScore W4224311959C162324750 @default.
- W4224311959 hasConceptScore W4224311959C176217482 @default.
- W4224311959 hasConceptScore W4224311959C21547014 @default.
- W4224311959 hasConceptScore W4224311959C41008148 @default.
- W4224311959 hasConceptScore W4224311959C58489278 @default.
- W4224311959 hasIssue "4" @default.
- W4224311959 hasLocation W42243119591 @default.
- W4224311959 hasLocation W42243119592 @default.
- W4224311959 hasLocation W42243119593 @default.
- W4224311959 hasOpenAccess W4224311959 @default.
- W4224311959 hasPrimaryLocation W42243119591 @default.
- W4224311959 hasRelatedWork W2161264592 @default.
- W4224311959 hasRelatedWork W2961085424 @default.
- W4224311959 hasRelatedWork W3046775127 @default.
- W4224311959 hasRelatedWork W3170094116 @default.
- W4224311959 hasRelatedWork W4205958290 @default.
- W4224311959 hasRelatedWork W4285260836 @default.
- W4224311959 hasRelatedWork W4286629047 @default.
- W4224311959 hasRelatedWork W4306321456 @default.