Matches in SemOpenAlex for { <https://semopenalex.org/work/W4386751617> ?p ?o ?g. }
Showing items 1 to 63 of
63
with 100 items per page.
- W4386751617 abstract "ObjectivesSynthetic data reproduces features of a dataset without disclosing sensitive information, allowing researchers to explore data structures and test code without requiring access to real, potentially sensitive, data. We produced a low-fidelity synthetic data generation tool, accompanied by extensive documentation, allowing novice and expert users to produce such data.
 MethodsOur tool, consisting of a Python notebook and a user guide, takes a dataset as input, and produces ‘low-fidelity’ synthetic copy of this dataset, recreating the data fields (or columns) of a dataset, as well as the data types and statistical relationships within these fields, but not between them. It has been tested using real-world administrative data sets and with several users, looking at the quality of the data generated, inspecting whether the data is indeed low-fidelity (i.e. statistical relationships between fields are not recreated) and the usability of the tool.
 ResultsOur tool successfully created synthetic datasets from administrative datasets. Users were positive about its usability and the generated data. Tests indicated that computational memory is a main constraint on the size of datatable that can be read in by the tool. We have since implemented improvements to the memory efficiency of the tool to partially address this and have also added procedures that allow for using subsets instead of complete datasets, allowing for the use of datasets which would have otherwise been too large to be used. Testing further indicated that, while the tool by design does not preserve any relationships between fields, they can be reproduced by coincidence, and a limited disclosure process may be required when correlations from the original data are reproduced.
 ConclusionsThe tool is easy to use and therefore a useful introduction to synthetic data, providing users with a foundation before using more sophisticated synthetic data tools like Synthpop. Future work could include the development of a Python library and extension of the tool to handle linked datatables." @default.
- W4386751617 created "2023-09-15" @default.
- W4386751617 creator A5000625466 @default.
- W4386751617 creator A5052807655 @default.
- W4386751617 date "2023-09-14" @default.
- W4386751617 modified "2023-09-26" @default.
- W4386751617 title "An Introductory Synthetic Data Tool" @default.
- W4386751617 doi "https://doi.org/10.23889/ijpds.v8i2.2255" @default.
- W4386751617 hasPublicationYear "2023" @default.
- W4386751617 type Work @default.
- W4386751617 citedByCount "0" @default.
- W4386751617 crossrefType "journal-article" @default.
- W4386751617 hasAuthorship W4386751617A5000625466 @default.
- W4386751617 hasAuthorship W4386751617A5052807655 @default.
- W4386751617 hasBestOaLocation W43867516171 @default.
- W4386751617 hasConcept C107457646 @default.
- W4386751617 hasConcept C124101348 @default.
- W4386751617 hasConcept C162324750 @default.
- W4386751617 hasConcept C170130773 @default.
- W4386751617 hasConcept C176217482 @default.
- W4386751617 hasConcept C199360897 @default.
- W4386751617 hasConcept C21547014 @default.
- W4386751617 hasConcept C24756922 @default.
- W4386751617 hasConcept C2776459999 @default.
- W4386751617 hasConcept C2780977526 @default.
- W4386751617 hasConcept C36464697 @default.
- W4386751617 hasConcept C41008148 @default.
- W4386751617 hasConcept C519991488 @default.
- W4386751617 hasConcept C56666940 @default.
- W4386751617 hasConcept C76155785 @default.
- W4386751617 hasConceptScore W4386751617C107457646 @default.
- W4386751617 hasConceptScore W4386751617C124101348 @default.
- W4386751617 hasConceptScore W4386751617C162324750 @default.
- W4386751617 hasConceptScore W4386751617C170130773 @default.
- W4386751617 hasConceptScore W4386751617C176217482 @default.
- W4386751617 hasConceptScore W4386751617C199360897 @default.
- W4386751617 hasConceptScore W4386751617C21547014 @default.
- W4386751617 hasConceptScore W4386751617C24756922 @default.
- W4386751617 hasConceptScore W4386751617C2776459999 @default.
- W4386751617 hasConceptScore W4386751617C2780977526 @default.
- W4386751617 hasConceptScore W4386751617C36464697 @default.
- W4386751617 hasConceptScore W4386751617C41008148 @default.
- W4386751617 hasConceptScore W4386751617C519991488 @default.
- W4386751617 hasConceptScore W4386751617C56666940 @default.
- W4386751617 hasConceptScore W4386751617C76155785 @default.
- W4386751617 hasIssue "2" @default.
- W4386751617 hasLocation W43867516171 @default.
- W4386751617 hasOpenAccess W4386751617 @default.
- W4386751617 hasPrimaryLocation W43867516171 @default.
- W4386751617 hasRelatedWork W179869519 @default.
- W4386751617 hasRelatedWork W1870207351 @default.
- W4386751617 hasRelatedWork W2032439740 @default.
- W4386751617 hasRelatedWork W2032959135 @default.
- W4386751617 hasRelatedWork W2069790458 @default.
- W4386751617 hasRelatedWork W2100369842 @default.
- W4386751617 hasRelatedWork W2627071553 @default.
- W4386751617 hasRelatedWork W4281714194 @default.
- W4386751617 hasRelatedWork W4297659148 @default.
- W4386751617 hasRelatedWork W4312626352 @default.
- W4386751617 hasVolume "8" @default.
- W4386751617 isParatext "false" @default.
- W4386751617 isRetracted "false" @default.
- W4386751617 workType "article" @default.