Matches in SemOpenAlex for { <https://semopenalex.org/work/W2895924946> ?p ?o ?g. }
Showing items 1 to 79 of
79
with 100 items per page.
- W2895924946 abstract "As scientific data repositories and filesystems grow in size and complexity, they become increasingly disorganized. The coupling of massive quantities of data with poor organization makes it challenging for scientists to locate and utilize relevant data, thus slowing the process of analyzing data of interest. To address these issues, we explore an automated clustering approach for quantifying the organization of data repositories. Our parallel pipeline processes heterogeneous filetypes (e.g., text and tabular data), automatically clusters files based on content and metadata similarities, and computes a novel score from the resulting clustering. We demonstrate the generation and accuracy of our cleanliness measure using both synthetic and real datasets, and conclude that it is more consistent than other potential cleanliness measures." @default.
- W2895924946 created "2018-10-26" @default.
- W2895924946 creator A5020842808 @default.
- W2895924946 creator A5041264166 @default.
- W2895924946 creator A5065464552 @default.
- W2895924946 creator A5087018424 @default.
- W2895924946 date "2018-10-13" @default.
- W2895924946 modified "2023-09-26" @default.
- W2895924946 title "Measuring Swampiness: Quantifying Chaos in Large Heterogeneous Data Repositories." @default.
- W2895924946 cites W1987971958 @default.
- W2895924946 cites W2069897123 @default.
- W2895924946 cites W2424304400 @default.
- W2895924946 cites W2624356872 @default.
- W2895924946 cites W2910663247 @default.
- W2895924946 cites W343945789 @default.
- W2895924946 hasPublicationYear "2018" @default.
- W2895924946 type Work @default.
- W2895924946 sameAs 2895924946 @default.
- W2895924946 citedByCount "0" @default.
- W2895924946 crossrefType "posted-content" @default.
- W2895924946 hasAuthorship W2895924946A5020842808 @default.
- W2895924946 hasAuthorship W2895924946A5041264166 @default.
- W2895924946 hasAuthorship W2895924946A5065464552 @default.
- W2895924946 hasAuthorship W2895924946A5087018424 @default.
- W2895924946 hasConcept C111919701 @default.
- W2895924946 hasConcept C124101348 @default.
- W2895924946 hasConcept C136764020 @default.
- W2895924946 hasConcept C154945302 @default.
- W2895924946 hasConcept C199360897 @default.
- W2895924946 hasConcept C23123220 @default.
- W2895924946 hasConcept C2522767166 @default.
- W2895924946 hasConcept C2780009758 @default.
- W2895924946 hasConcept C41008148 @default.
- W2895924946 hasConcept C43521106 @default.
- W2895924946 hasConcept C73555534 @default.
- W2895924946 hasConcept C77088390 @default.
- W2895924946 hasConcept C93518851 @default.
- W2895924946 hasConcept C98045186 @default.
- W2895924946 hasConceptScore W2895924946C111919701 @default.
- W2895924946 hasConceptScore W2895924946C124101348 @default.
- W2895924946 hasConceptScore W2895924946C136764020 @default.
- W2895924946 hasConceptScore W2895924946C154945302 @default.
- W2895924946 hasConceptScore W2895924946C199360897 @default.
- W2895924946 hasConceptScore W2895924946C23123220 @default.
- W2895924946 hasConceptScore W2895924946C2522767166 @default.
- W2895924946 hasConceptScore W2895924946C2780009758 @default.
- W2895924946 hasConceptScore W2895924946C41008148 @default.
- W2895924946 hasConceptScore W2895924946C43521106 @default.
- W2895924946 hasConceptScore W2895924946C73555534 @default.
- W2895924946 hasConceptScore W2895924946C77088390 @default.
- W2895924946 hasConceptScore W2895924946C93518851 @default.
- W2895924946 hasConceptScore W2895924946C98045186 @default.
- W2895924946 hasLocation W28959249461 @default.
- W2895924946 hasOpenAccess W2895924946 @default.
- W2895924946 hasPrimaryLocation W28959249461 @default.
- W2895924946 hasRelatedWork W1570335843 @default.
- W2895924946 hasRelatedWork W2017403776 @default.
- W2895924946 hasRelatedWork W2256117198 @default.
- W2895924946 hasRelatedWork W2342363408 @default.
- W2895924946 hasRelatedWork W2342619577 @default.
- W2895924946 hasRelatedWork W2405489628 @default.
- W2895924946 hasRelatedWork W2465290353 @default.
- W2895924946 hasRelatedWork W2784378868 @default.
- W2895924946 hasRelatedWork W2889878814 @default.
- W2895924946 hasRelatedWork W2904105011 @default.
- W2895924946 hasRelatedWork W2907066266 @default.
- W2895924946 hasRelatedWork W2950127400 @default.
- W2895924946 hasRelatedWork W2951388332 @default.
- W2895924946 hasRelatedWork W2961272359 @default.
- W2895924946 hasRelatedWork W3168784810 @default.
- W2895924946 hasRelatedWork W3169843875 @default.
- W2895924946 hasRelatedWork W55154484 @default.
- W2895924946 hasRelatedWork W562683932 @default.
- W2895924946 hasRelatedWork W2289981285 @default.
- W2895924946 hasRelatedWork W2339783699 @default.
- W2895924946 isParatext "false" @default.
- W2895924946 isRetracted "false" @default.
- W2895924946 magId "2895924946" @default.
- W2895924946 workType "article" @default.