Matches in SemOpenAlex for { <https://semopenalex.org/work/W2912374424> ?p ?o ?g. }
Showing items 1 to 68 of
68
with 100 items per page.
- W2912374424 abstract "Data lakes have emerged as an alternative to data warehouses for the storage, exploration and analysis of big data. In a data lake, data are stored in a raw state and bear no explicit schema. Thence, an efficient metadata system is essential to avoid the data lake turning to a so-called data swamp. Existing works about managing data lake metadata mostly focus on structured and semi-structured data, with little research on unstructured data. Thus, we propose in this paper a methodological approach to build and manage a metadata system that is specific to textual documents in data lakes. First, we make an inventory of usual and meaningful metadata to extract. Then, we apply some specific techniques from the text mining and information retrieval domains to extract, store and reuse these metadata within the COREL research project, in order to validate our proposals." @default.
- W2912374424 created "2019-02-21" @default.
- W2912374424 creator A5056111523 @default.
- W2912374424 creator A5059780425 @default.
- W2912374424 creator A5080624394 @default.
- W2912374424 date "2019-01-01" @default.
- W2912374424 modified "2023-10-07" @default.
- W2912374424 title "Metadata Management for Textual Documents in Data Lakes" @default.
- W2912374424 doi "https://doi.org/10.5220/0007706300720083" @default.
- W2912374424 hasPublicationYear "2019" @default.
- W2912374424 type Work @default.
- W2912374424 sameAs 2912374424 @default.
- W2912374424 citedByCount "17" @default.
- W2912374424 countsByYear W29123744242019 @default.
- W2912374424 countsByYear W29123744242020 @default.
- W2912374424 countsByYear W29123744242021 @default.
- W2912374424 countsByYear W29123744242022 @default.
- W2912374424 crossrefType "proceedings-article" @default.
- W2912374424 hasAuthorship W2912374424A5056111523 @default.
- W2912374424 hasAuthorship W2912374424A5059780425 @default.
- W2912374424 hasAuthorship W2912374424A5080624394 @default.
- W2912374424 hasBestOaLocation W29123744241 @default.
- W2912374424 hasConcept C136764020 @default.
- W2912374424 hasConcept C136976847 @default.
- W2912374424 hasConcept C153048206 @default.
- W2912374424 hasConcept C158746014 @default.
- W2912374424 hasConcept C1668388 @default.
- W2912374424 hasConcept C193150823 @default.
- W2912374424 hasConcept C23123220 @default.
- W2912374424 hasConcept C2522767166 @default.
- W2912374424 hasConcept C30872290 @default.
- W2912374424 hasConcept C41008148 @default.
- W2912374424 hasConcept C77088390 @default.
- W2912374424 hasConcept C93518851 @default.
- W2912374424 hasConceptScore W2912374424C136764020 @default.
- W2912374424 hasConceptScore W2912374424C136976847 @default.
- W2912374424 hasConceptScore W2912374424C153048206 @default.
- W2912374424 hasConceptScore W2912374424C158746014 @default.
- W2912374424 hasConceptScore W2912374424C1668388 @default.
- W2912374424 hasConceptScore W2912374424C193150823 @default.
- W2912374424 hasConceptScore W2912374424C23123220 @default.
- W2912374424 hasConceptScore W2912374424C2522767166 @default.
- W2912374424 hasConceptScore W2912374424C30872290 @default.
- W2912374424 hasConceptScore W2912374424C41008148 @default.
- W2912374424 hasConceptScore W2912374424C77088390 @default.
- W2912374424 hasConceptScore W2912374424C93518851 @default.
- W2912374424 hasLocation W29123744241 @default.
- W2912374424 hasLocation W29123744242 @default.
- W2912374424 hasLocation W29123744243 @default.
- W2912374424 hasLocation W29123744244 @default.
- W2912374424 hasLocation W29123744245 @default.
- W2912374424 hasLocation W29123744246 @default.
- W2912374424 hasOpenAccess W2912374424 @default.
- W2912374424 hasPrimaryLocation W29123744241 @default.
- W2912374424 hasRelatedWork W2170906434 @default.
- W2912374424 hasRelatedWork W2349281696 @default.
- W2912374424 hasRelatedWork W2358476326 @default.
- W2912374424 hasRelatedWork W2359886447 @default.
- W2912374424 hasRelatedWork W2374379029 @default.
- W2912374424 hasRelatedWork W2555822790 @default.
- W2912374424 hasRelatedWork W2907247951 @default.
- W2912374424 hasRelatedWork W4386544240 @default.
- W2912374424 hasRelatedWork W769818350 @default.
- W2912374424 hasRelatedWork W2904359197 @default.
- W2912374424 isParatext "false" @default.
- W2912374424 isRetracted "false" @default.
- W2912374424 magId "2912374424" @default.
- W2912374424 workType "article" @default.