Matches in SemOpenAlex for { <https://semopenalex.org/work/W4310398100> ?p ?o ?g. }
- W4310398100 endingPage "9333" @default.
- W4310398100 startingPage "9319" @default.
- W4310398100 abstract "Abstract This paper presents a new approach to retrieve and further integrate tabular datasets (collections of rows and columns) using union and join operations. In this work, both processes were carried out using a similarity measure based on contextual word embeddings, which allows finding semantically similar tables and overcome the recall problem of lexical approaches based on string similarity. This work is the first attempt to use contextual word embeddings in the whole pipeline of table search and integration, including for the first time their use in the join operation. A comprehensive analysis of their performance was carried out on both retrieving and integrating tabular datasets, comparing them with context-free models. Column headings and cell values were used as contextual information and their impact on each task was evaluated. The results revealed that contextual models significantly outperform context-free models and a traditional weighting schema in ad hoc table retrieval. In the data integration task, contextual models also improved the results on union operation compared to context-free approaches." @default.
- W4310398100 created "2022-12-10" @default.
- W4310398100 creator A5014858583 @default.
- W4310398100 creator A5015502433 @default.
- W4310398100 creator A5066901043 @default.
- W4310398100 creator A5068367881 @default.
- W4310398100 date "2022-11-30" @default.
- W4310398100 modified "2023-10-18" @default.
- W4310398100 title "Contextual word embeddings for tabular data search and integration" @default.
- W4310398100 cites W1576228976 @default.
- W4310398100 cites W1969621019 @default.
- W4310398100 cites W2046325278 @default.
- W4310398100 cites W2108223890 @default.
- W4310398100 cites W2162020046 @default.
- W4310398100 cites W2188138540 @default.
- W4310398100 cites W2250539671 @default.
- W4310398100 cites W2292351001 @default.
- W4310398100 cites W2398606196 @default.
- W4310398100 cites W2788550262 @default.
- W4310398100 cites W2798664493 @default.
- W4310398100 cites W2888329843 @default.
- W4310398100 cites W2889003264 @default.
- W4310398100 cites W2899286282 @default.
- W4310398100 cites W2962739339 @default.
- W4310398100 cites W2962784628 @default.
- W4310398100 cites W2963026768 @default.
- W4310398100 cites W2963341956 @default.
- W4310398100 cites W2998914929 @default.
- W4310398100 cites W3007024586 @default.
- W4310398100 cites W3007429250 @default.
- W4310398100 cites W3008881932 @default.
- W4310398100 cites W3034944976 @default.
- W4310398100 cites W3035140194 @default.
- W4310398100 cites W3035231859 @default.
- W4310398100 cites W3099965312 @default.
- W4310398100 cites W3101556001 @default.
- W4310398100 cites W3162752841 @default.
- W4310398100 cites W4205997760 @default.
- W4310398100 doi "https://doi.org/10.1007/s00521-022-08066-8" @default.
- W4310398100 hasPublicationYear "2022" @default.
- W4310398100 type Work @default.
- W4310398100 citedByCount "0" @default.
- W4310398100 crossrefType "journal-article" @default.
- W4310398100 hasAuthorship W4310398100A5014858583 @default.
- W4310398100 hasAuthorship W4310398100A5015502433 @default.
- W4310398100 hasAuthorship W4310398100A5066901043 @default.
- W4310398100 hasAuthorship W4310398100A5068367881 @default.
- W4310398100 hasBestOaLocation W43103981001 @default.
- W4310398100 hasConcept C124101348 @default.
- W4310398100 hasConcept C126838900 @default.
- W4310398100 hasConcept C138885662 @default.
- W4310398100 hasConcept C151730666 @default.
- W4310398100 hasConcept C154945302 @default.
- W4310398100 hasConcept C162324750 @default.
- W4310398100 hasConcept C165696696 @default.
- W4310398100 hasConcept C183115368 @default.
- W4310398100 hasConcept C187736073 @default.
- W4310398100 hasConcept C199360897 @default.
- W4310398100 hasConcept C204321447 @default.
- W4310398100 hasConcept C23123220 @default.
- W4310398100 hasConcept C2779343474 @default.
- W4310398100 hasConcept C2780451532 @default.
- W4310398100 hasConcept C38652104 @default.
- W4310398100 hasConcept C41008148 @default.
- W4310398100 hasConcept C41895202 @default.
- W4310398100 hasConcept C43521106 @default.
- W4310398100 hasConcept C45235069 @default.
- W4310398100 hasConcept C52146309 @default.
- W4310398100 hasConcept C71924100 @default.
- W4310398100 hasConcept C86803240 @default.
- W4310398100 hasConcept C90805587 @default.
- W4310398100 hasConceptScore W4310398100C124101348 @default.
- W4310398100 hasConceptScore W4310398100C126838900 @default.
- W4310398100 hasConceptScore W4310398100C138885662 @default.
- W4310398100 hasConceptScore W4310398100C151730666 @default.
- W4310398100 hasConceptScore W4310398100C154945302 @default.
- W4310398100 hasConceptScore W4310398100C162324750 @default.
- W4310398100 hasConceptScore W4310398100C165696696 @default.
- W4310398100 hasConceptScore W4310398100C183115368 @default.
- W4310398100 hasConceptScore W4310398100C187736073 @default.
- W4310398100 hasConceptScore W4310398100C199360897 @default.
- W4310398100 hasConceptScore W4310398100C204321447 @default.
- W4310398100 hasConceptScore W4310398100C23123220 @default.
- W4310398100 hasConceptScore W4310398100C2779343474 @default.
- W4310398100 hasConceptScore W4310398100C2780451532 @default.
- W4310398100 hasConceptScore W4310398100C38652104 @default.
- W4310398100 hasConceptScore W4310398100C41008148 @default.
- W4310398100 hasConceptScore W4310398100C41895202 @default.
- W4310398100 hasConceptScore W4310398100C43521106 @default.
- W4310398100 hasConceptScore W4310398100C45235069 @default.
- W4310398100 hasConceptScore W4310398100C52146309 @default.
- W4310398100 hasConceptScore W4310398100C71924100 @default.
- W4310398100 hasConceptScore W4310398100C86803240 @default.
- W4310398100 hasConceptScore W4310398100C90805587 @default.
- W4310398100 hasFunder F4320311011 @default.
- W4310398100 hasFunder F4320336489 @default.
- W4310398100 hasIssue "13" @default.
- W4310398100 hasLocation W43103981001 @default.