Matches in SemOpenAlex for { <https://semopenalex.org/work/W2121791507> ?p ?o ?g. }
Showing items 1 to 96 of
96
with 100 items per page.
- W2121791507 abstract "Modern organizations are accumulating huge volumes of textual documents. To turn archives into valuable knowledge sources, textual content must become explicit and able to be queried. Semantic tagging with markup languages such as XML satisfies both requirements. We thus introduce the DIAsDEM* framework for extracting semantics from structural text units (e.g., sentences), assigning XML tags to them and deriving a flat XML DTD for the archive. DIAsDEM focuses on archives characterized by a peculiar terminology and by an implicit structure such as court filings and company reports. In the knowledge discovery phase, text units are iteratively clustered by similarity of their content. Each iteration outputs clusters satisfying a set of quality criteria. Text units contained in these clusters are tagged with semiautomatically determined cluster labels and XML tags respectively. Additionally, extracted named entities (e.g., persons) serve as attributes of XML tags. We apply the framework in a case study on the German Commercial Register." @default.
- W2121791507 created "2016-06-24" @default.
- W2121791507 creator A5009384696 @default.
- W2121791507 creator A5083456364 @default.
- W2121791507 creator A5085688321 @default.
- W2121791507 date "2002-11-14" @default.
- W2121791507 modified "2023-09-23" @default.
- W2121791507 title "The DIAsDEM framework for converting domain-specific texts into XML documents with data mining techniques" @default.
- W2121791507 cites W147727067 @default.
- W2121791507 cites W1505320839 @default.
- W2121791507 cites W1535627680 @default.
- W2121791507 cites W1601717103 @default.
- W2121791507 cites W1978394996 @default.
- W2121791507 cites W2001615232 @default.
- W2121791507 cites W2010500834 @default.
- W2121791507 cites W2091009885 @default.
- W2121791507 cites W2112378479 @default.
- W2121791507 cites W2139528831 @default.
- W2121791507 cites W2161077873 @default.
- W2121791507 cites W2163307056 @default.
- W2121791507 cites W2294792814 @default.
- W2121791507 cites W2912297594 @default.
- W2121791507 cites W1663587272 @default.
- W2121791507 doi "https://doi.org/10.1109/icdm.2001.989515" @default.
- W2121791507 hasPublicationYear "2002" @default.
- W2121791507 type Work @default.
- W2121791507 sameAs 2121791507 @default.
- W2121791507 citedByCount "13" @default.
- W2121791507 countsByYear W21217915072014 @default.
- W2121791507 crossrefType "proceedings-article" @default.
- W2121791507 hasAuthorship W2121791507A5009384696 @default.
- W2121791507 hasAuthorship W2121791507A5083456364 @default.
- W2121791507 hasAuthorship W2121791507A5085688321 @default.
- W2121791507 hasBestOaLocation W21217915072 @default.
- W2121791507 hasConcept C11508877 @default.
- W2121791507 hasConcept C136764020 @default.
- W2121791507 hasConcept C137441365 @default.
- W2121791507 hasConcept C138885662 @default.
- W2121791507 hasConcept C184337299 @default.
- W2121791507 hasConcept C199360897 @default.
- W2121791507 hasConcept C23123220 @default.
- W2121791507 hasConcept C34716815 @default.
- W2121791507 hasConcept C40713593 @default.
- W2121791507 hasConcept C41008148 @default.
- W2121791507 hasConcept C41895202 @default.
- W2121791507 hasConcept C45874996 @default.
- W2121791507 hasConcept C547195049 @default.
- W2121791507 hasConcept C55348073 @default.
- W2121791507 hasConcept C68699486 @default.
- W2121791507 hasConcept C84314905 @default.
- W2121791507 hasConcept C8797682 @default.
- W2121791507 hasConceptScore W2121791507C11508877 @default.
- W2121791507 hasConceptScore W2121791507C136764020 @default.
- W2121791507 hasConceptScore W2121791507C137441365 @default.
- W2121791507 hasConceptScore W2121791507C138885662 @default.
- W2121791507 hasConceptScore W2121791507C184337299 @default.
- W2121791507 hasConceptScore W2121791507C199360897 @default.
- W2121791507 hasConceptScore W2121791507C23123220 @default.
- W2121791507 hasConceptScore W2121791507C34716815 @default.
- W2121791507 hasConceptScore W2121791507C40713593 @default.
- W2121791507 hasConceptScore W2121791507C41008148 @default.
- W2121791507 hasConceptScore W2121791507C41895202 @default.
- W2121791507 hasConceptScore W2121791507C45874996 @default.
- W2121791507 hasConceptScore W2121791507C547195049 @default.
- W2121791507 hasConceptScore W2121791507C55348073 @default.
- W2121791507 hasConceptScore W2121791507C68699486 @default.
- W2121791507 hasConceptScore W2121791507C84314905 @default.
- W2121791507 hasConceptScore W2121791507C8797682 @default.
- W2121791507 hasLocation W21217915071 @default.
- W2121791507 hasLocation W21217915072 @default.
- W2121791507 hasOpenAccess W2121791507 @default.
- W2121791507 hasPrimaryLocation W21217915071 @default.
- W2121791507 hasRelatedWork W1005420331 @default.
- W2121791507 hasRelatedWork W1486977603 @default.
- W2121791507 hasRelatedWork W1498913585 @default.
- W2121791507 hasRelatedWork W1581299483 @default.
- W2121791507 hasRelatedWork W2031866803 @default.
- W2121791507 hasRelatedWork W2097388954 @default.
- W2121791507 hasRelatedWork W2100125501 @default.
- W2121791507 hasRelatedWork W2100605496 @default.
- W2121791507 hasRelatedWork W2108083852 @default.
- W2121791507 hasRelatedWork W2117707685 @default.
- W2121791507 hasRelatedWork W2119242410 @default.
- W2121791507 hasRelatedWork W2161974882 @default.
- W2121791507 hasRelatedWork W2189592544 @default.
- W2121791507 hasRelatedWork W2246695077 @default.
- W2121791507 hasRelatedWork W2381406408 @default.
- W2121791507 hasRelatedWork W2412718107 @default.
- W2121791507 hasRelatedWork W3177762090 @default.
- W2121791507 hasRelatedWork W181731162 @default.
- W2121791507 hasRelatedWork W2108584347 @default.
- W2121791507 hasRelatedWork W2184673227 @default.
- W2121791507 isParatext "false" @default.
- W2121791507 isRetracted "false" @default.
- W2121791507 magId "2121791507" @default.
- W2121791507 workType "article" @default.