Matches in SemOpenAlex for { <https://semopenalex.org/work/W236085609> ?p ?o ?g. }
Showing items 1 to 90 of
90
with 100 items per page.
- W236085609 abstract "Previous chapter Next chapter Full AccessProceedings Proceedings of the 2008 SIAM International Conference on Data Mining (SDM)Exploiting Structured Reference Data for Unsupervised Text Segmentation with Conditional Random FieldsChang Zhao, Jalal Mahmud, and I.V. RamakrishnanChang Zhao, Jalal Mahmud, and I.V. Ramakrishnanpp.420 - 431Chapter DOI:https://doi.org/10.1137/1.9781611972788.38PDFBibTexSections ToolsAdd to favoritesExport CitationTrack CitationsEmail SectionsAboutAbstract Text segmentation is the process of converting information in unstructured text into structured records. This is an important problem since structured data is amenable to efficient query processing. CRFs are a class of discriminative probabilistic models that are gaining acceptance as an effective computing machinery for text segmentation. An important aspect of CRFs is learning model parameters from labeled training data. Labeling can be a labor intensive process. One can avoid the labeling step by using structured reference tables whose data domains and that of the input text data given for segmentation, coincide. In other words the labels in the training data drawn from reference tables “come for free”. Inspired by recent work on their use for training HMMs, we developed an unsupervised technique for text segmentation with CRFs using reference tables. Assuming text sequences to be segmented come in batches and sequences in a batch conform to the same attribute order, we build CRF models for each attribute in the reference table, use them to decide the attribute order of a batch of input sequences, derive labeled training data from the reference table according to that order, and train a global CRF model to segment the input sequences in the batch. Preliminary experimental results indicate that our technique works well in practice. Previous chapter Next chapter RelatedDetails Published:2008ISBN:978-0-89871-654-2eISBN:978-1-61197-278-8 https://doi.org/10.1137/1.9781611972788Book Series Name:ProceedingsBook Code:PR130Book Pages:1-869" @default.
- W236085609 created "2016-06-24" @default.
- W236085609 creator A5003322619 @default.
- W236085609 creator A5005648942 @default.
- W236085609 creator A5032509530 @default.
- W236085609 date "2008-04-24" @default.
- W236085609 modified "2023-09-25" @default.
- W236085609 title "Exploiting Structured Reference Data for Unsupervised Text Segmentation with Conditional Random Fields" @default.
- W236085609 cites W1557074680 @default.
- W236085609 cites W1568339100 @default.
- W236085609 cites W1934019294 @default.
- W236085609 cites W2029873015 @default.
- W236085609 cites W2034797903 @default.
- W236085609 cites W2048468185 @default.
- W236085609 cites W2067326963 @default.
- W236085609 cites W2108126629 @default.
- W236085609 cites W2124410446 @default.
- W236085609 cites W2125838338 @default.
- W236085609 cites W2140327372 @default.
- W236085609 cites W2143349571 @default.
- W236085609 cites W2147880316 @default.
- W236085609 cites W2156515921 @default.
- W236085609 cites W2158188757 @default.
- W236085609 cites W2158823144 @default.
- W236085609 cites W2160842254 @default.
- W236085609 cites W2162340487 @default.
- W236085609 cites W2169546346 @default.
- W236085609 doi "https://doi.org/10.1137/1.9781611972788.38" @default.
- W236085609 hasPublicationYear "2008" @default.
- W236085609 type Work @default.
- W236085609 sameAs 236085609 @default.
- W236085609 citedByCount "24" @default.
- W236085609 countsByYear W2360856092012 @default.
- W236085609 countsByYear W2360856092013 @default.
- W236085609 countsByYear W2360856092015 @default.
- W236085609 countsByYear W2360856092016 @default.
- W236085609 countsByYear W2360856092017 @default.
- W236085609 countsByYear W2360856092018 @default.
- W236085609 countsByYear W2360856092019 @default.
- W236085609 crossrefType "proceedings-article" @default.
- W236085609 hasAuthorship W236085609A5003322619 @default.
- W236085609 hasAuthorship W236085609A5005648942 @default.
- W236085609 hasAuthorship W236085609A5032509530 @default.
- W236085609 hasConcept C111919701 @default.
- W236085609 hasConcept C119857082 @default.
- W236085609 hasConcept C124101348 @default.
- W236085609 hasConcept C152565575 @default.
- W236085609 hasConcept C153180895 @default.
- W236085609 hasConcept C154945302 @default.
- W236085609 hasConcept C204321447 @default.
- W236085609 hasConcept C2775953691 @default.
- W236085609 hasConcept C41008148 @default.
- W236085609 hasConcept C45235069 @default.
- W236085609 hasConcept C49937458 @default.
- W236085609 hasConcept C89600930 @default.
- W236085609 hasConcept C97931131 @default.
- W236085609 hasConcept C98045186 @default.
- W236085609 hasConcept C98501671 @default.
- W236085609 hasConceptScore W236085609C111919701 @default.
- W236085609 hasConceptScore W236085609C119857082 @default.
- W236085609 hasConceptScore W236085609C124101348 @default.
- W236085609 hasConceptScore W236085609C152565575 @default.
- W236085609 hasConceptScore W236085609C153180895 @default.
- W236085609 hasConceptScore W236085609C154945302 @default.
- W236085609 hasConceptScore W236085609C204321447 @default.
- W236085609 hasConceptScore W236085609C2775953691 @default.
- W236085609 hasConceptScore W236085609C41008148 @default.
- W236085609 hasConceptScore W236085609C45235069 @default.
- W236085609 hasConceptScore W236085609C49937458 @default.
- W236085609 hasConceptScore W236085609C89600930 @default.
- W236085609 hasConceptScore W236085609C97931131 @default.
- W236085609 hasConceptScore W236085609C98045186 @default.
- W236085609 hasConceptScore W236085609C98501671 @default.
- W236085609 hasLocation W2360856091 @default.
- W236085609 hasOpenAccess W236085609 @default.
- W236085609 hasPrimaryLocation W2360856091 @default.
- W236085609 hasRelatedWork W1989442767 @default.
- W236085609 hasRelatedWork W2023784932 @default.
- W236085609 hasRelatedWork W2126384842 @default.
- W236085609 hasRelatedWork W2126807146 @default.
- W236085609 hasRelatedWork W2340365369 @default.
- W236085609 hasRelatedWork W2395488739 @default.
- W236085609 hasRelatedWork W2433259561 @default.
- W236085609 hasRelatedWork W2510758617 @default.
- W236085609 hasRelatedWork W2554282444 @default.
- W236085609 hasRelatedWork W2787080132 @default.
- W236085609 isParatext "false" @default.
- W236085609 isRetracted "false" @default.
- W236085609 magId "236085609" @default.
- W236085609 workType "article" @default.