Matches in SemOpenAlex for { <https://semopenalex.org/work/W2240823976> ?p ?o ?g. }
Showing items 1 to 55 of
55
with 100 items per page.
- W2240823976 abstract "The interlinear glossed text (IGT) is a complex object, the complexity of its structure depending on factors such as origin, intended use, languages involved etc. Developing tools and workflows for integrated linguistic analysis environments calls for particular attention to those aspects which in many common cases can be disregarded as insignificant; thus, collaborating for ELAN–FLEx integration was particularly motivating for this paper.IGT is often conceived of as a tree: the root node corresponds to the whole text, subdivided into smaller units (sentences, words, morphemes). Each unit has a number of associated annotations, generally one per information type, like sentence translation, part-of-speech label, morpheme gloss.However, an IGT can easily amount to a large set of trees. Unresolved ambiguities of all kinds are one reason for it. Each pair of alternative analyses (e.g. two concurrent parses of a word) implies two distinct trees, identical except for the node in question and all its descendants. The more ambiguities arise, the more underlying trees should be posited. Still, all trees in such a tree family stem from a single analyzed object (transcript, original orthographic representation). Storing entire trees for each combination of relevant alternatives being utterly inefficient, a more compact storage model is needed.Turning to the media dimension, an accurate transcript of a spontaneous discourse is most often unsuitable for a grammatical analysis without some preprocessing (normalization) dealing with various speech errors, incomprehensible fragments etc. to produce a grammatically correct and coherent text for subsequent grammatical analysis – whereas the “raw” transcript feeds phonological and possibly discourse analysis. We thus get two distinct texts, interconnected but giving rise to independent (families of) analysis trees; only one of them is linked directly to the media timeline.In some scenarios, more than one media-based timeline emerge which need to be interlinked (cf. BOLD framework: sound annotations to sound events; retelling experiments, e.g. pear stories; sign languages translated from/into spoken languages). The reference axis may not be properly a timeline (text, path through a complex graphic image).One should mention further complicating factors such as multi-speaker and multi-lingual settings, collaboration and versioning.The overall structure (an XML sketch will be presented) might grow unreasonably complex for any specialized analysis component to handle. It may thus be efficient to use an intermediate repository, e.g. a unified underlying RDF representation [Nakhimovsky et al. 2012], to which all changes made in specific tools are merged.ReferencesBow, Cathy, Baden Hughes and Steven Bird. 2003. Towards a General Model of Interlinear Text.Nakhimovsky, Alexander, Jeff Good, Tom Myers. 2012. Interoperability of Language Documentation Tools and Materials for Local Communities // Digital Humanities 2012." @default.
- W2240823976 created "2016-06-24" @default.
- W2240823976 creator A5068760934 @default.
- W2240823976 date "2013-03-01" @default.
- W2240823976 modified "2023-09-27" @default.
- W2240823976 title "Towards a more general model of interlinear text" @default.
- W2240823976 hasPublicationYear "2013" @default.
- W2240823976 type Work @default.
- W2240823976 sameAs 2240823976 @default.
- W2240823976 citedByCount "0" @default.
- W2240823976 crossrefType "journal-article" @default.
- W2240823976 hasAuthorship W2240823976A5068760934 @default.
- W2240823976 hasConcept C138885662 @default.
- W2240823976 hasConcept C154945302 @default.
- W2240823976 hasConcept C165297611 @default.
- W2240823976 hasConcept C204321447 @default.
- W2240823976 hasConcept C2777530160 @default.
- W2240823976 hasConcept C34736171 @default.
- W2240823976 hasConcept C41008148 @default.
- W2240823976 hasConcept C41895202 @default.
- W2240823976 hasConceptScore W2240823976C138885662 @default.
- W2240823976 hasConceptScore W2240823976C154945302 @default.
- W2240823976 hasConceptScore W2240823976C165297611 @default.
- W2240823976 hasConceptScore W2240823976C204321447 @default.
- W2240823976 hasConceptScore W2240823976C2777530160 @default.
- W2240823976 hasConceptScore W2240823976C34736171 @default.
- W2240823976 hasConceptScore W2240823976C41008148 @default.
- W2240823976 hasConceptScore W2240823976C41895202 @default.
- W2240823976 hasLocation W22408239761 @default.
- W2240823976 hasOpenAccess W2240823976 @default.
- W2240823976 hasPrimaryLocation W22408239761 @default.
- W2240823976 hasRelatedWork W1571385809 @default.
- W2240823976 hasRelatedWork W1886172393 @default.
- W2240823976 hasRelatedWork W1993755899 @default.
- W2240823976 hasRelatedWork W2064024982 @default.
- W2240823976 hasRelatedWork W208946482 @default.
- W2240823976 hasRelatedWork W2251722147 @default.
- W2240823976 hasRelatedWork W2328937216 @default.
- W2240823976 hasRelatedWork W2403210192 @default.
- W2240823976 hasRelatedWork W2486336489 @default.
- W2240823976 hasRelatedWork W2504393653 @default.
- W2240823976 hasRelatedWork W2552053157 @default.
- W2240823976 hasRelatedWork W2745539108 @default.
- W2240823976 hasRelatedWork W2747551352 @default.
- W2240823976 hasRelatedWork W2902414924 @default.
- W2240823976 hasRelatedWork W3001447330 @default.
- W2240823976 hasRelatedWork W76162884 @default.
- W2240823976 hasRelatedWork W84197106 @default.
- W2240823976 hasRelatedWork W180252976 @default.
- W2240823976 hasRelatedWork W2336424592 @default.
- W2240823976 hasRelatedWork W3175175769 @default.
- W2240823976 isParatext "false" @default.
- W2240823976 isRetracted "false" @default.
- W2240823976 magId "2240823976" @default.
- W2240823976 workType "article" @default.