Matches in SemOpenAlex for { <https://semopenalex.org/work/W1933993361> ?p ?o ?g. }
Showing items 1 to 74 of
74
with 100 items per page.
- W1933993361 abstract "Tagged and parsed corpora (LOB, Brown, London-Lund, ICE, Lancaster-IBM, PoW, Nijmegen, UPenn, BNC, etc) are used as training data for statistical syntactic constraint models to improve recognition accuracy in speech and handwriting recognisers. However, linguists developing these linguistic resources have used quite different wordtagging and parse-tree labelling schemes in each of these annotated corpora. This restricts the accessibility of each corpus, making it impossible for speech and handwriting researchers to collate them into a single very large training set. This is particularly problematic as there is evidence that one of these parsed corpora on its own is too small for a general statistical model of higher-level syntactic structure, but the combined size of all the above annotated corpora should deliver a much more reliable model. We are developing a set of mapping algorithms to map between the main tagsets and phrase structure grammar schemes used in the above corpora. We will develop a Multi-tagged Corpus and a MultiTreebank, a single text-set annotated with all the above tagging and parsing schemes. The text-set is the Spoken English Corpus; this is already annotated with two syntax schemes, and we plan to have added at least one more by the AISB Workshop. However, the main deliverable to the speech and handwriting research community is not the SEC-based MultiTreebank, but the mapping suite used to produce it - this can be used to combine currently-incompatible syntactic training sets into a large unified multicorpus. Our development of the mapping algorithms aims to distinguish notational from substantive differences in the annotation schemes, and we will be able to evaluate tagging schemes in terms of how well they fit standard statistical language models such as n-pos (Markov) models." @default.
- W1933993361 created "2016-06-24" @default.
- W1933993361 creator A5068339489 @default.
- W1933993361 creator A5074817028 @default.
- W1933993361 creator A5078117473 @default.
- W1933993361 date "1994-01-01" @default.
- W1933993361 modified "2023-09-26" @default.
- W1933993361 title "A unified multicorpus for training syntactic constraint models" @default.
- W1933993361 cites W115535506 @default.
- W1933993361 cites W1518059413 @default.
- W1933993361 cites W1526508180 @default.
- W1933993361 cites W1532436150 @default.
- W1933993361 cites W180841865 @default.
- W1933993361 cites W2006715244 @default.
- W1933993361 cites W274360350 @default.
- W1933993361 hasPublicationYear "1994" @default.
- W1933993361 type Work @default.
- W1933993361 sameAs 1933993361 @default.
- W1933993361 citedByCount "0" @default.
- W1933993361 crossrefType "journal-article" @default.
- W1933993361 hasAuthorship W1933993361A5068339489 @default.
- W1933993361 hasAuthorship W1933993361A5074817028 @default.
- W1933993361 hasAuthorship W1933993361A5078117473 @default.
- W1933993361 hasConcept C127413603 @default.
- W1933993361 hasConcept C154945302 @default.
- W1933993361 hasConcept C177264268 @default.
- W1933993361 hasConcept C186644900 @default.
- W1933993361 hasConcept C199360897 @default.
- W1933993361 hasConcept C204321447 @default.
- W1933993361 hasConcept C2776036281 @default.
- W1933993361 hasConcept C2776224158 @default.
- W1933993361 hasConcept C28490314 @default.
- W1933993361 hasConcept C41008148 @default.
- W1933993361 hasConcept C60048249 @default.
- W1933993361 hasConcept C78519656 @default.
- W1933993361 hasConceptScore W1933993361C127413603 @default.
- W1933993361 hasConceptScore W1933993361C154945302 @default.
- W1933993361 hasConceptScore W1933993361C177264268 @default.
- W1933993361 hasConceptScore W1933993361C186644900 @default.
- W1933993361 hasConceptScore W1933993361C199360897 @default.
- W1933993361 hasConceptScore W1933993361C204321447 @default.
- W1933993361 hasConceptScore W1933993361C2776036281 @default.
- W1933993361 hasConceptScore W1933993361C2776224158 @default.
- W1933993361 hasConceptScore W1933993361C28490314 @default.
- W1933993361 hasConceptScore W1933993361C41008148 @default.
- W1933993361 hasConceptScore W1933993361C60048249 @default.
- W1933993361 hasConceptScore W1933993361C78519656 @default.
- W1933993361 hasLocation W19339933611 @default.
- W1933993361 hasOpenAccess W1933993361 @default.
- W1933993361 hasPrimaryLocation W19339933611 @default.
- W1933993361 hasRelatedWork W1540959437 @default.
- W1933993361 hasRelatedWork W1551773846 @default.
- W1933993361 hasRelatedWork W1568589650 @default.
- W1933993361 hasRelatedWork W1983541059 @default.
- W1933993361 hasRelatedWork W2007367068 @default.
- W1933993361 hasRelatedWork W2074064986 @default.
- W1933993361 hasRelatedWork W2079597242 @default.
- W1933993361 hasRelatedWork W2098419420 @default.
- W1933993361 hasRelatedWork W2111080068 @default.
- W1933993361 hasRelatedWork W2120973168 @default.
- W1933993361 hasRelatedWork W2131940753 @default.
- W1933993361 hasRelatedWork W2189337307 @default.
- W1933993361 hasRelatedWork W2396902797 @default.
- W1933993361 hasRelatedWork W2400988636 @default.
- W1933993361 hasRelatedWork W2588225431 @default.
- W1933993361 hasRelatedWork W2742917097 @default.
- W1933993361 hasRelatedWork W2757814986 @default.
- W1933993361 hasRelatedWork W2785657704 @default.
- W1933993361 hasRelatedWork W2901548275 @default.
- W1933993361 hasRelatedWork W425689430 @default.
- W1933993361 isParatext "false" @default.
- W1933993361 isRetracted "false" @default.
- W1933993361 magId "1933993361" @default.
- W1933993361 workType "article" @default.