Matches in SemOpenAlex for { <https://semopenalex.org/work/W3089214111> ?p ?o ?g. }
Showing items 1 to 86 of
86
with 100 items per page.
- W3089214111 endingPage "40" @default.
- W3089214111 startingPage "31" @default.
- W3089214111 abstract "Technical documents with complex structures and orthography present special difficulties for current parsing technology. These include technical notation such as subscripts, superscripts and numeric and algebraic expressions as well as Greek letters, italics, small capitals, brackets and punctuation marks. Structural elements such as references to figures, tables and bibliographic items also cause problems. We first hand-code documents in Standard Generalized Markup Language (SGML) to specify the document’s logical structure (paragraphs, sentences, etc.) and capture significant orthography. Next, a regular expression analyzer produced by LEX is used to tokenize the SGML text. Then a token-based phrasal lexicon is used to identify the longest token sequences in the input that represent single lexical items. This lookup is efficient because limits on lookahead are precomputed for every item. After this, the Alvey Tools parser with specialized subgrammars is used to discover items such as floating-point numbers. The product of these preprocessing stages is a text that is acceptable to a full natural language parser. This work is directed towards automating the building of knowledge bases from research articles in the field of bacterial chemotaxis, but the techniques should be of wide applicability." @default.
- W3089214111 created "2020-10-01" @default.
- W3089214111 creator A5031432453 @default.
- W3089214111 creator A5075084809 @default.
- W3089214111 creator A5087202099 @default.
- W3089214111 creator A5089039914 @default.
- W3089214111 date "1991-02-13" @default.
- W3089214111 modified "2023-09-23" @default.
- W3089214111 title "Preprocessing and lexicon design for parsing technical text" @default.
- W3089214111 cites W131643106 @default.
- W3089214111 cites W1491178396 @default.
- W3089214111 cites W1974731480 @default.
- W3089214111 cites W1985896407 @default.
- W3089214111 cites W2114200727 @default.
- W3089214111 hasPublicationYear "1991" @default.
- W3089214111 type Work @default.
- W3089214111 sameAs 3089214111 @default.
- W3089214111 citedByCount "6" @default.
- W3089214111 crossrefType "journal-article" @default.
- W3089214111 hasAuthorship W3089214111A5031432453 @default.
- W3089214111 hasAuthorship W3089214111A5075084809 @default.
- W3089214111 hasAuthorship W3089214111A5087202099 @default.
- W3089214111 hasAuthorship W3089214111A5089039914 @default.
- W3089214111 hasConcept C136764020 @default.
- W3089214111 hasConcept C138885662 @default.
- W3089214111 hasConcept C150670947 @default.
- W3089214111 hasConcept C154945302 @default.
- W3089214111 hasConcept C186644900 @default.
- W3089214111 hasConcept C199360897 @default.
- W3089214111 hasConcept C204321447 @default.
- W3089214111 hasConcept C2778121359 @default.
- W3089214111 hasConcept C2778790839 @default.
- W3089214111 hasConcept C41008148 @default.
- W3089214111 hasConcept C41895202 @default.
- W3089214111 hasConcept C540372491 @default.
- W3089214111 hasConcept C554936623 @default.
- W3089214111 hasConcept C60048249 @default.
- W3089214111 hasConcept C62701983 @default.
- W3089214111 hasConcept C68699486 @default.
- W3089214111 hasConcept C8797682 @default.
- W3089214111 hasConceptScore W3089214111C136764020 @default.
- W3089214111 hasConceptScore W3089214111C138885662 @default.
- W3089214111 hasConceptScore W3089214111C150670947 @default.
- W3089214111 hasConceptScore W3089214111C154945302 @default.
- W3089214111 hasConceptScore W3089214111C186644900 @default.
- W3089214111 hasConceptScore W3089214111C199360897 @default.
- W3089214111 hasConceptScore W3089214111C204321447 @default.
- W3089214111 hasConceptScore W3089214111C2778121359 @default.
- W3089214111 hasConceptScore W3089214111C2778790839 @default.
- W3089214111 hasConceptScore W3089214111C41008148 @default.
- W3089214111 hasConceptScore W3089214111C41895202 @default.
- W3089214111 hasConceptScore W3089214111C540372491 @default.
- W3089214111 hasConceptScore W3089214111C554936623 @default.
- W3089214111 hasConceptScore W3089214111C60048249 @default.
- W3089214111 hasConceptScore W3089214111C62701983 @default.
- W3089214111 hasConceptScore W3089214111C68699486 @default.
- W3089214111 hasConceptScore W3089214111C8797682 @default.
- W3089214111 hasLocation W30892141111 @default.
- W3089214111 hasOpenAccess W3089214111 @default.
- W3089214111 hasPrimaryLocation W30892141111 @default.
- W3089214111 hasRelatedWork W1525486918 @default.
- W3089214111 hasRelatedWork W1612698827 @default.
- W3089214111 hasRelatedWork W188116024 @default.
- W3089214111 hasRelatedWork W1919254941 @default.
- W3089214111 hasRelatedWork W2146561283 @default.
- W3089214111 hasRelatedWork W22409890 @default.
- W3089214111 hasRelatedWork W2243140998 @default.
- W3089214111 hasRelatedWork W2245077632 @default.
- W3089214111 hasRelatedWork W2751651526 @default.
- W3089214111 hasRelatedWork W284628447 @default.
- W3089214111 hasRelatedWork W2985833574 @default.
- W3089214111 hasRelatedWork W391760326 @default.
- W3089214111 hasRelatedWork W40977747 @default.
- W3089214111 hasRelatedWork W659105873 @default.
- W3089214111 hasRelatedWork W1480620881 @default.
- W3089214111 hasRelatedWork W2109718064 @default.
- W3089214111 hasRelatedWork W2229240293 @default.
- W3089214111 hasRelatedWork W2276338555 @default.
- W3089214111 hasRelatedWork W2738667047 @default.
- W3089214111 hasRelatedWork W2834522630 @default.
- W3089214111 isParatext "false" @default.
- W3089214111 isRetracted "false" @default.
- W3089214111 magId "3089214111" @default.
- W3089214111 workType "article" @default.