Matches in SemOpenAlex for { <https://semopenalex.org/work/W1590753278> ?p ?o ?g. }
Showing items 1 to 72 of
72
with 100 items per page.
- W1590753278 abstract "Most information retrieval systems today use only the presence or absence of keywords to classify and retrieve texts. But simple word searches and frequency distributions do not provide these systems with any understanding of those texts. We call this limit the keyword barrier. To go beyond this barrier, information retrieval systems must at least partially understand the texts they retrieve. Although full natural language parsers are capable today of deep understanding within limited domains, they are still too restrictive and slow for general information retrieval. Text skimming parsers, such as DeJong's F scRUMP, are capable of coarse-level understanding, but they require large amounts of domain-specific knowledge in each application domain.This dissertation describes F scERRET: a full text, conceptual information retrieval system that uses a partial understanding of its texts to provide greater precision and recall performance than keyword search techniques. F scERRET parses its input documents by text skimming and then stores their representations as canonical case frames (called abstracts). User queries are similarly converted to case frames, and are matched to the abstracts using a case frame matcher. F scERRET uses the M sc CF scRUMP parser, a derivative of F scRUMP with two important additions. First, it is able to access an on-line English dictionary (Webster's Seventh) to handle unknown words, using script-based expectations to resolve multiple-meaning ambiguities. Second, a script learning component based on Holland's genetic algorithms updates the script database, augmenting M sc CF scRUMP's episodic world knowledge.Comparison studies of F scERRET's retrieval performance on 1065 astronomy texts show significant improvement in both recall and precision versus the standard boolean keyword search. Precision increased from 35 to 48 percent, and recall more than doubled, from 19 to 52 percent. The script learning component generated new scripts that significantly improved the recall performance of the basic F scERRET system without significant effects on precision.The robust parsing abilities demonstrated, with the depth and flexibility provided by on-line dictionary access and script learning, make text skimming a useful foundation for many applications. The partial understanding provided by a canonical case frame representation is useful for tasks as diverse as information filtering, routing, categorization and summarization." @default.
- W1590753278 created "2016-06-24" @default.
- W1590753278 creator A5032540953 @default.
- W1590753278 date "1989-01-01" @default.
- W1590753278 modified "2023-09-26" @default.
- W1590753278 title "Information retrieval by text skimming" @default.
- W1590753278 hasPublicationYear "1989" @default.
- W1590753278 type Work @default.
- W1590753278 sameAs 1590753278 @default.
- W1590753278 citedByCount "27" @default.
- W1590753278 countsByYear W15907532782013 @default.
- W1590753278 crossrefType "journal-article" @default.
- W1590753278 hasAuthorship W1590753278A5032540953 @default.
- W1590753278 hasConcept C126042441 @default.
- W1590753278 hasConcept C134306372 @default.
- W1590753278 hasConcept C138885662 @default.
- W1590753278 hasConcept C154945302 @default.
- W1590753278 hasConcept C186644900 @default.
- W1590753278 hasConcept C195324797 @default.
- W1590753278 hasConcept C204321447 @default.
- W1590753278 hasConcept C23123220 @default.
- W1590753278 hasConcept C33923547 @default.
- W1590753278 hasConcept C36503486 @default.
- W1590753278 hasConcept C41008148 @default.
- W1590753278 hasConcept C41895202 @default.
- W1590753278 hasConcept C44291984 @default.
- W1590753278 hasConcept C554936623 @default.
- W1590753278 hasConcept C76155785 @default.
- W1590753278 hasConcept C90805587 @default.
- W1590753278 hasConceptScore W1590753278C126042441 @default.
- W1590753278 hasConceptScore W1590753278C134306372 @default.
- W1590753278 hasConceptScore W1590753278C138885662 @default.
- W1590753278 hasConceptScore W1590753278C154945302 @default.
- W1590753278 hasConceptScore W1590753278C186644900 @default.
- W1590753278 hasConceptScore W1590753278C195324797 @default.
- W1590753278 hasConceptScore W1590753278C204321447 @default.
- W1590753278 hasConceptScore W1590753278C23123220 @default.
- W1590753278 hasConceptScore W1590753278C33923547 @default.
- W1590753278 hasConceptScore W1590753278C36503486 @default.
- W1590753278 hasConceptScore W1590753278C41008148 @default.
- W1590753278 hasConceptScore W1590753278C41895202 @default.
- W1590753278 hasConceptScore W1590753278C44291984 @default.
- W1590753278 hasConceptScore W1590753278C554936623 @default.
- W1590753278 hasConceptScore W1590753278C76155785 @default.
- W1590753278 hasConceptScore W1590753278C90805587 @default.
- W1590753278 hasLocation W15907532781 @default.
- W1590753278 hasOpenAccess W1590753278 @default.
- W1590753278 hasPrimaryLocation W15907532781 @default.
- W1590753278 hasRelatedWork W129750103 @default.
- W1590753278 hasRelatedWork W1485548522 @default.
- W1590753278 hasRelatedWork W1956559956 @default.
- W1590753278 hasRelatedWork W1971605129 @default.
- W1590753278 hasRelatedWork W199271499 @default.
- W1590753278 hasRelatedWork W1995343657 @default.
- W1590753278 hasRelatedWork W2075141984 @default.
- W1590753278 hasRelatedWork W2083605078 @default.
- W1590753278 hasRelatedWork W2107634432 @default.
- W1590753278 hasRelatedWork W2125713050 @default.
- W1590753278 hasRelatedWork W2131911084 @default.
- W1590753278 hasRelatedWork W2520484982 @default.
- W1590753278 hasRelatedWork W2539178971 @default.
- W1590753278 hasRelatedWork W2618134747 @default.
- W1590753278 hasRelatedWork W2758582739 @default.
- W1590753278 hasRelatedWork W2792379413 @default.
- W1590753278 hasRelatedWork W1499025187 @default.
- W1590753278 hasRelatedWork W2099794778 @default.
- W1590753278 hasRelatedWork W2795060791 @default.
- W1590753278 hasRelatedWork W3141054140 @default.
- W1590753278 isParatext "false" @default.
- W1590753278 isRetracted "false" @default.
- W1590753278 magId "1590753278" @default.
- W1590753278 workType "article" @default.