Matches in SemOpenAlex for { <https://semopenalex.org/work/W1630666026> ?p ?o ?g. }
Showing items 1 to 58 of
58
with 100 items per page.
- W1630666026 abstract "Argument from the Poverty of Stimulus (APS) is the great epistemological debate arena between simbolic and statistical paradigms in computational linguistics. Since 2000, several works inside statistical paradigm have been published, attacking APS as they present some unsupervised general-purpose algorithm for language acquisition. Among the most important contributions, Clark’s Ph.D. thesis (2001) appeals to diverse statistical techniques in order to come up with an unsupervised general-purpose algorithm for inducing language and, more precisely, a complete Context-Free Grammar (CFG) for English. Clark (2001) works with several induction techniques for each linguistic phenomenon modelized: morphology from Hidden Markovian Models (HMM), POS-tagging from clustering, etc. Particularly, in this current paper we are interested in the induction of syntax constituency, given a POS-tagged corpus, as a previous step towards the whole process of inducing a complete CFG. In his own thesis, the author admits that more crosslinguistic evidence is needed, so as to support the psycholinguistic plausibility of an approach such as his. Currently, there is no work that have proposed to prove Clark ’s approach in very inflected languages with free-order constituents like Spanish. Thus, our work is intended to contribute with that crosslinguistic evidence, analyzing the feasibilty of the application of Clark ’s algorithm for inducing constituency on Spanish. Clark (2001) entails the application of K-means clustering to group sequences of morpho-syntactic labels, according to their distributional information. Then, there is a stage of filtering out the clusters, through a mutual-information-based criterion between the symbols that co-occur immediately before and after the sequences. This criterion prevents from the typical bias in sparsed corpora, and in turn, succeeds in distinguishing the co-ocurrence of adyacent symbols above the threshold of default entropy for short-distance (Li 1990). Our implementation has been tested on a prototypical corpus, obtaining interesting results. We have verified recall=74%, precision=58% and F-measure=65% for this prototypical stage. These results encourage us to continue with our long-term research, the goal of developing an algorithm for complete acquisition of Spanish." @default.
- W1630666026 created "2016-06-24" @default.
- W1630666026 creator A5042091136 @default.
- W1630666026 creator A5069261956 @default.
- W1630666026 date "2010-06-01" @default.
- W1630666026 modified "2023-09-26" @default.
- W1630666026 title "Inducción de constituyentes sintácticos en español con técnicas de clustering y filtrado por información mutua" @default.
- W1630666026 cites W1495446613 @default.
- W1630666026 cites W1514872638 @default.
- W1630666026 cites W1574901103 @default.
- W1630666026 cites W160235627 @default.
- W1630666026 cites W1969005071 @default.
- W1630666026 cites W1978470410 @default.
- W1630666026 cites W2025210087 @default.
- W1630666026 cites W2036682528 @default.
- W1630666026 cites W2053599613 @default.
- W1630666026 cites W2078828996 @default.
- W1630666026 cites W2095223181 @default.
- W1630666026 cites W2124656564 @default.
- W1630666026 cites W2129882630 @default.
- W1630666026 cites W2142708806 @default.
- W1630666026 cites W2152810530 @default.
- W1630666026 cites W2153568660 @default.
- W1630666026 cites W2162693531 @default.
- W1630666026 cites W2170716495 @default.
- W1630666026 cites W2294835092 @default.
- W1630666026 cites W2586920128 @default.
- W1630666026 cites W3109437689 @default.
- W1630666026 cites W652518244 @default.
- W1630666026 cites W170911924 @default.
- W1630666026 hasPublicationYear "2010" @default.
- W1630666026 type Work @default.
- W1630666026 sameAs 1630666026 @default.
- W1630666026 citedByCount "0" @default.
- W1630666026 crossrefType "journal-article" @default.
- W1630666026 hasAuthorship W1630666026A5042091136 @default.
- W1630666026 hasAuthorship W1630666026A5069261956 @default.
- W1630666026 hasConcept C121332964 @default.
- W1630666026 hasConcept C41008148 @default.
- W1630666026 hasConceptScore W1630666026C121332964 @default.
- W1630666026 hasConceptScore W1630666026C41008148 @default.
- W1630666026 hasLocation W16306660261 @default.
- W1630666026 hasOpenAccess W1630666026 @default.
- W1630666026 hasPrimaryLocation W16306660261 @default.
- W1630666026 hasRelatedWork W1536502753 @default.
- W1630666026 hasRelatedWork W2748952813 @default.
- W1630666026 hasRelatedWork W2899084033 @default.
- W1630666026 hasRelatedWork W2902782467 @default.
- W1630666026 hasRelatedWork W2935759653 @default.
- W1630666026 hasRelatedWork W3105167352 @default.
- W1630666026 hasRelatedWork W54078636 @default.
- W1630666026 hasRelatedWork W1501425562 @default.
- W1630666026 hasRelatedWork W2954470139 @default.
- W1630666026 hasRelatedWork W3084825885 @default.
- W1630666026 isParatext "false" @default.
- W1630666026 isRetracted "false" @default.
- W1630666026 magId "1630666026" @default.
- W1630666026 workType "article" @default.