Matches in SemOpenAlex for { <https://semopenalex.org/work/W2394605776> ?p ?o ?g. }
Showing items 1 to 62 of
62
with 100 items per page.
- W2394605776 abstract "LPC analysis is one of the most powerful techniques in speech analysis. Spectral zeros during consonant or consonant-vowel transition regions introduce difficulties in estimating LPC parameters. In this paper, we propose to estimate formant frequencies from LPC model by MUSIC (Multiple Signal Classification) and ESPRIT (Estimation of Signal Parameters via Rotational Invariance Techniques). Formant candidates estimated by LS (Least Square), MUSIC and ESPRIT are combined to find an optimal solution. The effectiveness of this algorithm is verified by place classification task of stop consonants. 1. OVERVIEW Classification of stop consonants remains one of the most challenging problems in speech recognition. Halberstadt (1998) [3] reported classification of phones in the TIMIT database using heterogeneous acoustic measurements, they found that: for vowel classification, listener-labeler error and machine error produce very similar performance; for place classification of stop consonants, machine classification results lag by a factor of 1.8 − 5.1. Sussman (1991) [9] investigated the locus equation and applied it to the place classification of stop consonants.They found that discriminant analysis using F2onset and F2vowel as predictors showed 76% classification accuracy. And they achieved 100% classification accuracy using derived slope and intercept values as predictors. It is generally agreed that relative invariance cues of stop consonant place are coded in dynamic spectral shape starting from stop release. Sussman’s result suggested that a compact representation of dynamic spectra could be found by accurate formant estimation. Formant frequencies are estimated from the LPC model. Spectral zeros during consonant or consonant-vowel transition regions introduce difficulties in estimating LPC parameters. This paper proposes an algorithm to improve formant estimation by combining formant candidates from different estimators. The rest of the paper is organized as follows: Section 2 reviews important properties of the LPC model; in Section 3, MUSIC and ESPRIT are proposed for formant estimation; in Section 4, an algorithm combining formant estimation of LS, MUSIC and ESPRIT is proposed, and the effectiveness of this algorithm is demonstrated by place classification of stop consonants. This work was supported by NSF award number 0132900. Statements in this paper reflect the opinions and conclusions of the authors, and are not endorsed by the NSF. 2. LPC MODEL Discrete-time speech production model can be described by [5]: Y (z) = G(z)T (z)R(z) (1) where G(z) is z-transform of source, R(z) is the radiation impedance, and T (z) is the transfer function of the vocal tract taking the form of an ARMA model. In the time-domain, for stationary process {yt} with E[yt] = 0, the ARMA model of Eq. (1) is: p" @default.
- W2394605776 created "2016-06-24" @default.
- W2394605776 creator A5004778663 @default.
- W2394605776 creator A5023177815 @default.
- W2394605776 creator A5050462311 @default.
- W2394605776 date "2004-10-04" @default.
- W2394605776 modified "2023-09-25" @default.
- W2394605776 title "Stop consonant classification by dynamic formant trajectory" @default.
- W2394605776 cites W1554944419 @default.
- W2394605776 cites W1973499212 @default.
- W2394605776 cites W2048239603 @default.
- W2394605776 cites W2126231666 @default.
- W2394605776 cites W2135034868 @default.
- W2394605776 cites W2136172079 @default.
- W2394605776 cites W2164578337 @default.
- W2394605776 cites W30596880 @default.
- W2394605776 cites W653761051 @default.
- W2394605776 doi "https://doi.org/10.21437/interspeech.2004-403" @default.
- W2394605776 hasPublicationYear "2004" @default.
- W2394605776 type Work @default.
- W2394605776 sameAs 2394605776 @default.
- W2394605776 citedByCount "10" @default.
- W2394605776 countsByYear W23946057762014 @default.
- W2394605776 crossrefType "proceedings-article" @default.
- W2394605776 hasAuthorship W2394605776A5004778663 @default.
- W2394605776 hasAuthorship W2394605776A5023177815 @default.
- W2394605776 hasAuthorship W2394605776A5050462311 @default.
- W2394605776 hasConcept C121332964 @default.
- W2394605776 hasConcept C1276947 @default.
- W2394605776 hasConcept C13662910 @default.
- W2394605776 hasConcept C154945302 @default.
- W2394605776 hasConcept C158215666 @default.
- W2394605776 hasConcept C2778203577 @default.
- W2394605776 hasConcept C2779581591 @default.
- W2394605776 hasConcept C28490314 @default.
- W2394605776 hasConcept C41008148 @default.
- W2394605776 hasConceptScore W2394605776C121332964 @default.
- W2394605776 hasConceptScore W2394605776C1276947 @default.
- W2394605776 hasConceptScore W2394605776C13662910 @default.
- W2394605776 hasConceptScore W2394605776C154945302 @default.
- W2394605776 hasConceptScore W2394605776C158215666 @default.
- W2394605776 hasConceptScore W2394605776C2778203577 @default.
- W2394605776 hasConceptScore W2394605776C2779581591 @default.
- W2394605776 hasConceptScore W2394605776C28490314 @default.
- W2394605776 hasConceptScore W2394605776C41008148 @default.
- W2394605776 hasLocation W23946057761 @default.
- W2394605776 hasOpenAccess W2394605776 @default.
- W2394605776 hasPrimaryLocation W23946057761 @default.
- W2394605776 hasRelatedWork W1524863866 @default.
- W2394605776 hasRelatedWork W1595232633 @default.
- W2394605776 hasRelatedWork W1655196610 @default.
- W2394605776 hasRelatedWork W2045785976 @default.
- W2394605776 hasRelatedWork W2100027023 @default.
- W2394605776 hasRelatedWork W2124268178 @default.
- W2394605776 hasRelatedWork W2131669600 @default.
- W2394605776 hasRelatedWork W2142764841 @default.
- W2394605776 hasRelatedWork W2366172759 @default.
- W2394605776 hasRelatedWork W2394605776 @default.
- W2394605776 isParatext "false" @default.
- W2394605776 isRetracted "false" @default.
- W2394605776 magId "2394605776" @default.
- W2394605776 workType "article" @default.