Matches in SemOpenAlex for { <https://semopenalex.org/work/W3093129615> ?p ?o ?g. }
Showing items 1 to 69 of
69
with 100 items per page.
- W3093129615 abstract "Currently, speaker recognition research is mainly based on phonetics and speech signal processing. This research addresses speaker recognition from a new perspective, analysing the transcription of a fragment of speech with text analysis methods. Since text analysis is based on the transcription text only, it can be assumed independent from current automatic speaker recognition software. Hence, it would contribute significantly to the overall evidential value. The analysis is based on the frequencies of non-content, highly frequent words. We study whether information about the identity of the speaker is contained in the transcription of spoken text. The value of evidence is quantified using a score-based likelihood ratio. The score-based approach is chosen because in most forensic cases, there is not enough data from the suspect or of the disputed speech fragment available to model a robust feature-based likelihood ratio. Different methods to model the system from feature vector over score to likelihood ratio have been compared. As a baseline, a distance based method is used, where the score is the distance between the feature vectors. To improve upon this baseline, machine learning algorithms are implemented. The results from SVM and XGBoost are explored. As a third method a feature-based likelihood ratio is calculated and used as a score instead of as a direct likelihood ratio. With this method, both similarity and typicality are taken into account. The model is trained and tested on the FRIDA data set from the Netherlands Forensic Institute, consisting of Dutch conversations from a homogeneous group of 250 individuals. The performance of the likelihood ratio system is evaluated through computing the cost log-likelihood-ratio (Cllr), which is a measure for the accuracy and quality of the likelihood ratios, and the accuracy (A) of the likelihood ratios solely. The performance is also evaluated by inspecting the Tippett, empirical cross-entropy and pool-adjacent-violators plots. Different values for parameters used in the calculation of the likelihood ratios are investigated: the length of the sample, the number of frequent words (number of features) and the number of samples needed to train the model. The distance method showed a strong baseline, with good performance for large sample lengths. The SVM method outperformed the distance method for all parameter settings, with a peak performance of A=0.94 and Cllr=0.24. The XGBoost method showed promising results for smaller samples lengths, but a too large amount of data is needed to obtain good performance for larger sample lengths. The LR score method showed moderate results, but no improvements due to the necessity to estimate high-dimensional distributions. This thesis shows that information about the identity of the speaker is contained in transcriptions of speech. The complete process from data to likelihood ratio is constructed, where the likelihood ratio quantifies the evidential value of a transcribed speech fragment." @default.
- W3093129615 created "2020-10-22" @default.
- W3093129615 creator A5086938496 @default.
- W3093129615 date "2020-01-01" @default.
- W3093129615 modified "2023-09-26" @default.
- W3093129615 title "Forensic speaker recognition: Based on text analysis of transcribed speech fragments" @default.
- W3093129615 hasPublicationYear "2020" @default.
- W3093129615 type Work @default.
- W3093129615 sameAs 3093129615 @default.
- W3093129615 citedByCount "0" @default.
- W3093129615 crossrefType "journal-article" @default.
- W3093129615 hasAuthorship W3093129615A5086938496 @default.
- W3093129615 hasConcept C133892786 @default.
- W3093129615 hasConcept C138885662 @default.
- W3093129615 hasConcept C149838564 @default.
- W3093129615 hasConcept C153180895 @default.
- W3093129615 hasConcept C154945302 @default.
- W3093129615 hasConcept C15744967 @default.
- W3093129615 hasConcept C179926584 @default.
- W3093129615 hasConcept C204321447 @default.
- W3093129615 hasConcept C2776401178 @default.
- W3093129615 hasConcept C2778223634 @default.
- W3093129615 hasConcept C28490314 @default.
- W3093129615 hasConcept C41008148 @default.
- W3093129615 hasConcept C41895202 @default.
- W3093129615 hasConcept C73484699 @default.
- W3093129615 hasConcept C83665646 @default.
- W3093129615 hasConceptScore W3093129615C133892786 @default.
- W3093129615 hasConceptScore W3093129615C138885662 @default.
- W3093129615 hasConceptScore W3093129615C149838564 @default.
- W3093129615 hasConceptScore W3093129615C153180895 @default.
- W3093129615 hasConceptScore W3093129615C154945302 @default.
- W3093129615 hasConceptScore W3093129615C15744967 @default.
- W3093129615 hasConceptScore W3093129615C179926584 @default.
- W3093129615 hasConceptScore W3093129615C204321447 @default.
- W3093129615 hasConceptScore W3093129615C2776401178 @default.
- W3093129615 hasConceptScore W3093129615C2778223634 @default.
- W3093129615 hasConceptScore W3093129615C28490314 @default.
- W3093129615 hasConceptScore W3093129615C41008148 @default.
- W3093129615 hasConceptScore W3093129615C41895202 @default.
- W3093129615 hasConceptScore W3093129615C73484699 @default.
- W3093129615 hasConceptScore W3093129615C83665646 @default.
- W3093129615 hasLocation W30931296151 @default.
- W3093129615 hasOpenAccess W3093129615 @default.
- W3093129615 hasPrimaryLocation W30931296151 @default.
- W3093129615 hasRelatedWork W1491460777 @default.
- W3093129615 hasRelatedWork W1524196777 @default.
- W3093129615 hasRelatedWork W1768248362 @default.
- W3093129615 hasRelatedWork W1932968309 @default.
- W3093129615 hasRelatedWork W2000963911 @default.
- W3093129615 hasRelatedWork W2055873919 @default.
- W3093129615 hasRelatedWork W2092219257 @default.
- W3093129615 hasRelatedWork W2116375618 @default.
- W3093129615 hasRelatedWork W2127563803 @default.
- W3093129615 hasRelatedWork W2131618289 @default.
- W3093129615 hasRelatedWork W2395673847 @default.
- W3093129615 hasRelatedWork W2585644098 @default.
- W3093129615 hasRelatedWork W2740923123 @default.
- W3093129615 hasRelatedWork W2952783970 @default.
- W3093129615 hasRelatedWork W57306581 @default.
- W3093129615 hasRelatedWork W2244621498 @default.
- W3093129615 hasRelatedWork W2758240959 @default.
- W3093129615 hasRelatedWork W2821517580 @default.
- W3093129615 hasRelatedWork W2829466848 @default.
- W3093129615 hasRelatedWork W2851777596 @default.
- W3093129615 isParatext "false" @default.
- W3093129615 isRetracted "false" @default.
- W3093129615 magId "3093129615" @default.
- W3093129615 workType "article" @default.