Unsupervised Data-driven Hidden Markov Modeling for Text-dependent Speaker Verification

Dijana Petrovska-Delacrétaz, Houssemeddine Khemiri

2017

Abstract

We present a text-dependent speaker verification system based on unsupervised data-driven Hidden Markov Models (HMMs) in order to take into account the temporal information of speech data. The originality of our proposal is to train unsupervised HMMs with only raw speech without transcriptions, that provide pseudo phonetic segmentation of speech data. The proposed text-dependent system is composed of the following steps. First, generic unsupervised HMMs are trained. Then the enrollment speech data for each target speaker is segmented with the generic models, and further processing is done in order to obtain speaker and text adapted HMMs, that will represent each speaker. During the test phase, in order to verify the claimed identity of the speaker, the test speech is segmented with the generic and the speaker dependent HMMs. Finally, two approaches based on log-likelihood ratio and concurrent scoring are proposed to compute the score between the test utterance and the speaker’s model. The system is evaluated on Part1 of the RSR2015 database with Equal Error Rate (EER) on the development set, and Half Total Error Rate (HTER) on the evaluation set. An average EER of 1.29% is achieved on the development set, while for the evaluation part the average HTER is equal to 1.32%.

Download


Paper Citation


in Harvard Style

Petrovska-Delacrétaz D. and Khemiri H. (2017). Unsupervised Data-driven Hidden Markov Modeling for Text-dependent Speaker Verification . In Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-222-6, pages 199-207. DOI: 10.5220/0006202001990207

in Bibtex Style

@conference{icpram17,
author={Dijana Petrovska-Delacrétaz and Houssemeddine Khemiri},
title={Unsupervised Data-driven Hidden Markov Modeling for Text-dependent Speaker Verification},
booktitle={Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2017},
pages={199-207},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006202001990207},
isbn={978-989-758-222-6},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Unsupervised Data-driven Hidden Markov Modeling for Text-dependent Speaker Verification
SN - 978-989-758-222-6
AU - Petrovska-Delacrétaz D.
AU - Khemiri H.
PY - 2017
SP - 199
EP - 207
DO - 10.5220/0006202001990207