Completed Project 4 – Electrical and Computer Engineering

Text-dependent Speaker Verification and Identification (1 April 2018 – 31 March 2019)

Speaker verification aims to authenticate a person based on the voice example. Broadly it can be categorized as two categories, text-dependent and text-independent. From the view of application based systems text-dependent speaker verification is preferred due to involvement of short pass phrases during training and testing. This project aims to develop novel and effective methods for text-dependent speaker verification and identification to use in real-world applications.

Some of our explorations for text-dependent speaker verification include utterance compensation framework and unified framework for speaker and utterance verification. In the utterance compensation framework, a background utterance model is trained that is used to compensate the lexical content information as we are interested towards the speaker-specific information. Utterance verification is another that co-exists with text-dependent speaker verification. The general way of utterance verification is to have a separate framework from the speaker verification system. We propose a unified framework that can perform both speaker and utterance verification together. It is referred to as unified framework as speaker-utterance-verification. Further, we also aim to explore near-field vs. far-field and wake up word based speaker verification under the scope of this project.

Project Duration: 1 April 2018 – 31 March 2019

PUBLICATIONS

Conference Articles

Tianchi Liu, Rohan Kumar Das, Maulik Madhavi, Shengmei Shen and Haizhou Li, “Speaker-Utterance Dual Attention for Speaker and Utterance Verification”, in Proc. INTERSPEECH, Shanghai, China, October 2020.
Tianchi Liu, Maulik Madhavi, Rohan Kumar Das and Haizhou Li “A Unified Framework for Speaker and Utterance Verification” in Proc. Interspeech 2019, Graz, Austria, September 2019. [link]
Wei Rao, Chenglin Xu, Eng Siong Chng and Haizhou Li, “Target Speaker Extraction for Multi-Talker Speaker Verification”, in Proc. Interspeech, Graz, Austria, September 2019. [link]
Rohan Kumar Das, Maulik Madhavi and Haizhou Li “Compensating Utterance Information in Fixed Phrase Speaker Verification” in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC) 2018, Honolulu, Hawaii, USA, November 2018. [link]