Conference Papers – HLT – Electrical and Computer Engineering

Conference Papers – HLT

2024

EMNLP 2024

Chen Zhang, Chengguang Tang, Dading Chong, Ke Shi, Guohua Tang, Feng Jiang, Haizhou Li, "TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models" The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), Miami, Florida, November 12 –16, 2024

Yiming Chen, Xianghu Yue, Xiaoxue Gao, Chen Zhang, Luis Fernando D'Haro, Robby T. Tan, Haizhou Li, "Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models" The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), Miami, Florida, November 12 –16, 2024

ACM MULTIMEDIA 2024

Ruijie Tao, Zhan Shi, Yidi Jiang, Duc-Tuan Truong, Eng-Siong Chng, Massimo Alioto, Haizhou Li, "Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization" ACM MULTIMEDIA 2024, Melbourne, Australia, Oct 28 - Nov 1, 2024

Xianghu Yue, Xueyi Zhang, Yiming Chen, Chengwei Zhang, Mingrui Lao, Huiping Zhuang, Xinyuan Qian, Haizhou Li, "MMAL: Multi-Modal Analytic Learning for Exemplar-Free Audio-Visual Class Incremental Tasks" ACM MULTIMEDIA 2024, Melbourne, Australia, Oct 28 - Nov 1, 2024

IJCAI

Qianhui Liu, Jiaqi Yan, Malu Zhang, Gang Pan, Haizhou Li, "LitE-SNN: Designing Lightweight and Efficient Spiking Neural Network through Spatial-Temporal Compressive Network Search and Joint Optimization", International Joint Conference on Artificial Intelligence (IJCAI) in Jeju, Korea, August 3 - 9, 2024.
Yang Wang, Haiyang Mei, Qirui Bao, Ziqi Wei, Mike Zheng Shou, Haizhou Li, Bo Dong, Xin Yang, "Apprenticeship-Inspired Elegance: Synergistic Knowledge Distillation Empowers Spiking Neural Networks for Efficient Single-Eye Emotion Recognition" International Joint Conference on Artificial Intelligence (IJCAI) in Jeju, Korea, August 3 - 9, 2024.

LREC-COLING

Danqing Luo, Chen Zhang, Yan Zhang, Haizhou Li, "CrossTune: Black-Box Few-Shot Classification with Label Enhancement" LREC-COLING 2024 - The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation Lingotto Conference Centre - Torino (Italia), 20-25 May, 2024

ICASSP

AAAI

Shimin Zhang*, Qu Yang*, Chenxiang Ma, Jibin Wu, Haizhou Li, Kay Chen Tan, "TC-LIF: A Two-Compartment Spiking Neuron Model for Long-term Sequential Modelling" in the 38th Annual AAAI Conference on Artificial Intelligence (AAAI-24), Vancouver, Canada. (* Equal Contribution)
Rui Liu, Yifan Hu, Yi Ren, Xiang Yin, Haizhou Li, "Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling" in the 38th Annual AAAI Conference on Artificial Intelligence (AAAI-24), Vancouver, Canada.
Chen Zhang, Luis Fernando D'Haro, Yiming Chen, Malu Zhang, Haizhou Li, "A Comprehensive Analysis of the Effectiveness of Large Language Models as Automatic Dialogue Evaluators" in the 38th Annual AAAI Conference on Artificial Intelligence (AAAI-24), Vancouver, Canada.
Jiadong Wang, Zexu Pan, Malu Zhang, Robby T. Tan, Haizhou Li, "Restoring Speaking Lips from Occlusion for Audio-Visual Speech Recognition" in the 38th Annual AAAI Conference on Artificial Intelligence (AAAI-24), Vancouver, Canada.

2023

EMNLP

Chen Zhang, Luis Fernando D'haro, Chengguang Tang, Ke Shi, Guohua Tang, Haizhou Li, "xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark" In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), December 6–10, 2023, Singapore, Resorts World Convention Centre
Yan Zhang, Zhaopeng Feng, Zhiyang Teng, Zuozhu Liu, Haizhou Li, "How Well Do Text Embedding Models Understand Syntax?" In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), December 6–10, 2023, Singapore, Resorts World Convention Centre
Qinyi Wang, Haizhou Li, "Text-Derived Language Identity Incorporation for End-to-End Code-Switching Speech Recognition", In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), December 6–10, 2023, Singapore, Resorts World Convention Centre

NeurIPS

Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li, “Disentangling Voice and Content with Self-Supervision for Speaker Recognition”, Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023), December 10, 2023 – December 16, 2023, New Orleans, Louisiana, U.S.A

Engineering in Medicine and Biology Society (EMBC)

Siqi Cai, Jia Li, Hongmeng Yang, and Haizhou Li, " RGCnet: An Efficient Recursive Gated Convolutional Network for EEG-based Auditory Attention Detection", in 2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Sydney, Australia, July 24 to 27, 2023.

INTERSPEECH

Yidi Jiang, Ruijie Tao, Zexu Pan, Haizhou Li, "Target Active Speaker Detection with Audio-visual Cues", in Proc. Interspeech 2023, Convention Centre Dublin, Ireland, August 20 to 24, 2023.
Jingru Lin, Xianghu Yue, Junyi Ao, Haizhou Li, "Self-Supervised Acoustic Word Embedding Learning via Correspondence Transformer Encoder", in Proc. Interspeech 2023, Convention Centre Dublin, Ireland, August 20 to 24, 2023.
Ke Zhang, Marvin Borsdorf, Zexu Pan, Haizhou Li, Yangjie Wei, Yi Wang, "Speaker Extraction with Detection of Presence and Absence of Target Speakers", in Proc. Interspeech 2023, Convention Centre Dublin, Ireland, August 20 to 24, 2023.
Ruicong Wang, Siqi Cai and Haizhou Li, "EEG-based Auditory Attention Detection with Spatiotemporal Graph and Graph Convolutional Network", in Proc. Interspeech 2023, Convention Centre Dublin, Ireland, August 20 to 24, 2023.
Rui Liu, Haolin Zuo, De Hu, Guanglai Gao, Haizhou Li, "Explicit Intensity Control for Accented Text-to-speech", in Proc. Interspeech 2023, Convention Centre Dublin, Ireland, August 20 to 24, 2023.
Rui Liu, Jinhua Zhang, Guanglai Gao, Haizhou Li, "Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion", in Proc. Interspeech 2023, Convention Centre Dublin, Ireland, August 20 to 24, 2023.
Lu Junchen, Berrak Sisman, Mingyang Zhang, Haizhou Li, "High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units", in Proc. Interspeech 2023, Convention Centre Dublin, Ireland, August 20 to 24, 2023.

IJCAI

Shuang Lian, Jiangrong Shen, Qianhui Liu, Ziming Wang, Rui Yan, Huajin Tang, "Learnable Surrogate Gradient for Direct Training Spiking Neural Networks", International Joint Conference on Artificial Intelligence (IJCAI) in Macau, August 19 - 25, 2023.

EMBC

Siqi Cai, Jia Li, Hongmeng Yang, and Haizhou Li, "RGCnet: An Efficient Recursive Gated Convolutional Network for EEG-based Auditory Attention Detection", Annual International Conference of the IEEE Engineering in Medicine and Biology Society in Sydney, Australia, July 24 - 27, 2023.

ACL

Yiming Chen, Simin Chen, Zexin Li, Wei Yang, Cong Liu, Robby T. Tan, Haizhou Li, "Dynamic Transformers Provide a False Sense of Efficiency", Annual Meeting of the Association for Computational Linguistics (ACL’23) in Toronto, Canada, July 9 to 14, 2023.

CVPR

Jiadong Wang, Xinyuan Qian, Malu Zhang, Robby T. Tan, Haizhou Li, "Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert", Computer Vision and Pattern Recognition Conference (CVPR) in Vancouver, Canada. June 18 to 22, 2023.
Jiawei Du*, Yidi Jiang*, Vincent TF Tan, Joey Tianyi Zhou, Haizhou Li (*equal contribution), "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation", Computer Vision and Pattern Recognition Conference (CVPR) in Vancouver, Canada. June 18 to 22, 2023.

ICASSP

NER

Saurav Pahuja, Siqi Cai, Tanja Schultz, and Haizhou Li, "XAnet: Cross-Attention Between EEG of Left and Right Brain for Auditory Attention Decoding", International IEEE EMBS Conference on Neural Engineering, Baltimore, MD, USA, April 25 - 27, 2023

2022

Peiwen Li, Enze Su, Jia Li, Siqi Cai, Longhan Xie, and Haizhou Li, “ESAA: An Eeg-Speech Auditory Attention Detection Database,” 2022 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), Hanoi, Vietnam, November 24-26, 2022, pp. 1-6, DOI: 10.1109/O-COCOSDA202257103.2022.9997944
Xiaoxue Gao, Chitralekha Gupta and Haizhou Li, “Music-robust Automatic Lyrics Transcription of Polyphonic Music”, Music Technology and Design, June 5-12, 2022, Saint-Etienne (France)

EMNLP

Bin Wang, Chen Zhang, Yan Zhang, Yiming Chen, Haizhou Li, "Analyzing and Evaluating Faithfulness in Dialogue Summarization", In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP), December 7–11, 2022, pages 4897–4908, Abu Dhabi, United Arab Emirates
Chen Zhang, Luis Fernando D'Haro, Qiquan Zhang, Thomas Friedrichs, Haizhou Li, “FineD-Eval: Fine-grained Automatic Dialogue-Level Evaluation", In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP), December 7–11, 2022, pages 3336–3355, Abu Dhabi, United Arab Emirates
Yiming Chen, Yan Zhang, Bin Wang, Zuozhu Liu, Haizhou Li, "Generate, Discriminate and Contrast: A Semi-Supervised Sentence Representation Learning Framework", In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP), December 7–11, 2022, pages 8150–8161, Abu Dhabi, United Arab Emirates

NeurIPS

Qu Yang, Jibin Wu, Malu Zhang, Yansong Chua, Xinchao Wang, Haizhou Li, “Training Spiking Neural Networks with Local Tandem Learning”, Thirty-Sixth Conference on Neural Information Processing Systems (NeurIPS 2022), November 27, 2022 – December 3, 2022, New Orleans, Louisiana, (U.S.A)

INTERSPEECH

Zexu Pan, Meng Ge, Haizhou Li, “A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction”, in Proc. Interspeech 2022, Songdo ConvensiA, in Incheon, Korea, September 18 to 22, 2022.
Zeyang Song, Qi Liu, Qu Yang and Haizhou Li, “Knowledge distillation for In-memory keyword spotting model”, in Proc. Interspeech 2022, Songdo ConvensiA, in Incheon, Korea, September 18 to 22, 2022.
Qu Yang, Qi Liu, Haizhou Li, "Deep Residual Spiking Neural Network for Keyword Spotting in Low-Resource Settings", in Proc. Interspeech 2022, Songdo ConvensiA, in Incheon, Korea, September 18 to 22, 2022.
Zongyang Du, Berrak Sisman, Kun Zhou and Haizhou Li, “Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion”, in Proc. Interspeech 2022, Songdo ConvensiA, in Incheon, Korea, September 18 to 22, 2022.
Rui Liu, Berrak Sisman, Bj ̈orn W. Schuller, Guanglai Gao, Haizhou Li, “Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning”, in Proc. Interspeech 2022, Songdo ConvensiA, in Incheon, Korea, September 18 to 22, 2022.
Marvin Borsdorf, Kevin Scheck, Haizhou Li and Tanja Schultz, “Blind Language Separation: Disentangling Multilingual Cocktail Party Voices by Language”, in Proc. Interspeech 2022, Songdo ConvensiA, in Incheon, Korea, September 18 to 22, 2022.

ACL

Bin Wang, C.-C. Jay Kuo, and Haizhou Li, "Rethinking Evaluation with Word and Sentence Similarities", In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL), 22nd - 27th May 2022 (Volume 1: Long Papers), pages 6060–6077, Dublin, Ireland, DOI: 10.18653/v1/2022.acl-long.419

ICASSP

2021

ASRU

Marvin Borsdorf, Haizhou Li, and Tanja Schultz, “Target Language Extraction at Multilingual Cocktail Parties”, in Proc. IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop, Cartagena, Colombia, September 2021.
Zongyang Du, Berrak Sisman, Kun Zhou, and Haizhou Li, “Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer”, in Proc. IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop, Cartagena, Colombia, September 2021.
Yi Ma, Kong Aik Lee, Ville Hautamaki, and Haizhou Li, “PL-EESR: Perceptual Loss Based End-to-End Robust Speaker Representation Extraction”, in Proc. IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop, Cartagena, Colombia, September 2021.
Bidisha Sharma, Maulik Madhavi, Xuehao Zhou, and Haizhou Li, “Exploring Teacher-Student Learning Approach for Multi-lingual Speech-to-Intent Classification”, in Proc. IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop, Cartagena, Colombia, September 2021.
Sergey Nikonorov, Berrak Sisman, Mingyang Zhang, Haizhou Li, “Deepa: A Deep Neural Analyzer for Speech and Singing Vocoding”, in Proc. IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop, Cartagena, Colombia, September 2021.

INTERSPEECH

ICASSP

2020

APSIPA-ASC

SPEAKER ODYSSEY

Xiaohai Tian, Rohan Kumar Das and Haizhou Li, “Black-box Attacks on Automatic Speaker Verification using Feedback-controlled Voice Conversion” in Proc. Speaker Odyssey, Tokyo, Japan, November 2020, pp. 159-164.
Xiaoxue Gao, Xiaohai Tian, Yi Zhou, Rohan Kumar Das and Haizhou Li, “Personalized Singing Voice Generation Using WaveRNN” in Proc. Speaker Odyssey, Tokyo, Japan, November 2020, pp. 252-258.
Kun Zhou, Berrak Sisman and Haizhou Li, “Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data” in Proc. Speaker Odyssey, Tokyo, Japan, November 2020, pp. 230-237.
Berrak Sisman and Haizhou Li, “Generative Adversarial Networks for Singing Voice Conversion with and without Parallel Data” in Proc. Speaker Odyssey, Tokyo, Japan, November 2020, pp. 238-244.
Rui Liu, Sisman Berrak, Feilong Bao, Guanglai Gao and Haizhou Li, “WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss” in Proc. Speaker Odyssey, Tokyo, Japan, November 2020, pp. 245-251.

INTERSPEECH

ICASSP

2019

Rohan Sheelvant, Bidisha Sharma, Maulik Madhavi, Rohan Kumar Das, S.R.M. Prasanna and Haizhou Li “RSL2019: A Realistic Speech Localization Corpus” in Proc. International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (COCOSDA), Cebu City, Philippines, October 2019, pp. 1-6.
Jibin Wu, Yansong Chua, Malu Zhang, Qu Yang, Guoqi Li and Haizhou Li, “Deep Spiking Neural Network with Novel Spike Count based Learning Rule”, In. Proc. International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, July 2019, pp. 1-6.
Jibin Wu, Yansong Chua, Malu Zhang and Haizhou Li, “Competitive STDP-based Feature Representation Learning for Sound Event Classification”, In. Proc. International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, July 2019, pp. pp. 1-8.
Zihan Pan, Jibin Wu, Yansong Chua, Malu Zhang and Haizhou Li, “Neural Population Coding for Effective Temporal Classification”, In. Proc.International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, July 2019, pp. 1-8.
Maulik Madhavi, Tong Zhan, Haizhou Li and Min Yuan, “First Leap Towards Development of Dialogue System for Autonomous Bus”, In. Proc. International Workshop on Spoken Dialogue Systems Technology (IWSDS), Sicily, Italy, April 2019, pp. 1-6.
Malu Zhang, Jibin Wu, Yansong Chua, Xiaolin Luo, Zihan Pan, Dan Liu, and Haizhou Li, “MPD-AL: An Efficient Membrane Potential Driven Aggregate-Label Learning Algorithm for Spiking Neurons”, In. Proc. Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19), Hawaii, USA, 2019, pp. 1327-1334.

ASRU

Berrak Sisman, Mingyang Zhang, Minghui Dong, and Haizhou Li, “On the Study of Generative Adversarial Networks for Cross-Lingual Voice Conversion”, in Proc. IEEE Automatic Speech Recognition Understanding (ASRU) Workshop, Sentosa Island, Singapore, December 2019, pp. 144-151.
Hongqiang Du, Xiaohai Tian, Lei Xie and Haizhou Li, “Wavenet Factorization with Singular Value Decomposition for Voice Conversion”, in Proc. IEEE Automatic Speech Recognition Understanding (ASRU) Workshop, Sentosa Island, Singapore, December 2019, pp. 152-159.
Yi Zhou, Xiaohai Tian, Emre Yılmaz, Rohan Kumar Das and Haizhou Li, “A Modularized Neural Network with Language-Specific Output Layers for Cross-Lingual Voice Conversion”, in Proc. IEEE Automatic Speech Recognition Understanding (ASRU) Workshop, Sentosa Island, Singapore, December 2019, pp. 160-167.
Chenglin Xu, Wei Rao, Eng Siong Chng and Haizhou Li, “Time-Domain Speaker Extraction Network”, in Proc. IEEE Automatic Speech Recognition Understanding (ASRU) Workshop, Sentosa Island, Singapore, December 2019, pp. 327-334.
Rohan Kumar Das, Jichen Yang and Haizhou Li, “Long Range Acoustic and Deep Features Perspective on ASVspoof 2019”, in Proc. IEEE Automatic Speech Recognition Understanding (ASRU) Workshop, Sentosa Island, Singapore, December 2019, pp. 1018-1025.
Xianghu Yue, Grandee Lee, Emre Yılmaz, Fang Deng and Haizhou Li, “End-to-End Code-Switching ASR for Low-Resourced Language Pairs”, in Proc. IEEE Automatic Speech Recognition Understanding (ASRU) Workshop, Sentosa Island, Singapore, December 2019, pp. 972-979.

APSIPA-ASC

Yitong Liu, Rohan Kumar Das and Haizhou Li, “Multi-band Spectral Entropy Information for Detection of Replay Attacks”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Lanzhou, China, November 2019, pp. 838-843.
Rohan Kumar Das, Jichen Yang and Hazhou Li “Speaker Clustering with Penalty Distance for Speaker Verification with Multi-Speaker Speech”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Lanzhou, China, November 2019, pp. 1630-1635.
Xiaoxue Gao, Xiaohai Tian, Rohan Kumar Das, Yi Zhou and Haizhou Li, “Speaker-Independent Spectral Mapping for Speech-to-Singing Conversion”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Lanzhou, China, November 2019, pp. 159-164.
Karthika Vijayan, Kodukula Sri Rama Murty and Haizhou Li, “Allpass Modeling of Phase Spectrum of Speech Signals for Formant Tracking”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Lanzhou, China, November 2019, pp. 1190-1196.
Yi Zhou, Xiaohai Tian, Rohan Kumar Das and Haizhou Li, “Many-to-many Cross-lingual Voice Conversion with a Jointly Trained Speaker Embedding Network”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Lanzhou, China, November 2019, pp. 1282-1287.

INTERSPEECH

ICASSP

Bidisha Sharma, Chitralekha Gupta, Haizhou Li, and Ye Wang, “Automatic Lyrics-to-Audio Alignment on Polyphonic Music using Singing-Adapted Acoustic Models”, in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, United Kingdom, May 2019, pp. 396-400.
Yi Zhou, Xiaohai Tian, Haihua Xu, Rohan Kumar Das and Haizhou Li “Cross-Lingual Voice Conversion with Bilingual Phonetic Posteriorgram and Average Modeling” in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, United Kingdom, May 2019, pp. 6790-6794.
Grandee Lee and Haizhou Li “Word and Class Common Space Embedding for Code-switch Language Modeling”, In. Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, United Kingdom, May 2019, pp. 6086-6090.
Chenglin Xu, Wei Rao, Eng Siong Chng and Haizhou Li, “Optimization of Speaker Extraction Neural network with Magnitude and Temporal Spectrum Approximation Loss”, In. Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, United Kingdom, May 2019, pp. 6990-6994.

2018

ICASSP

Chenglin Xu, Wei Rao, Xiong Xiao, Eng Siong Chng and Haizhou Li, “Single Channel Speech Separation with Constrained Utterance Level Permutation Invariant Training Using Grid LSTM,” in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Calgary, Alberta, Canada, April 2018, pp. 6-10.
Qing Wang, Wei Rao, Sining Sun, Lei Xie, Eng Siong Chng and Haizhou Li, “Unsupervised Domain Adaptation via Domain Adversarial Training for Speaker Recognition,” in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Calgary, Alberta, Canada, April 2018, pp. 4889-4893.
Karthika Vijayan, Haizhou Li, Hanwu Sun and Kong-Aik Lee, “On the Importance of Analytic Phase of Speech Signals in Spoken Language Recognition,” in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Calgary, Alberta, Canada, April 2018, pp. 5194-5198.
Saad Irtza, Vidhyasaharan Sethu, Eliathamby Ambikairajah and Haizhou Li, “End-to-End Hierarchical Language Identification System,” in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Calgary, Alberta, Canada, April 2018, pp. 5199-5203.

APSIPA-ASC

INTERSPEECH

2017

Berrak Sisman, Grandee Lee, Haizhou Li and Kay Chen Tan, “On the Analysis and Evaluation of Prosody Conversion Techniques,” International Conference on Asian Language Processing (IALP), Singapore, December 2017, pp. 44-47.
Grandee Lee, Thi-Nga Ho, Eng-Siong Chng and Haizhou Li, “A Review of the Mandarin-English Code-Switching Corpus: SEAME,” International Conference on Asian Language Processing (IALP), Singapore, December 2017, pp. 210-213.

APSIPA-ASC

INTERSPEECH

D.Y. Huang, Wan Ding, Mingyu Xu, Huaiping Ming, Minghui Dong, Xinguo Yu and Haizhou Li, “Multimodal Prediction of Affective Dimensions via Fusing Multiple Regression Techniques”, in Proc. INTERSPEECH, Stockholm, Sweden, August 2017, pp. 162-165.
Kong Aik Lee and Haizhou Li, “Gain Compensation for Fast i-Vector Extraction Over Short Duration”, in Proc. INTERSPEECH, Stockholm, Sweden, August 2017, pp. 1527-1531.
Chenglin Xu, Xiong Xiao, Sining Sun, Wei Rao, Eng Siong Chng and Haizhou Li, “Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source”, in Proc. INTERSPEECH, Stockholm, Sweden, August 2017, pp. 1894-1898.
Saad Irtza, Vidhyasaharan Sethu, Eliathamby Ambikairajah and Haizhou Li, “Investigating Scalability in Hierarchical Language Identification System”, in Proc. INTERSPEECH, Stockholm, Sweden, August 2017, pp. 2581-2585.
Jie Wu, D.-Y. Huang, Lei Xie and Haizhou Li, “Denoising Recurrent Neural Network for Deep Bidirectional LSTM Based Voice Conversion, in Proc. INTERSPEECH, Stockholm, Sweden, August 2017, pp. 3379-3383.

ASRU

Berrak Sisman, Haizhou Li and Kay Chen Tan, “Sparse Representation of Phonetic Features for Voice Conversion with and Without Parallel Data”, IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Okinawa, Japan, December 2017, pp. 677-684.
Shan Yang, Lei Xie, Xiao Chen, Xiaoyan Lou, Xuan Zhu, Dongyan Huang and Haizhou Li, “Statistical Parametric Speech Synthesis using Generative Adversarial Networks Under a Multi-Task Learning Framework”, IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Okinawa, Japan, December 2017, pp. 685-691.
Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, “Multilingual bottle-neck feature learning from Untranscribed Speech”, IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Okinawa, Japan, December 2017, pp. 727-733.
Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma and Haizhou Li, “Extracting Bottleneck Features and Word-like Pairs from Untranscribed Speech from Feature Representation”, IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Okinawa, Japan, December 2017, pp. 734-739.
Shan Yang, Lei Xie, Xiao Chen, Xiaoyan Lou, Xuan Zhu, Dongyan Huang and Haizhou Li, “Statistical Parametric Speech Synthesis Using Generative Adversarial Networks Under a Multi-Task Learning Framework”, IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Okinawa, Japan, December 2017, pp. 685-691.

2016

Seokhwan Kim, Rafael E. Banchs and Haizhou Li, “Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling for Dialogue Topic Tracking”, in Proc. 54th Annual Meeting of the Association for Computational Linguistics (ACL), Berlin, Germany, August 2016, pp. 963-973.
Wan Ding, Mingyu Xu, Dong-Yan Huang, Weisi Lin, Minghui Dong, Xinguo Yu and Haizhou Li, “Audio and Face Video Emotion Recognition in the Wild Using Deep Neural Networks and Small Datasets”, in Proc. 18^th International Conference on Multimodal Interaction (ICMI), Tokyo, Japan, November 2016, pp. 506-513.

APSIPA-ASC

Nancy F. Chen and Haizhou Li, “Computer-Assisted Pronunciation Training: From Pronunciation Scoring Towards Spoken Language Learning”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Jeju, Korea, December 2016, pp. 1-7.
Xiaohai Tian, Xiong Xiao, Eng Siong Chng and Haizhou Li, “Spoofing Speech Detection using Temporal Convolutional Neural Network”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Jeju, Korea, December 2016, pp. 1-6.
Xiong Xiao, Shinji Watanabe, Eng Siong Chng and Haizhou Li, “Beamforming Networks using Spatial Covariance Features for Far-Field Speech Recognition”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Jeju, Korea, December 2016, pp. 1-6.
Haihua Xu, Wei Rao, Xiong Xiao, Hao Huang, Eng Siong Chng and Haizhou Li, “I-Vector Based Deep Neural Network Acoustic Model Adaptation using Multilingual Language Resource”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Jeju, Korea, December 2016, pp. 1-5.

ICASSP

INTERSPEECH

2015

APSIPA-ASC

Van Hai Do, Xiong Xiao, Eng Siong Chng and Haizhou Li “Distance Metric Learning for Kernel Density-Based Acoustic Model Under Limited Training Data Conditions”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Hong Kong, December 2015, pp. 54-58.
Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng and Haizhou Li, “A Density Peak Clustering Approach to Unsupervised Acoustic Subword Units Discovery”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Hong Kong, December 2015, pp. 178-183.
Shaofei Zhang, Dong-Yan Huang, Lei Xie, Eng Siong Chng, Haizhou Li and Minghui Dong, “Non-Negative Matrix Factorization Using Stable Alternating Direction Method of Multipliers for Source Separation”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Hong Kong, December 2015, pp. 222-228.
Van Tung Pham, Haihua Xu, Van Hai Do, Tze Yuang Chong, Xiong Xiao, Eng Siong Chng and Haizhou Li, “On the Study of Very Low-Resource Language Keyword Search”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Hong Kong, December 2015, pp. 358-364.
Minghui Dong, Chenyu Yang, Yanfeng Lu, Jochen Walter Ehnes, Dong-Yan Huang, Huaiping Ming, Rong Tong, Siu Wa Lee and Haizhou Li, “Mapping Frames with DNN-HMM Recognizer for Non-Parallel Voice Conversion” in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Hong Kong, December 2015, pp. 488-494.
Van Hai Do, Xiong Xiao, Haihua Xu, Eng Siong Chng and Haizhou Li, “Multilingual Exemplar-Based Acoustic Model for the NIST Open KWS 2015 Evaluation”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Hong Kong, December 2015, pp. 594-98.

ASRU

Shengkui Zhao, Xiong Xiao, Zhaofeng Zhang, Thi Ngoc Tho Nguyen, Xionghu Zhong, Bo Ren, Longbiao Wang, Douglas L. Jones, Eng Siong Chng and Haizhou Li, “Robust Speech Recognition Using Beamforming with Adaptive Microphone Gains and Multichannel Noise Reduction”, in Proc. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), Scottsdale, AZ, USA, December 2015, pp. 460-467.

ICASSP

INTERSPEECH

2014

VCSR”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Florence, Italy, May 2014, pp. 4883-4887.
Rong Tong, Boon Pang Lim, Nancy F. Chen, Bin Ma and Haizhou Li, “Subspace Gaussian Mixture Model for Computer-Assisted Language Learning”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Florence, Italy, May 2014, pp.5347-5351.
Van Tung Pham, Haihua Xu, Nancy F. Chen, Sunil Sivadas, Boon Pang Lim, Eng Siong Chng and Haizhou Li, “Discriminative Score Normalization for Keyword Search Decision”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Florence, Italy, May 2014, pp.7078-7082.

INTERSPEECH

2013

Tze Yuang Chong, Rafael E. Banchs, Eng Siong Chng and Haizhou Li, “Modeling of Term-Distance and Term-Occurrence Information for Improving n-Gram Language model performance”, in Proc. Annual Meeting of the Association for Computational Linguistics (ACL), Sofia, Bulgaria, August 2013, pp.233-237.
Xiaoming Lu, Lei Xie, Cheung-Chi Leung, Bin Ma and Haizhou Li, “Broadcast News Story Segmentation Using Manifold Learning on Latent Topic Distributions”, in Proc. Annual Meeting of the Association for Computational Linguistics (ACL), Sofia, Bulgaria, August 2013, pp. 190-195.
Zhizheng Wu, Eng Siong Chng and Haizhou Li, “Restricted Machine for Voice Conversion”, in Proc. IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP), Beijing, China, July 2013, pp. 104-108.
Yanan Li, Keng Peng Tee, Shuzhi Sam Ge and Haizhou Li, “Building Companionship through Human-Robot Collaboration”, in Proc. International Conference of Social Robotics (ICSR), Bristol, UK, October 2013.

APSIPA-ASC

Zhizheng Wu and Haizhou Li, “Voice conversion and spoofing attack on speaker verification systems”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Kaohsiung, Taiwan, November 2013. pp. 1-9 (Invited paper)
Duc Hoang Ha Nguyen, Aleem Mushtaq, Xiong Xiao, Eng Siong Chng, Haizhou Li and Chin-Hui Lee, “A Particle Filter Compensation Approach to Robust LVCSR”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), Kaohsiung, Taiwan, November 2013, pp. 1-7.

INTERSPEECH

ICASSP

2012

Tze Yuang Chong, Xiong Xiao, Tien-Ping Tan, Eng Siong Chng, and Haizhou Li, “Collection and annotation of Malay Conversational Speech Corpus”, in Proc. The International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), Macau, China, December 2012, pp. 30-35.
Deyi Xiong, Min Zhang, and Haizhou Li, “Modeling the Translation of Predicate-Argument Structure for SMT”, in Proc. Annual Meeting of the Association for Computational Linguistics (ACL), Jeju, Korea, July 2012, pp. 902-911.
Wenliang Chen, Min Zhang, and Haizhou Li, “Utilizing Dependency Language Models for Graph-based Dependency Parsing Models”, in Proc. Annual Meeting of the Association for Computational Linguistics (ACL), Jeju, Korea, July 2012, pp. 213-222.
Rafael E. Banchs and Haizhou Li, “IRIS: a Chat-oriented Dialogue System based on the Vector Space Model”, in Proc. Annual Meeting of the Association for Computational Linguistics (ACL), (System Demonstrations), Jeju, Korea, July 2012, pp. 37-42.
Van Hai Do, Xiong Xiao, Eng Siong Chng and Haizhou Li, “A Phone Mapping Technique for Acoustic Modeling of Under-Resourced Languages”, in Proc. International Conference on Asian Language Processing (IALP), Hanoi, Vietnam, November 2012, pp. 233-236.
Liyuan Li, Xinguo Yu, Jun Li, Gang Wang, Ji Yu Shi, Yeow Kee Tan and Haizhou Li, “Vision-based attention estimation and selection for social robot to perform natural interaction in the open world”, in Proc. Seventh Annual Conference on Human-Robot Interaction (HRI), Boston, Massachusetts, USA, March 2012, pp. 183-184.
Keng Peng Tee, Shuzhi Sam Ge, Rui Yan and Haizhou Li, “Adaptive control for robot manipulators under ellipsoidal task space constraints”, in Proc. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vilamoura, Algarve, Portugal, October 2012, pp. 1167-1172.

APSIPA-ASC

Zhizheng Wu, Tomi Kinnunen, Eng Siong Chng, Haizhou Li and Eliathamby Ambikairajah, “A Study on Spoofing Attack in State-of-the-Art Speaker Verification: the Telephone Speech Case”, in Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC), California, USA, December 2012. (Best Paper Award)

ICASSP

INTERSPEECH

Ye Jiang, Kong Aik Lee, Zhenmin Tang, Bin Ma, Anthony Larcher and Haizhou Li, “PLDA Modeling in I-Vector and Supervector Space for Speaker Verification”, in Proc. INTERSPEECH, Portland, Oregon, September 2012, pp. 1680-1683.
Anthony Larcher, Kong Aik Lee, Bin Ma and Haizhou Li, “RSR2015: Database for Text-Dependent Speaker Verification using Multiple Pass-Phrases”, in Proc. INTERSPEECH, Portland, Oregon, September 2012, pp. 1580-1583.
You Changhuai, Li Haizhou, Ma Bin and Lee Kong Aik, “Effect of Relevance Factor of Maximum a posteriori Adaptation for GMM-SVM in Speaker and Language Recognition”, in Proc. INTERSPEECH, Portland, Oregon, September 2012, pp. 2065-2068.

ISCSLP

Van Hai Do, Xiong Xiao, Eng Siong Chng and Haizhou Li, “Context dependent phone mapping for cross-lingual acoustic modelling”, in Proc. 8^th International Symposium on Chinese Spoken Language Processing (ISCSLP), Hong Kong, December 2012, pp. 16-20.
Cheung-Chi Leung, Bin Ma, and Haizhou Li, “Phonotactic spoken language recognition: Using diversely adapted acoustic models in parallel phone recognizers”, in Proc. 8^th International Symposium on Chinese Spoken Language Processing (ISCSLP), Hong Kong, December 2012, pp. 108-111.
Duc Hoang Ha Nguyen, Xiong Xiao, Chng Eng Siong and Haizhou Li, “An analysis of vector Taylor series model compensation for non-stationary noise in speech recognition”, in Proc. 8^th International Symposium on Chinese Spoken Language Processing (ISCSLP), Hong Kong, December 2012, pp. 131-135.
Siu Wa Lee, Minghui Dong and Haizhou Li, “A study of F0 modelling and generation with lyrics and shape characterization for singing voice synthesis”, in Proc. 8^th International Symposium on Chinese Spoken Language Processing (ISCSLP), Hong Kong, December 2012, pp. 150-154.

2011

Deyi Xiong, Min Zhang and Haizhou Li, “Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers”, in Proc. Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT), Portland, Oregon, June 2011, pp. 1288-1297.
Rafael E. Banchs and Haizhou Li, “AM-FM: A Semantic Framework for Translation Quality Assessment”, in Proc. Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT), Portland, Oregon, June 2011, pp. 153-158.
Wenliang Chen, Junichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie Zhang, Yiou Wang, Kentaro Torisaws, and Haizhou Li, “SMT Helps Bitext Dependency Parsing”, in Proc. Conference on Empirical Methods in Natural Language Processing (EMNLP), Edinburgh, UK, July 2011, pp. 73–83.
Zhenghua Li, Min Zhang, Wanxiang Che, Ting Liu, Wenliang Chen and Haizhou Li, “Joint Models for Chinese POS Tagging and Dependency Parsing”, in Proc. Conference on Empirical Methods in Natural Language Processing (EMNLP), Edinburgh, UK, July 2011, pp. 1180-1191.
Min Zhang, Xiangyu Duan, Ming Liu, Yunqing Xia and Haizhou Li, “Joint Alignment and Artificial Data Generation: An Empirical Study of Pivot-based Machine Transliteration”, in Proc. Fifth International Joint Conference on Natural Language Processing (IJCNLP), Chiang Mai, Thailand, November 2011, pp. 1207-1215.
Guoyu Tang, Yunqing Xia, Min Zhang, Haizhou Li and Fang Zhang, “CLGVSM: Adapting Generalized Vector Space Model to Cross-lingual Document Clustering”, in Proc. Fifth International Joint Conference on Natural Language Processing (IJCNLP), Chiang Mai, Thailand, November 2011, pp. 580–588.

ICASSP

Huy Dat Tran and Haizhou Li, “Probabilistic Distance SVM with Hellinger-Exponential Kernel for Sound Event Classification”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech, May 2011, pp. 2272-2275.
Huy Dat Tran and Haizhou Li, “Jump Function Kolmogorov for Overlapping Audio Event Classification”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech, May 2011, pp. 3696-3699.
Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma and Haizhou Li, “Score Fusion and Calibration in Multiple Language Detectors with Large Performance Variation”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech, May 2011, pp. 4404-4407.
Filip Sedlak, Tomi Kinnunen, Ville Hautamäki, Kong Aik Lee and Haizhou Li, “Classifier Subset Selection and Fusion for Speaker Verification”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech, May 2011, pp. 4544-4547.
Eryu Wang, Kong Aik Lee, Bin Ma, Haizhou Li, Wu Guo and Li-Rong Dai, “Factored Covariance Modeling for Text-Independent Speaker Verification”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech, May 2011, pp. 4856-4859.
Xiong Xiao, Jinyu Li, Eng Siong Chng and Haizhou Li, “Maximum Likelihood Adaptation of Histogram Equalization with Constraint for Robust Speech Recognition”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech, May 2011, pp. 5480-5483.

INTERSPEECH

2010

Min Zhang, Hui Zhang, and Haizhou Li, “Convolution Kernel over Packed Parse Forest”, in Proc. Association for Computational Linguistics (ACL), Uppsala, Sweden, July 2010, pp. 875-885.
Deyi Xiong, Min Zhang and Haizhou Li, “Error Detection for Statistical Machine Translation Using Linguistic Features”, in Proc. Association for Computational Linguistics (ACL), Uppsala, Sweden, July 2010, Pp. 604-611.
Xiangyu Duan, Min Zhang and Haizhou Li. “Pseudo-word for Phrase-based Machine Translation”, in Proc. Association for Computational Linguistics (ACL), Uppsala, Sweden, July 2010, pp 148-156.
Deyi Xiong, Min Zhang and Haizhou Li, “Learning Translation Boundaries for Phrase-Based Decoding”, in Proc. North American Chapter of the Association for Computational Linguistics – Human Language Technologies: (NAACL-HLT), Los Angeles, CA, June 2010, pp 136-144.
Lianhau Lee, Aiti Aw, Min Zhang and Haizhou Li, “EM-based Hybrid Model for Bilingual Terminology Extraction from Comparable Corpora”, in Proc. International Conference on Computational Linguistics (COLING), Beijing, China, August 2010, pp. 639–646.
Vladimir Pervouchine, Min Zhang, Ming Liu and Haizhou Li, “Improving Name Origin Recognition with Context Features and Unlabelled Data”, in Proc. International Conference on Computational Linguistics (COLING), Beijing, China, August 2010, pp. 972–978.
Min Zhang, Xiangyu Duan, Vladimir Pervouchine and Haizhou Li, “Machine Transliteration: Leveraging on Third Languages”, in Proc. International Conference on Computational Linguistics (COLING), Beijing, China, August 2010, pp. 1444–1452.

INTERSPEECH

ICASSP

2009

INTERSPEECH

Rong Tong, Bin Ma, Haizhou Li, Eng Siong Chng, and Kong-Aik Lee, “Target-Aware Language Models for Spoken Language Recognition”, in Proc. INTERSPEECH, Brighton, UK, September 2009, pp. 200-203.
Hanwu Sun, Tin Lay Nwe, Bin Ma, and Haizhou Li, “Speaker Diarization for Meeting Room Audio”, in Proc. INTERSPEECH, Brighton, UK, September 2009, pp. 900-903.
Ling Cen, Minghui Dong, Paul Chan, and Haizhou Li, “Unit Selection Based Speech Synthesis for Poor Channel Condition”, in Proc. INTERSPEECH, Brighton, UK, September 2009, pp. 2075-2078.
Donglai Zhu, Bin Ma, and Haizhou Li, “Large Margin Estimation of Gaussian Mixture Model Parameters with Extended Baum-Welch for Spoken Language Recognition”, in Proc. INTERSPEECH, Brighton, UK, September 2009, pp. 2179-2182.
Omid Dehzangi, Bin Ma, Eng Siong Chng, and Haizhou Li, “Discriminative Feature Transformation Using Output Coding for Speech Recognition”, in Proc. INTERSPEECH, Brighton, UK, September 2009, pp. 2979-2982.
Khe Chai Sim and Haizhou Li, “Stream-Based Context-Sensitive Phone Mapping for Cross-Lingual Speech Recognition”, in Proc. INTERSPEECH, Brighton, UK, September 2009, pp. 3019-3022.

ICASSP

2008

INTERSPEECH

ICASSP

Rong Tong, Bin Ma, Haizhou Li, and Eng Siong Chng, “Target-Oriented Phone Tokenizers for Spoken Language Recognition”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Las Vegas, Nevada, March- April 2008, pp. 4221-4224.
Donglai Zhu, Haizhou Li, Bin Ma, and Chin-Hui Lee, “Discriminative Learning for Optimizing Detection Performance in Spoken Language Recognition”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Las Vegas, Nevada, March- April 2008, pp. 4161-4164.
Tin Lay Nwe and Haizhou Li, “On Fusion of Timbre-Motivated Features for Singing Voice Detection And Singer Identification”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Las Vegas, Nevada, March- April 2008, pp. 2225-2228.
Swe Zin Kalayar Khine, Tin Lay Nwe, and Haizhou Li, “Singing Voice Detection In Pop Songs Using Co-Training Algorithm”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Las Vegas, Nevada, March- April 2008, pp. 1629-1632.
Khe Chai Sim and Haizhou Li, “Robust Phone Set Mapping Using Decision Tree Clustering for Cross-Lingual Phone Recognition”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Las Vegas, Nevada, March- April 2008, pp. 4309-4312.
Kong-Aik Lee, Changhuai You, and Haizhou Li, “Spoken Language Recognition Using Support Vector Machines with Generative Front-End”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Las Vegas, Nevada, March- April 2008, pp. 4153-4156.
Tran Huy Dat and Haizhou Li, “Jump Function Komogorov and Its Application for Audio Stream”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Las Vegas, Nevada, March- April 2008, pp. 3353-3356.

2007

Haizhou Li, Khe Chai Sim, Jin-Shea Kuo, and Minghui Dong, “Semantic Transliteration of Personal Names”, in Proc. Association for Computational Linguistics (ACL), Prague, Czech Republic, June 2007, pp. 120-127.
Hendra Setiawan, Min-Yen Kan, and Haizhou Li, “Ordering Phrases with Function Words”, The in Proc. Association for Computational Linguistics (ACL), Prague, Czech Republic, June 2007, pp. 712-719.
Tee Kiah Chia, Haizhou Li, and Hwee Tou Ng, “A Statistical Language Modeling Approach to Lattice-based Spoken Document Retrieval”, in Proc. Joint Meeting Conference on Empirical Methods in Natural Language Processing, and Conference on Computational Natural Language Learning (EMNLP-CoNLL), Prague, Czech Republic, June 2007, pp. 810–818.
Tin Lay Nwe and Haizhou Li, “Singing Voice Detection using Perceptually-Motivated Features”, in Proc. ACM Annual Conference on Multimedia (ACM), Augsburg, Germany, September 2007, pp. 309-312.
Lei Wang, Eng Siong Chng, and Haizhou Li, “A vector-based approach to broadcast audio database indexing and retrieval”, in Proc. IEEE International Conference on Multimedia and Expo (ICME), Beijing, China, July 2007. pp. 512-515.

ICASSP

Bin Ma, Rong Tong, and Haizhou Li, “Discriminative Vector for Spoken Language Recognition”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hawaii, USA, April 2007, pp. pp. 1001-1004.
Rong Tong, Haizhou Li, Bin Ma, and Eng Siong Chng, “Spoken Language Recognition with Relevance Feedback”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hawaii, USA, April 2007, pp. 861-864.
Donglai Zhu, Bin Ma, Haizhou Li, and Qiang Huo, “A Generalized Feature Transformation Approach for Channel Robust Speaker Verification”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hawaii, USA, April 2007, pp. 61-64.
Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Normalizing the Speech Modulation Spectrum for Robust Speech Recognition”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hawaii, USA, April 2007, pp. 1021-1024.

INTERSPEECH

Kong Aik Kee, Changhuai You, Haizhou Li, and Tomi Kinnunen, “A GMM-based Probabilistic Sequence Kernel for Speaker Verification”, in Proc. INTERSPEECH, Antwerp, Belgium, August 2007, pp. 294-297.
Eugene Chin Wei Koh, Hanwu Sun, Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma, Eng-Siong Chng, Haizhou Li, and Susanto Rahardja, “Using Direction of Arrival Estimate and Acoustic Feature Information in Speaker Diarization”, in Proc. INTERSPEECH, Antwerp, Belgium, August 2007, pp. 2149-2152.
Khe Chai Sim and Haizhou Li, “Fusion of Contrastive Acoustic Models for Parallel Phonotactic Spoken Language Identification”, in Proc. INTERSPEECH, Antwerp, Belgium, August 2007, pp. 170-173.
Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Evaluating the Temporal Structure Normalisation Technique on the Aurora-4 Task”, in Proc. INTERSPEECH, Antwerp, Belgium, August 2007, pp. 1070-1073.

2006

Jin-Shea Kuo, Haizhou Li, and Ying-Kuei Yang, “Learning Transliteration Lexicons from the Web”, in Proc. Association for Computational Linguistics (COLING-ACL), Sydney, Australia, July 2006, pp. 1129 – 1136.
Namunu Maddage, Haizhou Li, and Mohan Kankanhalli, “Music Structure-based Vector Space Retrieval”, in Proc. Annual International ACM SIGIR Conference on Research & Development on Information Retrieval (SIGIR), Seattle, Washington, August 2006, pp. 67-74.
Denny Iskandar, Ye Wang, Min -Yen Kan, and Haizhou Li, “Syllabic Level Automatic Synchronization of Music Signals and Text Lyrics”, in Proc. ACM Multimedia Conference, Santa Barbara, USA, October 2006, pp. 659-662.
Namunu C Maddage, Mohan S. Kankanhalli, and Haizhou Li, “A Hirarchical Approach for Music Chord Modeling based on the Analysis of Tonal Characteristics”, in Proc. IEEE International Conference on Multimedia and Expo (ICME), Toronto, Canada, July 2006.
Jinyu Li, Sibel Yaman, Chin-Hui Lee, Bin Ma, Rong Tong, Donglai Zhu, and Haizhou Li, “Language Recognition Based on Score Distribution Feature Vectors and Discriminative Classifier Fusion”, in Proc. IEEE Odyssey 2006 – The Speaker and Language Recognition Workshop, San Juan, Puerto Rico, June 2006, pp. 1-5.

ICASSP

Shuanhu Bai and Haizhou Li, “Bayesian Learning of N-gram Statistical Language Modeling”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toulouse, France, May 2006, pp. I-I.
Haizhou Li and Tin Lay Nwe, “Vibrato-Motivated Acoustic Features for Singer Identification”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toulouse, France, May 2006, pp. V-V.
Rong Tong, Bin Ma, Donglai Zhu, Haizhou Li, and Eng Siong Chng, “Integrating Acoustic, Prosodic and Phonotactic features for Spoken language identification”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toulouse, France, May 2006, pp. I-I.

INTERSPEECH

Tin Lay Nwe, Haizhou Li, and Minghui Dong, “Analysis and Detection of Speech under Sleep Deprivation”, in Proc. INTERSPEECH, Pittsburgh, USA, September 2006, pp. 1846-1849.
Haizhou Li, Bin Ma, and Rong Tong, “Vector-Based Spoken Language Recognition using Output Coding”, in Proc. of INTERSPEECH, Pittsburgh, USA, September 2006.
Minghui Dong, Haizhou Li, and Tin Lay Nwe, “Evaluating Prosody of Mandarin Speech for Language Learning”, in Proc. of INTERSPEECH, Pittsburgh, USA, September 2006, pp. 429-432.
Ma Bin, Donglai Zhu, Rong Tong, and Haizhou Li, “Speaker Cluster-based GMM Tokenization for Speaker Recognition”, in Proc. INTERSPEECH, Pittsburgh, USA, September 2006, pp. 505-508.

2005

Min Zhang, Haizhou Li, Jian Su, and Hendra Setiawan, “A Phrase-based Context-dependent Joint Probability”, in Proc. International Joint Conference on Natural Language Processing (IJCNLP), Jeju, South Korea, October 2005, pp. 600-611.
Hendra Setiawan, Haizhou Li, Min Zhang, and Beng Chin Ooi, “Phrase-based Statistical Machine Translation: A Level of Detail Approach”, in Proc. International Joint Conference on Natural Language Processing (IJCNLP), Jeju, South Korea, October 2005, pp. 576-587.
Haizhou Li and Bin Ma, “A Phonotactic Language Model for Spoken Language Identification”, in Proc. Association for Computational Linguistics (ACL), Ann Arbor, USA, June 2005, pp. 515-522.
Bin Ma and Haizhou Li, “A Phonotactic-Semantic Paradigm for Automatic Spoken Document Classification”, in Proc. International ACM SIGIR Conference (SIGIR), Salvador, Brazil, August 2005, pp. 369-376.
Minghui Dong, Kim Teng Lua, and Haizhou Li, “A Unit Selection based Speech Synthesis Approach for Chinese Mandarin Text-to-Speech”, in Proc. of the International Conference on Chinese Computing (ICCC), Singapore, March 2005, pp. 135-144.
Bin Ma and Haizhou Li, “Spoken Language Identification Using Bag-of-Sounds”, in Proc. International Conference on Chinese Computing 2005 (ICCC 2005), Singapore, March 2005.
Manickam K and Haizhou Li, “Complexity Analysis of Normal and Deaf Infant Cry Acoustic Waves”, in Proc. International Workshop on Model and Analysis of Vocal Emission for Biomedical Applications (MAVEBA 2005), Florence, Italy, 2005.
Boon Pang Lim, Bin Ma, and Haizhou Li, “Using Semantic Context to Improve Voice Keyword Mining”, in Proc. International Conference on Chinese Computing 2005 (ICCC 2005), Singapore, March 2005.

ICASSP

Tin Lay Nwe and Haizhou Li, “Broadcast News Segmentation by Audio Type Analysis”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Philadelphia, PA, March 2005, pp. 1065-1068.
Boon Pang Lim, Haizhou Li, and Bin Ma, “Using Local and Global Phonotactical Features in Chinese Dialect Identification”, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Philadelphia, PA, March 2005, pp. 577-580.

INTERSPEECH

Santhosh C. Kumar, V.P. Mohandas, and Haizhou Li, “Multilingual Speech Recognition: A Unified Approach”, in Proc. INTERSPEECH, Lisboa, Portugal, September 2005, pp. 3357-3360.
Tin Lay Nwe and Haizhou Li, “Identifying Singers of Popular Songs”, in Proc. INTERSPEECH, Lisboa, Portugal, September 2005, pp. 129-132.
Minghui Dong, Kim-Teng Lua, and Haizhou Li, “A Probabilistic Approach to Prosodic Word Prediction for Mandarin Chinese TTS”, in Proc. INTERSPEECH, Lisboa, Portugal, September 2005, pp. 3245-3248.
Sheng Gao, Bin Ma, Haizhou Li, and Chin-Hui Lee, “A Text Categorization Approach to Automatic Language Identification”, in Proc. INTERSPEECH, Lisboa, Portugal, September 2005, pp. 2837-2840.
Bin Ma, Haizhou Li, and Chin-Hui Lee, “An Acoustic Segment Modeling Approach to Automatic Language Identification”, in Proc. INTERSPEECH, Lisboa, Portugal, September 2005, pp. 2829-2832.

2004

Haizhou Li, Min Zhang, and Jian Su, “A Joint Source-Channel Model for Machine Transliteration”, in Proc. Association for Computational Linguistics (ACL), Barcelona, Spain, July 2004, pp. 160-167.
Min Zhang, Haizhou Li, and Jian Su, “Direct Orthographical Mapping for Machine Transliteration”, in Proc. International Conference on Computational Linguistics (COLING), Geneva, Switzerland, August 2004.
Boon Pang Lim, Haizhou Li, and Yu Chen, “Language Identification through Large Vocabulary Continuous Speech Recognition”, in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP), Hong Kong, December 2004.
Yeow Kee Tan, Boon Seong Teoh, and Haizhou Li, “A Grapheme to Phoneme Conversion for Standard Malay”, in Proc. International Conference on Speech and Language System for Human Communication and Workshop on Oriental COCOSDA (ICSLT-OCOCOSDA), New Delhi, India, November 2004.
C. S. Kumar and Haizhou Li, “Language identification System for Multilingual Speech Recognition Systems”, in Proc. International Conference Speech and Computer (SPECOM), St. Petersburg, Russia, September 2004.

INTERSPEECH

Jun Xu, Guohong Fu, and Haizhou Li, “Grapheme-to-Phoneme Conversion for Chinese Text-to-Speech Session Code”, in Proc. INTERSPEECH, Jeju Island, Korea, October 2004.

Return to HLT Main Page