aslp@nwpu

中文版

Lab Wiki

only search Dr. Lei Xie's homepage

Last Modified: Dec 2022

Affiliated with

Collaborators

tencent

alibaba

Huawei

sougou

XiaoMi

microsoft

iqiyi

jd.com

bytedance

Baidu

oppo

I2R

I2R

I2R

Lei Xie Lei Xie

Ph.D, Professor, Senior Member IEEE

Audio, Speech & Language Processing Group (ASLP@NPU)

Shaanxi Provincial Key Laboratory of Speech & Image Information Processing (SAIIP)

School of Computer Science, Northwestern Polytechnical University, Xi'an, China

E-mail: lxie (at) nwpu.edu.cn, xielei21st (at) gmail.com, lxie (at) nwpu-aslp.org (Convert AT to @)

Biosketch

Lei Xie received the Ph.D. degree in computer science from Northwestern Polytechnical University, Xi'an, China, in 2004. From 2001 to 2002, he was with the Department of Electronics and Information Processing, Vrije Universiteit Brussel (VUB), Brussels, Belgium, as a Visiting Scientist. From 2004 to 2006, he was a Senior Research Associate with the Center for Media Technology, School of Creative Media, City University of Hong Kong, Hong Kong, China. From 2006 to 2007, he was a Postdoctoral Fellow with the Human-Computer Communications Laboratory (HCCL), The Chinese University of Hong Kong, Hong Kong, China. He is currently a Professor with School of Computer Science, Northwestern Polytechnical University, Xian, China and leads the Audio, Speech and Language Processing Group (ASLP@NPU). He has published over 280 papers in referred journals and conferences, such as IEEE/ACM Transactions on Audio, Speech and Language Processing, IEEE Transactions on Multimedia, Interspeech, ICASSP, ASRU, SLT, ACL and ACM Multimedia. His team has wide and deep collaborations with industries, including Microsoft, Alibaba, Tencent, Huawei, Xiaomi, Sogou, JD.com, Bytedance, iQiyi and Meituan. His current research interests include general topics in speech and language processing, multimedia and human-computer interaction. Dr. Xie is currently a senior area editor (SAE) of IEEE/ACM Trans. on Audio, Speech and language Processing. He is a member of IEEE Speech and Language Technical Committee (SLTC) and the Vice Chairperson of ISCA SIG-CSLP. He has actively served as Chairs in many conferences and technical committees.

Research Interests

  • Audio, Speech and Language Processing, including but not limited to: Speech Enhancement and Separation, Robust Speech Recognition, Speech Synthesis, Voice Conversion, Speaker Recognition, Keyword Spotting, Low-resource Speech Processing
  • Multimedia/Multimodal Information Processing
  • Machine Learning
  • Human Computer Interaction

Recent Selected Publications

Yi Lei, Shan Yang, Xinsheng Wang, Qicong Xie, Jixun Yao, Lei Xie, Dan Su, UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis, AAAI2023 PDF

Junwen Xiong, Yu Zhou, Peng Zhang, Lei Xie, Wei Huang, Yufei Zha: Look&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement, IEEE Transactions on Multimedia PDF

Xinsheng Wang, Qicong Xie, Jihua Zhu, Lei Xie, Odette Scharenborg, AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Persons, IEEE Transactions on Multimedia, 2022 PDF

Xiang Hao, Chenglin Xu, Lei Xie, Neural speech enhancement with unsupervised pre-training and mixture training, Neural Networks Volume 158, January 2023, Pages 216-227 PDF

Yi Lei, Shan Yang, Xinfa Zhu, Lei Xie, Dan Su, Cross-speaker Emotion Transfer through Information Perturbation in Emotional Speech Synthesis, IEEE Signal Processing Letters, Volume 29, Page(s) 1948 - 1952, September 2022 PDF

Liumeng Xue, Frank K. Soong, Shaofei Zhang, Lei Xie, ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS, IEEE/ACM Trans. on Audio, Speech and Language Processing, 2022 PDF

Fan Yu, Shiliang Zhang, Pengcheng Guo, Yuhao Liang, Zhihao Du, Yuxiao Lin, Lei Xie, MFCCA: Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario, SLT2022, January 9 to 12, 2023, Doha, Qatar PDF

Shubo Lv, Yihui Fu, Yukai Jv, Lei Xie, Weixin Zhu, Wei Rao, Yannan Wang, Spatial-DCCRN: DCCRN Equipped with Frame-level Angle Feature and Hybrid Filtering for Multi-channel Speech Enhancement, SLT2022, January 9 to 12, 2023, Doha, Qatar PDF

Yukai Ju , Shimin Zhang, Wei Rao, Yannan Wang, Tao Yu, Lei Xie, Shidong Shang, TEA-PSE 2.0: SUB-BAND NETWORK FOR REAL-TIME PERSONALIZED SPEECH ENHANCEMENT, SLT2022, January 9 to 12, 2023, Doha, Qatar PDF

Ao Zhang, Fan Yu, Kaixun Huang, Lei Xie, Longbiao Wang, Eng Siong Chng, Hui Bu, Binbin Zhang, Wei Chen, Xin Xu ,The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results, ISCSLP2022, December 11 to 14, 2022, Singapore PDF

Yuhao Liang, Peikun Chen, Fan Yu, Xinfa Zhu, Tianyi Xu, Yingying Gao, Lei Xie, The NPU-ASLP System for The ISCSLP 2022 Magichub Code-Swiching ASR Challenge, ISCSLP2022, December 11 to 14, 2022, Singapore PDF

Bowen Pang, Huan Zhao, Gaosheng Zhang, Xiaoyue Yang, Yang Sun, Li Zhang, Qing Wang, Lei Xie, TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge, ISCSLP2022, December 11 to 14, 2022, Singapore PDF

Kun Song, Heyang Xue, Xinsheng Wang, Jian Cong, Yongmao Zhang, Lei Xie, Bing Yang,Xiong Zhang, Dan Su, AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation, ISCSLP2022, December 11 to 14, 2022, Singapore PDF

Yongmao Zhang, Zhichao Wang, Peiji Yang, Hongshen Sun, Zhisheng Wang, Lei Xie , AccentSpeech: Learning Accent from Crowd-sourced Data for Target Speaker TTS with Accents, ISCSLP2022, December 11 to 14, 2022, Singapore PDF

Kun Song, Jian Cong, Xinsheng Wang, Yongmao Zhang, Lei Xie, Ning Jiang, Haiying Wu, Robust MelGAN: A robust universal neural vocoder for high-fidelity TTS, ISCSLP2022, December 11 to 14, 2022, Singapore PDF

Qicong Xie, Shan Yang, Yi Lei, Lei Xie, Dan Su, End-to-End Voice Conversion with Information Perturbation, ISCSLP2022, December 11 to 14, 2022, Singapore PDF

Qicong Xie, Tao Li, Xinsheng Wang, Zhichao Wang, Lei Xie, Guoqiao Yu, Guanglu Wan, Multi-speaker Multi-style Text-to-speech Synthesis with Single-speaker Single-style Training Data Scenarios, ISCSLP2022, December 11 to 14, 2022, Singapore PDF

Yue Li, Li Zhang, Namin Wang, Jie Liu, MSV Challenge 2022: NPU-HC Speaker Verification System for Low-resource Indian Languages, O-COCOSDA MSV Challenge, 2022 PDF

Li Zhang, Yue Li, Namin Wang, Jie Liu, Lei Xie, NPU-HC Speaker Verification System for Far-field Speaker Verification, Interspeech Far-field Speaker Verification Challenge (FFSVC), 2022 PDF

Jixun Yao, Qing Wang, Li Zhang, Pengcheng Guo, Yuhao Liang, Lei Xie, NWPU-ASLP System for the VoicePrivacy 2022 Challenge, Interspeech Voice Privacy Challenge Workshop, 2022 PDF

Li Zhang, Yue Li, Huan Zhao, Qing Wang, Lei Xie, Backend Ensemble for Speaker Verification and Spoofing Countermeasure, INTERSPEECH2022, September 18 to 22, 2022, Korea PDF

Shimin Zhang, Ziteng Wang, Yukai Ju, Yihui Fu, Yueyue Na, Qiang Fu, Lei Xie, Personalized Acoustic Echo Cancellation for Full-duplex Communications, INTERSPEECH2022, September 18 to 22, 2022, Korea PDF

Liumeng Xue, Shan Yang, Na Hu, Dan Su, Lei Xie, Learning noise-independent speech representations for high-quality voice conversion for noisy target speakers, INTERSPEECH2022, September 18 to 22, 2022, Korea PDF

Heyang Xue, Xinsheng Wang, Yongmao Zhang, Lei Xie, Pengcheng Zhu, Mengxiao Bi, Learn2Sing 2.0: Diffusion and Mutual Information-Based Target Speaker SVS by learning from Singing Teachers, INTERSPEECH2022, September 18 to 22, 2022, Korea PDF

Zhanheng Yang, Sining Sun, Jin Li, Xiaoming Zhang, Xiong Wang, Long Ma, Lei Xie, CaTT-KWS: A Multi-stage Customized Keyword Spotting Framework based on Cascaded Transducer-Transformer, INTERSPEECH2022, September 18 to 22, 2022, Korea PDF

Zhanheng Yang, Hang Lv, Xiong Wang, Ao Zhang, Lei Xie, Minimizing Sequential Confusion Error in Speech Command Recognition, INTERSPEECH2022, September 18 to 22, 2022, Korea PDF

Fan Yu, Zhihao Du, Shiliang Zhang, Yuxiao Lin, Lei Xie, A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings, INTERSPEECH2022, September 18 to 22, 2022, Korea PDF

Yu Wang, Xinsheng Wang, Pengcheng Zhu, Jie Wu, Hanzhao Li, Heyang Xue, Yongmao Zhang, Lei Xie, Mengxiao Bi, Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis, INTERSPEECH2022, September 18 to 22, 2022, Korea PDF

Binbin Zhang, Di Wu, Zhendong Peng, Xingchen Song, Zhuoyuan Yao, Hang Lv, Lei Xie, Chao Yang, Fuping Pan, Jianwei Niu, WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit, INTERSPEECH2022, September 18 to 22, 2022, Korea PDF

Tao Li, Xinsheng Wang, Qicong Xie, Zhichao Wang, Mingqi Jiang, Lei Xie, Cross-speaker Emotion Transfer Based On Prosody Compensation for End-to-End Speech Synthesis, INTERSPEECH2022, September 18 to 22, 2022, Korea PDF

Qijie Shao, Jinghao Yan, Jian Kang, Pengcheng Guo, Xian Shi, Pengfei Hu, Lei Xie, Linguistic-Acoustic Bimodal Shift Based Accent Recognition, INTERSPEECH2022, September 18 to 22, 2022, Korea PDF

Kun Wei, Pengcheng Guo, Ning Jiang, Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism, INTERSPEECH2022, September 18 to 22, 2022, Korea PDF

Kun Wei, Yike Zhang, Sining Sun, Lei Xie, Long Ma, Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR, INTERSPEECH2022, September 18 to 22, 2022, Korea PDF

Shimin Zhang, Ziteng Wang, Yukai Ju, Yihui Fu, Yueyue Na, Qiang Fu, Lei Xie, Personalized Acoustic Echo Cancellation for Full-duplex Communications, INTERSPEECH2022, September 18 to 22, 2022, Korea PDF

Yi Lei, Shan Yang, Jian Cong, Lei Xie, Dan Su, Glow-WaveGAN 2: High-quality Zero-shot Text-to-speech Synthesis and Any-to-any Voice Conversion, INTERSPEECH2022, September 18 to 22, 2022, Korea PDF

Tao Li, Xinsheng Wang, Qicong Xie, Zhichao Wang, Lei Xie, Controllable cross-speaker emotion transfer for end-to-end speech synthesis, IEEE/ACM Transactions on Audio, Speech and Language Processing, 2022 PDF

Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu, Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challeng, ICASSP2022, May 22-27, 2022, Singapore PDF

Jingyong Hou, Lei Xie, Shilei Zhang, Two-stage streaming keyword detection and localization with multi-scale depthwise temporal convolution, Neural Networks, 150 (2022) 28-42 PDF

Yi Lei, Shan Yang, Xinsheng Wang , Student Member, and Lei Xie, MsEmoTTS: Multi-Scale Emotion Transfer, Prediction, and Control for Emotional Speech Synthesis, IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 30, 2022 PDF

Fan Yu, Shiliang Zhang, Yihui Fu, Lei Xie, Siqi Zheng, Zhihao Du, Weilong Huang, Pengcheng Guo, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu, M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge, ICASSP2022, May 22-27, 2022, Singapore PDF

Yongmao Zhang, Jian Cong, Heyang Xue, Lei Xie, Pengcheng Zhu, Mengxiao Bi, VISinger: Variational Inference with Adversarial Learning for End-to-End Singing Voice Synthesis, ICASSP2022, May 22-27, 2022, Singapore PDF

Shubo Lv, Yihui Fu, Mengtao Xing, Jiayao Sun, Lei Xie, Jun Huang, Yannan wang, Tao Yu, S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement, ICASSP2022, May 22-27, 2022, Singapore PDF

Yukai Ju, Wei Rao, Xiaopeng Yan, Yihui Fu, Shubo Lv, Luyao Cheng, Yannan Wang, Lei Xie, Shidong Shang, TEA-PSE: Tencent-ethereal-audiolab personalized speech enhancement system for ICASSP 2022 DNS CHALLENGE , ICASSP2022, May 22-27, 2022, Singapore PDF

Kun Wei, Yike Zhang, Sining Sun, Lei Xie, Long Ma, Conversational Speech Recognition by Learning Conversation-level Characteristics, ICASSP2022, May 22-27, 2022, Singapore PDF

Binbin Zhang, Hang Lv, Pengcheng Guo, Qijie Shao, Chao Yang, Lei Xie, Xin Xu, Hui Bu, Xiaoyu Chen, Chenchen Zeng, Di Wu, Zhendong Peng, WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition, ICASSP2022, May 22-27, 2022, Singapore PDF

Yihui Fu , Yun Liu , Jingdong Li , Dawei Luo, Shubo Lv, Yukai Jv, Lei Xie, Uformer: A unet based dilated complex & real dual-path conformer network for simutaneous speech enhancement and dereverberation, ICASSP2022, May 22-27, 2022, Singapore PDF

Zhichao Wang, Qicong Xie, Tao Li, Hongqiang Du, Lei Xie, Pengcheng Zhu, Mengxiao Bi, One-shot Voice Conversion for Style Transfer Based on Speaker Adaptation, ICASSP2022, May 22-27, 2022, Singapore PDF

Shimin Zhang, Ziteng Wang, Jiayao Sun, Yihui Fu, Biao Tian, Qiang Fu, Lei Xie, Multi-Task Deep Residual Echo Suppression with Echo-aware Loss, ICASSP2022, May 22-27, 2022, Singapore PDF

Hongqiang Du, Lei Xie, Haizhou Li, Noise-robust voice conversion with domain adversarial training, Neural Networks, Volume 148, April 2022, Pages 74-84 PDF

Xiaochun An, Frank K. Soong and Lei Xie, Disentangling Style and Speaker Attributes for TTS Style Transfer, IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 30, 2022 PDF

Li Zhang, Qing Wang, Lei Xie, Duality Temporal-channel-frequency Attention Enhanced Speaker Representation Learning, Dec 13-17, 2021, Cartagena, Colombia PDF

Fan Yu, Haoneng Luo, Pengcheng Guo, Yuhao Liang, Zhuoyuan Yao, Lei Xie, Yingying Gao, Leijing Hou, Shilei Zhang, Boundary and Context Aware Training for CIF-based Non-Autoregressive End-to-end ASR, Dec 13-17, 2021, Cartagena, Colombia PDF

Li Zhang, Huan Zhao, Qinglin Meng, Yanli Chen, Min Liu, Lei Xie, Beijing ZKJ-NPU Speaker Verification System for VoxCeleb Speaker Recognition Challenge 2021, The VoxSRC Workshop 2021 (Rank 2 in Track 1 and 2) PDF

Qijie Shao, Jingyong Hou, Yanxin Hu, Qing Wang, Lei Xie and Xin Lei, Target Speaker Extraction for Customizable Query-by-Example Keyword Spotting, APSIPA ASC, Dec 14 - 17, 2021, Tokyo, Japan PDF

Xian Shi, Pan Zhou, Wei Chen, and Lei Xie, Efficient Gradient-Based Neural Architecture Search For End-to-End ASR, 2021 International Conference on Multimodal Interaction (ICMI2021), October 18 - 22, 2021, Montreal, Canada PDF

Yi Chen, Shan Yang, Na Hu, Lei Xie and Dan Su, TeNC: Low Bit-Rate Speech Coding with VQ-VAE and GAN, 2021 International Conference on Multimodal Interaction (ICMI2021), October 18 - 22, 2021, Montreal, Canada PDF

Heyang Xue, Xiao Zhang, Jie Wu, Jian Luan, Yujun Wang, and Lei Xie, Noise Robust Singing Voice Synthesis Using Gaussian Mixture Variational Autoencoder, 2021 International Conference on Multimodal Interaction (ICMI2021), October 18 - 22, 2021, Montreal, QC, Canada PDF

Jian Cong, Shan Yang, Na Hu, Guangzhi Li, Lei Xie, Dan Su, Controllable Context-aware Conversational Speech Synthesis, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Jian Cong, Shan Yang, Lei Xie, Dan Su, Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Zhichao Wang, Xinyong Zhou, Fengyu Yang, Tao Li, Hongqiang Du, Lei Xie, Wendong Gan, Haitao Chen, Hai Li, Enriching Source Style Transfer in Recognition-Synthesis based Non-Parallel Voice Conversion , Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Xiong Wang, Sining Sun, Lei Xie, Long Ma, Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-End Speech Recognition , Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Pengcheng Guo, Xuankai Chang, Shinji Watanabe, Lei Xie, Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Shimin Zhang, Yuxiang Kong, Shubo Lv, Yanxin Hu, Lei Xie, F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and Speech Enhancement, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Yihui Fu, Luyao Cheng, Shubo Lv, Yukai Jv, Yuxiang Kong,Zhuo Chen, Yanxin Hu, Lei Xie, Jian Wu, Hui Bu, Xin Xu, Jun Du, Jingdong Chen, AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Li Zhang, Qing Wang, Kong Aik Lee, Lei Xie, Haizhou Li. Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Hongqiang Du, Lei Xie, Dan Su, Improving robustness of one-shot voice conversion with deep discriminative speaker encoder, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Xiaochun An, Frank K. Soong, Lei Xie, Improving Performance of Seen and Unseen Speech Style Transfer in End-to-end Neural TTS, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Shubo Lv,Yanxin Hu,Shimin Zhang,Lei Xie, DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Jingsong Wang, Yuxuan He, Chunyu Zhao, Qijie Shao, Wei-Wei Tu, Tom Ko, Hung-yi Lee, Lei Xie, Auto-KWS 2021 Challenge: Task, Datasets, and Baselines, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Zhuoyuan Yao, Di Wu, Xiong Wang, Binbin Zhang, Fan Yu, Chao Yang, Zhendong Peng, Xiaoyu Chen, Lei Xie, WeNet: Production Oriented Streaming and Non-streaming End-to-End Speech Recognition Toolkit, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li, Factorized WaveNet for voice conversion with limited data, Speech Communication, 130 (2021), 45-54 PDF

Xiaochun An, Frank K. Soong, Shan Yang, Lei Xie, Effective and direct control of neural TTS prosody by removing interactions between different attributes, Neural Networks 143 (2021) 250–260 PDF

Hang Lv, Zhehuai Chen, Hainan Xu, Daniel Povey, Lei Xie, Sanjeev Khudanpur, An asynchronous WFST-based decoder for automatic speech recognition, ICASSP2021, Toronto, Canada, 6-11 June, 2021 PDF

Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur, Wake word detection with streaming transformers, ICASSP2021, Toronto, Canada, 6-11 June, ICASSP2021, Toronto, Canada, 6-11 June, 2021 PDF

Qicong Xie, Xiaohai Tian, Guanghou Liu, Kun Song, Lei Xie, Zhiyong Wu, Hai Li, Song Shi, Haizhou Li, Fen Hong, Hui Bu, Xin Xu, THE MULTI-SPEAKER MULTI-STYLE VOICE CLONING CHALLENGE 2021, ICASSP2021, Toronto, Canada, 6-11 June, 2021 PDF

Xian Shi, Fan Yu, Yizhou Lu, Yuhao Liang, Qiangze Feng, Daliang Wang, Yanmin Qian, Lei Xie, The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods, ICASSP2021, Toronto, Canada, 6-11 June, 2021 PDF

Hang Lv, Daniel Povey, Mahsa Yarmohammadi, Ke Li, Yiming Wang, Lei Xei, Sanjeev Khudanpur, LET-Decoder: A WFST-based lazy-evaluation token-group decoder with exact lattice generation, IEEE Signal Processing Letters PDF

Jingyong Hou, Li Zhang, Yihui Fu, Qing Wang, Zhanheng Yang, Qijie Shao, Lei Xie, The NPU System for the 2020 Personalized Voice Trigger Challenge, ISCSLP2021 Personalized Voice Trigger Challenge PDF

Liumeng Xue, Shifeng Pan, Lei He, Lei Xie and Frank K. Soong, Cycle consistent network for end-to-end style transfer TTS training, Neural Networks, vol. 140, August 2021, pages 223-236 PDF

Xiong Wang, Zhuoyuan Yao, Xian Shi, Lei Xie, Cascade RNN-Transducer: Syllable Based Streaming On-device Mandarin Speech Recognition with a Syllable-to-Character Converter, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Yuxiang Kong, Jian Wu, Quandong Wang, Peng Gao, Weiji Zhuang, Yujun Wang, Lei Xie, Multi-Channel Automatic Speech Recognition Using Deep Complex Unet, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Haoneng Luo, Shiliang Zhang, Ming Lei, Lei Xie, Simplified Self-Attention for Transformer-based End-to-End Speech Recognition, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Yihui Fu, Jian Wu, Yanxin Hu, Mengtao Xing, Lei Xie, DESNet: A Multi-channel Network for Simultaneous Speech Dereverberation, Enhancement and Separation, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Yihui Fu, Zhuoyuan Yao, Weipeng He, Jian Wu, Xiong Wang, Zhanheng Yang, Shimin Zhang, Lei Xie, Dongyan Huang, Hui Bu, Petr Motlicek, Jean-Marc Odobez, IEEE SLT 2021 Alpha-mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Fan Yu, Zhuoyuan Yao, Xiong Wang, Keyu An, Lei Xie, Zhijian Ou, Bo Liu, Xiulin Li, Guanqiong Miao, The SLT 2021 children speech recognition challenge: Open datasets, rules and baselines, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Yi Lei, Shan Yang, Lei Xie, Fine-grained Emotion Strength Transfer, Control and Prediction for Emotional Speech Synthesis, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Heyang Xue, Shan Yang, Yi Lei, Lei Xie, Xiulin Li, Learn2Sing: Target Speaker Singing Voice Synthesis by Learning from a Singing Teacher, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Geng Yang, Shan Yang, Kai Liu, Peng Fang, Wei Chen, Lei Xie, Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Haohan Guo, Shaofei Zhang, Frank K. Soong, Lei He, Lei Xie, Conversational End-to-End TTS for Voice Agent, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li, Optimizing voice conversion network with cycle consistency loss of speaker identity, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Xiaohai Tian, Zhichao Wang, Shan Yang, Xinyong Zhou, Hongqiang Du, Yi Zhou, Mingyang Zhang, Kun Zhou, Berrak Sisman, Lei Xie, Haizhou Li, The NUS & NWPU system for Voice Conversion Challenge 2020, Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 30 October 2020, Shanghai, China PDF

Tao Li, Shan Yang, Liumeng Xue, Lei Xie, Controllable Emotion Transfer For End-to-End Speech Synthesis, ISCSLP2021, January 24-26, Hong Kong, China PDF

Zhichao Wang, Wenshuo Ge, Xiong Wang, Shan Yang, Wendong Gan, Haitao Chen, Hai Li, Lei Xie, Xiulin Li, Accent and Speaker Disentanglement in Many-to-many Voice Conversion, ISCSLP2021, January 24-26, Hong Kong, China PDF

Kun Wei, Pengcheng Guo, Hang Lv, Zhen Tu, Lei Xie, Xiulin Li, Context-aware RNNLM Rescoring for Conversational Speech Recognition, ISCSLP2021, January 24-26, Hong Kong, China PDF

Qing Wang, Wei Rao, Pengcheng Guo, Lei Xie, Adversarial Training for Multi-domain Speaker Recognition, ISCSLP2021, January 24-26, Hong Kong, China PDF

Jing Shi, Xuankai Chang, Pengcheng Guo, Shinji Watanabe, Yusuke Fujita, Jiaming Xu, Bo Xu, Lei Xie, Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals, NeurlPS 2020, PDF

Shan Yang, Yuxuan Wang, Lei Xie, Adversarial Feature Learning and Unsupervised Clustering based Speech Synthesis for Found Data with Acoustic and Textual Noise, IEEE Signal Processing Letters, 2020 PDF

Yanxin Hu, Yun Liu, Shubo Lv, Mengtao Xing, Shimin Zhang, Yihui Fu, Jian Wu, Bihong Zhang, Lei Xie, DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement, Interspeech2020, October 25-29, Shanghai, China PDF

Fengyu Yang, Shan Yang, Qinghua Wu, Yujun Wang, Lei Xie, Exploiting Deep Sentential Context for Expressive End-to-End Speech Synthesis, Interspeech2020, October 25-29, Shanghai, China PDF

Jian Wu, Zhuo Chen, Jinyu Li, Takuya Yoshioka, Zhili Tan, Ed Lin, Yi Luo, Lei Xie, An End-to-end Architecture of Online Multi-channel Speech Separation, Interspeech2020, October 25-29, Shanghai, China PDF

Shiliang Zhang, Zhifu Gao, Haoneng Luo, Ming Lei, Jie Gao, Zhijie Yan, Lei Xie, Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech Recognition, Interspeech2020, October 25-29, Shanghai, China PDF

Qing Wang, Pengcheng Guo, Lei Xie, Inaudible Adversarial Perturbations for Targeted Attack in Speaker Recognition, Interspeech2020, October 25-29, Shanghai, China PDF

Li Zhang, Jian Wu, Lei Xie, NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge, Interspeech2020, October 25-29, Shanghai, China PDF

Haohe Liu, Lei Xie, Jian Wu, Geng Yang, Channel-wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music, Interspeech2020, October 25-29, Shanghai, China PDF

Jian Cong, Shan Yang, Lei Xie, Guoqiao Yu, Guanglu Wan, Data Efficient Voice Cloning from Noisy Samples with Domain Adversarial Training, Interspeech2020, October 25-29, Shanghai, China PDF

Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur, Wake Word Detection with Alignment-Free Lattice-Free MMI, Interspeech2020, October 25-29, Shanghai, China PDF

Jingsong Wang, Tom Ko, Zhen Xu, Xiawei Guo, Souxiang Liu, Wei-Wei Tu, Lei Xie, AutoSpeech 2020: The Second Automated Machine Learning Challenge for Speech Classification, Interspeech2020, October 25-29, Shanghai, China PDF

Xian Shi, Qiangze Feng, Lei Xie, The ASRU 2019 Mandarin-English Code-Switching Speech Recognition Challenge: Open Datasets, Tracks, Methods and Results, First Workshop on Speech Technologies for Code-switching in Multilingual Communities 2020, October 30-31, 2020 PDF

Yougen Yuan, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Bin Ma, "Fast Query-by-example Speech Search using Attention-based Deep Binary Embeddings", IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2020 PDF

Jingyong Hou, Yangyang Shi, Mari Ostendorf, Mei-Yuh Hwang, Lei Xie, "MINING EFFECTIVE NEGATIVE TRAINING SAMPLES FOR KEYWORD SPOTTING", ICASSP2020, Barcelona, Spain, May 4-8, 2020 PDF

Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li, "EFFECTIVE WAVENET ADAPTATION FOR VOICE CONVERSION WITH LIMITED DATA", ICASSP2020, Barcelona, Spain, May 4-8, 2020 PDF

Xiang Hao, Chenglin Xu, Nana Hou, Lei Xie, Eng Siong Chng, Haizhou Li, "TIME-DOMAIN NEURAL NETWORK APPROACH FOR SPEECH BANDWIDTH EXTENSION", ICASSP2020, Barcelona, Spain, May 4-8, 2020 PDF

Shan Yang, Heng Lu, Shiyin Kang, Liumeng Xue, Jinba Xiao, Dan Su, Lei Xie, Dong Yu, "On the localness modeling for the self-attention based end-to-end speech synthesis", Neural Networks, Elsevier, 2020 PDF

Chenggang Mi, Lei Xie and Yanning Zhang, "Improving Adversarial Neural Machine Translation for Morphologically Rich Language", IEEE Transactions on Emerging Topics in Computational Intelligence, 2020 PDF

Chenggang Mi, Lei Xie and Yanning Zhang, "Loanword Identification in Low-resource Languages with Minimal Supervision", ACM Transactions on Asian and Low-Resource Language Information Processing, 2020 PDF

Jian Wu, Yong Xu, Shi-Xiong Zhang, Lian-Wu Chen, Meng Yu, Lei Xie, Dong Yu, "Time Domain Audio Visual Speech Separation", ASRU2019, 14-18 December 2019, Singapore PDF

Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li, "Wavenet Factorization with Singular Value Decomposition for Voice Conversion", ASRU2019, 14-18 December 2019, Singapore PDF

Fengyu Yang, Shan Yang, Pengcheng Zhu, Pengju Yan, Lei Xie, "Improving Mandarin End-to-End Speech Synthesis by Self-Attention and Learnable Gaussian Bias", ASRU2019, 14-18 December 2019, Singapore PDF

Yougen Yuan, Zhiqiang Lv, Shen Huang, Lei Xie, "Verifying Deep Keyword Spotting Detection with Acoustic Word Embeddings", ASRU2019, 14-18 December 2019, Singapore PDF

Xiaolian Zhu, Shan Yang, Geng Yang, Lei Xie, "Controlling Emotion Strength with Relative Attribute for End-To-End Speech Synthesis", ASRU2019, 14-18 December 2019, Singapore PDF

Xiaochun An, Yuxuan Wang, Shan Yang, Zejun Ma, Lei Xie, "Learning Hierarchical Representations for Expressive Speaking Style in End-to-End Speech Synthesis", ASRU2019, 14-18 December 2019, Singapore PDF

Xiong Wang, Sining Sun, Lei Xie, "Virtual Adversarial Training for DS-CNN Based Small-Footprint Keyword Spotting", ASRU2019, 14-18 December 2019, Singapore PDF

Yiming Wang, Tongfei Chen,Hainan Xu, Shuoyang Ding, Hang Lv, Yiwen Shao, Nanyun Peng, Lei Xie, Shinji Watanabe, Sanjeev Khudanpur, "ESPRESSO: A FAST END-TO-END NEURAL SPEECH RECOGNITION TOOLKIT", ASRU2019, 14-18 December 2019, Singapore PDF

Zhehuai Chen, Mahsa Yarmohammadi, Hainan Xu, Hang Lv, Lei Xie, Daniel Povey, Sanjeev Khudanpur, "INCREMENTAL LATTICE DETERMINIZATION FOR WFST DECODERS", ASRU2019, 14-18 December 2019, Singapore PDF

Yougen Yuan, Wei Tang, Minhao Fan, Yue Chao, Peng Zhang, Lei Xie, "Deep Audio-visual System for Closed-set Word-level Speech Recognition", The 21st ACM International Conference on Multimodal Interaction (ICMI 2019), Suzhou, China (Top 1 system in the 1st Mandarin Audio-Visual Speech Recognition Challenge) PDF

Senmao Wang, Pan Zhou, Wei Chen, Jia Jia, Lei Xie, "Exploring RNN-Transducer for Chinese Speech Recognition", APSIPA ASC 2019, 18-21 November, 2019, Lanzhou, China PDF

Sining Sun, Shuran Zhou, Mei-Yuh Hwang, Lei Xie, Qin Li, Xin Lei, "Multiple Fixed Beamformers with a Spacial Wiener-form Postfilter for Far-Field Speech Recognition", APSIPA ASC 2019, 18-21 November, 2019, Lanzhou, China PDF

Sining Sun, Pengcheng Guo, Lei Xie and Mei-Yuh Hwang, Adversarial Regularization for Attention Based End-to-End Robust Speech Recognition, IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, vol. 27, no. 11, November 2019 PDF

Jingyong Hou, Yangyang Shi, Mari Ostendorf, Mei-Yuh Hwang, Lei Xie, Region Proposal Network Based Small-Footprint Keyword Spotting, IEEE Signal Processing Letters, 2019 PDF

Xiaolian Zhu, Yuchao Zhang, Shan Yang, Liumeng Xue, Lei Xie, Pre-Alignment Guided Attention for Improving Training Efficiency and Model Stability in End-to-End Speech Synthesis, IEEE Access, vol. 7, 2019 PDF

Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, "Query-by-Example Speech Search Using Recurrent Neural Acoustic Word Embeddings With Temporal Context", IEEE Access, vol. 7, 2019 PDF

Haohan Guo, Frank K. Soong, Lei He, Lei Xie, "A New GAN-based End-to-End TTS Training Algorithm", Interspeech2019, 16-19 September, 2019, Graz, Austria PDF

Haohan Guo, Frank K. Soong, Lei He, Lei Xie, "Exploiting Syntactic Features in a Parsed Tree to Improve End-to-End TTS", Interspeech2019, 16-19 September, 2019, Graz, Austria PDF

Liumeng Xue, Wei Song, Guanghui Xu, Lei Xie, Zhizheng Wu, "Building a mixed-lingual neural TTS system with only monolingual data", Interspeech2019, 16-19 September, 2019, Graz, Austria PDF

Pengcheng Guo, Sining Sun, Lei Xie, "Unsupervised Adaptation with Adversarial Dropout Regularization for Robust Speech Recognition", Interspeech2019, 16-19 September, 2019, Graz, Austria PDF

Jian Wu, Yong Xu, Shi-Xiong Zhang, Lian-Wu Chen, Meng Yu, Lei Xie, Dong Yu, "Improved Speaker-Dependent Separation for CHiME-5 Challenge", Interspeech2019, 16-19 September, 2019, Graz, Austria PDF

Qing Wang, Pengcheng Guo, Sining Sun, Lei Xie1, John H.L. Hansen, "Adversarial Regularization for End-to-end Robust Speaker Verification", Interspeech2019, 16-19 September, 2019, Graz, Austria PDF

Shiliang Zhang, Yuan Liu, Ming Lei, Bin Ma, Lei Xie, "Towards Language-Universal Mandarin-English Speech Recognition", Interspeech2019, 16-19 September, 2019, Graz, Austria PDF

Shan Yang, Heng Lu, Shiying Kang, Lei Xie, Dong Yu, "ENHANCING HYBRID SELF-ATTENTION STRUCTURE WITH RELATIVE-POSITION-AWARE BIAS FOR SPEECH SYNTHESIS", ICASSP2019, 12-17 May, 2019, Brighton, UK PDF

Changhao Shan, Chao Weng, Guangsen Wang, Dan Su, Min Luo, Dong Yu, Lei Xie, "INVESTIGATING END-TO-END SPEECH RECOGNITION FOR MANDARIN-ENGLISH CODE-SWITCHING", ICASSP2019, 12-17 May, 2019, Brighton, UK PDF

Changhao Shan, Chao Weng, Guangsen Wang, Dan Su, Min Luo, Dong Yu, Lei Xie, "COMPONENT FUSION: LEARNING REPLACEABLE LANGUAGE MODEL COMPONENT FOR END-TO-END SPEECH RECOGNITION SYSTEM", ICASSP2019, 12-17 May, 2019, Brighton, UK PDF

Ke Wang, Frank Soong, Lei Xie, "A PITCH-AWARE APPROACH TO SINGLE-CHANNEL SPEECH SEPARATION", ICASSP2019, 12-17 May, 2019, Brighton, UK PDF

Jingyong Hou, Pengcheng Guo, Sining Sun, Frank K. Soong, Wenping Hu, Lei Xie, "DOMAIN ADVERSARIAL TRAINING FOR IMPROVING KEYWORD SPOTTING PERFORMANCE OF ESL SPEECH", ICASSP2019, 12-17 May, 2019, Brighton, UK PDF

Xiang Hao, Changhao Shan, Yong Xu, Sining Sun, Lei Xie, "AN ATTENTION-BASED NEURAL NETWORK APPROACH FOR SINGLE CHANNEL SPEECH ENHANCEMENT", ICASSP2019, 12-17 May, 2019, Brighton, UK PDF

Xiong Wang, Sining Sun, Changhao Shan, Jingyong Hou, Lei Xie, Shen Li, Xin Lei, "ADVERSARIAL EXAMPLES FOR IMPROVING END-TO-END ATTENTION-BASED SMALL-FOOTPRINT KEYWORD SPOTTING", ICASSP2019, 12-17 May, 2019, Brighton, UK PDF

Shiliang Zhang, Ming Lei, Bin Ma, Lei Xie, "ROBUST AUDIO-VISUAL SPEECH RECOGNITION USING BIMODAL DFSMN WITH MULTI-CONDITION TRAINING AND DROPOUT REGULARIZATION", ICASSP2019, 12-17 May, 2019, Brighton, UK PDF

Zhiwei Zhao, Jian Wu, Lei Xie, "The NWPU System for CHiME-5 Challenge", The 5th CHiME Speech Separation and Recognition Challenge (CHiME-5), September 7, 2019, Hyderabad, India PDF

Sining Sun, Yangyang Shi, Ching-Feng Yeh, Suliang Bu, Mei-Yuh Hwang, Lei Xie, "Multiple Beamformers with ROVER for the CHiME-5 Challenge", The 5th CHiME Speech Separation and Recognition Challenge (CHiME-5), September 7, 2018, Hyderabad, India PDF

Jingyong Hou, Wenping Hu, Frank K. Soong, Lei Xie, "A Refined Query-by-Example Approach to Spoken Term Detection on ESL Learners' Speech", International Symposium on Chinese Spoken Language Processing (ISCSLP2018), November 26-29, 2018, Taipei, Taiwan PDF

Jingyong Hou, Wenping Hu, Frank K. Soong, Lei Xie, "A Refined Query-by-Example Approach to Spoken Term Detection on ESL Learners' Speech", International Symposium on Chinese Spoken Language Processing (ISCSLP2018), November 26-29, 2018, Taipei, Taiwan PDF

Xiaochun An, Yuchao Zhang, Bing Liu, Liumeng Xue, Lei Xie, "A Kullback-Leibler Divergence Based Recurrent Mixture Density Network for Acoustic Modeling in Emotional Statistical Parametric Speech Synthesis", ACM Multimedia ASMMC Workshop, 26 October 2018, Seoul, Korea PDF

Liumeng Xue, Xiaolian Zhu, Xiaochun An, Lei Xie, "A Comparison of Expressive Speech Synthesis Approaches based on Neural Network", ACM Multimedia ASMMC Workshop, 26 October 2018, Seoul, Korea PDF

Sining Sun, Ching-Feng Yeh, Mari Ostendorf, Mei-Yuh Hwang, Lei Xie, "Training Augmentation with Adversarial Examples for Robust Speech Recognition", Interspeech2018, September 2-6, 2018, Hyderabad, India PDF

Changhao Shan, Junbo Zhang, Yujun Wang, Lei Xie, "Attention-based End-to-End Models for Small-Footprint Keyword Spotting", Interspeech2018, September 2-6, 2018, Hyderabad, India PDF

Ke Wang, Junbo Zhang, Sining Sun, Yujun Wang, Fei Xiang, Lei Xie, "Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition", Interspeech2018, September 2-6, 2018, Hyderabad, India PDF

Ke Wang, Junbo Zhang, Yujun Wang, Lei Xie, "Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model", Interspeech2018, September 2-6, 2018, Hyderabad, India PDF

Yougen Yuan, Cheung-Chi Leung, Lei Xie1, Hongjie Chen, Bin Ma, Haizhou Li, "Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search", Interspeech2018, September 2-6, 2018, Hyderabad, India PDF

Pengcheng Guo, Haihua Xu, Lei Xie, Eng Siong Chng, "Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition", Interspeech2018, September 2-6, 2018, Hyderabad, India PDF

Lei Xie, Tan Lee, Man-Wai Mak, "Guest Editorial: Advances in Deep Learning for Speech Processing", Journal of Signal Processing Systems, 2018 PDF

Sining Sun, Ching-Feng Yeh, Mei-Yuh Hwang, Mari Ostendorf, Lei Xie, "DOMAIN ADVERSARIAL TRAINING FOR ACCENTED SPEECH RECOGNITION", ICASSP2018, 15-20 April 2018, Calgary, Alberta, Canada PDF

Qing Wang, Wei Rao, Sining Sun, Lei Xie, Eng Siong Chng, Haizhou Li, "UNSUPERVISED DOMAIN ADAPTATION VIA DOMAIN ADVERSARIAL TRAINING FOR SPEAKER RECOGNITION", ICASSP2018, 15-20 April 2018, Calgary, Alberta, Canada PDF

Changhao Shan, Junbo Zhang, Yujun Wang, Lei Xie, "ATTENTION-BASED END-TO-END SPEECH RECOGNITION ON VOICE SEARCH", ICASSP2018, 15-20 April 2018, Calgary, Alberta, Canada PDF

Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Multi-Task Feature Learning for Low-Resource Query-by-Example Spoken Term Detection", IEEE Journal of Selected Topics in Signal Processing, 2017 PDF

Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "MULTILINGUAL BOTTLE-NECK FEATURE LEARNING FROM UNTRANSCRIBED SPEECH", 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU2017), December 16-20, 2017, Okinawa, Japan PDF

Shan Yang, Lei Xie, Xiao Chen, Xiaoyan Lou, Xuan Zhu, Dongyan Huang, Haizhou Li, "Statistical Parametric Speech Synthesis Using Generative Adversarial Networks Under A Multi-task Learning Framework", 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU2017), December 16-20, 2017, Okinawa, Japan PDF

Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li, "EXTRACTING BOTTLENECK FEATURES AND WORD-LIKE PAIRS FROM UNTRANSCRIBED SPEECH FOR FEATURE REPRESENTATION ", 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU2017), December 16-20, 2017, Okinawa, Japan PDF

Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng, "An End-to-End Neural Network Approach to Story Segmentation", 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2017), December 12-15, 2017, Kuala Lumpur, Malaysia PDF

Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng, "Topic Embedding of Sentences for Story Segmentation", 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2017), December 12-15, 2017, Kuala Lumpur, Malaysia PDF

Jie Yan, Lei Xie, Guangsen Wang, Zhong-Hua Fu, "A Segmental DNN/i-vector Approach for Digit-Prompted Speaker Verification", 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2017), December 12-15, 2017, Kuala Lumpur, Malaysia PDF

Chenglin Xu, Lei Xie, Xiong Xiao, "A Bidirectional LSTM Approach with Word Embeddings for Sentence Boundary Detection", Journal of Signal Processing Systems, Springer, 2017 PDF

Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng, "Learning Distributed Sentence Representations for Story Segmentation", Signal Processing, 2017 PDF

Wenpeng Li, BinBin Zhang, Lei Xie, Dong Yu, "Empirical Evaluation of Parallel Training Algorithms on Acoustic Modeling", Interspeech2017, August 20-24, Stockholm, Sweden. PDF

Jie Wu, Dongyan Huang, Lei Xie and Haizhou Li, "Denoising Recurrent Neural Network for Deep Bidirectional LSTM based Voice Conversion", Interspeech2017, August 20-24, Stockholm, Sweden. PDF

Yanfeng Lu, Zhengchen Zhang, Chenyu Yang, Huaiping Ming, Xiaolian Zhu, Yuchao Zhang, Shan Yang, Dongyan Huang, Lei Xie, Minghui Dong, "The I2R-NWPU Text-to-Speech System for Blizzard Challenge 2017", Blizzard Challenge 2017 Workshop, August 2017, Stockholm, Sweden pdf

Yougen Yuan, Lei Xie, Zhong-Hua Fu, Qi Cong, "Sound image externalization for headphone based real-time 3D audio", Frontiers of Computer Science, June 2017, Volume 11, Issue 3, pp 419-428.

Lei Xie, Lijuan Wang and Shan Yang, "Visual Speech Animation", Book Chapter in Handbook of Human Motion, Springer, 2017 PDF

Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li, "Pairwise learning using multi-lingual bottleneck features for low-resource query-by-example spoken term detection",ICASSP 2017, March 5-9, 2017, New Orleans, USA. PDF

Hongjie Chen, Lei Xie, Cheung-Chi Leung, Bin Ma and Haizhou Li, "Modeling Latent Topics and Temporal Distance for Story Segmentation of Broadcast News", IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 25, no. 1, January 2017 PDF

Sining Sun, Binbin Zhang, Lei Xie and Yanning Zhang, An unsupervised deep domain adaptation approach for robust speech recognition, Neurocomputing, 2017 PDF

Jingyong Hou, Lei Xie, Zhonghua Fu, "Investigating Neural Network based Query-by-Example Keyword Spotting Approach for Personalized Wake-up Word Detection in Mandarin Chinese", the 10th International Symposium on Chinese Spoken Language Processing (ISCSLP2016), October 17-20, 2016, Tianjin, China PDF

Changhao Shan, Lei Xie, Kaisheng Yao, "A Bi-directional LSTM Approach for Polyphone Disambiguation in Mandarin Chinese", the 10th International Symposium on Chinese Spoken Language Processing (ISCSLP2016), October 17-20, 2016, Tianjin, China PDF

Kaituo Xu, Lei Xie, Kaisheng Yao, "Investigating LSTM for Punctuation Prediction", the 10th International Symposium on Chinese Spoken Language Processing (ISCSLP2016), October 17-20, 2016, Tianjin, China PDF

Zhengchen Zhang, Mei Li, Yuchao Zhang, Weini Zhang, Yang Liu, Shan Yang, Yanfeng Lu,Van Tung Pham, Lei Xie, Minghui Dong, "The I2R-NWPU-NTU Text-to-Speech System at Blizzard Challenge 2016", Blizzard Challenge 2016 Workshop, September 16, 2016, Apple Inc., Cupertino, CA, USA PDF

Dong-Yan Huang, Lei Xie, Yvonne Siu Wa Lee, Jie Wu, Huaiping Ming, Xiaohai Tian, Shaofei Zhang, Chuang Ding, Mei Li, Quy Hy Nguyen, Minghui Dong, Haizhou Li, "An Automatic Voice Conversion Evaluation Strategy Based on Perceptual Background Noise Distortion and Speaker Similarity", the 9th ISCA Workshop on Speech Synthesis (SSW9), September 13th -15th, 2016, Sunnyvale, CA, USA PDF

Mei Li, Zhizheng Wu, Lei Xie, "On the impact of phoneme alignment in DNN-based speech synthesis", Mei Li, Zhizheng Wu, Lei Xie, the 9th ISCA Workshop on Speech Synthesis (SSW9), September 13th -15th, 2016, Sunnyvale, CA, USA PDF

Jie Wu, Zhizheng Wu, Lei Xie, "On the Use of I-vectors and Average Voice Model for Voice Conversion without Parallel Data", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2016), December 13-16, 2016, Jeju, Korea PDF

Shan Yang, Zhizheng Wu, Lei Xie, "On the training of DNN-based average voice model for speech synthesis", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2016), December 13-16, 2016, Jeju, Korea PDF

Zhen Wei, Zhizheng Wu, Lei Xie, "Predicting Articulatory Movement from Text Using Deep Architecture with Stacked Bottleneck Features", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2016), December 13-16, 2016, Jeju, Korea PDF

Xiong Xiao, Chenglin Xu, Zhaofeng Zhang, Shengkui Zhao, Sining Sun, Shinji Watanabe, Longbiao Wang, Lei Xie, Douglas L. Jones, Eng Siong Chng, Haizhou Li, Investigation of Neural Networks Based Beamforming Approaches for Speech Recognition: The NTU Systems for CHiME-4 Evaluation, the 4th International Workshop on Speech Processing in Everyday Environments (CHiME), San Francisco, September 13, 2016 PDF

Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, "Unsupervised Bottleneck Features for Low-Resource Query-by-Example Spoken Term Detection", Interspeech2016, September 8-12, 2016, San Francisco, USA PDF

Yougen Yuan, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, "Learning Neural Network Representations using Cross-lingual Bottleneck Features with Word-pair Information", Interspeech2016, September 8-12, 2016, San Francisco, USA PDF

Jia Yu, Xiong Xiao, Lei Xie, Eng Siong Chng and Haizhou Li, "A DNN-HMM Approach to Story Segmentation", Interspeech2016, September 8-12, 2016, San Francisco, USA PDF

Huaiping Ming, Dongyan Huang, Lei Xie, Jie Wu, Minghui Dong and Haizhou Li, "Deep Bidirectional LSTM Modeling of Timbre and Prosody for Emotional Voice Conversion", Interspeech2016, September 8-12, 2016, San Francisco, USA PDF

Cheung-Chi Leung, Lei Wang, Haihua Xu, Jingyong Hou, Van Tung Pham, Hang Lv, Lei Xie, Xiong Xiao, Chongjia Ni, Bin Ma, Eng Siong Chng, Haizhou Li,"Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis", Interspeech2016, September 8-12, 2016, San Francisco, USA PDF

Bihong Zhang, Lei Xie, Yougen Yuan, Huaiping Ming, Dongyan Huang and Mingli Song, "Deep neural network derived bottleneck features for accurate audio classification", ICME2016, S July 11-15, 2016, Seattle, USA PDF

Huaiping Ming, Dongyan Huang, Lei Xie, Shaofei Zhang, Minghui Dong and Haizhou Li, "Exemplar-based Sparse Representation of Timbre and Prosody for Voice Conversion", ICASSP2016, March 20-25, 2016, Shanghai, China PDF

Haihua Xu, Jingyong Hou, Xiong Xiao, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Van Hai Do, Hang Lv, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, "Approximate Search of Audio Queries using DTW with Phone Time Boundary and Data Augmentation", ICASSP2016, March 20-25, 2016, Shanghai, China PDF

Chuang Ding, Lei Xie, Jie Yan, Weini Zhang and Yang Liu, "Automatic Prosody Prediction for Chinese Speech Synthesis using BLSTM-RNN and Embedding Features",2016 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU2016), Dec 13-17, 2016, Scottsdale, Arizona PDF

Jingyong Hou, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Haihua Xu, Hang Lv, Lei Xie, Zhonghua Fu, Chongjia Ni, Xiong Xiao, Hongjie Chen, Shaofei Zhang, Sining Sun, Yougen Yuan, Pengcheng Li, Tin Lay Nwe, Sunil Sivadas, Bin Ma, Eng Siong Chng, Haizhou Li,"The NNI Query-by-Example System for MediaEval 2016", MediaEval 2016 Workshop, Wurzen, Germany, Sept 14-15, 2016 PDF  (Best performing system in the MediaEval2016 QUESST Evaluation)

Xiangzeng Zhou, Lei Xie, Peng Zhang and Yanning Zhang, "Online Object Tracking based on CNN with Metropolis-Hasting Re-sampling", ACM Multimedia 2016, Brisbane, Australia, Oct 26-30, 2016 PDF

Bo Fan, Lei Xie, Shan Yang, Lijuan Wang and Frank K. Soong, "A Deep Bidirectional LSTM Approach for Video-Realistic Talking Head", Multimedia Tools and Applications, Springer, 2016PDF

Bo Fan, Sui Wa Lee, Xiaohai Tian, Lei Xie and Minghua Dong, "A Waveform Representation Framework for High-quality Statistical Parametric Speech Synthesis", APSIPA ASC 2016, Hong Kong, China, Dec 16-19, 2016 PDF

Jia Yu, Lei Xie, Xiao Xiong, Eng Siong Chng, Haizhou Li, "A Density Peak Clustering Approach to Unsupervised Acoustic Subword Units Discovery", APSIPA ASC 2016, Hong Kong, China, Dec 16-19, 2016 PDF

Shaofei Zhang, Dongyan Huang, Lei Xie, Eng Siong Chng, Haizhou Li and Minghui Dong, "Non-negative Matrix Factorization using Stable Alternating Direction Method of Multipliers for Source Separation", APSIPA ASC 2016, Hong Kong, China, Dec 16-19, 2016 PDF

Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, Parallel Inference of Dirichlet Process Gaussian Mixture Models for Unsupervised Acoustic Modeling: A Feasibility Study, Interspeech2016, September 6-10, Dresden, Germany PDF (Interspeech2016 Zerospeech Challenge Best Paper Award)

Huaiping Ming, Dongyan Huang, Lei Xie, Haizhou Li and Minghui Dong, An Alternating Optimization Approach for Phase Retrieval Interspeech2016, September 6-10, Dresden, Germany PDF

Pengcheng Zhu, Lei Xie, Yunlin Chen, Articulatory Movement Prediction Using Deep Bidirectional Long Short-Term Memory Based Recurrent Neural Networks andWord/Phone Embeddings,Interspeech2016, September 6-10, Dresden, Germany PDF

Shaofei Zhang, Dongyan Huang, Lei Xie, Eng Siong Chng, Haizhou Li, Minghui Dong, Regularized Non-negative Matrix Factorization Using Alternating Direction Method of Multipliers and Its Application to Source SeparationInterspeech2016, September 6-10, Dresden, Germany PDF

Xiangzeng Zhou, Lei Xie, Qiang Huang, Stephen Cox and Yanning Zhang, Tennis Ball Tracking using a Two-Layered Data Association Approach, IEEE Transactions on Multimedia, 2014 PDF

Bo Fan, Lijuan Wang, Frank K. Soong and Lei Xie, Photo-real Talking Head with Deep Bidirectional LSTM, ICASSP2016, 19-24 April 2016, Brisbane, Australia PDF

Haihua Xu, Peng Yang, Xiong Xiao, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Jia Yu, Hang Lv, Lei Wang, Su Jun Leow, Bin Ma, Eng Siong Chng, Haizhou Li, Language Independent Query-by-Example Spoken Term Detection using N-Best Phone Sequences and Partial Matching, ICASSP2016, 19-24 April 2016, Brisbane, Australia PDF

Peng Yang, Haihua Xu, Xiong Xiao, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Jia Yu, Hang Lv, Lei Wang, Su Jun Leow, Bin Ma, Eng Siong Chng, Haizhou Li, "The NNI Query-by-Example System for MediaEval 2014", MediaEval 2014 Workshop, Barcelona, Spain, Oct 16-17, 2014 PDF

Guangpu Huang, Chenglin Xu, Xiong Xiao, Lei Xie, Eng Siong Chng, Haizhou Li, " Multi-View Features in a DNN-CRF Model for Improved Sentence Unit Detection on English Broadcast News", APSIPA ASC 2014, Siem Reap, Cambodia, December 9-12, 2014

Chuang Ding, Pengcheng Zhu, Lei Xie, Dongmei Jiang and Zhonghua Fu, "Speech-Driven Head Motion Synthesis Using Neural Networks," Interspeech, Singapore, 14-18, September 2014 PDF

Chenglin Xu, Lei Xie, Guangpu Huang, Xiong Xiao, Eng Siong Chng and Haizhou Li, "A Deep Neural Network Approach for Sentence Boundary Detection in Broadcast News," Interspeech, Singapore, 14-18, September 2014 PDF

Peng Yang, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, "Intrinsic Spectral Analysis Based on Temporal Context Features for Query by Example Spoken Term Detection," Interspeech, Singapore, 14-18, September 2014 (Best Student Paper Finalist) PDF

Zhong-hua Fu, Lei Xie, "Stereo Acoustic Echo Suppression Using Widely Linear Filtering in the Frequency Domain," Interspeech, Singapore, 14-18, September 2014

Shaofei Zhang, Lei Xie, Zhong-hua Fu, "A Hybrid Virtual Bass System with Improved Phase Vocoder and High Efficiency,” ISCSLP, Singapore, 12-14, September 2014

Zhong-hua Fu, Lei Xie, "Experimental Study on Dereverberation and Noise Reduction for Distant Speech Recognition,” ISCSLP, Singapore, 12-14, September 2014

Hongjie Chen, Lei Xie, Wei Feng, Lilei Zheng and Yanning Zhang, "Topic Segmentation on Spoken Documents Using Self-Validated Acoustic Cuts,” Soft Computing, Springer, accepted, June 2014

Xiangzeng Zhou, Lei Xie, Peng Zhang, Yanning Zhang, "An Ensemble of Deep Neural Networks for Object Tracking", ICIP2014, October 27-30, 2014, Paris, France PDF

Chuang Ding, Lei Xie, Pengcheng Zhu, " "Head Motion Synthesis From Speech Using Deep Neural Networks", Multimedia Tools and Applications, Springer, accepted, 2014

Chao Yang, Lei Xie and Xiangzeng Zhou, "Unsupervised Broadcast News Story Segmentation Using Distance Dependent Chinese Restaurant Processes", ICASSP2014, May 4-9, 2014, Florence, Italy PDF

Huaiping Ming, Dongyan Huang, Lei Xie and Haizhou Li, "Learning Optimal Features for Music Transcription", the 2nd IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP2014), July 9-13, 2014, Xi'an, China

Chenglin Xu, Lei Xie and Zhonghua Fu, "Sentence Boundary Detection in Chinese Broadcast News using Conditional Random Fields and Prosodic Features", the 2nd IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP2014), July 9-13, 2014, Xi'an, China

Huaiping Ming, Lei Xie and Haizhou LI, "Filter Bank Design for Automatic Music Transcription", the 2013 Young Engineers and Scientists Conference on Multimedia, Communication and Mobile Application Technologies (YES2013), Nov. 8, 2013, Singapore

Xiaoming Lu, Lei Xie, Cheung-Chi Leung, Bin Ma and Haizhou Li, "Broadcast News Story Segmentation Using Manifold Learning on Latent Topic Distributions", ACL2013, 4-9 August, 2013, Sofia, Bulgaria. PDF

Jianwei Niu, Lei Xie, Lei Jia and Na Hu, "Context-Dependent Deep Neural Networks for Commercial Mandarin Speech Recognition Applications", APSIPA Annual Summit and Conference (APSIPA ASC 2013), Kaohsiung, Taiwan, Oct. 29 - Nov. 1, 2013. PDF

Haoran Liang, Mingli Song, Lei Xie and Ronghua Liang, "Personalized 3-D Facial Expression Synthesis based on Landmark Constraint", APSIPA Annual Summit and Conference (APSIPA ASC 2013), Kaohsiung, Taiwan, Oct. 29 - Nov. 1, 2013.

Ling Tang, Zhong-Hua Fu and Lei Xie, "Numerical Calculation of the Head-Related Transfer Functions with Chinese Dummy Head", APSIPA Annual Summit and Conference (APSIPA ASC 2013), Kaohsiung, Taiwan, Oct. 29 - Nov. 1, 2013.

Lei Xie, Zhigang Deng and Stephen Cox, "Multimodal joint information processing in human machine interaction: recent advances", Multimedia Tools and Applications, Guest Editorial, Springer, November, 2013.

Lei Xie, Naicai Sun and Bo Fan, "A Statistical Parametric Approach to Video-Realistic Text-driven Talking Avatar", Multimedia Tools and Applications, Springer, August 2013.

Peng Yang, Lei Xie, Qiao Luan and Wei Feng, "A Tighter Lower Bound Estimate for Dynamic Time Warping", ICASSP2013, May 26-31, 2013, Vancouver, Canada PDF

Xiangzeng Zhou, Qiang Huang, Lei Xie and Stephen Cox, "A Two Layered Data Association Approach for Ball Tracking", ICASSP2013, May 26-31, 2013, Vancouver, Canada PDF

Xiaoming Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Broadcast News Story Segmentation Using Latent Topics on Data Manifold", ICASSP2013, May 26-31, 2013, Vancouver, Canada PDF

Xuecheng Nie, Wei Feng, Liang Wan, Lei Xie, "Measuring Similarity by Contextual Word Connections in Chinese News Story Segmentation", ICASSP2013, May 26-31, 2013, Vancouver, Canada

Bingfeng Li, Lei Xie, Pengcheng Zhu and Fan Bo, "Head Motion Generation for Speech-driven Talking Avatar", NCMMSC2013, Journal of Tsinghua University (Sci and Tech), No.6, 2013 PDF

Peng Yang, Lei Xie and Hongjie Chen, "Speech Pattern Discovery using Segmental Dynamic Time Warping and Posteriorgram Features", NCMMSC2013, Journal of Tsinghua University (Sci and Tech), No.6, 2013 PDF

Lei Xie, Lilei Zheng, Zihan Liu and Yanning Zhang, "Laplacian Eigenmaps for Automatic Story Segmentation of Broadcast News," IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 1, pp 264-277, January 2012. PDF Bib

Lei Xie, Yinqing Xu, Lilei Zheng, Qiang Huang and Bingfeng Li, "Speech Pattern Discovery using Audio-Visual Fusion and Canonical Correlation Analysis", Interspeech, Portland, Oregon, USA, September 9-13, 2012. PDF Bib Poster

Yali Zhao, Lei Xie and Zhonghua Fu, "A Two Stage Mask Estimation Approach to Robust Speaker Verification", Interspeech, Portland, Oregon, USA, September 9-13, 2012. PDF Bib Poster

Wei Feng, Xuecheng Nie, Liang Wan, Lei Xie and Jianmin Jiang, "Lexical Story Co-Segmentation of Chinese Broadcast News", Interspeech, Portland, Oregon, USA, September 9-13, 2012. PDF Bib

Lei Xie, Chenglin Xu and Xiaoxuan Wang, "Prosody-based Sentence Boundary Detection in Chinese Broadcast News", The 8th International Symposium on Chinese Spoken Language Processing (ISCSLP2012) , Hong Kong, China, December 5-8, 2012 PDF Bib

Qiang Huang, Stephen Cox, Xiangzeng Zhou and Lei Xie, "Detection of Ball Hits in a Tennis Game Using Audio and Visual Information", APSIPA Annual Summit and Conference (APSIPA ASC 2012), Hollywood, California, USA, Dec 3-6, 2012

Yang Liang, Mingli Song, Lei Xie, Jiajun Bu and Chun Chen,"Face Sketch-to-Photo Synthesis from Simple Line Drawing", APSIPA Annual Summit and Conference (APSIPA ASC 2012), Hollywood, California, USA, Dec 3-6, 2012

Lilei Zheng, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Acoustic Texttiling For Story Segmentation Of Spoken Documents", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2012), March 25 - 30, Kyoto, Japan, 2012. PDF Bib Poster

Yali Zhao, Zhong-Hua Fu, Lei Xie, Jian Zhang, Yanning Zhang, "Dual-microphone based binary mask estimation for robust speaker verification", International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, July 16 -18, 2012.

Dan Li, Zhong-Hua Fu and Lei Xie, "Comprehensive Comparison of the Least Mean Square Algorithm and the Fast Deconvolution Algorithm for Crosstalk Cancellation", International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, July 16 -18, 2012.

Lei Xie, Yulian Yang and Zhi-Qiang Liu, "On the Effectiveness of Subwords for Lexical Cohesion Based Story Segmentation of Chinese Broadcast News", Information Sciences, 181(13):2873–2891, Elsevier, 2011. PDF Bib

Mimi Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Probabilistic Latent Semantic Analysis for Broadcast News Story Segmentation", Interspeech2011, Florence, Italy, August, 2011. (Interspeech Grant) PDF Bib Slides

Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng and Haizhou Li, "Broadcast News Story Segmentation Using Conditional Random Fields and Multi-modal Features", IEICE Transactions on Information and Systems, Vol. E95-D, No. 5, pp. 1206-1215, May 2012. PDF Bib

Mimi Lu, Lilei Zheng, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, "Broadcast News Story Segmentation Using Probabilistic Latent Semantic Analysis and Laplacian Eigenmaps", APSIPA Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, 2011. PDF Bib

Xiaoyu Chen, Zhonghua Fu and Lei Xie, "Multiple Sparse Sources Separation Based on Multichannel Frequency Domain Adaptive Filtering", APSIPA Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, 2011

Jian Zhang, Zhonghua Fu and Lei Xie, "A Block-Based Blind Source Separation Approach with Equilateral Triangular Microphone Array", APSIPA Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, 2011

Lei Xie, Zhong-hua Fu, Wei Feng and Yong Luo,"Pitch-Density-based Features and an SVM Binary Tree Approach for Multi-Class Audio Classification in Broadcast News", ACM/Springer Multimedia Systems Journal, 17(2):101-112 , 2011. PDF Bib

LI Bingfeng, XIE Lei, ZHOU Xiangzeng, FU Zhonghua and ZHANG Yanning, "Real-time speech driven talking avatar", Journal of Tsinghua University, 2011, 51(9):1180-1186. (In Chinese, selected paper from NCMMSC2011, Best Student Paper Nomination Award) PDF Bib

ZHANG Jian, FU Zhonghua, XIE Lei and ZHAO Yali, "Semi-blind dual-microphone noise reduction with known target localization", Journal of Tsinghua University. 2011, 51(9):1215-1219. (In Chinese, selected paper from NCMMSC2011)

ZHENG Li-lei, XIE Lei, LU Mi-mi, WANG Xiao-xuan, YANG Yu-lian and ZHANG Yan-ning, "An Automatic Caption Generator for Mandarin Broadcast News", Chinese Journal of Electronics, 39(3A): 69-74, 2011. PDF Bib

Mimi Lu, Lei Xie, Zhonghua Fu, Dongmei-Jiang, "Multi-Modal Feature Integration for Story Boundary Detection in Broadcast News", International Symposium on Chinese Spoken Language Processing (ISCSLP2010), Tainan, Taiwan, 2010. PDF Bib

Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, "Modeling Broadcast News Prosody Using Conditional Random Fields for Story Segmentation", APSIPA Annual Summit and Conference (APSIPA ASC 2010), Biopolis, Singapore, December 14-17, 2010. PDF Bib

Zihan Liu, Lei Xie, Wei Feng, "Maximum Lexical Cohesion for Fine-Grained News Story Segmentation," Interspeech2010, Makuhari, Japan, 26-30 September, 2010. (Interspeech Best Student Paper Award Finalist) PDF Bib

Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, "Phoneme Lattice based TextTiling towards Multilingual Story Segmentation," Interspeech2010, Makuhari, Japan, 26-30 September, 2010. PDF Bib

Lei Xie, Yulian Yang, Zhi-Qiang Liu, Wei Feng and Zihan Liu, "Integrating Acoustic and Lexical Features In Topic Segmentation of Chinese Broadcast News Using Maximum Entropy Approach," International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, 23-25 November 2010.

Zihan Liu, Lei Xie and Lilei Zheng, "Laplacian Eigenmaps for Automatic News Story Segmentation", International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, 23-25 November 2010.

Lei Xie et al., "Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications," Demo for The 7th International Conference on Ubiquitous Intelligence and Computing(UIC), October 26-29, 2010, Xi'an, China

Xiaohai Tian, Zhonghua Fu and Lei Xie, "An Experimental Comparison on KEMAR and BHead210 Dummy Heads for HRTF-based Virtual Auditory on Chinese Subjects," The Third IET International Conference on Wireless, Mobile & Multimedia Networks (ICWMMN2010), 26 - 29, September 2010, Beijing, China.

Yaodong Ni, Lei Xie, and Zhi-Qiang Liu, " Minimizing the Expected Complete Influence Time of a Social Network," Information Sciences, 180(13): 2514-2527, 2010.

Wei Feng, Lei Xie and Zhi-Qiang Liu, "Multicue Graph Mincut for Image Segmentation", Ninth Asian Conference on Computer Vision (ACCV2009), LNCS 5995, pp. 707-717, Springer, 2010.

Jin Zhang, Lei Xie, Wei Feng and Yanning Zhang, "A Subword Normalized Cut Approach to Automatic Story Segmentation of Chinese Broadcast News", Asia Information Retrieval Symposium (AIRS2009), LNCS 5839, Springer, pp136-148, 2009.

Wei Feng, Lei Xie, Jia Zeng and Zhi-Qiang Liu, "Audio-Visual Human Recognition Using Semi-Supervised Spectral Learning and Hidden Markov Models," Journal of Visual languages and Computing , invited paper, 20(3):188-195, 2009.

Jia Zeng, Wei Feng, Lei Xie and Zhi-Qiang Liu, "Cascade Markov random fields for stroke extraction of Chinese characters," Information Sciences, 180(2):301-311, 2009.

Lilei Zheng, Lei Xie, Xiaoxuan Wang, Mimi Lu, Yulian Yang and Yanning Zhang, "An Antomatic Caption Generator for Mandarin Broadcast News," 5th Joint Conference on Harmonious Human Machine Environment (HHME2009), Xi'an, China, Oct 28-30, 2009 (Best Paper Award)

Mimi Lu, Lei Xie, Lilei Zheng, Yulian Yang, Yanning Zhang, "Anchor Labeling System for Broadcast News using Alize toolkit", 5th Joint Conference on Harmonious Human Machine Environment (HHME2009), Xi'an, China, Oct 28-30, 2009

Zhonghua Fu, Jhing-Fa Wang and Lei Xie, "Noise Robust Features for Speech/Music Discrimination in Real-time Telecommunication", IEEE International Conference on Multimedia and Expo (ICME 2009), pp 574-577, New York, USA.

Lei Xie, "Discovering salient prosodic cues and their interactions for automatic story segmentation in Mandarin broadcast news", ACM/Springer Multimedia Systems Journal, 14(4):237-253, 2008.

Jia Zeng, Lei Xie and Zhi-Qiang Liu, "Type-2 Fuzzy Gaussian Mixture Models" Pattern Recognition, 41, 2008, pp 3636-3643.

Lei Xie and Guangsen Wang, "A Two-stage Multi-feature integration approach to Unsupervised Speaker Change Detection in Real-time News Broadcasting", International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 350-353, Yunnan, China, 2008. PDF Bib

Yulian Yang and Lei Xie, "Subword Latent Semantic Analysis for TextTiling-based Automatic Story Segmentation of Chinese Broadcast News", International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 358-361, Yunnan, China, 2008. (Microsoft Student Grant. This paper is also presented in the 2008 Beijing-Hong Kong International Doctoral Forum, Beijing) PDF Bib

Lei Xie and Yulian Yang, "Subword Lexical Chaining for Automatic Story Segmentation in Chinese Broadcast News", Pacific-Rim Conference on Multimedia (PCM2008), LNCS 5353, Springer, pp248-258, 2008.

Lei Xie, Jia Zeng and Wei Feng, "Multi-Scale TextTiling for Automatic Story Segmentation in Chinese Broadcast News", Asia Information Retrieval Symposium (AIRS2008), LNCS 4993, Harbin, China, pp345-355, Springer, 2008.

Lei Xie and Zhi-Qiang Liu, "Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modelling", IEEE Transactions on Multimedia, 9(3), 2007, pp500-510. PDF Bib

Lei Xie and Zhi-Qiang Liu, "A Coupled HMM Approach for Video-Realistic Speech Animation", Pattern Recognition, 40(10), 2007, pp2325-2340. PDF Bib

Lei Xie, "Dynamic Bayesian Network Inversion for Robust Speech Recognition", IEICE Transactions on Information and Systems, 2007, Vol. E90-D, No. 7, pp 156-159.

Lei Xie, Chuan Liu and Helen Meng, "Combined Use of Speaker- and Tone-Normalized Pitch Reset with Pause Duration for Automatic Story Segmentation in Mandarin Broadcast News", Human Language Technology Conference /North American chapter of the Association for Computational Linguistics Annual Meeting (HLT-NAACL), pp193-196, Rochester, NY, USA, April, 2007.

Chuan Liu, Lei Xie, Helen Meng, "Classification of Music and Speech in Mandarin News Broadcasts", 9th National Conference on Man-Machine Speech Communication (NCMMSC), Huangshan, Anhui, China, 2007.

Shing-kai Chan, Lei Xie and Helen Mei-ling Meng, "Modeling the Statistical Behavior of Lexical Chains to Capture Word Cohesiveness for Automatic Story Segmentation" Interspeech, Belgium, 2007. PDF Bib

Lei Xie, and Zhi-Qiang Liu, "An Articulatory Approach to Video-Realistic Mouth Animation", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. I, pp593-596, Toulouse, France, 2006.

Lei Xie, Helen Meng and Zhi-Qiang Liu, "A Cantonese Speech-Driven Talking Face using Translingual Audio-to-Visual Conversion", International Symposium on Chinese Spoken Language Processing (ISCSLP2006), LNAI 4274, Singapore, pp627-639, Springer, Dec, 2006.

Lei Xie and Zhi-Qiang Liu, "Multi-Stream Articulator Model with Adaptive Reliability Measure for Audio Visual Speech Recognition", Advances in Machine Learning and Cybernetics, LNAI 3930, Springer, pp99-114, April, 2006.

Lei Xie, and Zhi-Qiang Liu, "Speech Animation Using Coupled Hidden Markov Models", International Conference on Pattern Recognition (ICPR), vol. I, pp1128-1131, Hong Kong, 2006.

Lei Xie, and Zhi-Qiang Liu, "Lip Assistant: Visualize Speech for Hearing Impaired People in Multimedia Services" , International Conference on System, Man and Cybernetics (ICSMC) , pp4331-4336, Taipei, Taiwan, 2006.

......

WARNING: This page contains links to pdf files whose contents may be covered by copyright. You may browse them at your convenience in the same spirit as you may read a journal or a conference proceedings article in a public library. Retrieving, copying, or distributing these files, however, may violate international copyright protection law.