aslp@npu

English Version

Lab Wiki

只搜索谢磊博士的网页

最新更新:2021年06月

隶属于

合作单位

tencent

alibaba

Huawei

sougou

XiaoMi

microsoft

iqiyi

jd.com

bytedance

Baidu

unisound

I2R

I2R

I2R

谢磊 Lei Xie

博士,教授,博导

西北工业大学计算机学院

陕西省语音与图像信息处理重点实验室

音频、语音与语言处理研究组

电邮: lxie (at) nwpu.edu.cn, xielei21st (at) gmail.com, lxie (at) nwpu-aslp.org (将 (at) 换为 @)

网页: http://lxie.npu-aslp.org

新闻:

履历:

谢磊,西北工业大学计算机学院教授、博士生导师,音频语音与语言处理研究组(ASLP@NPU)负责人。2001年至2002年在比利时布鲁塞尔自由大学(VUB)担任研究科学家。2004年至2007年先后在香港城市大学(CityU)创意媒体学院和香港中文大学(CUHK)系统工程与工程管理学系从事研究工作。2007年作为海外引进人才受聘于西北工业大学计算机学院。获得教育部“新世纪优秀人才支持计划”、陕西省青年科技新星、西安市青年科技奖、亚太信号与信息处理协会(APSIPA)杰出讲学专家等荣誉。研究领域包括音频语音与语言处理、多媒体技术、机器学习、人机交互等。在包括IEEE/ACM Transactions on Audio, Speech and Language Processing, IEEE Transactions on Multimedia, IEEE Journal of Selected Topics in Signal Processing, Interspeech, ICASSP, ASRU,ACL,ACM Multimedia在内的重要期刊和会议上发表论文200余篇,获得多项学术会议最佳论文奖和重要国际评测第一名。主持多项国家级与省部级科研项目,与华为、微软、腾讯、阿里巴巴、搜狗、小米、京东、百度、三星、出门问问、字节跳动、快手、美团、爱奇艺等十余家业界著名企业开展了广泛深入的技术合作,研究成果在企业中获得广泛应用。担任重要学术会议主席40余次,包括第十届国际中文口语语言处理学术会议(ISCSLP2016)程序委员会主席、第十一届和第十五届全国人机语音通讯学术会议(NCMMSC2011、NCMMSC2019)程序委员会主席、2018中国多媒体大会(ChinaMM2018)程序委员会主席、2021 IEEE口语语言技术研讨会(SLT2021)大会主席、第三届亚太信号与信息处理协会年度峰会(APSIPA ASC2011)组织主席等。谢磊教授目前担任语音领域顶级期刊IEEE/ACM Transactions on Audio, Speech and Language Processing的编委。谢磊教授是中国计算机学会(CCF)语音对话与听觉专业组常务委员、中国中文信息学会理事、中国中文信息学会语音信息专业委员会副主任,亚太信号与信息处理协会(APSIPA)语音语言与音频学术委员会委员、国际中文口语语言处理兴趣小组(SIG-CSLP)工作组主席、NCMMSC常设机构副主席、中国计算机学会多媒体专业委员会委员、IEEE高级会员、中国计算机学会高级会员等。

研究兴趣:

  • 音频、语音与语言处理
  • 多媒体信息处理
  • 模式识别与机器学习
  • 人机交互

了解ASLP实验室:

ASLP Lab

近期论文:

Jian Cong, Shan Yang, Na Hu, Guangzhi Li, Lei Xie, Dan Su, Controllable Context-aware Conversational Speech Synthesis, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Jian Cong, Shan Yang, Lei Xie, Dan Su, Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Zhichao Wang, Xinyong Zhou, Fengyu Yang, Tao Li, Hongqiang Du, Lei Xie, Wendong Gan, Haitao Chen, Hai Li, Enriching Source Style Transfer in Recognition-Synthesis based Non-Parallel Voice Conversion , Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Xiong Wang, Sining Sun, Lei Xie, Long Ma, Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-End Speech Recognition , Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Pengcheng Guo, Xuankai Chang, Shinji Watanabe, Lei Xie, Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Shimin Zhang, Yuxiang Kong, Shubo Lv, Yanxin Hu, Lei Xie, F-T-LSTM based Complex Network for Joint Acoustic Echo Cancellation and Speech Enhancement, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Yihui Fu, Luyao Cheng, Shubo Lv, Yukai Jv, Yuxiang Kong,Zhuo Chen, Yanxin Hu, Lei Xie, Jian Wu, Hui Bu, Xin Xu, Jun Du, Jingdong Chen, AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Li Zhang, Qing Wang, Kong Aik Lee, Lei Xie, Haizhou Li. Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Hongqiang Du, Lei Xie, Dan Su, Improving robustness of one-shot voice conversion with deep discriminative speaker encoder, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Xiaochun An, Frank K. Soong, Lei Xie, Improving Performance of Seen and Unseen Speech Style Transfer in End-to-end Neural TTS, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Shubo Lv,Yanxin Hu,Shimin Zhang,Lei Xie, DCCRN+: Channel-wise Subband DCCRN with SNR Estimation for Speech Enhancement, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Jingsong Wang, Yuxuan He, Chunyu Zhao, Qijie Shao, Wei-Wei Tu, Tom Ko, Hung-yi Lee, Lei Xie, Auto-KWS 2021 Challenge: Task, Datasets, and Baselines, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Zhuoyuan Yao, Di Wu, Xiong Wang, Binbin Zhang, Fan Yu, Chao Yang, Zhendong Peng, Xiaoyu Chen, Lei Xie, WeNet: Production Oriented Streaming and Non-streaming End-to-End Speech Recognition Toolkit, Interspeech2021, Brno, Czech Republic, Aug 30 - Sept 3, 2021 PDF

Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li, Factorized WaveNet for voice conversion with limited data, Speech Communication, 130 (2021), 45-54 PDF

Xiaochun An, Frank K. Soong, Shan Yang, Lei Xie, Effective and direct control of neural TTS prosody by removing interactions between different attributes, Neural Networks 143 (2021) 250–260 PDF

Hang Lv, Zhehuai Chen, Hainan Xu, Daniel Povey, Lei Xie, Sanjeev Khudanpur, An asynchronous WFST-based decoder for automatic speech recognition, ICASSP2021, Toronto, Canada, 6-11 June, 2021 PDF

Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur, Wake word detection with streaming transformers, ICASSP2021, Toronto, Canada, 6-11 June, ICASSP2021, Toronto, Canada, 6-11 June, 2021 PDF

Qicong Xie, Xiaohai Tian, Guanghou Liu, Kun Song, Lei Xie, Zhiyong Wu, Hai Li, Song Shi, Haizhou Li, Fen Hong, Hui Bu, Xin Xu, THE MULTI-SPEAKER MULTI-STYLE VOICE CLONING CHALLENGE 2021, ICASSP2021, Toronto, Canada, 6-11 June, 2021 PDF

Xian Shi, Fan Yu, Yizhou Lu, Yuhao Liang, Qiangze Feng, Daliang Wang, Yanmin Qian, Lei Xie, The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods, ICASSP2021, Toronto, Canada, 6-11 June, 2021 PDF

Hang Lv, Daniel Povey, Mahsa Yarmohammadi, Ke Li, Yiming Wang, Lei Xei, Sanjeev Khudanpur, LET-Decoder: A WFST-based lazy-evaluation token-group decoder with exact lattice generation, IEEE Signal Processing Letters PDF

Jingyong Hou, Li Zhang, Yihui Fu, Qing Wang, Zhanheng Yang, Qijie Shao, Lei Xie, The NPU System for the 2020 Personalized Voice Trigger Challenge, ISCSLP2021 Personalized Voice Trigger Challenge PDF

Liumeng Xue, Shifeng Pan, Lei He, Lei Xie and Frank K. Soong, Cycle consistent network for end-to-end style transfer TTS training, Neural Networks, vol. 140, August 2021, pages 223-236 PDF

Xiong Wang, Zhuoyuan Yao, Xian Shi, Lei Xie, Cascade RNN-Transducer: Syllable Based Streaming On-device Mandarin Speech Recognition with a Syllable-to-Character Converter, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Yuxiang Kong, Jian Wu, Quandong Wang, Peng Gao, Weiji Zhuang, Yujun Wang, Lei Xie, Multi-Channel Automatic Speech Recognition Using Deep Complex Unet, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Haoneng Luo, Shiliang Zhang, Ming Lei, Lei Xie, Simplified Self-Attention for Transformer-based End-to-End Speech Recognition, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Yihui Fu, Jian Wu, Yanxin Hu, Mengtao Xing, Lei Xie, DESNet: A Multi-channel Network for Simultaneous Speech Dereverberation, Enhancement and Separation, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Yihui Fu, Zhuoyuan Yao, Weipeng He, Jian Wu, Xiong Wang, Zhanheng Yang, Shimin Zhang, Lei Xie, Dongyan Huang, Hui Bu, Petr Motlicek, Jean-Marc Odobez, IEEE SLT 2021 Alpha-mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Fan Yu, Zhuoyuan Yao, Xiong Wang, Keyu An, Lei Xie, Zhijian Ou, Bo Liu, Xiulin Li, Guanqiong Miao, The SLT 2021 children speech recognition challenge: Open datasets, rules and baselines, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Yi Lei, Shan Yang, Lei Xie, Fine-grained Emotion Strength Transfer, Control and Prediction for Emotional Speech Synthesis, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Heyang Xue, Shan Yang, Yi Lei, Lei Xie, Xiulin Li, Learn2Sing: Target Speaker Singing Voice Synthesis by Learning from a Singing Teacher, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Geng Yang, Shan Yang, Kai Liu, Peng Fang, Wei Chen, Lei Xie, Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Haohan Guo, Shaofei Zhang, Frank K. Soong, Lei He, Lei Xie, Conversational End-to-End TTS for Voice Agent, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li, Optimizing voice conversion network with cycle consistency loss of speaker identity, IEEE SLT2021, January 19-22, Shenzhen, China PDF

Xiaohai Tian, Zhichao Wang, Shan Yang, Xinyong Zhou, Hongqiang Du, Yi Zhou, Mingyang Zhang, Kun Zhou, Berrak Sisman, Lei Xie, Haizhou Li, The NUS & NWPU system for Voice Conversion Challenge 2020, Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 30 October 2020, Shanghai, China PDF

Tao Li, Shan Yang, Liumeng Xue, Lei Xie, Controllable Emotion Transfer For End-to-End Speech Synthesis, ISCSLP2021, January 24-26, Hong Kong, China PDF

Zhichao Wang, Wenshuo Ge, Xiong Wang, Shan Yang, Wendong Gan, Haitao Chen, Hai Li, Lei Xie, Xiulin Li, Accent and Speaker Disentanglement in Many-to-many Voice Conversion, ISCSLP2021, January 24-26, Hong Kong, China PDF

Kun Wei, Pengcheng Guo, Hang Lv, Zhen Tu, Lei Xie, Xiulin Li, Context-aware RNNLM Rescoring for Conversational Speech Recognition, ISCSLP2021, January 24-26, Hong Kong, China PDF

Qing Wang, Wei Rao, Pengcheng Guo, Lei Xie, Adversarial Training for Multi-domain Speaker Recognition, ISCSLP2021, January 24-26, Hong Kong, China PDF

Jing Shi, Xuankai Chang, Pengcheng Guo, Shinji Watanabe, Yusuke Fujita, Jiaming Xu, Bo Xu, Lei Xie, Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals, NeurlPS 2020, PDF

Shan Yang, Yuxuan Wang, Lei Xie, Adversarial Feature Learning and Unsupervised Clustering based Speech Synthesis for Found Data with Acoustic and Textual Noise, IEEE Signal Processing Letters, 2020 PDF

Yanxin Hu, Yun Liu, Shubo Lv, Mengtao Xing, Shimin Zhang, Yihui Fu, Jian Wu, Bihong Zhang, Lei Xie, DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement, Interspeech2020, October 25-29, Shanghai, China PDF

Fengyu Yang, Shan Yang, Qinghua Wu, Yujun Wang, Lei Xie, Exploiting Deep Sentential Context for Expressive End-to-End Speech Synthesis, Interspeech2020, October 25-29, Shanghai, China PDF

Jian Wu, Zhuo Chen, Jinyu Li, Takuya Yoshioka, Zhili Tan, Ed Lin, Yi Luo, Lei Xie, An End-to-end Architecture of Online Multi-channel Speech Separation, Interspeech2020, October 25-29, Shanghai, China PDF

Shiliang Zhang, Zhifu Gao, Haoneng Luo, Ming Lei, Jie Gao, Zhijie Yan, Lei Xie, Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech Recognition, Interspeech2020, October 25-29, Shanghai, China PDF

Qing Wang, Pengcheng Guo, Lei Xie, Inaudible Adversarial Perturbations for Targeted Attack in Speaker Recognition, Interspeech2020, October 25-29, Shanghai, China PDF

Li Zhang, Jian Wu, Lei Xie, NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge, Interspeech2020, October 25-29, Shanghai, China PDF

Haohe Liu, Lei Xie, Jian Wu, Geng Yang, Channel-wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music, Interspeech2020, October 25-29, Shanghai, China PDF

Jian Cong, Shan Yang, Lei Xie, Guoqiao Yu, Guanglu Wan, Data Efficient Voice Cloning from Noisy Samples with Domain Adversarial Training, Interspeech2020, October 25-29, Shanghai, China PDF

Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur, Wake Word Detection with Alignment-Free Lattice-Free MMI, Interspeech2020, October 25-29, Shanghai, China PDF

Jingsong Wang, Tom Ko, Zhen Xu, Xiawei Guo, Souxiang Liu, Wei-Wei Tu, Lei Xie, AutoSpeech 2020: The Second Automated Machine Learning Challenge for Speech Classification, Interspeech2020, October 25-29, Shanghai, China PDF

Xian Shi, Qiangze Feng, Lei Xie, The ASRU 2019 Mandarin-English Code-Switching Speech Recognition Challenge: Open Datasets, Tracks, Methods and Results, First Workshop on Speech Technologies for Code-switching in Multilingual Communities 2020, October 30-31, 2020 PDF

Yougen Yuan, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Bin Ma, "Fast Query-by-example Speech Search using Attention-based Deep Binary Embeddings", IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2020 PDF

Jingyong Hou, Yangyang Shi, Mari Ostendorf, Mei-Yuh Hwang, Lei Xie, "MINING EFFECTIVE NEGATIVE TRAINING SAMPLES FOR KEYWORD SPOTTING", ICASSP2020, Barcelona, Spain, May 4-8, 2020 PDF

Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li, "EFFECTIVE WAVENET ADAPTATION FOR VOICE CONVERSION WITH LIMITED DATA", ICASSP2020, Barcelona, Spain, May 4-8, 2020 PDF

Xiang Hao, Chenglin Xu, Nana Hou, Lei Xie, Eng Siong Chng, Haizhou Li, "TIME-DOMAIN NEURAL NETWORK APPROACH FOR SPEECH BANDWIDTH EXTENSION", ICASSP2020, Barcelona, Spain, May 4-8, 2020 PDF

Shan Yang, Heng Lu, Shiyin Kang, Liumeng Xue, Jinba Xiao, Dan Su, Lei Xie, Dong Yu, "On the localness modeling for the self-attention based end-to-end speech synthesis", Neural Networks, Elsevier, 2020 PDF

Chenggang Mi, Lei Xie and Yanning Zhang, "Improving Adversarial Neural Machine Translation for Morphologically Rich Language", IEEE Transactions on Emerging Topics in Computational Intelligence, 2020 PDF

Chenggang Mi, Lei Xie and Yanning Zhang, "Loanword Identification in Low-resource Languages with Minimal Supervision", ACM Transactions on Asian and Low-Resource Language Information Processing, 2020 PDF

Jian Wu, Yong Xu, Shi-Xiong Zhang, Lian-Wu Chen, Meng Yu, Lei Xie, Dong Yu, "Time Domain Audio Visual Speech Separation", ASRU2019, 14-18 December 2019, Singapore PDF

Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li, "Wavenet Factorization with Singular Value Decomposition for Voice Conversion", ASRU2019, 14-18 December 2019, Singapore PDF

Fengyu Yang, Shan Yang, Pengcheng Zhu, Pengju Yan, Lei Xie, "Improving Mandarin End-to-End Speech Synthesis by Self-Attention and Learnable Gaussian Bias", ASRU2019, 14-18 December 2019, Singapore PDF

Yougen Yuan, Zhiqiang Lv, Shen Huang, Lei Xie, "Verifying Deep Keyword Spotting Detection with Acoustic Word Embeddings", ASRU2019, 14-18 December 2019, Singapore PDF

Xiaolian Zhu, Shan Yang, Geng Yang, Lei Xie, "Controlling Emotion Strength with Relative Attribute for End-To-End Speech Synthesis", ASRU2019, 14-18 December 2019, Singapore PDF

Xiaochun An, Yuxuan Wang, Shan Yang, Zejun Ma, Lei Xie, "Learning Hierarchical Representations for Expressive Speaking Style in End-to-End Speech Synthesis", ASRU2019, 14-18 December 2019, Singapore PDF

Xiong Wang, Sining Sun, Lei Xie, "Virtual Adversarial Training for DS-CNN Based Small-Footprint Keyword Spotting", ASRU2019, 14-18 December 2019, Singapore PDF

Yiming Wang, Tongfei Chen,Hainan Xu, Shuoyang Ding, Hang Lv, Yiwen Shao, Nanyun Peng, Lei Xie, Shinji Watanabe, Sanjeev Khudanpur, "ESPRESSO: A FAST END-TO-END NEURAL SPEECH RECOGNITION TOOLKIT", ASRU2019, 14-18 December 2019, Singapore PDF

Zhehuai Chen, Mahsa Yarmohammadi, Hainan Xu, Hang Lv, Lei Xie, Daniel Povey, Sanjeev Khudanpur, "INCREMENTAL LATTICE DETERMINIZATION FOR WFST DECODERS", ASRU2019, 14-18 December 2019, Singapore PDF

Yougen Yuan, Wei Tang, Minhao Fan, Yue Chao, Peng Zhang, Lei Xie, "Deep Audio-visual System for Closed-set Word-level Speech Recognition", The 21st ACM International Conference on Multimodal Interaction (ICMI 2019), Suzhou, China (Top 1 system in the 1st Mandarin Audio-Visual Speech Recognition Challenge) PDF

Senmao Wang, Pan Zhou, Wei Chen, Jia Jia, Lei Xie, "Exploring RNN-Transducer for Chinese Speech Recognition", APSIPA ASC 2019, 18-21 November, 2019, Lanzhou, China PDF

Sining Sun, Shuran Zhou, Mei-Yuh Hwang, Lei Xie, Qin Li, Xin Lei, "Multiple Fixed Beamformers with a Spacial Wiener-form Postfilter for Far-Field Speech Recognition", APSIPA ASC 2019, 18-21 November, 2019, Lanzhou, China PDF

Sining Sun, Pengcheng Guo, Lei Xie and Mei-Yuh Hwang, Adversarial Regularization for Attention Based End-to-End Robust Speech Recognition, IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, vol. 27, no. 11, November 2019 PDF

Jingyong Hou, Yangyang Shi, Mari Ostendorf, Mei-Yuh Hwang, Lei Xie, Region Proposal Network Based Small-Footprint Keyword Spotting, IEEE Signal Processing Letters, 2019 PDF

Xiaolian Zhu, Yuchao Zhang, Shan Yang, Liumeng Xue, Lei Xie, Pre-Alignment Guided Attention for Improving Training Efficiency and Model Stability in End-to-End Speech Synthesis, IEEE Access, vol. 7, 2019 PDF

Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, "Query-by-Example Speech Search Using Recurrent Neural Acoustic Word Embeddings With Temporal Context", IEEE Access, vol. 7, 2019 PDF

Haohan Guo, Frank K. Soong, Lei He, Lei Xie, "A New GAN-based End-to-End TTS Training Algorithm", Interspeech2019, 16-19 September, 2019, Graz, Austria PDF

Haohan Guo, Frank K. Soong, Lei He, Lei Xie, "Exploiting Syntactic Features in a Parsed Tree to Improve End-to-End TTS", Interspeech2019, 16-19 September, 2019, Graz, Austria PDF

Liumeng Xue, Wei Song, Guanghui Xu, Lei Xie, Zhizheng Wu, "Building a mixed-lingual neural TTS system with only monolingual data", Interspeech2019, 16-19 September, 2019, Graz, Austria PDF

Pengcheng Guo, Sining Sun, Lei Xie, "Unsupervised Adaptation with Adversarial Dropout Regularization for Robust Speech Recognition", Interspeech2019, 16-19 September, 2019, Graz, Austria PDF

Jian Wu, Yong Xu, Shi-Xiong Zhang, Lian-Wu Chen, Meng Yu, Lei Xie, Dong Yu, "Improved Speaker-Dependent Separation for CHiME-5 Challenge", Interspeech2019, 16-19 September, 2019, Graz, Austria PDF

Qing Wang, Pengcheng Guo, Sining Sun, Lei Xie1, John H.L. Hansen, "Adversarial Regularization for End-to-end Robust Speaker Verification", Interspeech2019, 16-19 September, 2019, Graz, Austria PDF

Shiliang Zhang, Yuan Liu, Ming Lei, Bin Ma, Lei Xie, "Towards Language-Universal Mandarin-English Speech Recognition", Interspeech2019, 16-19 September, 2019, Graz, Austria PDF

Shan Yang, Heng Lu, Shiying Kang, Lei Xie, Dong Yu, "ENHANCING HYBRID SELF-ATTENTION STRUCTURE WITH RELATIVE-POSITION-AWARE BIAS FOR SPEECH SYNTHESIS", ICASSP2019, 12-17 May, 2019, Brighton, UK PDF

Changhao Shan, Chao Weng, Guangsen Wang, Dan Su, Min Luo, Dong Yu, Lei Xie, “INVESTIGATING END-TO-END SPEECH RECOGNITION FOR MANDARIN-ENGLISH CODE-SWITCHING”, ICASSP2019, 12-17 May, 2019, Brighton, UK PDF

Changhao Shan, Chao Weng, Guangsen Wang, Dan Su, Min Luo, Dong Yu, Lei Xie, “COMPONENT FUSION: LEARNING REPLACEABLE LANGUAGE MODEL COMPONENT FOR END-TO-END SPEECH RECOGNITION SYSTEM”, ICASSP2019, 12-17 May, 2019, Brighton, UK PDF

Ke Wang, Frank Soong, Lei Xie, “A PITCH-AWARE APPROACH TO SINGLE-CHANNEL SPEECH SEPARATION”, ICASSP2019, 12-17 May, 2019, Brighton, UK PDF

Jingyong Hou, Pengcheng Guo, Sining Sun, Frank K. Soong, Wenping Hu, Lei Xie, “DOMAIN ADVERSARIAL TRAINING FOR IMPROVING KEYWORD SPOTTING PERFORMANCE OF ESL SPEECH”, ICASSP2019, 12-17 May, 2019, Brighton, UK PDF

Xiang Hao, Changhao Shan, Yong Xu, Sining Sun, Lei Xie, “AN ATTENTION-BASED NEURAL NETWORK APPROACH FOR SINGLE CHANNEL SPEECH ENHANCEMENT”, ICASSP2019, 12-17 May, 2019, Brighton, UK PDF

Xiong Wang, Sining Sun, Changhao Shan, Jingyong Hou, Lei Xie, Shen Li, Xin Lei, “ADVERSARIAL EXAMPLES FOR IMPROVING END-TO-END ATTENTION-BASED SMALL-FOOTPRINT KEYWORD SPOTTING”, ICASSP2019, 12-17 May, 2019, Brighton, UK PDF

Shiliang Zhang, Ming Lei, Bin Ma, Lei Xie, “ROBUST AUDIO-VISUAL SPEECH RECOGNITION USING BIMODAL DFSMN WITH MULTI-CONDITION TRAINING AND DROPOUT REGULARIZATION”, ICASSP2019, 12-17 May, 2019, Brighton, UK PDF

Zhiwei Zhao, Jian Wu, Lei Xie, "The NWPU System for CHiME-5 Challenge", The 5th CHiME Speech Separation and Recognition Challenge (CHiME-5), September 7, 2019, Hyderabad, India PDF

Sining Sun, Yangyang Shi, Ching-Feng Yeh, Suliang Bu, Mei-Yuh Hwang, Lei Xie, "Multiple Beamformers with ROVER for the CHiME-5 Challenge", The 5th CHiME Speech Separation and Recognition Challenge (CHiME-5), September 7, 2018, Hyderabad, India PDF

Jingyong Hou, Wenping Hu, Frank K. Soong, Lei Xie, "A Refined Query-by-Example Approach to Spoken Term Detection on ESL Learners' Speech", International Symposium on Chinese Spoken Language Processing (ISCSLP2018), November 26-29, 2018, Taipei, Taiwan PDF

Xiaochun An, Yuchao Zhang, Bing Liu, Liumeng Xue, Lei Xie, "A Kullback-Leibler Divergence Based Recurrent Mixture Density Network for Acoustic Modeling in Emotional Statistical Parametric Speech Synthesis", ACM Multimedia ASMMC Workshop, 26 October 2018, Seoul, Korea PDF

Liumeng Xue, Xiaolian Zhu, Xiaochun An, Lei Xie, "A Comparison of Expressive Speech Synthesis Approaches based on Neural Network", ACM Multimedia ASMMC Workshop, 26 October 2018, Seoul, Korea PDF

Sining Sun, Ching-Feng Yeh, Mari Ostendorf, Mei-Yuh Hwang, Lei Xie, "Training Augmentation with Adversarial Examples for Robust Speech Recognition", Interspeech2018, September 2-6, 2018, Hyderabad, India PDF

Changhao Shan, Junbo Zhang, Yujun Wang, Lei Xie, "Attention-based End-to-End Models for Small-Footprint Keyword Spotting", Interspeech2018, September 2-6, 2018, Hyderabad, India PDF

Ke Wang, Junbo Zhang, Sining Sun, Yujun Wang, Fei Xiang, Lei Xie, "Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition", Interspeech2018, September 2-6, 2018, Hyderabad, India PDF

Ke Wang, Junbo Zhang, Yujun Wang, Lei Xie, "Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model", Interspeech2018, September 2-6, 2018, Hyderabad, India PDF

Yougen Yuan, Cheung-Chi Leung, Lei Xie1, Hongjie Chen, Bin Ma, Haizhou Li, "Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search", Interspeech2018, September 2-6, 2018, Hyderabad, India PDF

Pengcheng Guo, Haihua Xu, Lei Xie, Eng Siong Chng,"Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition",Interspeech2018, September 2-6, 2018, Hyderabad, India PDF

Lei Xie, Tan Lee, Man-Wai Mak, "Guest Editorial: Advances in Deep Learning for Speech Processing", Journal of Signal Processing Systems, 2018 PDF

Sining Sun, Ching-Feng Yeh, Mei-Yuh Hwang, Mari Ostendorf, Lei Xie, "DOMAIN ADVERSARIAL TRAINING FOR ACCENTED SPEECH RECOGNITION", ICASSP2018, 15-20 April 2018, Calgary, Alberta, Canada PDF

Qing Wang, Wei Rao, Sining Sun, Lei Xie, Eng Siong Chng, Haizhou Li, "UNSUPERVISED DOMAIN ADAPTATION VIA DOMAIN ADVERSARIAL TRAINING FOR SPEAKER RECOGNITION", ICASSP2018, 15-20 April 2018, Calgary, Alberta, Canada PDF

Changhao Shan, Junbo Zhang, Yujun Wang, Lei Xie, "ATTENTION-BASED END-TO-END SPEECH RECOGNITION ON VOICE SEARCH", ICASSP2018, 15-20 April 2018, Calgary, Alberta, Canada PDF

Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Multi-Task Feature Learning for Low-Resource Query-by-Example Spoken Term Detection", IEEE Journal of Selected Topics in Signal Processing, 2017 PDF

Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "MULTILINGUAL BOTTLE-NECK FEATURE LEARNING FROM UNTRANSCRIBED SPEECH", 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU2017), December 16-20, 2017, Okinawa, Japan PDF

Shan Yang, Lei Xie, Xiao Chen, Xiaoyan Lou, Xuan Zhu, Dongyan Huang, Haizhou Li, "Statistical Parametric Speech Synthesis Using Generative Adversarial Networks Under A Multi-task Learning Framework", 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU2017), December 16-20, 2017, Okinawa, Japan PDF

Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li, "EXTRACTING BOTTLENECK FEATURES AND WORD-LIKE PAIRS FROM UNTRANSCRIBED SPEECH FOR FEATURE REPRESENTATION ", 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU2017), December 16-20, 2017, Okinawa, Japan PDF

Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng, "An End-to-End Neural Network Approach to Story Segmentation", 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2017), December 12-15, 2017, Kuala Lumpur, Malaysia PDF

Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng, "Topic Embedding of Sentences for Story Segmentation", 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2017), December 12-15, 2017, Kuala Lumpur, Malaysia PDF

Jie Yan, Lei Xie, Guangsen Wang, Zhong-Hua Fu, "A Segmental DNN/i-vector Approach for Digit-Prompted Speaker Verification", 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2017), December 12-15, 2017, Kuala Lumpur, Malaysia PDF

Chenglin Xu, Lei Xie, Xiong Xiao, "A Bidirectional LSTM Approach with Word Embeddings for Sentence Boundary Detection", Journal of Signal Processing Systems, Springer, 2017 PDF

Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng, "Learning Distributed Sentence Representations for Story Segmentation", Signal Processing, 2017 PDF

Wenpeng Li, BinBin Zhang, Lei Xie, Dong Yu, "Empirical Evaluation of Parallel Training Algorithms on Acoustic Modeling", Interspeech2017, August 20-24, Stockholm, Sweden. PDF

Jie Wu, Dongyan Huang, Lei Xie and Haizhou Li, "Denoising Recurrent Neural Network for Deep Bidirectional LSTM based Voice Conversion", Interspeech2017, August 20-24, Stockholm, Sweden. PDF

Yanfeng Lu, Zhengchen Zhang, Chenyu Yang, Huaiping Ming, Xiaolian Zhu, Yuchao Zhang, Shan Yang, Dongyan Huang, Lei Xie, Minghui Dong, "The I2R-NWPU Text-to-Speech System for Blizzard Challenge 2017", Blizzard Challenge 2017 Workshop, August 2017, Stockholm, Sweden pdf

Yougen Yuan, Lei Xie, Zhong-Hua Fu, Qi Cong, "Sound image externalization for headphone based real-time 3D audio", Frontiers of Computer Science, June 2017, Volume 11, Issue 3, pp 419-428.

Lei Xie, Lijuan Wang and Shan Yang, "Visual Speech Animation", Book Chapter in Handbook of Human Motion, Springer, 2017 PDF

Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li, "Pairwise learning using multi-lingual bottleneck features for low-resource query-by-example spoken term detection",ICASSP 2017, March 5-9, 2017, New Orleans, USA. PDF

Hongjie Chen, Lei Xie, Cheung-Chi Leung, Bin Ma and Haizhou Li, "Modeling Latent Topics and Temporal Distance for Story Segmentation of Broadcast News", IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 25, no. 1, January 2017 PDF

Sining Sun, Binbin Zhang, Lei Xie and Yanning Zhang, An unsupervised deep domain adaptation approach for robust speech recognition, Neurocomputing, 2017 PDF

Jingyong Hou, Lei Xie, Zhonghua Fu, "Investigating Neural Network based Query-by-Example Keyword Spotting Approach for Personalized Wake-up Word Detection in Mandarin Chinese", the 10th International Symposium on Chinese Spoken Language Processing (ISCSLP2016), October 17-20, 2016, Tianjin, China PDF

Changhao Shan, Lei Xie, Kaisheng Yao, "A Bi-directional LSTM Approach for Polyphone Disambiguation in Mandarin Chinese", the 10th International Symposium on Chinese Spoken Language Processing (ISCSLP2016), October 17-20, 2016, Tianjin, China PDF

Kaituo Xu, Lei Xie, Kaisheng Yao, "Investigating LSTM for Punctuation Prediction", the 10th International Symposium on Chinese Spoken Language Processing (ISCSLP2016), October 17-20, 2016, Tianjin, China PDF

Zhengchen Zhang, Mei Li, Yuchao Zhang, Weini Zhang, Yang Liu, Shan Yang, Yanfeng Lu,Van Tung Pham, Lei Xie, Minghui Dong, "The I2R-NWPU-NTU Text-to-Speech System at Blizzard Challenge 2016", Blizzard Challenge 2016 Workshop, September 16, 2016, Apple Inc., Cupertino, CA, USA PDF

Dong-Yan Huang, Lei Xie, Yvonne Siu Wa Lee, Jie Wu, Huaiping Ming, Xiaohai Tian, Shaofei Zhang, Chuang Ding, Mei Li, Quy Hy Nguyen, Minghui Dong, Haizhou Li, "An Automatic Voice Conversion Evaluation Strategy Based on Perceptual Background Noise Distortion and Speaker Similarity", the 9th ISCA Workshop on Speech Synthesis (SSW9), September 13th -15th, 2016, Sunnyvale, CA, USA PDF

Mei Li, Zhizheng Wu, Lei Xie, "On the impact of phoneme alignment in DNN-based speech synthesis", Mei Li, Zhizheng Wu, Lei Xie, the 9th ISCA Workshop on Speech Synthesis (SSW9), September 13th -15th, 2016, Sunnyvale, CA, USA PDF

Jie Wu, Zhizheng Wu, Lei Xie, "On the Use of I-vectors and Average Voice Model for Voice Conversion without Parallel Data", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2016), December 13-16, 2016, Jeju, Korea PDF

Shan Yang, Zhizheng Wu, Lei Xie, "On the training of DNN-based average voice model for speech synthesis", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2016), December 13-16, 2016, Jeju, Korea PDF

Zhen Wei, Zhizheng Wu, Lei Xie, "Predicting Articulatory Movement from Text Using Deep Architecture with Stacked Bottleneck Features", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2016), December 13-16, 2016, Jeju, Korea PDF

Xiong Xiao, Chenglin Xu, Zhaofeng Zhang, Shengkui Zhao, Sining Sun, Shinji Watanabe, Longbiao Wang, Lei Xie, Douglas L. Jones, Eng Siong Chng, Haizhou Li, Investigation of Neural Networks Based Beamforming Approaches for Speech Recognition: The NTU Systems for CHiME-4 Evaluation, the 4th International Workshop on Speech Processing in Everyday Environments (CHiME), San Francisco, September 13, 2016 PDF

Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, "Unsupervised Bottleneck Features for Low-Resource Query-by-Example Spoken Term Detection", Interspeech2016, September 8-12, 2016, San Francisco, USA PDF

Yougen Yuan, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, "Learning Neural Network Representations using Cross-lingual Bottleneck Features with Word-pair Information", Interspeech2016, September 8-12, 2016, San Francisco, USA PDF

Jia Yu, Xiong Xiao, Lei Xie, Eng Siong Chng and Haizhou Li, "A DNN-HMM Approach to Story Segmentation", Interspeech2016, September 8-12, 2016, San Francisco, USA PDF

Huaiping Ming, Dongyan Huang, Lei Xie, Jie Wu, Minghui Dong and Haizhou Li, "Deep Bidirectional LSTM Modeling of Timbre and Prosody for Emotional Voice Conversion", Interspeech2016, September 8-12, 2016, San Francisco, USA PDF

Cheung-Chi Leung, Lei Wang, Haihua Xu, Jingyong Hou, Van Tung Pham, Hang Lv, Lei Xie, Xiong Xiao, Chongjia Ni, Bin Ma, Eng Siong Chng, Haizhou Li,"Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis", Interspeech2016, September 8-12, 2016, San Francisco, USA PDF

Bihong Zhang, Lei Xie, Yougen Yuan, Huaiping Ming, Dongyan Huang and Mingli Song, "Deep neural network derived bottleneck features for accurate audio classification", ICME2016, S July 11-15, 2016, Seattle, USA PDF

Huaiping Ming, Dongyan Huang, Lei Xie, Shaofei Zhang, Minghui Dong and Haizhou Li, "Exemplar-based Sparse Representation of Timbre and Prosody for Voice Conversion", ICASSP2016, March 20-25, 2016, Shanghai, China PDF

Haihua Xu, Jingyong Hou, Xiong Xiao, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Van Hai Do, Hang Lv, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, "Approximate Search of Audio Queries using DTW with Phone Time Boundary and Data Augmentation", ICASSP2016, March 20-25, 2016, Shanghai, China PDF

Chuang Ding, Lei Xie, Jie Yan, Weini Zhang and Yang Liu, "Automatic Prosody Prediction for Chinese Speech Synthesis using BLSTM-RNN and Embedding Features",2015 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU2015), Dec 13-17, 2015, Scottsdale, Arizona PDF

Jingyong Hou, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Haihua Xu, Hang Lv, Lei Xie, Zhonghua Fu, Chongjia Ni, Xiong Xiao, Hongjie Chen, Shaofei Zhang, Sining Sun, Yougen Yuan, Pengcheng Li, Tin Lay Nwe, Sunil Sivadas, Bin Ma, Eng Siong Chng, Haizhou Li,"The NNI Query-by-Example System for MediaEval 2015", MediaEval 2015 Workshop, Wurzen, Germany, Sept 14-15, 2015 PDF  (Best performing system in the MediaEval2015 QUESST Evaluation)

Xiangzeng Zhou, Lei Xie, Peng Zhang and Yanning Zhang, "Online Object Tracking based on CNN with Metropolis-Hasting Re-sampling", ACM Multimedia 2015, Brisbane, Australia, Oct 26-30, 2015  PDF

Bo Fan, Lei Xie, Shan Yang, Lijuan Wang and Frank K. Soong, "A Deep Bidirectional LSTM Approach for Video-Realistic Talking Head", Multimedia Tools and Applications, Springer, 2015PDF

Bo Fan, Sui Wa Lee, Xiaohai Tian, Lei Xie and Minghua Dong, "A Waveform Representation Framework for High-quality Statistical Parametric Speech Synthesis", APSIPA ASC 2015, Hong Kong, China, Dec 16-19, 2015 PDF

Jia Yu, Lei Xie, Xiao Xiong, Eng Siong Chng, Haizhou Li, "A Density Peak Clustering Approach to Unsupervised Acoustic Subword Units Discovery", APSIPA ASC 2015, Hong Kong, China, Dec 16-19, 2015 PDF

Shaofei Zhang, Dongyan Huang, Lei Xie, Eng Siong Chng, Haizhou Li and Minghui Dong, "Non-negative Matrix Factorization using Stable Alternating Direction Method of Multipliers for Source Separation", APSIPA ASC 2015, Hong Kong, China, Dec 16-19, 2015 PDF

Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, Parallel Inference of Dirichlet Process Gaussian Mixture Models for Unsupervised Acoustic Modeling: A Feasibility Study, Interspeech2015, September 6-10, Dresden, Germany PDF (Interspeech2015 Zerospeech Challenge Best Paper Award)

Huaiping Ming, Dongyan Huang, Lei Xie, Haizhou Li and Minghui Dong, An Alternating Optimization Approach for Phase Retrieval Interspeech2015, September 6-10, Dresden, Germany PDF

Pengcheng Zhu, Lei Xie, Yunlin Chen, Articulatory Movement Prediction Using Deep Bidirectional Long Short-Term Memory Based Recurrent Neural Networks andWord/Phone Embeddings,Interspeech2015, September 6-10, Dresden, Germany PDF

Shaofei Zhang, Dongyan Huang, Lei Xie, Eng Siong Chng, Haizhou Li, Minghui Dong, Regularized Non-negative Matrix Factorization Using Alternating Direction Method of Multipliers and Its Application to Source SeparationInterspeech2015, September 6-10, Dresden, Germany PDF

Xiangzeng Zhou, Lei Xie, Qiang Huang, Stephen Cox and Yanning Zhang, "Tennis Ball Tracking using a Two-Layered Data Association Approach", IEEE Transactions on Multimedia, 2014 PDF

Bo Fan, Lijuan Wang, Frank K. Soong and Lei Xie, Photo-real Talking Head with Deep Bidirectional LSTM, ICASSP2015, 19-24 April 2015, Brisbane, Australia PDF

Haihua Xu, Peng Yang, Xiong Xiao, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Jia Yu, Hang Lv, Lei Wang, Su Jun Leow, Bin Ma, Eng Siong Chng, Haizhou Li, Language Independent Query-by-Example Spoken Term Detection using N-Best Phone Sequences and Partial Matching, ICASSP2015, 19-24 April 2015, Brisbane, Australia PDF

Peng Yang, Haihua Xu, Xiong Xiao, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Jia Yu, Hang Lv, Lei Wang, Su Jun Leow, Bin Ma, Eng Siong Chng, Haizhou Li, "The NNI Query-by-Example System for MediaEval 2014", MediaEval 2014 Workshop, Barcelona, Spain, Oct 16-17, 2014 PDF

Guangpu Huang, Chenglin Xu, Xiong Xiao, Lei Xie, Eng Siong Chng, Haizhou Li, " Multi-View Features in a DNN-CRF Model for Improved Sentence Unit Detection on English Broadcast News", APSIPA ASC 2014, Siem Reap, Cambodia, December 9-12, 2014

Chuang Ding, Pengcheng Zhu, Lei Xie, Dongmei Jiang and Zhonghua Fu, "Speech-Driven Head Motion Synthesis Using Neural Networks," Interspeech, Singapore, 14-18, September 2014 PDF

Chenglin Xu, Lei Xie, Guangpu Huang, Xiong Xiao, Eng Siong Chng and Haizhou Li, "A Deep Neural Network Approach for Sentence Boundary Detection in Broadcast News," Interspeech, Singapore, 14-18, September 2014 PDF

Peng Yang, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, "Intrinsic Spectral Analysis Based on Temporal Context Features for Query by Example Spoken Term Detection," Interspeech, Singapore, 14-18, September 2014 (Best Student Paper Finalist) PDF

Zhong-hua Fu, Lei Xie, "Stereo Acoustic Echo Suppression Using Widely Linear Filtering in the Frequency Domain," Interspeech, Singapore, 14-18, September 2014

Shaofei Zhang, Lei Xie, Zhong-hua Fu, "A Hybrid Virtual Bass System with Improved Phase Vocoder and High Efficiency,” ISCSLP, Singapore, 12-14, September 2014

Zhong-hua Fu, Lei Xie, "Experimental Study on Dereverberation and Noise Reduction for Distant Speech Recognition,” ISCSLP, Singapore, 12-14, September 2014

Hongjie Chen, Lei Xie, Wei Feng, Lilei Zheng and Yanning Zhang, "Topic Segmentation on Spoken Documents Using Self-Validated Acoustic Cuts,” Soft Computing, Springer, accepted, June 2014

Xiangzeng Zhou, Lei Xie, Peng Zhang, Yanning Zhang, "An Ensemble of Deep Neural Networks for Object Tracking", ICIP2014, October 27-30, 2014, Paris, France PDF

Chuang Ding, Lei Xie, Pengcheng Zhu, " "Head Motion Synthesis From Speech Using Deep Neural Networks", Multimedia Tools and Applications, Springer, accepted, 2014

Chao Yang, Lei Xie and Xiangzeng Zhou, "Unsupervised Broadcast News Story Segmentation Using Distance Dependent Chinese Restaurant Processes", ICASSP2014, May 4-9, 2014, Florence, Italy PDF

Huaiping Ming, Dongyan Huang, Lei Xie and Haizhou Li, "Learning Optimal Features for Music Transcription", the 2nd IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP2014), July 9-13, 2014, Xi'an, China

Chenglin Xu, Lei Xie and Zhonghua Fu, "Sentence Boundary Detection in Chinese Broadcast News using Conditional Random Fields and Prosodic Features", the 2nd IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP2014), July 9-13, 2014, Xi'an, China

Huaiping Ming, Lei Xie and Haizhou Li, "Filter Bank Design for Automatic Music Transcription", the 2013 Young Engineers and Scientists Conference on Multimedia, Communication and Mobile Application Technologies (YES2013), Nov. 8, 2013, Singapore

Xiaoming Lu, Lei Xie, Cheung-Chi Leung, Bin Ma and Haizhou Li, "Broadcast News Story Segmentation Using Manifold Learning on Latent Topic Distributions", ACL2013, 4-9 August, 2013, Sofia, Bulgaria. PDF

Jianwei Niu, Lei Xie, Lei Jia and Na Hu, "Context-Dependent Deep Neural Networks for Commercial Mandarin Speech Recognition Applications", APSIPA Annual Summit and Conference (APSIPA ASC 2013), Kaohsiung, Taiwan, Oct. 29 - Nov. 1, 2013. PDF

Haoran Liang, Mingli Song, Lei Xie and Ronghua Liang, "Personalized 3-D Facial Expression Synthesis based on Landmark Constraint", APSIPA Annual Summit and Conference (APSIPA ASC 2013), Kaohsiung, Taiwan, Oct. 29 - Nov. 1, 2013.

Ling Tang, Zhong-Hua Fu and Lei Xie, "Numerical Calculation of the Head-Related Transfer Functions with Chinese Dummy Head", APSIPA Annual Summit and Conference (APSIPA ASC 2013), Kaohsiung, Taiwan, Oct. 29 - Nov. 1, 2013.

Lei Xie, Zhigang Deng and Stephen Cox, "Multimodal joint information processing in human machine interaction: recent advances", Multimedia Tools and Applications, Guest Editorial, Springer, November, 2013.

Lei Xie, Naicai Sun and Bo Fan, "A Statistical Parametric Approach to Video-Realistic Text-driven Talking Avatar", Multimedia Tools and Applications, Springer, August 2013.

Peng Yang, Lei Xie, Qiao Luan and Wei Feng, "A Tighter Lower Bound Estimate for Dynamic Time Warping", ICASSP2013, May 26-31, 2013, Vancouver, Canada PDF

Xiangzeng Zhou, Qiang Huang, Lei Xie and Stephen Cox, "A Two Layered Data Association Approach for Ball Tracking", ICASSP2013, May 26-31, 2013, Vancouver, Canada PDF

Xiaoming Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Broadcast News Story Segmentation Using Latent Topics on Data Manifold", ICASSP2013, May 26-31, 2013, Vancouver, Canada PDF

Xuecheng Nie, Wei Feng, Liang Wan, Lei Xie, "Measuring Similarity by Contextual Word Connections in Chinese News Story Segmentation", ICASSP2013, May 26-31, 2013, Vancouver, Canada

李冰锋,谢磊,朱鹏程,樊博,语音驱动虚拟说话人的自然头动生成,第12届全国人机语音通讯学术会议,《清华大学学报》,2013年第6期 PDF

杨鹏,谢磊,陈虹洁,基于SDTW和后验特征的中文语音模式发现,第12届全国人机语音通讯学术会议,《清华大学学报》,2013年第6期 PDF

Lei Xie, Lilei Zheng, Zihan Liu and Yanning Zhang, "Laplacian Eigenmaps for Automatic Story Segmentation of Broadcast News," IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 1, pp 264-277, January 2012. PDF Bib

Lei Xie, Yinqing Xu, Lilei Zheng, Qiang Huang and Bingfeng Li, "Speech Pattern Discovery using Audio-Visual Fusion and Canonical Correlation Analysis", Interspeech, Portland, Oregon, USA, September 9-13, 2012. PDF Bib Poster

Yali Zhao, Lei Xie and Zhonghua Fu, "A Two Stage Mask Estimation Approach to Robust Speaker Verification", Interspeech, Portland, Oregon, USA, September 9-13, 2012. PDF Bib Poster

Wei Feng, Xuecheng Nie, Liang Wan, Lei Xie and Jianmin Jiang, "Lexical Story Co-Segmentation of Chinese Broadcast News", Interspeech, Portland, Oregon, USA, September 9-13, 2012. PDF Bib

Lei Xie, Chenglin Xu and Xiaoxuan Wang, "Prosody-based Sentence Boundary Detection in Chinese Broadcast News", The 8th International Symposium on Chinese Spoken Language Processing (ISCSLP2012) , Hong Kong, China, December 5-8, 2012 PDF Bib

Qiang Huang, Stephen Cox, Xiangzeng Zhou and Lei Xie, "Detection of Ball Hits in a Tennis Game Using Audio and Visual Information", APSIPA Annual Summit and Conference (APSIPA ASC 2012), Hollywood, California, USA, Dec 3-6, 2012

Yang Liang, Mingli Song, Lei Xie, Jiajun Bu and Chun Chen,"Face Sketch-to-Photo Synthesis from Simple Line Drawing", APSIPA Annual Summit and Conference (APSIPA ASC 2012), Hollywood, California, USA, Dec 3-6, 2012

Lilei Zheng, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Acoustic Texttiling For Story Segmentation Of Spoken Documents", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2012), March 25 - 30, Kyoto, Japan, 2012. PDF Bib Poster

Yali Zhao, Zhong-Hua Fu, Lei Xie, Jian Zhang, Yanning Zhang, "Dual-microphone based binary mask estimation for robust speaker verification", International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, July 16 -18, 2012.

Dan Li, Zhong-Hua Fu and Lei Xie, "Comprehensive Comparison of the Least Mean Square Algorithm and the Fast Deconvolution Algorithm for Crosstalk Cancellation", International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, July 16 -18, 2012.

Lei Xie, Yulian Yang and Zhi-Qiang Liu, "On the Effectiveness of Subwords for Lexical Cohesion Based Story Segmentation of Chinese Broadcast News", Information Sciences, 181(13):2873–2891, Elsevier, 2011. PDF Bib

Mimi Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Probabilistic Latent Semantic Analysis for Broadcast News Story Segmentation", Interspeech2011, Florence, Italy, August, 2011. (Interspeech Grant) PDF Bib

Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng and Haizhou Li, "Broadcast News Story Segmentation Using Conditional Random Fields and Multi-modal Features", IEICE Transactions on Information and Systems, invited paper, to appear, 2012. PDF Bib

Mimi Lu, Lilei Zheng, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, "Broadcast News Story Segmentation Using Probabilistic Latent Semantic Analysis and Laplacian Eigenmaps", APSIPA Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, 2011. PDF Bib

Xiaoyu Chen, Zhonghua Fu and Lei Xie, "Multiple Sparse Sources Separation Based on Multichannel Frequency Domain Adaptive Filtering", APSIPA Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, 2011

Jian Zhang, Zhonghua Fu and Lei Xie, "A Block-Based Blind Source Separation Approach with Equilateral Triangular Microphone Array", APSIPA Annual Summit and Conference (APSIPA ASC 2011), Xi'an, China, 2011

李冰锋,谢磊, 周祥增, 付中华,张艳宁,实时语音驱动的虚拟说话人,第11届全国人机语音通讯学术会议(《清华大学学报》),2011 PDF Bib

张健,付中华,谢磊,赵亚丽,基于目标声源方位已知的双麦克风噪声抑制,第11届全国人机语音通讯学术会议(《清华大学学报》),2011

赵亚丽,付中华,谢磊,张健, 张艳宁,双麦克风语音增强和杂混模型训练相结合的顽健说话人确认,第11届全国人机语音通讯学术会议,2011

Lei Xie, Zhong-hua Fu, Wei Feng and Yong Luo,"Pitch-Density-based Features and an SVM Binary Tree Approach for Multi-Class Audio Classification in Broadcast News", ACM/Springer Multimedia Systems Journal, 17(2):101-112 , 2011. PDF Bib

Mimi Lu, Lei Xie, Zhonghua Fu, Dongmei-Jiang, "Multi-Modal Feature Integration for Story Boundary Detection in Broadcast News", International Symposium on Chinese Spoken Language Processing (ISCSLP2010), Tainan, Taiwan, 2010. PDF Bib

Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, "Modeling Broadcast News Prosody Using Conditional Random Fields for Story Segmentation", APSIPA Annual Summit and Conference (APSIPA ASC 2010), Biopolis, Singapore, December 14-17, 2010. PDF Bib

郑李磊,谢磊,芦咪咪,王晓暄,杨玉莲,张艳宁,全自动中文新闻字幕生成系统的设计与实现,《电子学报》, 2011 PDF Bib

Zihan Liu, Lei Xie, Wei Feng, "Maximum Lexical Cohesion for Fine-Grained News Story Segmentation," Interspeech2010, Makuhari, Japan, 26-30 September, 2010.最佳学生论文FinalistPDF Bib

Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, "Phoneme Lattice based TextTiling towards Multilingual Story Segmentation," Interspeech2010, Makuhari, Japan, 26-30 September, 2010. PDF Bib

Lei Xie, Yulian Yang, Zhi-Qiang Liu, Wei Feng and Zihan Liu, "Integrating Acoustic and Lexical Features In Topic Segmentation of Chinese Broadcast News Using Maximum Entropy Approach," International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, 23-25 November 2010.

Zihan Liu, Lei Xie and Lilei Zheng, "Laplacian Eigenmaps for Automatic News Story Segmentation", International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, 23-25 November 2010.

Lei Xie et al., "Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications," Demo for The 7th International Conference on Ubiquitous Intelligence and Computing(UIC), October 26-29, 2010, Xi'an, China

Xiaohai Tian, Zhonghua Fu and Lei Xie, "An Experimental Comparison on KEMAR and BHead210 Dummy Heads for HRTF-based Virtual Auditory on Chinese Subjects," The Third IET International Conference on Wireless, Mobile & Multimedia Networks (ICWMMN2010), 26 - 29, September 2010, Beijing, China.

Yaodong Ni, Lei Xie, and Zhi-Qiang Liu, " Minimizing the Expected Complete Influence Time of a Social Network," Information Sciences, 180(13): 2514-2527, 2010.

Jin Zhang, Lei Xie, Wei Feng and Yanning Zhang, "A Subword Normalized Cut Approach to Automatic Story Segmentation of Chinese Broadcast News", Asia Information Retrieval Symposium (AIRS2009), LNCS 5839, Springer, pp136-148, 2009.

Wei Feng, Lei Xie and Zhi-Qiang Liu, "Multicue Graph Mincut for Image Segmentation", Ninth Asian Conference on Computer Vision (ACCV2009), LNCS 5995, pp. 707-717, Springer, 2010.

Wei Feng, Lei Xie, Jia Zeng and Zhi-Qiang Liu, "Audio-Visual Human Recognition Using Semi-Supervised Spectral Learning and Hidden Markov Models," Journal of Visual languages and Computing , invited paper, 20(3):188-195, 2009.

Jia Zeng, Wei Feng, Lei Xie and Zhi-Qiang Liu, "Cascade Markov random fields for stroke extraction of Chinese characters," Information Sciences, 180(2):301-311, 2009.

Lilei Zheng, Lei Xie, Xiaoxuan Wang, Mimi Lu, Yulian Yang and Yanning Zhang, "An Antomatic Caption Generator for Mandarin Broadcast News," 5th Joint Conference on Harmonious Human Machine Environment (HHME2009), Xi'an, China, Oct 28-30, 2009 (最佳论文)

Mimi Lu, Lei Xie, Lilei Zheng, Yulian Yang, Yanning Zhang, "Anchor Labeling System for Broadcast News using Alize toolkit", 5th Joint Conference on Harmonious Human Machine Environment (HHME2009), Xi'an, China, Oct 28-30, 2009

Zhonghua Fu, Jhing-Fa Wang and Lei Xie, "Noise Robust Features for Speech/Music Discrimination in Real-time Telecommunication", IEEE International Conference on Multimedia and Expo (ICME 2009), pp 574-577, New York, USA.

Lei Xie, "Discovering salient prosodic cues and their interactions for automatic story segmentation in Mandarin broadcast news", ACM/Springer Multimedia Systems Journal, 14(4):237-253, 2008.

Jia Zeng, Lei Xie and Zhi-Qiang Liu, "Type-2 Fuzzy Gaussian Mixture Models" Pattern Recognition, 41, 2008, pp 3636-3643.

Lei Xie and Guangsen Wang, "A Two-stage Multi-feature integration approach to Unsupervised Speaker Change Detection in Real-time News Broadcasting", International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 350-353, Yunnan, China, 2008. PDF Bib

Yulian Yang and Lei Xie, "Subword Latent Semantic Analysis for TextTiling-based Automatic Story Segmentation of Chinese Broadcast News", International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 358-361, Yunnan, China, 2008. (Microsoft Student Grant. This paper is also presented in the 2008 Beijing-Hong Kong International Doctoral Forum, Beijing) PDF Bib

Lei Xie and Yulian Yang, "Subword Lexical Chaining for Automatic Story Segmentation in Chinese Broadcast News", Pacific-Rim Conference on Multimedia (PCM2008), LNCS 5353, Springer, pp248-258, 2008.

Lei Xie, Jia Zeng and Wei Feng, "Multi-Scale TextTiling for Automatic Story Segmentation in Chinese Broadcast News", Asia Information Retrieval Symposium (AIRS2008), LNCS 4993, Harbin, China, pp345-355, Springer, 2008.

Lei Xie and Zhi-Qiang Liu, "Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modelling", IEEE Transactions on Multimedia, 9(3), 2007, pp500-510. PDF Bib

Lei Xie and Zhi-Qiang Liu, "A Coupled HMM Approach for Video-Realistic Speech Animation", Pattern Recognition, 40(10), 2007, pp2325-2340. PDF Bib

Lei Xie, "Dynamic Bayesian Network Inversion for Robust Speech Recognition", IEICE Transactions on Information and Systems, 2007, Vol. E90-D, No. 7, pp 156-159.

Lei Xie, Chuan Liu and Helen Meng, "Combined Use of Speaker- and Tone-Normalized Pitch Reset with Pause Duration for Automatic Story Segmentation in Mandarin Broadcast News", Human Language Technology Conference /North American chapter of the Association for Computational Linguistics Annual Meeting (HLT-NAACL), pp193-196, Rochester, NY, USA, April, 2007.

Chuan Liu, Lei Xie, Helen Meng, "Classification of Music and Speech in Mandarin News Broadcasts", 9th National Conference on Man-Machine Speech Communication (NCMMSC), Huangshan, Anhui, China, 2007.

Shing-kai Chan, Lei Xie and Helen Mei-ling Meng, "Modeling the Statistical Behavior of Lexical Chains to Capture Word Cohesiveness for Automatic Story Segmentation" Interspeech, Belgium, 2007. PDF Bib

Lei Xie, and Zhi-Qiang Liu, "An Articulatory Approach to Video-Realistic Mouth Animation", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. I, pp593-596, Toulouse, France, 2006.

Lei Xie, Helen Meng and Zhi-Qiang Liu, "A Cantonese Speech-Driven Talking Face using Translingual Audio-to-Visual Conversion", International Symposium on Chinese Spoken Language Processing (ISCSLP2006), LNAI 4274, Singapore, pp627-639, Springer, Dec, 2006.

Lei Xie and Zhi-Qiang Liu, "Multi-Stream Articulator Model with Adaptive Reliability Measure for Audio Visual Speech Recognition", Advances in Machine Learning and Cybernetics, LNAI 3930, Springer, pp99-114, April, 2006.

Lei Xie, and Zhi-Qiang Liu, "Speech Animation Using Coupled Hidden Markov Models", International Conference on Pattern Recognition (ICPR), vol. I, pp1128-1131, Hong Kong, 2006.

Lei Xie, and Zhi-Qiang Liu, "Lip Assistant: Visualize Speech for Hearing Impaired People in Multimedia Services" , International Conference on System, Man and Cybernetics (ICSMC) , pp4331-4336, Taipei, Taiwan, 2006.

陕ICP备15008649号