ASLP Logo

SpacerASLP HomeDividerAbout UsDividerContact UsSpacerContact Us

Spacer
         
  ASLP Logo

Publications

2011

  • Lei Xie, Yulian Yang, Zhi-Qiang Liu and Wei Feng, On the Effectiveness of Subwords for Lexical Cohesion Based Story Segmentation of Chinese Broadcast News, Information Sciences, accepted, inprint, 2011.
  • Lei Xie, Zhong-hua Fu, Wei Feng and Yong Luo, Pitch-Density-based Features and an SVM Binary Tree Approach for Multi-Class Audio Classification in Broadcast News, ACM/Springer Multimedia Systems Journal, 17(2):101-112, 2011.
  • 郑李磊,谢磊,芦咪咪,王晓暄,杨玉莲,张艳宁,全自动中文新闻字幕生成系统的设计与实现,电子学报,Vol . 39,No. 3A,2011年3月.
  • 周虹辰,蒋冬梅,Hichem Sahli, Werner Verhelst, 基于谐波的乐纹提取和音乐检索,计算机工程与应用,已录用.
  • 吴鹏,蒋冬梅,王风娜,Hichem Sahli, Werner Verhelst, 基于发音特征的音视频融合语音识别模型,计算机工程,已录用.
  • 吕兰兰,蒋冬梅,王风娜,Hichem Sahli, Werner Verhelst, 基于三流DBN模型的听视觉情感识别,计算机工程,已录用.
  • 亢宣,付中华,蒋冬梅,田霄海,谢磊, 基于TMS320C6414的3D虚拟声合成系统, 计算机工程与科学,已录用.
  • 吴鹏,蒋冬梅,王风娜,Hichem Sahli, Werner Verhelst,基于发音特征的音视频融合语音识别模型, 计算机工程,已录用.

2010

  • Yaodong Ni, Lei Xie, and Zhi-Qiang Liu, Minimizing the Expected Complete Influence Time of a Social Network, Information Sciences, 180(13): 2514-2527, 2010.
  • Xie Lei, Fu Zhong-hua, Feng Wei, Luo Yong, Pitch-density-based features and an SVM binary tree approach for multi-class audio classification in broadcast news, [J] Multimedia Systems, 2010. DOI 10.1007/s00530-010-0205-x.
  • Wei Feng, Lei Xie and Zhi-Qiang Liu, Multicue Graph Mincut for Image Segmentation, Ninth Asian Conference on Computer Vision (ACCV2009), LNCS 5995, pp. 707-717, Springer, 2010.
  • Zihan Liu, Lei Xie, Wei Feng, Maximum Lexical Cohesion for Fine-Grained News Story Segmentation, Interspeech, Makuhari, Japan, 26-30 September, 2010.
  • Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, Phoneme Lattice based TextTiling towards Multilingual Story Segmentation, Interspeech, Makuhari, Japan, 26-30 September, 2010.
  • Fu Zhong-Hua, Wang Jhing-Fa, Speech presence probability estimation based on integrated time-frequency minimum tracking for speech enhancement in adverse environments, International Conference on Acoustics, Speech and Signal Processing (ICASSP2010), pp. 4258-4261, 2010.
  • Dongmei Jiang, Ilse Ravyse, Peizhen Liu, Hichem Sahli, Werner Verhelst. Realistic Mouth Animation Based on an Articulatory DBN Model with Constrained Asynchrony, Proc. 35th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2010), March 14-19, Texas, USA, pp.2478-2481, 2010.
  • Mimi Lu, Lei Xie, Zhonghua Fu, Dongmei-Jiang and Yan-ning Zhang, Multi-Modal Feature Integration for Story Boundary Detection in Broadcast News, International Symposium on Chinese Spoken Language Processing (ISCSLP2010), Tainan, Taiwan, 2010.
  • Zhong-hua Fu, Lei Xie and Domg-mei Jiang, Dual-microphone Noise Reduction Based on Semi-Blind DUET, International Symposium on Chinese Spoken Language Processing (ISCSLP2010), Tainan, Taiwan, 2010.
  • Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, Modeling Broadcast News Prosody Using Conditional Random Fields for Story Segmentation, APSIPA Annual Summit and Conference (APSIPA ASC 2010), Biopolis, Singapore, December 14-17, 2010.
  • Zihan Liu, Lei Xie and Lilei Zheng, Laplacian Eigenmaps for Automatic News Story Segmentation, International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, 23-25 November 2010.
  • Lei Xie, Yulian Yang, Zhi-Qiang Liu, Wei Feng and Zihan Liu, Integrating Acoustic and Lexical Features In Topic Segmentation of Chinese Broadcast News Using Maximum Entropy Approach, International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, 23-25, November 2010.
  • Lei Xie et al., Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications, The 7th International Conference on Ubiquitous Intelligence and Computing(UIC), October 26-29, 2010, Xi'an, China
  • Xiaohai Tian, Zhonghua Fu and Lei Xie, An Experimental Comparison on KEMAR and BHead210 Dummy Heads for HRTF-based Virtual Auditory on Chinese Subjects, The Third IET International Conference on Wireless, Mobile & Multimedia Networks (ICWMMN2010), 26 - 29, September 2010, Beijing, China.
  • Dongmei Jiang, Peng Wu, Fengna Wang, Hichem Sahli, Audio Visual Speech Recognition Based on Multi-Stream DBN Models with Articulatory Features, 10th Int. Symposium on Chinese Spoken Language Processing (ISCSLP), pp.190-193 , 2010.
  • 刘培桢,蒋冬梅,Ilse Ravyse, Hichem Sahli, 基于发音特征DBN模型的语音驱动嘴部动画合成,科学技术与工程,10(14),pp. 3335-3339, 2010.
  • 陈丹淇,蒋冬梅,Ilse Ravyse,Hichem Sahli, 基于动态贝叶斯网络的听视觉融合情感识别,计算机仿真,已录用.
  • 李青,蒋冬梅,Fan Ping,Ilse Ravyse,Hichem Sahli, 基于流形特征的视频情感分析与识别,计算机工程与科学,32(12),pp.39-41,2010.

2009

  • Wei Feng, Lei Xie and Zhi-Qiang Liu, Audio-Visual Human Recognition Using Semi-Supervised Spectral Learning and Hidden Markov Models, Journal of Visual languages and Computing, invited paper, 20(3):188-195, 2009.
  • Jia Zeng, Wei Feng, Lei Xie and Zhi-Qiang Liu, Cascade Markov random fields for stroke extraction of Chinese characters, Information Sciences, 80(2):301-311, 2009.
  • Dongmei Jiang, Ilse Ravyse, Hichem Sahli, Werner Verhelst, Speech Driven Realistic Mouth Animation Based on Multi-modal Unit Selection, Journal on Multimodal User Interfaces, Vol.2, No.3, pp.157-169, 2009.
  • Jin Zhang, Lei Xie, Wei Feng and Yanning Zhang, A Subword Normalized Cut Approach to Automatic Story Segmentation of Chinese Broadcast News, Asia Information Retrieval Symposium (AIRS2009), LNCS 5839, Springer, pp136-148, 2009.
  • Zhonghua Fu, Jhing-Fa Wang and Lei Xie, Noise Robust Features for Speech/Music Discrimination in Real-time Telecommunication, IEEE International Conference on Multimedia and Expo (ICME 2009), pp 574-577, New York, USA.
  • Po-Yi Shih, Jhing-Fa Wang, Yuan-Ning Lin, Zhong-Hua Fu, Multi-Speaker Adaptation for Robust Speech Recognition under Ubiquitous Environment, O-COCOSDA 2009, pp.126-131, Aug. 2009.
  • Fu Zhong-Hua, Wang Jhing-Fa, Low-delay noise estimation based on spectrum ripples and minimum statistics in adverse environments,
    International Conf. Digital Signal Proce. 2009, pp. 1-6, July 2009.
  • Fu Zhong-Hua, Wang Jhing-Fa, Xie Lei, Noise robust features for speech/music discrimination in real-time telecommunication, IEEE International Conf. Multimedia & Expo (ICME) 2009, pp. 574 - 577, June 28 2009-July 3 2009.
  • Dongmei Jiang, Peizhen Liu, Ilse Ravyse, Hichem Sahli, Werner Verhelst, Video Realistic Mouth Animation Based on an Audio Visual DBN Model with Articulatory Features and Constrained Asynchrony, Int. Conf. on Image and Graphics, Xi’an, China, Sept.19-22, pp.658-662, 2009.
  • Danqi Chen, Dongmei Jiang, Ilse Ravyse, Hichem Sahli, Audio-Visual Emotion Recognition Based on a DBN Model with Constrained Asynchrony, Int. Conf. on Image and Graphics, Xi’an, China, Sept.19-22, pp.912-916, 2009.
  • Ping Fan, Dongmei Jiang, Ilse Ravyse, Fengna Wang, Hichem Sahli, Manifold Analysis for Subject Independent Dynamic Emotion Recognition in Video Sequences, Int. Conf. on Image and Graphics, Xi’an, China, Sept.19-22, pp.896-901, 2009.
  • 王跃,谢磊,杨玉莲,基于自适应白化的音乐节拍实时跟踪算法,计算机应用研究,26(5):1676-1678,2009.
  • 杨玉莲,谢磊,基于子词链的中文新闻广播故事自动分割,计算机应用研究,26(2):582-586,2009.
  • 宋培岩,蒋冬梅,王风娜, 基于发音特征的音/视频双流语音识别模型,计算机应用研究,26(7),pp.2481-2483, 2009.
  • 王风娜,蒋冬梅,宋培岩, 结合发音特征的动态贝叶斯网络语音识别模型,计算机工程与应用,45(8),pp.178-181,2009.
  • 孟永辉,蒋冬梅,付中华,谢 磊, 一种新颖的语言/ 音乐分割与分类方法,计算机工程与科学,31(4),pp.106-109, 2009.
  • 白洁,蒋冬梅, 归一化振幅商在语音情感识别中的应用,计算机仿真,26(2),183-186,2009.
  • 任翠红,蒋冬梅,付中华, 基于α阶GMMSE 算法的语音增强研究,微电子学与计算机,26(3),pp.76-80, 2009.
  • 郑李磊,谢磊,卢咪咪,杨玉莲,张艳宁, 新闻字幕自动生成系统的设计与实现,第五届和谐人机环境联合学术会议,中国西安,2009.
  • 卢咪咪,谢磊,郑李磊,杨玉莲, 张艳宁, 基于Alize工具包的广播音频播音员自动标注系统,第五届和谐人机环境联合学术会议,中国西安,2009.
  • 刑永涛, 付中华, 张艳宁, 二维维纳滤波语音增强方法研究与实现, 计算机工程与应用, vol. 45(19), pp. 137-141, 2009.

2008

  • Lei Xie, Discovering Salient Prosodic Cues and their Interactions for Automatic Story Segmentation in Mandarin Broadcast News,ACM/Springer Multimedia Systems Journal, 14(4):237-253, 2008.
  • Jia Zeng, Lei Xie and Zhi-Qiang Liu, Type-2 Fuzzy Gaussian Mixture Models, Pattern Recognition, 2008, 41(12):3636-3643, 2008.
  • Lei Xie and Xi Tan, A Heuristic Approach to Caption Enhancement for Effective Video OCR, in Book Chapter, Advanced Intelligent Computing Theories and Applications, LNCS 5226, Springer, pp347-355, 2008.
  • Lei Xie and Yulian Yang,Subword Lexical Chaining for Automatic Story Segmentation in Chinese Broadcast News, Pacific-Rim Conference on Multimedia (PCM), LNCS 5353, Springer, pp248-258, 2008.
  • Lei Xie, Jia Zeng and Wei Feng, Multi-Scale TextTiling for Automatic Story Segmentation in Chinese Broadcast News, in Book Chapter, Information Retrieval Technology, LNCS 4993, Springer, pp345-355, 2008.
  • Lei Xie and Guangsen Wang, A Two-stage Multi-feature integration approach to Unsupervised Speaker Change Detection in Real-time News Broadcasting, International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 350-353, Yunnan, China, 2008.
  • Yulian Yang and Lei Xie, Subword Latent Semantic Analysis for TextTiling-based Automatic Story Segmentation of Chinese Broadcast News, International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 358-361, Yunnan, China, 2008.
  • Lei Xie, Jia Zeng and Wei Feng, "Multi-Scale TextTiling for Automatic Story Segmentation in Chinese Broadcast News", Asia Information Retrieval Symposium (AIRS2008), Harbin, China, 2008.
  • Dongmei Jiang, Ilse Ravyse, Hichem Sahli, Yanning Zhang, Accurate Visual Speech Synthesis Based On Diviseme Unit Selection And Concatenation, Proc. the IEEE 10th Workshop on Multimedia Signal Processing (MMSP2008), Oct 8-10, 2008, Cairns, Queensland, Australia, pp. 906-909.
  • Ravyse I., Fan P., Sahli H., Jiang D., and Verhelst W., SPREEKPAD: Audiovisual technology for pronunciation training, 11th International Conference on Interactive Computer aided Learning, Villach, Austria, issue contribution 208, pp.1 - 8, 2008.
  • Fu Zhong-Hua, Wang Jhing-Fa, Robust features for effective speech and music discrimination, ROCLing2008, pp. 209-215, TaiPei, Sep. 2008.
  • 白洁,蒋冬梅, 基于NAQ的语音情感识别研究,计算机应用研究,25(11),pp.3243-3246,2008.
  • 杨永超, 付中华, 蒋冬梅, 基于DSP的实时语音检测的设计与实现, 计算机应用, vol. 28(2), pp. 491-494, 2008.

2007

  • Dongmei Jiang, Guoyun Lv, Ilse Ravyse, Xiaoyue Jiang, Yanning Zhang, Hichem Sahli and Rongchun Zhao. "Audio Visual Speech Recognition and Segmentation Based on DBN Models". Chapter in Robust Speech Recognition and Understanding, pp: 139 - 156, eds: Michael Grimm, Kristian Kroschel, published by: I-Tech Education and Publishing, ISBN-ISSN: 978-3-902613-08-0, 2007.
  • Lei Xie and Zhi-Qiang Liu, "Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modelling", IEEE Transactions on Multimedia, 9(3), 2007, pp500-510.
  • Lei Xie and Zhi-Qiang Liu, "A Coupled HMM Approach for Video-Realistic Speech Animation", Pattern Recognition, 40(10), 2007.
  • Lei Xie and Hongwu Yang, "Dynamic Bayesian Network Inversion for Robust Speech Recognition", IEICE Transactions on Information and Systems, 2007, Vol. E90-D, No. 7, pp 156-159,2007.
  • Fu Zhonghua, Zhang Yanning,"Robust Boostrapping Algorithm of Speaker Models for On-Line Unsupervised Speaker Indexing", Journal of software, Vol:18(3),2007.3.(in Chinese)
  • Fu Zhonghua, Zhang Yanning,"Design and Implementation of Real Time Pitch Scaling", Computer Engineering and Applications, Vol: 43(8), 2007.3.(in Chinese)
  • Guoyun Lv, Rongchun Zhao, Dongmei Jiang, Xiaoyue Jiang, Yunshu Hou, Hichem Sahli. "Lip Reading and Viseme Segmentation Based on BTSM and DBN Models". In Computer Engineering and Application, 43(14), pp. 21-24, 2007.(in Chinese)
  • Guoyun Lv, Dongmei Jiang, Rongchun Zhao, Xiaoyue Jiang, Yunshu Hou, Ali Sun, Hichem Sahli, Werner. Verhelst. "Audio Visual Continuous Speech Recognition and Phone Segmentation Based on Dynamic Bayesian Networks". Accepted by Computer Application, 2007.(in Chinese)
  • Guoyun Lv, Dongmei Jiang. "DBN Models for Audio Visual Speech Recognition and Phone Segmentation". Technical Report. Department ETRO, Vrije Universiteit Brussels, 2007.
  • Guoyun Lv, Dongmei Jiang, Rongchun Zhao, Xiaoyue Jiang, Yunshu Hou, Ali Sun, Hichem Sahli, Werner Verhelst. "Audio Visual Continuous Speech Recognition and Phone Segmentation Based on Dynamic Bayesian Networks". Computer Application, 27 (07), pp.1670~1673, 2007(in Chinese).
  • Guoyun Lv, Dongmei Jiang, Rongchun Zhao, Hichem Sahli, Werner Verhelst. "Large Vocabulary Continuous Audio Visual Speech Recognition and Phone Segmentation Based on Dynamic Bayesian Networks". Accepted by Journal of Northwestern Polytechnical University, 2007. (in Chinese).
  • Guoyun Lv, Dongmei Jiang, Rongchun Zhao, Ilse Ravyse, Yunshu Hou, Hichem Sahli, Werner Verhelst. “ Large Vocabulary Audio Visual Continuous Speech Recognition Based on Multi-stream Multi-state Dynamic Bayesian Networks”. Accepted by Journal of Electronics& Information Technology, 2007 (in Chinese).
  • Guoyun Lv, Dongmei Jiang, Werner Verhelst, Rongchun Zhao and Hichem Sahli. “Multi-stream Asynchronous DBN Models for Audio-visual Speech Recognition and Phone Segmentation”, IEEE transactions on audio, speech and language processing, submitted, 2007.
  • Jia Zeng, Lei Xie and Zhi-Qiang Liu, "Type-2 Fuzzy Gaussian Mixture Models", Pattern Recognition, 2007, submitted.
  • Jiang Zeng, Wei Feng, Lei Xie and Zhi-Qiang Liu, "Markov Random Fields for Stroke Extraction of Chinese Characters", Pattern Recognition, 2007, submitted.
  • Lei Xie, Chuan Liu and Helen Meng, "Combined Use of Speaker- and Tone-Normalized Pitch Reset with Pause Duration for Automatic Story Segmentation in Mandarin Broadcast News", Human Language Technology Conference /North American chapter of the Association for Computational Linguistics Annual Meeting (HLT-NAACL2007), pp193-196, Rochester, NY, USA, April, 2007.
  • Fu Zhonghua"Robust Bootstrapping of Speaker Models for Unsupervised Speaker Indexing", MCAM 2007: 122-129.
  • Chuan Liu, Lei Xie, Helen Meng, "Claasification of Music and Speech in Mandarin News Broadcasts", 9th National Conference on Man-Machine Speech Communication (NCMMSC2007), Huangshan, Anhui, China, 2007.
  • Shing-kai Chan, Lei Xie and Helen Mei-ling Meng, "Modeling the Statistical Behavior of Lexical Chains to Capture Word Cohesiveness for Automatic Story Segmentation", Interspeech2007, Belgium, 2007.
  • Jiang Zeng, Lei Xie and Zhi-Qiang Liu, "Gaussian Mixture Models with Uncertain Parameters", accepted by International Conference on Machine Learning and Cybernetics (ICMLC07), pp2761-2766, Hong Kong, 2007.
  • Guoyun Lv, Dongmei Jiang, Hichem Sahli, Rongchun Zhao, Werner Verhelst. "A Novel DBN Model For Large Vocabulary Continuous Speech Recognition and Phone Segmentation", Proceedings Of The 2007 International Conference On Artificial Intelligence and Pattern Recognition, pp: 397 - 402, eds: Dimitrios A. Karras, Chunping Li, Zoran Majkic and S.R.M. Prasanna, published by: ISRST U.S, ISBN-ISSN: 978-0-9727412-3-1, 2007.
  • Guoyun Lv, Dongmei Jiang, Rongchun Zhao, Xiaoyue Jiang, H. Sahli. “Multi-Stream Asynchrony Dynamic Bayesian Network Model for Audio-Visual Continuous Speech Recognition”. 14th International Conference on systems, Signals and Image Processing (IWSSIP 2007) and 6th Eurasip conference Focused on Speech and Image Processing, Multimedia Communications and Services (ECSIPMCS 2007), June 27-30, 2007, vol. 1, pp.437-440, Maribor, Slovenia.
  • Guoyun Lv, Dongmei Jiang, RongChun Zhao, Yunshu Hou. “Multi-stream Asynchrony Modeling for Audio-Visual Speech Recognition”. IEEE International Symposium on Multimedia 2007 (ISM2007), Dec. 10-12, 2007, Taiwan, China.
  • Guoyun Lv, Dongmei Jiang, Rongchun Zhao. “Single Stream DBN Model Based on Triphone for Large Vocabulary Continuous Speech Recognition”. IEEE International Symposium on Multimedia 2007 (ISM2007) & Workshop on Multimedia Audio and Speech Processing: advancing the state-of-the-art, Dec. 10-12, 2007, Taiwan, China.
  • Lei Xie, and Zhi-Qiang Liu, "An Articulatory Approach to Video-Realistic Mouth Animation", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2006), vol. I, pp593-596, Toulouse, France, 2006. 
  • Lei Xie, and Zhi-Qiang Liu, "Speech Animation Using Coupled Hidden Markov Models", International Conference on Pattern Recognition (ICPR2006), vol. I, pp1128-1131, Hong Kong, 2006. 
  • Lei Xie, and Zhi-Qiang Liu, "Lip Assistant: Visualize Speech for Hearing Impaired People in Multimedia Services" , International Conference on System, Man and Cybernetics (ICSMC2006) , pp4331-4336, Taipei, Taiwan, 2006.
  • Lei Xie, and Zhi-Qiang Liu, "A Comparative Study of Audio Features for Audio-to-Video Conversion in MPEG-4 Compliant Facial Animation", International Symposium on Media Computing (ISMC2006), ICMLC2006, pp4359-4364, Dalian, China, 2006.
  • Ilse Ravyse, Dongmei Jiang, Xiaoyue Jiang, Guoyun Lv, Yunshu Hou, Hichem Sahli, Werner Verhelst, Rongchun Zhao. "DBN Based Models for Audio-Visual Speech Analysis and Recognition". Proc. Int. Conf. PCM2006, LNCS, Advances in Multimedia Information Processing, pp. 19-30, 2006.
  • Guoyun Lv, Dongmei Jiang, Pengjuan Guo, Ali Sun,Rongchun Zhao, H.Sahli, W.Verhelst, "Single Stream DBN model for Continuous Speech Recognition and Phone Segmentation", 2006 International Symposium on Distributed Computing and Applications to Business, Engineering and Science (DCABES 2006), Hangzhou, China, vol.1:277-280, 2006.
  • Yi Wang, Lei Xie, Zhi-Qiang Liu and Li-Zhu Zhou, "Supervised Learning of Motion Style for Real-time Synthesis of 3D Character Animations", International Conference on System, Man and Cybernetics (ICSMC2006) , pp4321-4325, Taipei, Taiwan, 2006.
  • Yi Wang, Lei Xie, Zhi-Qiang Liu and Li-Zhu Zhou, "The SOMN-HMM Model and Its Application to Automatic Synthesis of 3D Character Animations", International Conference on System, Man and Cybernetics (ICSMC2006) , pp4948-4952, Taipei, Taiwan, 2006.
  • Yi Wang Li-Zhu Zhou, Jiang-hua Feng, Lei Xie and Chun Yuan, "2D/3D Web Visualization on Mobile Devices", 7th International Conference on Web Information Systems Enginerring (WISE'06), LNCS 4255, pp536-547, Wuhan, China, 2006. 
  • Ali Sun, Dongmei Jiang, Guoyun Lv, Hichem Sahli, Werner. Verhelst. "Research on DBN Based Continuous Speech Recognition and Segmentation". In Application Research of Computers. 24(10), pp.104-106, 2007.(in Chinese)
  • Pengjuan Guo, Dongmei Jiang, Hichem Sahli, Werner. Verhelst. "Speech Emotion Recognition Based on Pitch Features". In Application Research of Computers. 24(10), pp.101-103, 2007.(in Chinese)

2006

  • Lei Xie, and Zhi-Qiang Liu, "Multi-Stream Articulator Model with Adaptive Reliability Measure for Audio Visual Speech Recognition", in Book Chapter, Advances in Machine Learning and Cybernetics, Springer, pp99~114, April, 2006.
  • Lei Xie, Helen Meng and Zhi-Qiang Liu, "A Cantonese Speech-Driven Talking Face using Translingual Audio-to-Visual Conversion", in Book Chapter, Advances in Chinese Spoken Language Processing , pp627-639, Dec, 2006.
  • Hou Yunshu, Fu Zhonghua, et al, "Face Feature Points Extraction Based on Refined ASM". Application Research of Computer, 2006.11,Vol:23(11),pp:255-257.(in Chinese)

2005

  • Lei Xie, Rong-Chun Zhao and Zhi-Qiang Liu, "Adaptive Stream Reliability Modelling based on Local Measures for Audio Visual Speech Recognition", The 4th IEEE International Conference on Machine Learning and Cybernetics (ICMLC2005), 2005, pp4852-4857.
  • Chen Hao, Fu ZhongHua, Zhao RongChun."Text-independent speaker identification method based on common Gaussian bases". International Symposium on Computer Science & Technology,NingBo,2005,10.
  • Xie Lei, Fu Zhonghua, Jiang Dongmei, Zhao Rongchun, Werner Verhelst, Hichem Sahli, Jan Cornelis."A Robust Dynamic Mouth Feature Based on Visemic LDA for Audio Visual Speech Recognition", Journal of Electronics & Information Technology, 27(1): 64~68, 2005.(in Chinese)
  • Chen Hao, Fu Zhonghua, et al,"A Score Compensation Approach to Speaker Verification in Different Speech Coding Channels", Journal of Northwestern Polytechnical University, 2005.Vol:23(4).(in Chinese)
  • Chen Hao, Fu Zhonghua, et al,"Feature extraction in speaker recognition applications based on G. 729 codec parameters", Science Journal of Northwest University, Vol.35(3),pp.266-269,2005.(in Chinese)

2004

  • Lei Xie, Xiu-li Cai and Rong-Chun Zhao, "A robust hierarchical lip tracking approach for lipreading and audio visual speech recognition". The 3rd IEEE International Conference on Machine Learning and Cybernetics (ICMLC2004), vol.6, pp3620~3624. Shanghai, China, August 26~29,2004. 
  • Lei Xie, Xiu-li Cai and Rong-Chun Zhao, "Lip Temporal Pattern Analysis for Automatic Visual Speech Recognition", the 7th International Conference on Signal Processing (ICSP2004), vol. 1, pp703~706. Beijing, China, August 31~Sept. 4, 2004.
  • Lei Xie, Xiu-li Cai and Rong-Chun Zhao, "Dynamic Visual Features Based on Discriminative Speech Class Projection for Visual Speech Recognition". 2004 International Symposium on Intelligent Multimedia, Video & Speech Processing (ISIMP 2004), Hong Kong, 2004, pp687-690.
  • Fu Zhonghua, Xie Lei, Zhao Rong-Chun, "Channel robust speaker verification via extended feature mapping", the 7th International Conference on Signal Processing (ICSP2004), vol. 3, pp2417~2420, Beijing, China, August 31~Sept. 4, 2004.
  • Fu Zhonghua, Xie Lei, Zhao Rong-Chun, "RC-MES: A novel speaker modelling technique based on regression class for speaker identification", 2004 International Symposium on Intelligent Multimedia, Video & Speech Processing (ISIMP 2004), Hong Kong, 2004, pp214-217.
  • Fu Zhonghua, Zhao Rongchun,"Speaker modeling technique based on regression class for speaker identification with sparse training", Sinobiometrics 2004, LNCS 3338, pp. 610?C616, 2004.
  • D. Jiang, X. Lei, I. Ravyse, Z. Rongchun, H. Sahli, J. Cornelis. "The Viseme Based Continous Speech Recognition System for a Talking Head", Journal of Electronics & Information Technology, 26(3), pp.375-381, 2004.(in Chinese)
  • Xie Lei, Feng Wei, Zhao Rong Chun. "A Lip Contour Extraction Method Based on Multiple Active Shape Model (MASM) for Audio Visual Speech Recognition", Journal of Northwestern Polytechnical University, 22(5): 674-678,2004.(in Chinese)
  • Xie Lei, Jiang Dongmei, Ilse Ravyse, Zhao Rongchun, Hichem Sahli, Werner Verhelst, Jan Cornelis. "Experimental Research on Audio Visual Fusion and on Model Asynchrony for Raising Speech Recognition Rate", Journal of Northwestern Polytechnical University, 22(2): 171~174, 2004.(in Chinese)
  • Xie Lei, Zhao Rongchun, Jiang Dongmei, Ilse Ravyse, Hichem Sahli, Werner Verhelst, Jan Cornelis, Ignace Lemahieu. "A Viseme Based Speech Recognition System For Talking Head Animation", Computer Applications and Software, 22(5): 22~24, 2004.(in Chinese)

2003

  • Lei Xie, Rong-Chun Zhao, Dong-mei Jiang, Ilse Rayvse, Hichem Sahli, Werner Verhelst and Jan Cornelis, " Triseme Decision Trees in the Continuous Speech Recognition System for Talking Head Animation". The 2nd International Conference on Active Media Technology (ICAMT??03), pp 389-395, Chongqing, China, May, 2003.
  • Lei Xie, Dong-mei Jiang, Ilse Rayvse, Rong-Chun Zhao, Hichem Sahli, Werner Verhelst and Jan Cornelis, "Context dependent viseme models for voice driven animation". The 4th EURASIP Conference focused on Video/Image Processing and Multimedia Communications, (EC-VIP-MC 2003), pp 649-654, Zagreb, Croatia, July 2-4, 2003.
  • Lei Xie, Dong-mei Jiang, Ilse Rayvse, Rong-Chun Zhao, Hichem Sahli, Werner Verhelst and Jan Cornelis, "Visualize Speech: A continuous Speech Recognition System for Facial Animation Using Acoustic Visemes". IEEE International Conference on Neural Networks and Signal Processing (ICNNSP 2003), pp872~875, Nanjing, China, Dec 14-17, 2003.
  • Lei Xie, Ilse Ravyse, Dong-Mei Jiang et al., "A Multi-Stream Bimodal Continuous Speech Recognition System Using Datasieve Based Features". The 2nd IEEE International Conference on Machine Learning and Cybernetics (ICMLC2003), pp2287~2290.Xi??an, China, Nov, 02-05,2003. 
  • Fu Zhonghua, Zhao RongChun,"An overview of modeling technology of speaker recognition", ICNNSP, 2003, Vol:1, pp: 887-891.
  • Xie Lei, Ilse Ravyse, Jiang Dongmei, Zhao Rongchun, Hichem Sahli, Werner Verhelst, Jan Cornelis, Ignace Lemahieu."An Eigen M outh Based Audio Visual Continuous Speech Recognition System in Noisy Environments",Computer Engineering and Applications, 39(16): 3~5, 2003.(in Chinese)
  • Xie Lei, Ilse Ravyse, Jiang Dongmei, Zhao Rongchun, Werner Verhelst ,Hichem Sahli, Jan Cornelis. "A Datasieve-based Audiovisual Continuous Speech Recognition System",Journal of Computer Applications, 23(7): 1??3, 2003.(in Chinese)
  • Fu Zhonghua, Wang Dawei, Zhao Rongchun, Xie Lei,"A Scheme of Low-frequency Sampled Audio Compression Based on Mp3 Frame", Computer Engineering, Vol:19, 2003.(in Chinese)
  • Fu Zhonghua, Zhang Rongchun, Jiang Dongmei,"A More Accurate Pitch Detection Algorithm ", Journal of Northwestern Polytechnical University, Vol:5, 2003.(in Chinese)

1999-2002

  • Dong-Mei Jiang, Lei Xie, Rong-Chun Zhao, Hichem Sahli and Werner Verhelst, "Acoustic Viseme Modelling for Speech Driven Animation: A Case Study", Proc. 1st IEEE Benlux Workshop on Model based Processing and Coding of Audio (MPCA-2002), Leuven, Belgium, Nov., 2002.
  • Jiang Dongmei, Xie Lei, Ilse Ravyse, Zhao Rongchun, Hichem Sahli, Jan Cornelis. "Triseme Decision Trees in the Continuous Speech Recognition System for a Talking Head", Proc. 1st IEEE International Conference on Machine Learning & Cybernetics, pp.2097-2100, 2002.
  • Jiang Dongmei, Xie Lei, Zhao Rongchun, Werner Verhelst, Ilse Ravyse, Hichem Sahli. "Acoustic Viseme Modelling for Speech Driven Animation: A Case Study", Proc. 1st IEEE Benelux Workshop on Model Based Processing and Coding of Audio (MPCA-2002), pp.1-4, 2002.
  • D. Jiang, X. Lei, I. Ravyse, W. Verhelst, Z. Rongchun, H. Sahli, I. Lemahieu. Viseme Based Continuous Speech Recognition System for a Talking Head", 24th Annual meeting of the German Association for Pattern Recognition , DAGM 2002, 16-18/9/2002, Zurich, Switzerland, Sep. 2002.
  • Jiang Dongmei, Zhao Rongchun, "Speaker Normalization Based on the Generalized Time-Frequency Representation and Mellin Transform", Proc. 5th International Conference on Signal Processing, pp. 782-785, 2000.
  • Fu Zhonghua, Zhang Rongchun,"Implementing dynamic time-warping in small RAM", Journal of Northwestern Polytechnical University, Vol: 20(4), 2002.(in Chinese)
  • Dongmei Jiang, Rongchun Zhao. "Generalized Time - Frequency Distribution With Cone -Shaped Kernel And Its Describing Voiced Speech", in Computer Applications and Software. 19(1), pp.7-10, 2002. (in Chinese)
  • Dongmei Jiang, Rongchun Zhao. "A Novel Speaker Normalization Method Based on Formant Recovery and Mellin Transform", Journal of Data Acquisition & Processing, 16(1), pp.58-62, 2001. (in Chinese)
  • Dongmei Jiang, Guokang Fu, Rongchun Zhao. "An Improved Viterbi Algorithm and Speech Recognition with State Duration Considered", Journal of Northwestern Polytechnical University, 18(4), pp.595-599, 2000. (in Chinese)
  • Guokang Fu, Rongchun Zhao, Zhiqiang Liu. "Study on Markov Random Field in Speech Recognition". Journal of Data Acquisition & Processing, 14(4), pp.433-437. 1999. (in Chinese)
  • Guokang Fu, Rongchun Zhao. "On Improving the Clustering Ability of Fuzzy Self Organizing Neural Network in Speech Recognition". Journal of Northwestern Polytechnical University, 17(4), pp.599-602, 1999. (in Chinese)

Before 1999

  • Avaliable upon request.

    PRINT         

    About Us  Vertical divider   Site Map  Divider   Contact Us
    ©2011 ASLP@NPU Chang'an Campus, Northwestern Polytechnical University, Xi'an, 710129, China Tel: +8629-88431532 Email: webmaster@nwpu-aslp.org

    Privacy Policy
    陕ICP备08003978号