基于SVM的非特定人声调识别的研究

英文篇名：Study of speaker-independent tone recognition based on support vector machine
中文刊名：计算机工程与应用
英文刊名：Computer Engineering and Applications
作者：肖汉光 ; 蔡从中
英文作者：XIAO Han-guang1 ; CAI Cong-zhong2 1.School of Mathematics and Physics ; Chongqing Institute of Technology ; Chongqing 400054 ; China 2.School of Mathematics and Physics ; Chongqing University ; Chongqing 400044 ; China
中文关键词：声调识别 ; 特征提取 ; Mel频率倒谱系数(MFCC) ; 支持向量机
英文关键词：tone recognition ; feature extraction ; Mel-Frequency Cepstrum Coefficients(MFCCs) ; Support Vector Machine(SVM)
出版日期：2009-03-21
机构：重庆工学院数理学院;重庆大学数理学院;
年：2009
期：09
出版单位：计算机工程与应用

摘要

在建立非特定人普通话四声语调语音数据库的基础上,采用Mel频率倒谱系数(MFCCs)对语音数据进行特征参数的提取,并利用支持向量机(SVM)对语音中的四种声调进行了训练和识别研究。实验结果表明MFCCs和SVM的结合得到的平均识别率达到了97.6%。
A speaker-independent tone database of Chinese speech(putonghua) is established.The Mel-frequency cepstrum coefficients(MFCCs) are used for extraction of the tone feature parameters.The four recognizing models of four tones are trained by using support vector machine(SVM) ,and are tested by using the testing tone data.The results show that a recognition accuracy can reach 97.6% by combining MFCCs and SVM.

引文

[1]赵鹤鸣,周旭东,金延庆.基于小波变换的重叠语音基频提取及声调识别[J].声学学报,1999,24(1):87-93.
    [2]关存太,陈永彬.非特定人四声识别[J].声学学报,1993,18(5):379-385.
    [3]朱小燕,王昱,刘俊.汉语声调识别中的基音平滑新方法[J].计算机学报,2001,24(2):213-218.
    [4]汤霖,尹俊勋,粟志昂,等.基于两级BP模型的普通话声调识别系统[J].计算机工程与应用,2004,40(25):96-99.
    [5]孙放,胡光锐.一种新型前向神经网络用于汉语四声识别[J].上海交通大学学报,1997,31(5):36-38.
    [6]Vapnik V.The nature of statistical learning theory[M].New York:Springer,1995.
    [7]张学工.关于统计学习理论与支持向量机[J].自动化学报,2000,26(1):32-42.
    [8]朱永生,张优云.支持向量机分类器中几个问题的研究[J].计算机工程与应用,2003,39(13):36-38.
    [9]肖汉光,蔡从中,廖克俊.利用声波和地震波识别军事车辆类型[J].系统工程理论与实践,2006,26(4):108-113.
    [10]Cai C Z,Han L Y,Ji Z L,et al.SVM-Prot:Web-based support vec-tor machine software for functional classification of a protein from its primary sequence[J].Nucleic Acids Research,2003,31(13):3692-3697.