欢迎来到天天文库
浏览记录
ID:31853647
大小:1.88 MB
页数:60页
时间:2019-01-21
《基于语音反演机器学习方法的声道模型分析》由会员上传分享,免费在线阅读,更多相关内容在应用文档-天天文库。
1、AbstractTheperformanceoftheautomaticspeechrecognition(ASR)systemsisaffectedbecauseofcoarticulation.Existingrelatedstudieshaveclaimedthatarticulatoryinformationcanbeusedtoimprovetheperformanceofautomaticspeechrecognitionsystems.However,sucharticulatoryinformationisnotsoeasytobeobt
2、ainedintypicalspeaker-listenersituations.Thatiswhytheacoustic-to-articulatoryspeechinversionisproposed.Acoustic-to-articulatoryspeechinversion(speechinversion)isamethodofestimatingarticulatorytrajectoriesorvocaltractconfigurationsfromthespeechsignal.Ifarticulatoryinformationcanbe
3、estimatedaccurately,itwillbeusefulforspeechsynthesis,languageacquisition,speechvisualizationandsoon.Firstly,tractvariables(insteadoftraditionalpellettrajectories)areusedasarticulatoryinformationtomodelspeechdynamicsandtheestimationperformanceandnon-uniquenessoftractvariablesandpe
4、llettrajectoriesarecomparedinthispaper.Thespeechsignalsareparameterizedasmel-frequencycepstralcoefficients(MFCC),perceptuallinearpredictioncepstralcoefficients(PLPCC)andlinearpredictioncepstralcoefficients(LPCC),andmixturedensitynetworks(MDN)isusedtoestimatetractvariablesandpelle
5、ttrajectories.Theresultsindicatethattractvariablescanprovideabetterestimationperformancethanpellettrajectories.Furthermore,amodel-basedstatisticalparadigmisusedtocalculatetheNormalizedNon-Uniqueness(NNU)andtheresultsshowthatuniquenessintheTV-basedinversemodeliscomparativelylowert
6、hanthepellet-basedmodelforthesamesixconsonants.Secondly,fourdifferentmachinelearningmethodsareusedforspeechinversion,whicharefeedforwardartificialneuralnetwork(FF-ANN),autoregressiveartificialneuralnetwork(AR-ANN),distalsupervisedlearning(DSL)andtrajectorymixturedensitynetwork(TM
7、DN),tocomparetractvariablesandpellettrajectories.Theresultsindicatethattractvariableshavebetterperformancethanpellettrajectoriesandaremorefitforarticulatoryfeature-basedASRsystems.Inaddition,theestimationperformanceofthesemachinelearningmethodsfortractvariableswhenspeechsignalisp
8、arameterizedasMFCCandacousticparameters(
此文档下载收益归作者所有