欢迎来到天天文库
浏览记录
ID:34432292
大小:251.01 KB
页数:14页
时间:2019-03-06
《e s e a r c h r e p r o r t i d inew》由会员上传分享,免费在线阅读,更多相关内容在教育资源-天天文库。
1、ORTIDIAPMartigny-Valais-SuisseEPRMulti-StreamSpeechRecognitionabHerveBourlardStephaneDupontbESEARCHChristopheRisRIDIAP{RR96-07IDIAPDecember1996DalleMolleInstituteforPerceptiveArtificialInteligenceP.O.Box592MartignyValaisSwitzerlandphone+41?27?721771
2、1fax+41?27?7217712e-mailsecretariat@idiap.chaIDIAPinternethttp://www.idiap.chbFacultePolytechniquedeMons,BelgiumIDIAPResearchReport96-07Multi-StreamSpeechRecognitionHerveBourlardStephaneDupontChristopheRisDecember1996Abstract.Inthispaper,wediscussanewa
3、utomaticspeechrecognition(ASR)approachbasedonindependentprocessingandrecombinationofseveralfeaturestreams.Inthisframework,itisassumedthatthespeechsignalisrepresentedintermsofmultipleinputstreams,eachinputstreamrepresentingadierentcharacteristicofthesigna
4、l.Ifthestreamsareentirelysynchronous,theymaybeaccommodatedsimply(astheyusuallyareinstate-of-the-artsystems).However,asdiscussedinthepaper,itmayberequiredtopermitsomedegreeofasynchronybetweenstreams.Thispaperintroducesthebasicframeworkofastatisticalstructu
5、rethatcanaccommodatemultiple(asynchronous)observationstreams(possiblyexhibitingdierentframerates).Thisapproachwillthenbeappliedtotheparticularcaseofmulti-bandspeechrecognitionandwillbeshowntoyieldsignicantlybetternoiserobustness.2IDIAP{RR96-071Introduct
6、ionIncurrentautomaticspeechrecognition(ASR)systems,theacousticprocessingmoduletypicallyemploysfeatureextractiontechniquesinwhich20to30millisecondsofspeechisanalyzedoncepercentisecond,leadingtoasequenceofacoustic(feature)vectorsthateachdescribelocalcompone
7、ntsofthespeechsignal.Eachacousticvectoristypicallyasmoothedspectrumorcepstrum.HiddenMarkovModel(HMM)states,whicharetypicallyassociatedwithcontext-dependentphonessuchastriphones,arethencharacterizedbyastationaryprobabilitydensityfunctionoverthespaceofthese
8、acousticvectors.WordsandsentencesarethenassumedtobepiecewisestationaryandrepresentedintermsofasequenceofHMMstates.Instate-of-the-artASRsystems,each10-msspeechsegmentisoftendescribedintermsofseveral(dependentorindepe
此文档下载收益归作者所有