欢迎来到天天文库
浏览记录
ID:34104137
大小:672.36 KB
页数:34页
时间:2019-03-03
《convolutional neural networks for sentence》由会员上传分享,免费在线阅读,更多相关内容在教育资源-天天文库。
1、ConvolutionalNeuralNetworksforSentenceClassicationConvolutionalNeuralNetworksforSentenceClassicationYoonKimNewYorkUniversity1/34ConvolutionalNeuralNetworksforSentenceClassicationAgendaWordEmbeddingsClassicationRecursiveNeuralTensorNetworksConvolutionalNeuralNetwor
2、ksExperimentsConclusion2/34ConvolutionalNeuralNetworksforSentenceClassicationWordEmbeddingsDeeplearninginNaturalLanguageProcessingIDeeplearninghasachievedstate-of-the-artresultsincomputervision(Krizhevskyetal.,2012)andspeech(Gravesetal.,2013).INLP:fastbecoming(alread
3、yis)ahotareaofresearch.IMuchoftheworkinvolveslearningwordembeddingsandperformingcompositionoverthelearnedembeddingsforNLPtasks.3/34ConvolutionalNeuralNetworksforSentenceClassicationWordEmbeddingsWordEmbeddings(orWordVectors)ITraditionalNLP:Wordsaretreatedasindices(or
4、one-hot"vectorsinRV)IEverywordisorthogonaltooneanother.Iwmotherwfather=0ICanweembedwordsinRDwithDVsuchthatsemanticallyclosewordsarelikewise`close'inRD?(i.e.wmotherwfather>0)IYes!IDon't(necessarily)needdeeplearningforthis:LatentSemanticAnalysis,LatentDirichletAlloc
5、ation,orsimplecontextcountsallgivedenserepresentations.4/34ConvolutionalNeuralNetworksforSentenceClassicationWordEmbeddingsNeuralLanguageModels(NLM)IAnotherwaytoobtainwordembeddings.IWordsareprojectedfromRVtoRDviaahiddenlayer.IDisahyperparametertobetuned.IVariousarch
6、itecturesexist.Simpleonesarepopularthesedays(right).IVeryfast
7、cantrainonbillionsoftokensinonedaywithasinglemachine.Figure1:SkipgramarchitectureofMikolovetal.(2013)5/34ConvolutionalNeuralNetworksforSentenceClassicationWordEmbeddingsLinguisticregularitiesintheobtainede
8、mbeddingsIThelearnedembeddingsencodesemanticandsyntacticregularities:Iwbig wbiggerwslow wslowerIwfrance wpariswkorea wseoulIThesearecool,butnotnecessarilyuniquetoneurallanguagemodels.[...]theneuralembeddingprocessisnotdiscoveringnovelpatterns,butratherisdoingaremar
9、kablejobatpreservingthepatternsinherentintheword-contextco-occurrencematrix."LevyandGoldberg,LinguisticRegularitiesinSparse
此文档下载收益归作者所有