欢迎来到天天文库
浏览记录
ID:34047546
大小:5.25 MB
页数:73页
时间:2019-03-03
《微博主题数据的抓取与情感分析》由会员上传分享,免费在线阅读,更多相关内容在教育资源-天天文库。
1、浙江大学硕士学位论文AbstractWiththeexplosivedevelopmentoftheInternetinrecentyears,peopleareexperiencingvariouswaystoobtainandpublishinformation.Thenewlybornmicro-blogstandsoutamongalltheplatformsforitssimplicity,authenticityandconvenience.Accordingtostatistics,thenumberofthemicro-blogusershasalreadyr
2、eached330millionattheendofthefirsthalfof2013.Notsurprisingly,theeverydayinformationpublicobtainedfromthemicro—blogismuchlargerthanthatfromanytraditionalmedianow.Themicro—blogconveyseveryindividual’Ssentimentaboutatopic.Asaresult,howtoefficientlycrawlthetheme—relatedmicro—blogdataanddiscernt
3、hepublic’Ssentimentstowardagiventopicisveryinterestingchallenge.Thisthesisdevelopsasystemtocrawletheme-relatedmicro—blogdataforsentimentanalysis.Themainpansofthesisarelistedasfollows:Firstly,atheme-relatedcrawlerformicro—blogdataisproposedandimplemented.Secondly,theperformanceofthesegmentat
4、ionofwordsintheme-relatedmicro-blogdataisimprovedaccordingtotheinnatecharacteristicsofmicro-blogdata.Intheend,thisthesisproposesanapproachforsentimentanalysisoftheme-relatedmicro-blogdata.Keywords:micro—blog,topiccrawler,textpreprocessing,emotionaldictionary,sentimentanalysis浙江大学硕士学位论文目录摘要⋯
5、⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯。iAbstract⋯.⋯⋯.⋯⋯.⋯⋯⋯.⋯⋯⋯.⋯⋯⋯⋯⋯..⋯⋯.⋯⋯..⋯⋯⋯.⋯⋯..⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯..ii目录⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯一I图目录⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯IV表目录⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯.V第1章绪论⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯11.1研究背景及意义⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯11.2本文的主要工作⋯⋯⋯⋯⋯
6、⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯21.3本文的组织结构⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯31.4本章小结⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯4第2章相关理论与研究综述⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯52.1网络爬虫简介⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯52.1.1网络爬虫现状⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯62.1.2网络爬虫框架⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯72.2微博与文本预处理技术简介⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯92.2.1微博相关概述⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯一1
7、02.2.2中文分词技术简介⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯112.3中外微博情感分析现状⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯一122.3.1基于词典和规则集及文本互信息的方法⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯..132.3.2基于机器学习的方法⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯一152.4本章小结⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯..16第3章微博数据实时抓取与预处理⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯。.173.1针对微博主题的网络爬虫⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯..173.1.
此文档下载收益归作者所有