欢迎来到天天文库
浏览记录
ID:37651921
大小:509.63 KB
页数:7页
时间:2019-05-27
《开放式地理实体关系抽取的Bootstrapping方法_余丽》由会员上传分享,免费在线阅读,更多相关内容在行业资料-天天文库。
1、第45卷第5期测绘学报Vol.45,No.52016年5月ActaGeodaeticaetCartographicaSinicaMay,2016引文格式:余丽,陆锋,刘希亮.开放式地理实体关系抽取的Bootstrapping方法[J].测绘学报,2016,45(5):616-622.DOI:10.11947/j.AGCS.2016.20150181.YULi,LUFeng,LIUXiliang.ABootstrappingBasedApproachforOpenGeo-entityRelationExtraction[J].ActaGeodaeticaetCart
2、ographicaSinica,2016,45(5):616-622.DOI:10.11947/j.AGCS.2016.20150181.开放式地理实体关系抽取的Bootstrapping方法余丽1,2,陆锋1,3,刘希亮11.中国科学院地理科学与资源研究所资源与环境信息系统国家重点实验室,北京100101;2.中国科学院大学,北京100101;3.江苏省地理信息资源开发与利用协同创新中心,江苏南京210023ABootstrappingBasedApproachforOpenGeo-entityRelationExtraction1,2,LUFeng1,3,LI
3、UXiliang1YULi1.StateKeyLabofResourcesandEnvironmentalInformationSystem,TheInstituteofGeographicSciencesandNaturalResourcesResearch,Beijing100101,China;2.UniversityofChineseAcademyofSciences,Beijing100101,China;3.JiangsuCenterforCollaborativeInnovationinGeographicalInformationResourceD
4、evelopmentandApplication,Nanjing210023,ChinaAbstract:Extractingspatialrelationsandsemanticrelationsbetweentwogeo-entitiesfromWebtexts,asksrobustandeffectivesolutions.Thispaperputsforwardanovelapproach:firstly,thecharacteristicsofterms(part-of-speech,positionanddistance)areanalyzedbyme
5、ansofbootstrapping.Secondly,theweightofeachtermiscalculatedandthekeywordispickedoutastheclueofgeo-entityrelations.Thirdly,thegeo-entitypairsandtheirkeywordsareorganizedintostructuredinformation.Finally,anexperimentisconductedwithBaidubaikeandStanfordCoreNLP.Thestudyshowsthatthepresent
6、edmethodcanautomaticallyexplorepartofthelexicalfeaturesandfindadditionalrelationaltermswhichneitherthedomainexpertknowledgenorlargescalecorporaneed.Moreover,comparedwiththreeclassicalfrequencystatisticsmethods,namelyFrequency,TF-IDFandPPMI,theprecisionandrecallareimprovedabout5%and23%
7、respectively.Keywords:textmining;geo-entities;relationextraction;quantitativeevaluation;bootstrappingFoundationsupport:TheNationalNaturalScienceFoundationofChina(No.41271408);TheNationalHigh-TechResearchandDevelopmentProgramofChina(863Program)(No.2013AA120305)摘要:从网络文本中抽取地理实体间空间关系和语义关系
8、要求高时效
此文档下载收益归作者所有