资源描述:
《MINECoP An Integrated Visualization Tool for Corpus Mining》由会员上传分享,免费在线阅读,更多相关内容在行业资料-天天文库。
1、*MINECoP:AnIntegratedVisualizationToolforCorpusMiningAsaneeKawtrakul,PatchareeVarasai,SuteeSudprasert,PrachyaBoonkwanandDusadeeThamavijitSpecialtyResearchUnitofNaturalLanguageProcessingandIntelligentInformationSystemTechnology(NAiST)DepartmentofComputerEngineering,Facult
2、yofEngineering,KasetsartUniversity,Bangkok.Email:{ak,pom,sutee,arm,ton}@vivaldi.cpe.ku.ac.thexclusivelywordsenseannotation.OneofthefirstAbstractmajoreffortsatwordsenseannotationwasSEMCOR[8],whereabout230,000wordoccur-InordertohaveagoodlanguagemodelrencesoftheBrowncorpusw
3、eretaggedwithforcreatingcosteffectivesolutionstotheWordNetsenses.AnotherexampleFrameNetpro-practicalproblemsindevelopingNLPject[9]isstartedbyBerkeley,whichproducedtheapplications,weneedtolearnfromob-frame-semanticdescriptionsofseveralthousandserveddataofnaturallyoccurrin
4、gtext.Englishlexicalitemsandbackedupthesedescrip-tionswithsemanticallyannotatedfromcontempo-ThispaperpresentsadesignofpackagedraryEnglish.SALSAcorpus[10]isdesignedtotoolcalledMINECoPforannotatingandcreatealargesemanticallyannotatedGermancor-mininglinguisticphenomena.Theu
5、serpusandtoinvestigatemethodsforitsutilization,couldlearnthelanguagebehaviorsbylikeimprovingstatisticalparsers,andextendingidentifyingspecific-needquerypatternsmethodsforinformationextractionandmachinetoobservetheproblemsandtodeducetranslation.ForChinese,manyattentionsha
6、veknow-howtodesignlanguagemodelbeennaturallypaidtoresearchesonsemantics,fromthenaturallyoccurringtext.becauseChineseisameaning-combinedlanguage,itssyntaxisveryflexible,andsemanticrulesaremorestablethansyntacticrules[11].1IntroductionSinceourcurrentresearchisaimedtodevelo
7、pLargecorporaforanumberoflanguages,whichacomputationalmodelforknowledgemanage-carefullyandquitecomprehensivelyannotatedmentwhichincludingontologyextraction,infor-withsyntacticinformation,areavailable,suchas,mationextraction,textsummarizationandthePennTree-bank[2],theNEGR
8、Acorpus[3]knowledgediscovery,weneedtolearnfromob-andTigercorpus[4],thePennChineseTree-bankserveddataofn