资源描述:
《Automatic learning for semantic collocation》由会员上传分享,免费在线阅读,更多相关内容在行业资料-天天文库。
1、AutomaticLearningforSemanticCollocationSatoshlSEKINE*TokyoInformationandCommunicationsResearchLaboratoryMatsushitaElectricIndustrialCo.,Ltd.3-10-1,higashimita,tama-ku,kawasaki214JAPANJeremyJ.CARROLLSofiaANANIADOUJun'ichiTSUJIICentreforComputationalLinguisticsUniv
2、ersityofManchesterInstituteofScienceandTechnologyP.O.Box88,ManchesterM601QD,UnitedKingdomAbstractguisticorextra-linguistic.Inparticular,ithasbeenre-ported[Ananiadou,1990]thatnotonlyextra-linguistic,Therealdifficultyindevelopmentofpracticaldomainknowledgebutalsoli
3、nguisticknowledgerequiredNLPsystemscomesfromthefactthatwedoforapplicationsystemsvaries,dependingontext-typenothaveeffectivemeansforgathering"knowl-(technicalreports,scientificpapers,manuals,etc.),sub-edge".Inthispaper,weproposeanalgorithmjectdomain,typeofapplicat
4、ion(MT,automaticab-whichacquiresautomaticallyknowledgeofse-straction,etc.)etc.Thismeansthatwehavetohaveef-manticcollocationsamong"words"fromsam-fectiveandefficientmethodseitherforadaptingalreadyplecorpora.existingknowledgeforaspecific"sublanguage"orforac-Thealgor
5、ithmproposedinthispapertriestoquiringknowledgeautomatically,forexamplefromsam-discoversemanticcollocationswhichwillbeplecorporaofgivenapplications.usefulfordisambiguatingstructurallyambigu-Inthispaper,weproposeanalgorithmwhichauto-oussentences,byastatisticalappro
6、ach.Thematicallyacquiresknowledgeofsemanticcollocationsalgorithmrequiresacorpusandminimumlin-among"words"."Semantic"heremeansthatthecol-guisticknowledge(parts-of-speechofwords,locationsthealgorithmdiscoversarenotcollocationssimpleinflectionrules,andasmallnumberof
7、amongwordsinthesenseoftraditionallinguisticsbutgeneralsyntacticrules).collocationsthatreflectontologicalrelationsamongen-Weconductedtwoexperimentsofapplyingthetitiesingivensubjectdomains.Weexpectthatthealgorithmtodifferentcorporatoextractdif-knowledgetobeextracte
8、dwillnotonlybeusefulforferenttypesofsemanticcollocations.Thoughdisambiguatingsentencesbutalsowillcontributetodis-therearesomeunsolvedproblems,theresultscoverin