欢迎来到天天文库
浏览记录
ID:37657487
大小:245.18 KB
页数:13页
时间:2019-05-27
《Anvil a system for the retrieval of captioned images using nlp techniques》由会员上传分享,免费在线阅读,更多相关内容在行业资料-天天文库。
1、ANVIL:aSystemfortheRetrievalofCaptionedImagesusingNLPTechniquesTonyRose,DavidElworthy,AaronKotcheffandAmandaClareCanonResearchCentreEuropeGuildford,SurreyPetrosTsonisDept.ofComputingScience,UniversityofGlasgowScotlandAbstractANVILisasystemdesignedfortheretrievalofimagesannotatedwith
2、shortcaptions.ItusesNLPtechniquestoextractdependencystructuresfromcaptionsandqueries,andthenappliesarobustmatchingalgorithmtorecursivelyexploreandcomparethem.TherearecurrentlytwomaininterfacestoANVIL:alist-baseddisplayanda2Dspatiallayoutthatallowsuserstointeractwithandnavigatebetwee
3、nsimilarimages.ANVILwasdesignedtooperateaspartofapubliclyaccessible,WWW-basedimageretrievalserver.Consequently,product-levelengineeringstandardswererequired.Thispaperexaminesboththeresearchaspectsofthesystemandalsolooksatsomeofthedesignandevaluationissues.1IntroductionAfundamentalai
4、mofmanyInformationRetrieval(IR)systemsistomatchauser’sstatementoftheirinformationneedwiththecontentsofadocumentcollection.Sincequeriesanddocumentsareoftenpredominantlytextual,itseemsreasonabletosuggestthatinterpretationoftheirunderlyinglinguisticstructurewouldbebeneficial.However,in
5、practice,thishasrarelyprovedtobethecase[1].OnenotableexceptionistheIntermezzosystem[2],inwhichNLPtechniqueswereappliedtotheretrievalofimagesthathadbeenannotatedwithshortcaptions.Thissystemreturnedaprecisionofaround90%for50queriesona500,000-imagedatabase.Thisresultwouldtendtoindicate
6、thatthecombinationofashortcaption(asopposedtoalargerdocument)withsomecontroloveritscontent(asopposedtominimaleditorialcontroloverlargedocumentcollections)mayfacilitatetheapplicationofspecificNLPtechniquestoimprovetheretrievalprocess.Thispaper,andtheANVILimageretrievalsystemitdescrib
7、es,constitutesafurtherinvestigationofthishypothesis.1.1DesignGoalsANVILisdesignedfortheretrievalofcaptionedimages,usingfastandaccuratenaturallanguagetechniques.AswithIntermezzo,weuseadatabaseofimageswithphrasalcaptions,theirlengthrangingfrom1to20words(mean=~9words).Someexamplesareas
8、follows:Goldencolum
此文档下载收益归作者所有