资源描述:
《Statistical Consistency of Top-k Ranking》由会员上传分享,免费在线阅读,更多相关内容在学术论文-天天文库。
1、StatisticalConsistencyofTop-kRankingFenXiaTie-YanLiuHangLiInstituteofAutomationMicrosoftResearchAsiaMicrosoftResearchAsiaChineseAcademyofSciencestyliu@microsoft.comhanglig@microsoft.comfen.xia@ia.ac.cnAbstractThispaperisconcernedwiththeconsistencyanalysison
2、listwiserankingmeth-ods.Amongvariousrankingmethods,thelistwisemethodshavecompetitiveper-formancesonbenchmarkdatasetsandareregardedasoneofthestate-of-the-artapproaches.Mostlistwiserankingmethodsmanagetooptimizerankingonthewholelist(permutation)ofobjects,howe
3、ver,inpracticalapplicationssuchasin-formationretrieval,correctrankingatthetopkpositionsismuchmoreimportant.Thispaperaimstoanalyzewhetherexistinglistwiserankingmethodsarestatisti-callyconsistentinthetop-ksetting.Forthispurpose,wedefineatop-krankingframework,w
4、herethetrueloss(andthustherisks)aredefinedonthebasisoftop-ksubgroupofpermutations.Thisframeworkcanincludethepermutation-levelrankingframeworkproposedinpreviousworkasaspecialcase.Basedonthenewframework,wederivesufficientconditionsforalistwiserankingmethodtobec
5、onsistentwiththetop-ktrueloss,andshowaneffectivewayofmodify-ingthesurrogatelossfunctionsinexistingmethodstosatisfytheseconditions.Experimentalresultsshowthatafterthemodifications,themethodscanworksig-nificantlybetterthantheiroriginalversions.1IntroductionRank
6、ingisthecentralprobleminmanyapplicationsincludinginformationretrieval(IR).Inrecentyears,machinelearningtechnologieshavebeensuccessfullyappliedtoranking,andmanylearningtorankmethodshavebeenproposed,includingthepointwise[12][9][6],pairwise[8][7][2],andlistwis
7、emethods[13][3][16].Empiricalresultsonbenchmarkdatasetshavedemonstratedthatthelistwiserankingmethodshaveverycompetitiverankingperformances[10].Toexplainthehighrankingperformancesofthelistwiserankingmethods,atheoreticalframeworkwasproposedin[16].Intheframewo
8、rk,existinglistwiserankingmethodsareinterpretedasmakinguseofdifferentsurrogatelossfunctionsofthepermutation-level0-1loss.Theoreticalanalysisshowsthatthesesurrogatelossfunctionsareallstatisticallyconsis