资源描述:
《Cross-Task Crowdsourcing跨任务众包》由会员上传分享,免费在线阅读,更多相关内容在学术论文-天天文库。
1、Cross-TaskCrowdsourcingKaixiangMo1,ErhengZhong1andQiangYang1,21:HongKongUniversityofScienceandTechnology,HongKong;2:HuaweiNoah’sArkLab,ScienceandTechnologyPark,Shatin,HongKong{kxmo,ezhong,qyang}@cse.ust.hkABSTRACTKeywordsCrowdsourcingisaneffectivemethodforcollectinglabeledCrowdsourcing,
2、TransferLearningdataforvariousdataminingtasks.Itiscriticaltoensuretheveracityoftheproduceddatabecauseresponsescollected1.INTRODUCTIONfromdifferentusersmaybenoisyandunreliable.PreviousworkssolvethisveracityproblembyestimatingboththeProducinglarge-scaletraining,validationandtestsetsisuser
3、abilityandquestiondifficultybasedontheknowledgevitalformanymachinelearninganddataminingapplica-ineachtaskindividually.Inthiscase,eachsingletaskneedstions.Mostoftenthistaskhastobecarriedout“byhand”largeamountsofdatatoprovideaccurateestimations.How-andthusitisdelicate,expensive,andtedious.
4、Crowd-1ever,inpractice,budgetsprovidedbycustomersforagivensourcingsystemssuchasAmazonMechanicalTurk,re-23targettaskmaybelimited,andhenceeachquestioncanbeCAPTCHAandtheESPgamehavemadeiteasytodis-presentedtoonlyafewuserswhereeachusercananswertributesimplelabelingtaskstohundredsofusers(ref
5、erredonlyafewquestions.ThisdatasparsityproblemcancausetoasworkerinMechanicalTurk).Crowdsourcingiswidelypreviousapproachestoperformpoorlyduetotheoverfit-usedintaskssuchassentimentclassification[12],objecttingproblemonraredataandeventuallydamagethedatarecognition[7],ranking[13]andclusterin
6、g[6],etc.veracity.Fortunately,inreal-worldapplications,userscanAtypicalcrowdsourcingpipelinecanbedividedintothreeanswerquestionsfrommultiplehistoricaltasks.Forexam-mainsteps:1:Taskdesign,2:Datadistributionand3:An-ple,onecanannotateimagesaswellaslabelthesentimentsweraggregation.Intheans
7、weraggregationstep,oneneedsofagiventitle.Inthispaper,weemploytransferlearning,toaggregatethesenoisyandunreliableresponsesintoatruewhichborrowsknowledgefromauxiliaryhistoricaltaskstoanswer.Answeraggregationismostimportantstepbecauseimprovethedataveracityinagiventargettask.Themoti-itis