Data Quality from Crowdsourcing(conference)

Data Quality from Crowdsourcing(conference)

ID:18429910

大小:880.03 KB

页数:73页

时间:2018-09-17

Data Quality from Crowdsourcing(conference)_第1页
Data Quality from Crowdsourcing(conference)_第2页
Data Quality from Crowdsourcing(conference)_第3页
Data Quality from Crowdsourcing(conference)_第4页
Data Quality from Crowdsourcing(conference)_第5页
资源描述:

《Data Quality from Crowdsourcing(conference)》由会员上传分享,免费在线阅读,更多相关内容在学术论文-天天文库

1、NAACLHLT2009ActiveLearningforNaturalLanguageProcessing(ALNLP-09)ProceedingsoftheWorkshopJune5,2009Boulder,ColoradoProductionandManufacturingbyOmnipressInc.2600AndersonStreetMadison,WI53707USAEndorsedbythefollowingACLSpecialInterestGroups:•SIGNLL,SpecialInterestGroupforNaturalLanguageLearning

2、•SIGANN,SpecialInterestGroupforAnnotationc2009TheAssociationforComputationalLinguisticsOrdercopiesofthisandotherACLproceedingsfrom:AssociationforComputationalLinguistics(ACL)209N.EighthStreetStroudsburg,PA18360USATel:+1-570-476-8006Fax:+1-570-476-0860acl@aclweb.orgISBN978-1-932432-40-4iiIntr

3、oductionWelcometotheworkshoponActiveLearningforNaturalLanguageProcessing!Westartedorganizingthisworkshopinmid-2008afterstrongencouragementinresponsetosomeofourownworkinthearea.Aswegatheredmembersoftheprogramcommittee,thetimelinessofthetopicresonatedwithseveralofthem:thegrowingbodyofknowledge

4、onactivelearningandonactivelearningforNLPinparticularmakesthistopiconeworthexploringinafocusedworkshopratherthaninisolatedpapersinoccasional,far-flungconferences.Labeleddataisaprerequisiteformanypopularalgorithmsinnaturallanguageprocessingandmachinelearning.Whileitispossibletoobtainlargeamoun

5、tsofannotateddataforwell-studiedlanguagesinwell-studieddomainsandwell-studiedproblems,labeleddataarerarelyavailableforlesscommonlanguages,domains,orproblems.Unfortunately,obtaininghumanannotationsforlinguisticdataislabor-intensiveandtypicallythecostliestpartoftheacquisitionofanannotatedcorpu

6、s.Ithasbeenshownbeforethatactivelearningcanbeemployedtoreduceannotationcostsbutnotattheexpenseofquality.WhilediverseworkoverthepastdecadehasdemonstratedthepossibleadvantagesofactivelearningforcorpusannotationandNLPapplications,activelearningisnotwidelyusedinmanyongoingdataannotationtasks.Muc

7、hofthemachinelearningliteratureonthetopichasfocusedonactivelearningforclassificationproblemswithlessattentiondevotedtothekindsofproblemsencounteredinNLP.Relatedtopicssuchasdistributed“humancomputation”,cost-sensitivemachinelearning,andsemi-supervise

当前文档最多预览五页,下载文档查看全文

此文档下载收益归作者所有

当前文档最多预览五页,下载文档查看全文
温馨提示:
1. 部分包含数学公式或PPT动画的文件,查看预览时可能会显示错乱或异常,文件下载后无此问题,请放心下载。
2. 本文档由用户上传,版权归属用户,天天文库负责整理代发布。如果您对本文档版权有争议请及时联系客服。
3. 下载前请仔细阅读文档内容,确认文档内容符合您的需求后进行下载,若出现内容与标题不符可向本站投诉处理。
4. 下载文档时可能由于网络波动等原因无法下载或下载错误,付费完成后未能成功下载的用户请联系客服处理。