wang_managing and mining billion-node graphsnew

wang_managing and mining billion-node graphsnew

ID:34470692

大小:2.36 MB

页数:137页

时间:2019-03-06

wang_managing and mining billion-node graphsnew_第1页
wang_managing and mining billion-node graphsnew_第2页
wang_managing and mining billion-node graphsnew_第3页
wang_managing and mining billion-node graphsnew_第4页
wang_managing and mining billion-node graphsnew_第5页
资源描述:

《wang_managing and mining billion-node graphsnew》由会员上传分享,免费在线阅读,更多相关内容在教育资源-天天文库

1、KDD2012SummerSchoolManagingandMiningBillion-NodeGraphsHaixunWangMicrosoftResearchAsia1OurFocusSystem&GraphProcessing2Outline•LargeGraphChallenges•SystemsforLargeGraphs–RDBMS,MapReduce,Pregel,Pegasus,Trinity•KeyGraphAlgorithms–GraphPartitioning,Traversal,Query,Analytics3Out

2、line•LargeGraphChallenges•SystemsforLargeGraphs–RDBMS,MapReduce,Pregel,Pegasus,Trinity•KeyAlgorithms–GraphPartitioning,Traversal,Query,Analytics4Graphsencoderichrelationshipstrillion8#ofEdgesDeBrujinGraphtrillion1theWebbillion104Facebookbillion31LinkedDatabillion5.6million

3、58USRoadMap#ofNodes245048001.4501millionmillionmillionbillionbilliontrillion5DiversityofGraphsP(k)~k-aScaleFreeGraphsCommunityStructureSmallWorld6ALargeVarietyofGraphOperations•Onlinequeryprocessing–Shortestpathquery–Subgraphmatchingquery–SPARQLquery–…•Offlinegraphanalytic

4、s–PageRank–Communitydetection–…•Othergraphoperations–Graphgeneration,visualization,interactiveexploration,etc.7CurrentStatus•Goodsystemsforprocessinggraphs:–PBGL,Neo4j•Goodsystemsforprocessinglargedata:–Map/Reduce,Hadoop•Goodsystemsforprocessingspecializedlargegraphdata:–S

5、pecializedsystemsforpagerank,etc.8Thisishard.GeneralityGraphLargeDataDataNogoodsystemforprocessinggenerallargegraphs9Graphprocessingwithoutasystemishard!FundamentalissuesDifferentprogrammingmodelsscheduling,datadistribution,synchronization,inter-processcommunication,robust

6、ness,faulttolerance,MessagePassingSharedMemory…MemoryArchitecturalissuesP1P2P3P4P5P1P2P3P4P5Flynn’staxonomy(SIMD,MIMD,etc.),networktypology,bisectionbandwidthUMAvs.NUMA,cachecoherenceCommonproblemslivelock,deadlock,datastarvation,priorityinversion…diningphilosophers,sleepi

7、ngbarbers,cigarettesmokers,…producerconsumerDifferentprogrammingconstructsmastermutexes,conditionalvariables,barriers,…masters/slaves,producers/consumers,workqueues,…slavesworkqueueproducerconsumerProgrammershoulderstheburdenofmanagingallthesesubtleissues…Adaptedfrom:Jimmy

8、Lin,SIKS/BigGridBigDataTutorial(2011)Benefitsofageneralpurposesystem•Enableapplicationsto

当前文档最多预览五页,下载文档查看全文

此文档下载收益归作者所有

当前文档最多预览五页,下载文档查看全文
温馨提示:
1. 部分包含数学公式或PPT动画的文件,查看预览时可能会显示错乱或异常,文件下载后无此问题,请放心下载。
2. 本文档由用户上传,版权归属用户,天天文库负责整理代发布。如果您对本文档版权有争议请及时联系客服。
3. 下载前请仔细阅读文档内容,确认文档内容符合您的需求后进行下载,若出现内容与标题不符可向本站投诉处理。
4. 下载文档时可能由于网络波动等原因无法下载或下载错误,付费完成后未能成功下载的用户请联系客服处理。