欢迎来到天天文库
浏览记录
ID:26484348
大小:123.00 KB
页数:25页
时间:2018-11-27
《全文搜索引擎的设计与实现-外文翻译》由会员上传分享,免费在线阅读,更多相关内容在学术论文-天天文库。
1、江汉大学毕业论文(设计)外文翻译原文来源TheHadoopDistributedFileSystem:ArchitectureandDesign中文译文Hadoop分布式文件系统:架构和设计姓名XXXX学号2007082021372013年4月8日英文原文TheHadoopDistributedFileSystem:ArchitectureandDesignSource:http://hadoop.apache.org/docs/r0.18.3/hdfs_design.htmlIntroductionTheHado
2、opDistributedFileSystem(HDFS)isadistributedfilesystemdesignedtorunoncommodityhardware.Ithasmanysimilaritieswithexistingdistributedfilesystems.However,thedifferencesfromotherdistributedfilesystemsaresignificant.HDFSishighlyfault-tolerantandisdesignedtobedeploye
3、donlow-costhardware.HDFSprovideshighthroughputaccesstoapplicationdataandissuitableforapplicationsthathavelargedatasets.HDFSrelaxesafewPOSIXrequirementstoenablestreamingaccesstofilesystemdata.HDFSwasoriginallybuiltasinfrastructurefortheApacheNutchwebsearchengin
4、eproject.HDFSispartoftheApacheHadoopCoreproject.TheprojectURLishttp://hadoop.apache.org/core/.AssumptionsandGoalsHardwareFailureHardwarefailureisthenormratherthantheexception.AnHDFSinstancemayconsistofhundredsorthousandsofservermachines,eachstoringpartofthefil
5、esystem’sdata.Thefactthatthereareahugenumberofcomponentsandthateachcomponenthasanon-trivialprobabilityoffailuremeansthatsomecomponentofHDFSisalwaysnon-functional.Therefore,detectionoffaultsandquick,automaticrecoveryfromthemisacorearchitecturalgoalofHDFS.Stream
6、ingDataAccessApplicationsthatrunonHDFSneedstreamingaccesstotheirdatasets.Theyarenotgeneralpurposeapplicationsthattypicallyrunongeneralpurposefilesystems.HDFSisdesignedmoreforbatchprocessingratherthaninteractiveusebyusers.Theemphasisisonhighthroughputofdataacce
7、ssratherthanlowlatencyofdataaccess.POSIXimposesmanyhardrequirementsthatarenotneededforapplicationsthataretargetedforHDFS.POSIXsemanticsinafewkeyareashasbeentradedtoincreasedatathroughputrates.LargeDataSetsApplicationsthatrunonHDFShavelargedatasets.Atypicalfile
8、inHDFSisgigabytestoterabytesinsize.Thus,HDFSistunedtosupportlargefiles.Itshouldprovidehighaggregatedatabandwidthandscaletohundredsofnodesinasinglecluster.Itshouldsupporttensofmilli
此文档下载收益归作者所有