欢迎来到天天文库
浏览记录
ID:11091977
大小:123.00 KB
页数:25页
时间:2018-07-10
《全文搜索引擎的设计与实现-外文翻译》由会员上传分享,免费在线阅读,更多相关内容在学术论文-天天文库。
1、江汉大学毕业论文(设计)外文翻译原文来源TheHadoopDistributedFileSystem:ArchitectureandDesign中文译文Hadoop分布式文件系统:架构和设计姓名XXXX学号2007082021372013年4月8日英文原文TheHadoopDistributedFileSystem:ArchitectureandDesignSource:http://hadoop.apache.org/docs/r0.18.3/hdfs_design.htmlIntroductionTheHadoopDistributed
2、FileSystem(HDFS)isadistributedfilesystemdesignedtorunoncommodityhardware.Ithasmanysimilaritieswithexistingdistributedfilesystems.However,thedifferencesfromotherdistributedfilesystemsaresignificant.HDFSishighlyfault-tolerantandisdesignedtobedeployedonlow-costhardware.HDFSpr
3、ovideshighthroughputaccesstoapplicationdataandissuitableforapplicationsthathavelargedatasets.HDFSrelaxesafewPOSIXrequirementstoenablestreamingaccesstofilesystemdata.HDFSwasoriginallybuiltasinfrastructurefortheApacheNutchwebsearchengineproject.HDFSispartoftheApacheHadoopCor
4、eproject.TheprojectURLishttp://hadoop.apache.org/core/.AssumptionsandGoalsHardwareFailureHardwarefailureisthenormratherthantheexception.AnHDFSinstancemayconsistofhundredsorthousandsofservermachines,eachstoringpartofthefilesystem’sdata.Thefactthatthereareahugenumberofcompon
5、entsandthateachcomponenthasanon-trivialprobabilityoffailuremeansthatsomecomponentofHDFSisalwaysnon-functional.Therefore,detectionoffaultsandquick,automaticrecoveryfromthemisacorearchitecturalgoalofHDFS.StreamingDataAccessApplicationsthatrunonHDFSneedstreamingaccesstotheird
6、atasets.Theyarenotgeneralpurposeapplicationsthattypicallyrunongeneralpurposefilesystems.HDFSisdesignedmoreforbatchprocessingratherthaninteractiveusebyusers.Theemphasisisonhighthroughputofdataaccessratherthanlowlatencyofdataaccess.POSIXimposesmanyhardrequirementsthatarenotn
7、eededforapplicationsthataretargetedforHDFS.POSIXsemanticsinafewkeyareashasbeentradedtoincreasedatathroughputrates.LargeDataSetsApplicationsthatrunonHDFShavelargedatasets.AtypicalfileinHDFSisgigabytestoterabytesinsize.Thus,HDFSistunedtosupportlargefiles.Itshouldprovidehigha
8、ggregatedatabandwidthandscaletohundredsofnodesinasinglecluster.Itshouldsupporttensofmilli
此文档下载收益归作者所有