欢迎来到天天文库
浏览记录
ID:23448822
大小:128.00 KB
页数:25页
时间:2018-11-07
《全文搜索引擎的设计与实现-外文翻译》由会员上传分享,免费在线阅读,更多相关内容在学术论文-天天文库。
1、江汉大学毕业论文(设计)外文翻译原文来源TheHadoopDistributedFileSystem:ArchitectureandDesign中文译文Hadoop分布式文件系统:架构和设计姓名XXXX学号2007082021372013年4月8日英文原文TheHadoopDistributedFileSystem:ArchitectureandDesignSource:http://hadoop.apache.org/docs/r0.18.3/hdfs_design.htmlIntroductionTheHadoopDistributedFileSystem(HDFS)isa
2、distributedfilesystemdesignedtorunoncommodityhardware.Ithasmanysimilaritieswithexistingdistributedfilesystems.However,thedifferencesfromotherdistributedfilesystemsaresignificant.HDFSishighlyfault-tolerantandisdesignedtobedeployedonlow-costhardware.HDFSprovideshighthroughputaccesstoapplicatio
3、ndataandissuitableforapplicationsthathavelargedatasets.HDFSrelaxesafewPOSIXrequirementstoenablestreamingaccesstofilesystemdata.HDFSwasoriginallybuiltasinfrastructurefortheApacheNutchwebsearchengineproject.HDFSispartoftheApacheHadoopCoreproject.TheprojectURLishttp://hadoop.apache.org/core/.As
4、sumptionsandGoalsHardwareFailureHardwarefailureisthenormratherthantheexception.AnHDFSinstancemayconsistofhundredsorthousandsofservermachines,eachstoringpartofthefilesystem’sdata.Thefactthatthereareahugenumberofcomponentsandthateachcomponenthasanon-trivialprobabilityoffailuremeansthatsomecomp
5、onentofHDFSisalwaysnon-functional.Therefore,detectionoffaultsandquick,automaticrecoveryfromthemisacorearchitecturalgoalofHDFS.StreamingDataAccessApplicationsthatrunonHDFSneedstreamingaccesstotheirdatasets.Theyarenotgeneralpurposeapplicationsthattypicallyrunongeneralpurposefilesystems.HDFSisd
6、esignedmoreforbatchprocessingratherthaninteractiveusebyusers.Theemphasisisonhighthroughputofdataaccessratherthanlowlatencyofdataaccess.POSIXimposesmanyhardrequirementsthatarenotneededforapplicationsthataretargetedforHDFS.POSIXsemanticsinafewkeyareashasbeentradedtoincreasedatathroughputrates.
7、LargeDataSetsApplicationsthatrunonHDFShavelargedatasets.AtypicalfileinHDFSisgigabytestoterabytesinsize.Thus,HDFSistunedtosupportlargefiles.Itshouldprovidehighaggregatedatabandwidthandscaletohundredsofnodesinasinglecluster.Itshouldsupporttensofmilli
此文档下载收益归作者所有