资源描述:
《etl系统在保险行业ods中的设计和实现》由会员上传分享,免费在线阅读,更多相关内容在行业资料-天天文库。
1、~上海交通大学工程硕士学位论文ABSTRACTDESIGNANDIMPLEMENTATIONOFETLSYSTEMONODSINTHEINSURANCEINDUSTRYABSTRACTThispaperhasanalysedETL-relatedresearchandtechniques,basedonapractical
projectofacertaininsurancecompany,andhasdonealotofresearchondesignand
implementationofETL.TheETLsystemhadbeenputintoproductionenvironm
2、ent.
Onmodelingdesignarchitecture,firstlyaccordingtotheprojectthispaperputs
forwarddesignmodelarchitecturebasedonthecommondatawarehousemodel.Thenon
thebasisofitthispaperhassetupjobscheduleMetamodelbymeansofanalyzinglogic.
Onextractingdata,thispaperbringsforwardanextraction-transfer-staging-merge
3、approachtosolvetheissueofextractingandmergingdatainthedistributedheterogeneous
environment.OntheETLsystemperformance,thispaperhasimplementedboostingthe
performancebyusingpipeliningandpartitioningthought.Ontheconformingduplicatecustomerdata,firstlythispaperputsforwardthesorting
andequalmatchingalg
4、orithm.Then,inthesituationwhenthematchingkeysexist,this
paperprovestheperformanceiseffective.Thispapermakesuseofbusinessrulestobring
forwardthealgorithmofprocessingtheduplicatedata,whichhasn’tbeendemonstratedon
thispointbefore.Ondetectingerrordata,thispaperputsforwardanapproachthatbusiness
ruleob
5、jectsareusedfordetectingerrordataexpression.Thispaperprovesthatthe
approachiseffectiveandefficient.Ondataquality,byfullyusingasetofquantitysystemthispaperconformstothedata
qualitydimensionanditsimportanceweight,andbywhichtheapproachtoweighted
averagedataqualityisusedtoevaluatedataqualityofsystema
6、sawhole.Thispaper
makesdataqualityonepartofETLdesign,whichenhancesfeatureofETLdesignmodel
andhigherusability.KEYWORDSETL,Metamodel,dataquality,erroneousdatadetection,duplicatedataconformingII~~~上海交通大学工程硕士学位论文符号说明符号说明Abbreviations缩略语Fullspelling英文全名Chineseexplanation中文解释BIBusinessIntelligence商业智能
7、CIFCommonInterfaceFile普通接口文件
CWMCommonWarehouseMetamodel公共数据仓库元模型
DSEEDataStage7.5.1AEnterpriseEditionDataStage企业版
DSSDecisionSupportSystem决策支持系统
ETLExtract,Transform,andLoad抽取、转换和加载
EXFExtractFile抽取文件
GQMgoal-question