欢迎来到天天文库
浏览记录
ID:57055698
大小:2.53 MB
页数:104页
时间:2020-07-30
《chap8_basic_cluster_analysis基本聚类分析课件.ppt》由会员上传分享,免费在线阅读,更多相关内容在教育资源-天天文库。
1、DataMiningClusterAnalysis:BasicConceptsandAlgorithmsLectureNotesforChapter8IntroductiontoDataMiningbyTan,Steinbach,Kumar©Tan,Steinbach,KumarIntroductiontoDataMining4/18/20041WhatisClusterAnalysis?Findinggroupsofobjectssuchthattheobjectsinagroupwillbesimila
2、r(orrelated)tooneanotheranddifferentfrom(orunrelatedto)theobjectsinothergroupsInter-clusterdistancesaremaximizedIntra-clusterdistancesareminimizedApplicationsofClusterAnalysisUnderstandingGrouprelateddocumentsforbrowsing,groupgenesandproteinsthathavesimilarf
3、unctionality,orgroupstockswithsimilarpricefluctuationsSummarizationReducethesizeoflargedatasetsClusteringprecipitationinAustraliaWhatisnotClusterAnalysis?SupervisedclassificationHaveclasslabelinformationSimplesegmentationDividingstudentsintodifferentregistra
4、tiongroupsalphabetically,bylastnameResultsofaqueryGroupingsarearesultofanexternalspecificationGraphpartitioningSomemutualrelevanceandsynergy,butareasarenotidenticalNotionofaClustercanbeAmbiguousHowmanyclusters?FourClustersTwoClustersSixClustersTypesofCluster
5、ingsAclusteringisasetofclustersImportantdistinctionbetweenhierarchicalandpartitionalsetsofclustersPartitionalClusteringAdivisiondataobjectsintonon-overlappingsubsets(clusters)suchthateachdataobjectisinexactlyonesubsetHierarchicalclusteringAsetofnestedcluster
6、sorganizedasahierarchicaltreePartitionalClusteringOriginalPointsAPartitionalClusteringHierarchicalClusteringTraditionalHierarchicalClusteringNon-traditionalHierarchicalClusteringNon-traditionalDendrogramTraditionalDendrogramOtherDistinctionsBetweenSetsofClus
7、tersExclusiveversusnon-exclusiveInnon-exclusiveclusterings,pointsmaybelongtomultipleclusters.Canrepresentmultipleclassesor‘border’pointsFuzzyversusnon-fuzzyInfuzzyclustering,apointbelongstoeveryclusterwithsomeweightbetween0and1Weightsmustsumto1Probabilisticc
8、lusteringhassimilarcharacteristicsPartialversuscompleteInsomecases,weonlywanttoclustersomeofthedataHeterogeneousversushomogeneousClusterofwidelydifferentsizes,shapes,anddensitiesTypesofClustersW
此文档下载收益归作者所有