资源描述:
《统计学文献19new》由会员上传分享,免费在线阅读,更多相关内容在教育资源-天天文库。
1、ComparativeStatisticsforDNAandProteinSequences:SingleSequenceAnalysisSamuelKarlin,andGhassanGhandourPNAS1985;82;5800-5804doi:10.1073/pnas.82.17.5800ThisinformationiscurrentasofDecember2006.Thisarticlehasbeencitedbyotherarticles:www.pnas.org#otherarticlesE-mailAlertsReceivef
2、reeemailalertswhennewarticlescitethisarticle-signupintheboxatthetoprightcornerofthearticleorclickhere.Rights&PermissionsToreproducethisarticleinpart(figures,tables)orinentirety,see:www.pnas.org/misc/rightperm.shtmlReprintsToorderreprints,see:www.pnas.org/misc/reprints.shtml
3、Notes:Proc.Nati.Acad.Sci.USAVol.82,pp.5800-5804,September1985EvolutionComparativestatisticsforDNAandproteinsequences:Singlesequenceanalysis(repeatoccurrencecounts/high-frequencyrepeats/randomsequencemodels)SAMUELKARLINANDGHASSANGHANDOURDepartmentofMathematics,StanfordUniver
4、sity,Stanford,CA94305ContributedbySamuelKarlin,March8,1985ABSTRACTFourcategoriesofdatarepresentationsarerandommodelthataccountsforfirstorder(Markovian)usedtohelpinterpretstructuresandsimilaritiesofnucleicaciddependenciesbetweenneighboringlettersprescribesp,asandproteinseque
5、nces.Statisticalsignificanceoftheobservedtheconditionalprobabilityofsamplingletterljfollowingrelationshipsrevealedbytheserepresentationsareassessedbyletterli.[ForDNA,p.,J)generallycorrespondtotheahierarchyofpermutationproceduresandbycomparisonsdinucleotidefrequenciesofthevt
6、hsequence.]withtheoreticalrandommodels.ApplicationsarepresentedforTHEOREMI.TheexpectedlengthofthelongestcommonvariousDNAsequencesincludingpapovaviruses,Epstein-wordpresentinatleastroutofssequences,Krs,fortheBarrvirus,mitochondrialgenomes,andseveralglobinandindependencemodel
7、hasgenerallyordergrowthimmunoglobulingenes.[logn(N1,...,Ns)+logX(1-X)+0.577]/(-logX)whereDistinguishingnonrandomsequencerelationshipsfromchanceconfigurationsisimportantinnucleicacidandproteinsequencecomparisons.Specificword(asetofcontiguousn(N,Ns)=l-ij8、tionshipsareknowntobebiologicallymeaning-...