资源描述:
《Next-generation transcriptome assembly》由会员上传分享,免费在线阅读,更多相关内容在学术论文-天天文库。
1、REVIEWSSTUDYDESIGNSNext-generationtranscriptomeassemblyJeffreyA. MartinandZhongWangAbstract
2、Transcriptomicsstudiesoftenrelyonpartialreferencetranscriptomesthatfailtocapturethefullcatalogueoftranscriptsandtheirvariations.Recentadvancesinsequencingtechnologiesandassemblyalgorith
3、mshavefacilitatedthereconstructionoftheentiretranscriptomebydeepRNAsequencing(RNA-seq),evenwithoutareferencegenome.However,transcriptomeassemblyfrombillionsofRNA-seqreads,whichareoftenveryshort,posesasignificantinformaticschallenge.ThisReviewsummarizestherecentdevelopmentsintr
4、anscriptomeassemblyapproaches—reference-based,de novoandcombinedstrategies—alongwithsomeperspectivesontranscriptomeassemblyinthenearfuture.Identifyingthefullsetoftranscripts—includinglargeReconstructingacomprehensivetranscriptomefromRNAsequencing(RNA-seq).Anexperimentalandsmal
5、lRNAs,noveltranscriptsfromunannotatedshortreadshasmanyinformaticschallenges.Similarprotocolthatusesnext-genes,splicingisoformsandgene-fusiontranscripts—toshort-readgenomeassembly,transcriptomeassemblygenerationsequencingservesasthefoundationforacomprehensivestudyofinvolvespiec
6、ingtogethershort,low-qualityreads.Typicaltechnologiestosequencethetranscriptome.Foralongtime,ourknowledgeNGSdatasetsareverylarge(severalgigabasestotera-theRNAmoleculeswithinaofthetranscriptomewaslargelyderivedfromgenebases),whichrequirescomputingsystemstohavelargebiologicalsam
7、pleinanefforttodeterminetheprimarypredictionsandlimitedESTevidenceandhasthere-memoriesand/ormanycorestorunparallelalgorithms.sequenceandrelativeforebeenpartialandbiased.Recently,however,whole-Severalshort-readassemblershavebeendevelopedtoabundanceofeachRNA.transcriptomesequenc
8、ingusingnext-generationtacklethesechallenges6–9,includingVelvet6,ABYSS7andsequencing(NGS)technologies,orRNAsequencingALLPATHS8.Althoughthesetoolshaveachievedreason-Sequencingdepth(RNA-seq),hasstartedtorevealthecomplexlandscapeablesuccessintheassemblyofgenomes9,10,theycannotThe
9、averagenumberofreadsrepresentingagivennucleotideanddynamicsof