Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species

Bräutigam A, Mullick T, Schliesky S, Weber APM (2011)
Journal of Experimental Botany 62(9): 3093-3102.

Zeitschriftenaufsatz | Veröffentlicht | Englisch
OA 481.83 KB
Bräutigam, AndreaUniBi ; Mullick, Thomas; Schliesky, Simon; Weber, Andreas P. M.
Abstract / Bemerkung
Next-generation sequencing enables the study of species without a sequenced genome at the 'omics' level. Custom transcriptome databases are generated and global expression profiles can be compared. However, the assembly of transcriptome sequence reads into contigs remains a daunting task. In this study, five different assembly programs, both traditional overlap-based, 'read-centric' assemblers and de Bruijn graph data structure-based assemblers, were compared. To this end, artificial read libraries with and without simulated sequencing errors were constructed from Arabidopsis thaliana, based on quantitative profiles of mature leaf tissue. The open source TGICL pipeline and the commercial CLC bio genomics workbench produced the best assemblies in terms of contig length, hybrid assemblies, redundancy reduction, and error tolerance. The mature leaf transcriptomes of the C-3 species Cleome spinosa and the C-4 species Cleome gynandra were assembled and analysed. The pathways and cellular processes tagged in the transcriptome assemblies reflect processes of a mature leaf. The databases are useful for extracting transcripts related to C-4 processes as full-length or nearly full-length sequences.
Assembly; C-4; next-generation sequencing; transcriptome
Journal of Experimental Botany
Page URI


Bräutigam A, Mullick T, Schliesky S, Weber APM. Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species. Journal of Experimental Botany. 2011;62(9):3093-3102.
Bräutigam, A., Mullick, T., Schliesky, S., & Weber, A. P. M. (2011). Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species. Journal of Experimental Botany, 62(9), 3093-3102. doi:10.1093/jxb/err029
Bräutigam, Andrea, Mullick, Thomas, Schliesky, Simon, and Weber, Andreas P. M. 2011. “Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species”. Journal of Experimental Botany 62 (9): 3093-3102.
Bräutigam, A., Mullick, T., Schliesky, S., and Weber, A. P. M. (2011). Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species. Journal of Experimental Botany 62, 3093-3102.
Bräutigam, A., et al., 2011. Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species. Journal of Experimental Botany, 62(9), p 3093-3102.
A. Bräutigam, et al., “Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species”, Journal of Experimental Botany, vol. 62, 2011, pp. 3093-3102.
Bräutigam, A., Mullick, T., Schliesky, S., Weber, A.P.M.: Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species. Journal of Experimental Botany. 62, 3093-3102 (2011).
Bräutigam, Andrea, Mullick, Thomas, Schliesky, Simon, and Weber, Andreas P. M. “Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species”. Journal of Experimental Botany 62.9 (2011): 3093-3102.
Alle Dateien verfügbar unter der/den folgenden Lizenz(en):
Copyright Statement:
Dieses Objekt ist durch das Urheberrecht und/oder verwandte Schutzrechte geschützt. [...]
Access Level
OA Open Access
Zuletzt Hochgeladen
MD5 Prüfsumme

43 Zitationen in Europe PMC

Daten bereitgestellt von Europe PubMed Central.

Transcriptome analysis and codominant markers development in caper, a drought tolerant orphan crop with medicinal value.
Mercati F, Fontana I, Gristina AS, Martorana A, El Nagar M, De Michele R, Fici S, Carimi F., Sci Rep 9(1), 2019
PMID: 31320697
Comparative Transcriptome and iTRAQ Proteome Analyses Reveal the Mechanisms of Diapause in Aphidius gifuensis Ashmead (Hymenoptera: Aphidiidae).
Zhang HZ, Li YY, An T, Huang FX, Wang MQ, Liu CX, Mao JJ, Zhang LS., Front Physiol 9(), 2018
PMID: 30555341
Issues with RNA-seq analysis in non-model organisms: A salmonid example.
Sundaram A, Tengs T, Grimholt U., Dev Comp Immunol 75(), 2017
PMID: 28223254
De novo transcriptomic analysis and development of EST-SSRs for Sorbus pohuashanensis (Hance) Hedl.
Liu C, Dou Y, Guan X, Fu Q, Zhang Z, Hu Z, Zheng J, Lu Y, Li W., PLoS One 12(6), 2017
PMID: 28614366
Elevated auxin biosynthesis and transport underlie high vein density in C4 leaves.
Huang CF, Yu CP, Wu YH, Lu MJ, Tu SL, Wu SH, Shiu SH, Ku MSB, Li WH., Proc Natl Acad Sci U S A 114(33), 2017
PMID: 28761000
Transcriptome Profile of the Asian Giant Hornet (Vespa mandarinia) Using Illumina HiSeq 4000 Sequencing: De Novo Assembly, Functional Annotation, and Discovery of SSR Markers.
Patnaik BB, Park SY, Kang SW, Hwang HJ, Wang TH, Park EB, Chung JM, Song DK, Kim C, Kim S, Lee JB, Jeong HC, Park HS, Han YS, Lee YS., Int J Genomics 2016(), 2016
PMID: 26881195
RNA-seq-based evaluation of bicolor tepal pigmentation in Asiatic hybrid lilies (Lilium spp.).
Suzuki K, Suzuki T, Nakatsuka T, Dohra H, Yamagishi M, Matsuyama K, Matsuura H., BMC Genomics 17(1), 2016
PMID: 27516339
Functional insights into the testis transcriptome of the edible sea urchin Loxechinus albus.
Gaitán-Espitia JD, Sánchez R, Bruning P, Cárdenas L., Sci Rep 6(), 2016
PMID: 27805042
Comparative transcriptional profile of the fish parasite Cryptocaryon irritans.
Mo ZQ, Li YW, Wang HQ, Wang JL, Ni LY, Yang M, Lao GF, Luo XC, Li AX, Dan XM., Parasit Vectors 9(1), 2016
PMID: 27923398
A de novo floral transcriptome reveals clues into Phalaenopsis orchid flower development.
Huang JZ, Lin CP, Cheng TC, Chang BC, Cheng SY, Chen YW, Lee CY, Chin SW, Chen FC., PLoS One 10(5), 2015
PMID: 25970572
Global Reprogramming of Transcription in Chinese Fir (Cunninghamia lanceolata) during Progressive Drought Stress and after Rewatering.
Hu R, Wu B, Zheng H, Hu D, Wang X, Duan H, Sun Y, Wang J, Zhang Y, Li Y., Int J Mol Sci 16(7), 2015
PMID: 26154763
Discovering New Biology through Sequencing of RNA.
Weber AP., Plant Physiol 169(3), 2015
PMID: 26353759
Transcriptome Analysis of Syringa oblata Lindl. Inflorescence Identifies Genes Associated with Pigment Biosynthesis and Scent Metabolism.
Zheng J, Hu Z, Guan X, Dou D, Bai G, Wang Y, Guo Y, Li W, Leng P., PLoS One 10(11), 2015
PMID: 26587670
Comparative Transcriptomic Analyses of Vegetable and Grain Pea (Pisum sativum L.) Seed Development.
Liu N, Zhang G, Xu S, Mao W, Hu Q, Gong Y., Front Plant Sci 6(), 2015
PMID: 26635856
Physiological and transcriptional analyses of developmental stages along sugarcane leaf.
Mattiello L, Riaño-Pachón DM, Martins MC, da Cruz LP, Bassi D, Marchiori PE, Ribeiro RV, Labate MT, Labate CA, Menossi M., BMC Plant Biol 15(), 2015
PMID: 26714767
454 pyrosequencing-based analysis of gene expression profiles in the amphipod Melita plumulosa: transcriptome assembly and toxicant induced changes.
Hook SE, Twine NA, Simpson SL, Spadaro DA, Moncuquet P, Wilkins MR., Aquat Toxicol 153(), 2014
PMID: 24434169
Transcriptome analysis of Houttuynia cordata Thunb. by Illumina paired-end RNA sequencing and SSR marker discovery.
Wei L, Li S, Liu S, He A, Wang D, Wang J, Tang Y, Wu X., PLoS One 9(1), 2014
PMID: 24392108
RNA sequencing read depth requirement for optimal transcriptome coverage in Hevea brasiliensis.
Chow KS, Ghazali AK, Hoh CC, Mohd-Zainuddin Z., BMC Res Notes 7(), 2014
PMID: 24484543
Azolla domestication towards a biobased economy?
Brouwer P, Bräutigam A, Külahoglu C, Tazelaar AO, Kurz S, Nierop KG, van der Werf A, Weber AP, Schluepmann H., New Phytol 202(3), 2014
PMID: 24494738
Comparative studies of C3 and C4 Atriplex hybrids in the genomics era: physiological assessments.
Oakley JC, Sultmanis S, Stinson CR, Sage TL, Sage RF., J Exp Bot 65(13), 2014
PMID: 24675672
Evolution of the Phosphoenolpyruvate Carboxylase Protein Kinase Family in C3 and C4 Flaveria spp.
Aldous SH, Weise SE, Sharkey TD, Waldera-Lupa DM, Stühler K, Mallmann J, Groth G, Gowik U, Westhoff P, Arsova B., Plant Physiol 165(3), 2014
PMID: 24850859
Transcriptome profiling of pyrethroid resistant and susceptible mosquitoes in the malaria vector, Anopheles sinensis.
Zhu G, Zhong D, Cao J, Zhou H, Li J, Liu Y, Bai L, Xu S, Wang MH, Zhou G, Chang X, Gao Q, Yan G., BMC Genomics 15(), 2014
PMID: 24909924
Next generation sequencing and de novo transcriptomics to study gene evolution.
Jayasena AS, Secco D, Bernath-Levin K, Berkowitz O, Whelan J, Mylne JS., Plant Methods 10(1), 2014
PMID: 25364374
Biochemical and molecular changes associated with heteroxylan biosynthesis in Neolamarckia cadamba (Rubiaceae) during xylogenesis.
Zhao X, Ouyang K, Gan S, Zeng W, Song L, Zhao S, Li J, Doblin MS, Bacic A, Chen XY, Marchant A, Deng X, Wu AM., Front Plant Sci 5(), 2014
PMID: 25426124
RNA-seq analysis of Quercus pubescens Leaves: de novo transcriptome assembly, annotation and functional markers development.
Torre S, Tattini M, Brunetti C, Fineschi S, Fini A, Ferrini F, Sebastiani F., PLoS One 9(11), 2014
PMID: 25393112
Comparative analyses of two Geraniaceae transcriptomes using next-generation sequencing.
Zhang J, Ruhlman TA, Mower JP, Jansen RK., BMC Plant Biol 13(), 2013
PMID: 24373163
Transcriptomic analysis of Chinese bayberry (Myrica rubra) fruit development and ripening using RNA-Seq.
Feng C, Chen M, Xu CJ, Bai L, Yin XR, Li X, Allan AC, Ferguson IB, Chen KS., BMC Genomics 13(), 2012
PMID: 22244270
SNP markers retrieval for a non-model species: a practical approach.
Shahin A, van Gurp T, Peters SA, Visser RG, van Tuyl JM, Arens P., BMC Res Notes 5(), 2012
PMID: 22284269
Next-generation sequencing-based transcriptomic and proteomic analysis of the common reed, Phragmites australis (Poaceae), reveals genes involved in invasiveness and rhizome specificity
He R, Kim MJ, Nelson W, Balbuena TS, Kim R, Kramer R, Crow JA, May GD, Thelen JJ, Soderlund CA, Gang DR., Am J Bot 99(2), 2012
PMID: IND44687691
CBrowse: a SAM/BAM-based contig browser for transcriptome assembly visualization and analysis.
Li P, Ji G, Dong M, Schmidt E, Lenox D, Chen L, Liu Q, Liu L, Zhang J, Liang C., Bioinformatics 28(18), 2012
PMID: 22789590
The protein composition of the digestive fluid from the venus flytrap sheds light on prey digestion mechanisms.
Schulze WX, Sanggaard KW, Kreuzer I, Knudsen AD, Bemm F, Thøgersen IB, Bräutigam A, Thomsen LR, Schliesky S, Dyrlund TF, Escalante-Perez M, Becker D, Schultz J, Karring H, Weber A, Højrup P, Hedrich R, Enghild JJ., Mol Cell Proteomics 11(11), 2012
PMID: 22891002
RNA-Seq Assembly - Are We There Yet?
Schliesky S, Gowik U, Weber AP, Bräutigam A., Front Plant Sci 3(), 2012
PMID: 23056003
Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa.
Shahin A, van Kaauwen M, Esselink D, Bargsten JW, van Tuyl JM, Visser RG, Arens P., BMC Genomics 13(), 2012
PMID: 23167289
Exploiting the engine of C(4) photosynthesis.
Sage RF, Zhu XG., J Exp Bot 62(9), 2011
PMID: 21652533

38 References

Daten bereitgestellt von Europe PubMed Central.

Comparative 454 pyrosequencing of transcripts from two olive genotypes during fruit development.
Alagna F, D'Agostino N, Torchia L, Servili M, Rao R, Pietrella M, Giuliano G, Chiusano ML, Baldoni L, Perrotta G., BMC Genomics 10(), 2009
PMID: 19709400
Comparison of the transcriptomes of American chestnut (Castanea dentata) and Chinese chestnut (Castanea mollissima) in response to the chestnut blight infection.
Barakat A, DiLoreto DS, Zhang Y, Smith C, Baier K, Powell WA, Wheeler N, Sederoff R, Carlson JE., BMC Plant Biol. 9(), 2009
PMID: 19426529
An mRNA blueprint for C4 photosynthesis derived from comparative transcriptomics of closely related C3 and C4 species.
Brautigam A, Kajala K, Wullenweber J, Sommer M, Gagneul D, Weber KL, Carr KM, Gowik U, Mass J, Lercher MJ, Westhoff P, Hibberd JM, Weber AP., Plant Physiol. 155(1), 2010
PMID: 20543093
The future of C4 research--maize, Flaveria or Cleome?
Brown NJ, Parsley K, Hibberd JM., Trends Plant Sci. 10(5), 2005
PMID: 15882653
Using the miraEST assembler for reliable and automated mRNA transcript assembly and SNP detection in sequenced ESTs.
Chevreux B, Pfisterer T, Drescher B, Driesel AJ, Muller WE, Wetter T, Suhai S., Genome Res. 14(6), 2004
PMID: 15140833
MIRA: an automated genome and EST assembler. PhD Thesis
Chevreux, 2006
Shedding light on an extremophile lifestyle through transcriptomics.
Dassanayake M, Haas JS, Bohnert HJ, Cheeseman JM., New Phytol. 183(3), 2009
PMID: 19549131
Sense from sequence reads: methods for alignment and assembly.
Flicek P, Birney E., Nat. Methods 6(11 Suppl), 2009
PMID: 19844229
C-4 photosynthesis—a unique blend of modified biochemistry, anatomy and ultrastructure
Hatch, Biochimica et Biophysica Acta 895(), 1987
CAP3: A DNA sequence assembly program.
Huang X, Madan A., Genome Res. 9(9), 1999
PMID: 10508846
Comparing de novo assemblers for 454 transcriptome data.
Kumar S, Blaxter ML., BMC Genomics 11(), 2010
PMID: 20950480
Consequences of C4 differentiation for chloroplast membrane proteomes in maize mesophyll and bundle sheath cells.
Majeran W, Zybailov B, Ytterberg AJ, Dunsmore J, Sun Q, van Wijk KJ., Mol. Cell Proteomics 7(9), 2008
PMID: 18453340
Cleome, a genus closely related to Arabidopsis, contains species spanning a developmental progression from C(3) to C(4) photosynthesis.
Marshall DM, Muhaidat R, Brown NJ, Liu Z, Stanley S, Griffiths H, Sage RF, Hibberd JM., Plant J. 51(5), 2007
PMID: 17692080
Differential biogenesis of photosystem II in mesophyll and bundle-sheath cells of monocotyledonous NADP-malic enzyme-type C-4 plants—the nonstoichiometric abundance of the subunits of photosystem-II in the bundle-sheath chloroplasts and the translational activity of the plastome-encoded genes
Meierhoff, Planta 191(), 1993
Sequencing technologies - the next generation.
Metzker ML., Nat. Rev. Genet. 11(1), 2009
PMID: 19997069
Diversity of Kranz anatomy and biochemistry in C₄ eudicots
Muhaidat R, Sage RF, Dengler NG., Am. J. Bot. 94(3), 2007
PMID: IND43890999
Agrobacterium tumefaciens-mediated transformation of Cleome gynandra L., a C(4) dicotyledon that is closely related to Arabidopsis thaliana.
Newell CA, Brown NJ, Liu Z, Pflug A, Gowik U, Westhoff P, Hibberd JM., J. Exp. Bot. 61(5), 2010
PMID: 20150516
High-throughput gene and SNP discovery in Eucalyptus grandis, an uncharacterized genome.
Novaes E, Drost DR, Farmerie WG, Pappas GJ Jr, Grattapaglia D, Sederoff RR, Kirst M., BMC Genomics 9(), 2008
PMID: 18590545
Mapping accuracy of short reads from massively parallel sequencing and the implications for quantitative expression profiling
Palmieri, PLoS ONE 4(), 2009
The Sorghum bicolor genome and the diversification of grasses.
Paterson AH, Bowers JE, Bruggmann R, Dubchak I, Grimwood J, Gundlach H, Haberer G, Hellsten U, Mitros T, Poliakov A, Schmutz J, Spannagl M, Tang H, Wang X, Wicker T, Bharti AK, Chapman J, Feltus FA, Gowik U, Grigoriev IV, Lyons E, Maher CA, Martis M, Narechania A, Otillar RP, Penning BW, Salamov AA, Wang Y, Zhang L, Carpita NC, Freeling M, Gingle AR, Hash CT, Keller B, Klein P, Kresovich S, McCann MC, Ming R, Peterson DG, Mehboob-ur-Rahman , Ware D, Westhoff P, Mayer KF, Messing J, Rokhsar DS., Nature 457(7229), 2009
PMID: 19189423
TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets.
Pertea G, Huang X, Liang F, Antonescu V, Sultana R, Karamycheva S, Lee Y, White J, Cheung F, Parvizi B, Tsai J, Quackenbush J., Bioinformatics 19(5), 2003
PMID: 12651724
The evolution of C4 photosynthesis.
Sage RF., New Phytol. 161(2), 2004
PMID: IND43668189
A gene expression map of Arabidopsis thaliana development.
Schmid M, Davison TS, Henz SR, Pape UJ, Demar M, Vingron M, Scholkopf B, Weigel D, Lohmann JU., Nat. Genet. 37(5), 2005
PMID: 15806101
The B73 maize genome: complexity, diversity, and dynamics.
Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S, Liang C, Zhang J, Fulton L, Graves TA, Minx P, Reily AD, Courtney L, Kruchowski SS, Tomlinson C, Strong C, Delehaunty K, Fronick C, Courtney B, Rock SM, Belter E, Du F, Kim K, Abbott RM, Cotton M, Levy A, Marchetto P, Ochoa K, Jackson SM, Gillam B, Chen W, Yan L, Higginbotham J, Cardenas M, Waligorski J, Applebaum E, Phelps L, Falcone J, Kanchi K, Thane T, Scimone A, Thane N, Henke J, Wang T, Ruppert J, Shah N, Rotter K, Hodges J, Ingenthron E, Cordes M, Kohlberg S, Sgro J, Delgado B, Mead K, Chinwalla A, Leonard S, Crouse K, Collura K, Kudrna D, Currie J, He R, Angelova A, Rajasekar S, Mueller T, Lomeli R, Scara G, Ko A, Delaney K, Wissotski M, Lopez G, Campos D, Braidotti M, Ashley E, Golser W, Kim H, Lee S, Lin J, Dujmic Z, Kim W, Talag J, Zuccolo A, Fan C, Sebastian A, Kramer M, Spiegel L, Nascimento L, Zutavern T, Miller B, Ambroise C, Muller S, Spooner W, Narechania A, Ren L, Wei S, Kumari S, Faga B, Levy MJ, McMahan L, Van Buren P, Vaughn MW, Ying K, Yeh CT, Emrich SJ, Jia Y, Kalyanaraman A, Hsia AP, Barbazuk WB, Baucom RS, Brutnell TP, Carpita NC, Chaparro C, Chia JM, Deragon JM, Estill JC, Fu Y, Jeddeloh JA, Han Y, Lee H, Li P, Lisch DR, Liu S, Liu Z, Nagel DH, McCann MC, SanMiguel P, Myers AM, Nettleton D, Nguyen J, Penning BW, Ponnala L, Schneider KL, Schwartz DC, Sharma A, Soderlund C, Springer NM, Sun Q, Wang H, Waterman M, Westerman R, Wolfgruber TK, Yang L, Yu Y, Zhang L, Zhou S, Zhu Q, Bennetzen JL, Dawe RK, Jiang J, Jiang N, Presting GG, Wessler SR, Aluru S, Martienssen RA, Clifton SW, McCombie WR, Wing RA, Wilson RK., Science 326(5956), 2009
PMID: 19965430
The Arabidopsis Information Resource (TAIR): gene structure and function annotation.
Swarbreck D, Wilks C, Lamesch P, Berardini TZ, Garcia-Hernandez M, Foerster H, Li D, Meyer T, Muller R, Ploetz L, Radenbaugh A, Singh S, Swing V, Tissier C, Zhang P, Huala E., Nucleic Acids Res. 36(Database issue), 2007
PMID: 17986450
Role of indigenous leafy vegetables in combating hunger and malnutrition
van, South African Journal of Botany 70(), 2004
Sampling the Arabidopsis transcriptome with massively parallel pyrosequencing.
Weber AP, Weber KL, Carr K, Wilkerson C, Ohlrogge JB., Plant Physiol. 144(1), 2007
PMID: 17351049
Velvet: algorithms for de novo short read assembly using de Bruijn graphs.
Zerbino DR, Birney E., Genome Res. 18(5), 2008
PMID: 18349386

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Web of Science

Dieser Datensatz im Web of Science®

PMID: 21398430
PubMed | Europe PMC

Suchen in

Google Scholar