Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species

Bräutigam A, Mullick T, Schliesky S, Weber APM (2011)
Journal of Experimental Botany 62(9): 3093-3102.

Download
OA 481.83 KB
Journal Article | Original Article | Published | English
Author
; ; ;
Abstract
Next-generation sequencing enables the study of species without a sequenced genome at the 'omics' level. Custom transcriptome databases are generated and global expression profiles can be compared. However, the assembly of transcriptome sequence reads into contigs remains a daunting task. In this study, five different assembly programs, both traditional overlap-based, 'read-centric' assemblers and de Bruijn graph data structure-based assemblers, were compared. To this end, artificial read libraries with and without simulated sequencing errors were constructed from Arabidopsis thaliana, based on quantitative profiles of mature leaf tissue. The open source TGICL pipeline and the commercial CLC bio genomics workbench produced the best assemblies in terms of contig length, hybrid assemblies, redundancy reduction, and error tolerance. The mature leaf transcriptomes of the C-3 species Cleome spinosa and the C-4 species Cleome gynandra were assembled and analysed. The pathways and cellular processes tagged in the transcriptome assemblies reflect processes of a mature leaf. The databases are useful for extracting transcripts related to C-4 processes as full-length or nearly full-length sequences.
Publishing Year
ISSN
PUB-ID

Cite this

Bräutigam A, Mullick T, Schliesky S, Weber APM. Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species. Journal of Experimental Botany. 2011;62(9):3093-3102.
Bräutigam, A., Mullick, T., Schliesky, S., & Weber, A. P. M. (2011). Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species. Journal of Experimental Botany, 62(9), 3093-3102. doi:10.1093/jxb/err029
Bräutigam, A., Mullick, T., Schliesky, S., and Weber, A. P. M. (2011). Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species. Journal of Experimental Botany 62, 3093-3102.
Bräutigam, A., et al., 2011. Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species. Journal of Experimental Botany, 62(9), p 3093-3102.
A. Bräutigam, et al., “Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species”, Journal of Experimental Botany, vol. 62, 2011, pp. 3093-3102.
Bräutigam, A., Mullick, T., Schliesky, S., Weber, A.P.M.: Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species. Journal of Experimental Botany. 62, 3093-3102 (2011).
Bräutigam, Andrea, Mullick, Thomas, Schliesky, Simon, and Weber, Andreas P. M. “Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C-3 and C-4 species”. Journal of Experimental Botany 62.9 (2011): 3093-3102.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Main File(s)
Access Level
OA Open Access
Last Uploaded
2017-12-19T09:15:45Z

This data publication is cited in the following publications:
This publication cites the following data publications:

37 Citations in Europe PMC

Data provided by Europe PubMed Central.

Elevated auxin biosynthesis and transport underlie high vein density in C4 leaves.
Huang CF, Yu CP, Wu YH, Lu MJ, Tu SL, Wu SH, Shiu SH, Ku MSB, Li WH., Proc. Natl. Acad. Sci. U.S.A. 114(33), 2017
PMID: 28761000
De novo transcriptomic analysis and development of EST-SSRs for Sorbus pohuashanensis (Hance) Hedl.
Liu C, Dou Y, Guan X, Fu Q, Zhang Z, Hu Z, Zheng J, Lu Y, Li W., PLoS ONE 12(6), 2017
PMID: 28614366
Issues with RNA-seq analysis in non-model organisms: A salmonid example.
Sundaram A, Tengs T, Grimholt U., Dev. Comp. Immunol. 75(), 2017
PMID: 28223254
Functional insights into the testis transcriptome of the edible sea urchin Loxechinus albus.
Gaitan-Espitia JD, Sanchez R, Bruning P, Cardenas L., Sci Rep 6(), 2016
PMID: 27805042
RNA-seq-based evaluation of bicolor tepal pigmentation in Asiatic hybrid lilies (Lilium spp.).
Suzuki K, Suzuki T, Nakatsuka T, Dohra H, Yamagishi M, Matsuyama K, Matsuura H., BMC Genomics 17(1), 2016
PMID: 27516339
Comparative Transcriptomic Analyses of Vegetable and Grain Pea (Pisum sativum L.) Seed Development.
Liu N, Zhang G, Xu S, Mao W, Hu Q, Gong Y., Front Plant Sci 6(), 2015
PMID: 26635856
Global Reprogramming of Transcription in Chinese Fir (Cunninghamia lanceolata) during Progressive Drought Stress and after Rewatering.
Hu R, Wu B, Zheng H, Hu D, Wang X, Duan H, Sun Y, Wang J, Zhang Y, Li Y., Int J Mol Sci 16(7), 2015
PMID: 26154763
A de novo floral transcriptome reveals clues into Phalaenopsis orchid flower development.
Huang JZ, Lin CP, Cheng TC, Chang BC, Cheng SY, Chen YW, Lee CY, Chin SW, Chen FC., PLoS ONE 10(5), 2015
PMID: 25970572

38 References

Data provided by Europe PubMed Central.

Mapping accuracy of short reads from massively parallel sequencing and the implications for quantitative expression profiling
Palmieri, PLoS ONE 4(), 2009
The Sorghum bicolor genome and the diversification of grasses.
Paterson AH, Bowers JE, Bruggmann R, Dubchak I, Grimwood J, Gundlach H, Haberer G, Hellsten U, Mitros T, Poliakov A, Schmutz J, Spannagl M, Tang H, Wang X, Wicker T, Bharti AK, Chapman J, Feltus FA, Gowik U, Grigoriev IV, Lyons E, Maher CA, Martis M, Narechania A, Otillar RP, Penning BW, Salamov AA, Wang Y, Zhang L, Carpita NC, Freeling M, Gingle AR, Hash CT, Keller B, Klein P, Kresovich S, McCann MC, Ming R, Peterson DG, Mehboob-ur-Rahman , Ware D, Westhoff P, Mayer KF, Messing J, Rokhsar DS., Nature 457(7229), 2009
PMID: 19189423
TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets.
Pertea G, Huang X, Liang F, Antonescu V, Sultana R, Karamycheva S, Lee Y, White J, Cheung F, Parvizi B, Tsai J, Quackenbush J., Bioinformatics 19(5), 2003
PMID: 12651724
The evolution of C4 photosynthesis.
Sage RF., New Phytol. 161(2), 2004
PMID: IND43668189
A gene expression map of Arabidopsis thaliana development.
Schmid M, Davison TS, Henz SR, Pape UJ, Demar M, Vingron M, Scholkopf B, Weigel D, Lohmann JU., Nat. Genet. 37(5), 2005
PMID: 15806101
The B73 maize genome: complexity, diversity, and dynamics.
Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S, Liang C, Zhang J, Fulton L, Graves TA, Minx P, Reily AD, Courtney L, Kruchowski SS, Tomlinson C, Strong C, Delehaunty K, Fronick C, Courtney B, Rock SM, Belter E, Du F, Kim K, Abbott RM, Cotton M, Levy A, Marchetto P, Ochoa K, Jackson SM, Gillam B, Chen W, Yan L, Higginbotham J, Cardenas M, Waligorski J, Applebaum E, Phelps L, Falcone J, Kanchi K, Thane T, Scimone A, Thane N, Henke J, Wang T, Ruppert J, Shah N, Rotter K, Hodges J, Ingenthron E, Cordes M, Kohlberg S, Sgro J, Delgado B, Mead K, Chinwalla A, Leonard S, Crouse K, Collura K, Kudrna D, Currie J, He R, Angelova A, Rajasekar S, Mueller T, Lomeli R, Scara G, Ko A, Delaney K, Wissotski M, Lopez G, Campos D, Braidotti M, Ashley E, Golser W, Kim H, Lee S, Lin J, Dujmic Z, Kim W, Talag J, Zuccolo A, Fan C, Sebastian A, Kramer M, Spiegel L, Nascimento L, Zutavern T, Miller B, Ambroise C, Muller S, Spooner W, Narechania A, Ren L, Wei S, Kumari S, Faga B, Levy MJ, McMahan L, Van Buren P, Vaughn MW, Ying K, Yeh CT, Emrich SJ, Jia Y, Kalyanaraman A, Hsia AP, Barbazuk WB, Baucom RS, Brutnell TP, Carpita NC, Chaparro C, Chia JM, Deragon JM, Estill JC, Fu Y, Jeddeloh JA, Han Y, Lee H, Li P, Lisch DR, Liu S, Liu Z, Nagel DH, McCann MC, SanMiguel P, Myers AM, Nettleton D, Nguyen J, Penning BW, Ponnala L, Schneider KL, Schwartz DC, Sharma A, Soderlund C, Springer NM, Sun Q, Wang H, Waterman M, Westerman R, Wolfgruber TK, Yang L, Yu Y, Zhang L, Zhou S, Zhu Q, Bennetzen JL, Dawe RK, Jiang J, Jiang N, Presting GG, Wessler SR, Aluru S, Martienssen RA, Clifton SW, McCombie WR, Wing RA, Wilson RK., Science 326(5956), 2009
PMID: 19965430
The Arabidopsis Information Resource (TAIR): gene structure and function annotation.
Swarbreck D, Wilks C, Lamesch P, Berardini TZ, Garcia-Hernandez M, Foerster H, Li D, Meyer T, Muller R, Ploetz L, Radenbaugh A, Singh S, Swing V, Tissier C, Zhang P, Huala E., Nucleic Acids Res. 36(Database issue), 2008
PMID: 17986450
Role of indigenous leafy vegetables in combating hunger and malnutrition
van, South African Journal of Botany 70(), 2004
Sampling the Arabidopsis transcriptome with massively parallel pyrosequencing.
Weber AP, Weber KL, Carr K, Wilkerson C, Ohlrogge JB., Plant Physiol. 144(1), 2007
PMID: 17351049
Velvet: algorithms for de novo short read assembly using de Bruijn graphs.
Zerbino DR, Birney E., Genome Res. 18(5), 2008
PMID: 18349386

Export

0 Marked Publications

Open Data PUB

Web of Science

View record in Web of Science®

Sources

PMID: 21398430
PubMed | Europe PMC

Search this title in

Google Scholar