An overview of the wcd EST clustering tool

Hazelhurst S, Hide W, Lipták Z, Nogueira R, Starfield R (2008)
BIOINFORMATICS 24(13): 1542-1546.

No fulltext has been uploaded. References only!
Journal Article | Original Article | Published | English

No fulltext has been uploaded

; ; ; ;
The wcd system is an open source tool for clustering expressed sequence tags (EST) and other DNA and RNA sequences. wcd allows efficient all-versus-all comparison of ESTs using either the d(2) distance function or edit distance, improving existing implementations of d(2). It supports merging, refinement and reclustering of clusters. It is drop in compatible with the StackPack clustering package. wcd supports parallelization under both shared memory and cluster architectures. It is distributed with an EMBOSS wrapper allowing wcd to be installed as part of an EMBOSS installation (and so provided by a web server).
Publishing Year

Cite this

Hazelhurst S, Hide W, Lipták Z, Nogueira R, Starfield R. An overview of the wcd EST clustering tool. BIOINFORMATICS. 2008;24(13):1542-1546.
Hazelhurst, S., Hide, W., Lipták, Z., Nogueira, R., & Starfield, R. (2008). An overview of the wcd EST clustering tool. BIOINFORMATICS, 24(13), 1542-1546. doi:10.1093/bioinformatics/btn203
Hazelhurst, S., Hide, W., Lipták, Z., Nogueira, R., and Starfield, R. (2008). An overview of the wcd EST clustering tool. BIOINFORMATICS 24, 1542-1546.
Hazelhurst, S., et al., 2008. An overview of the wcd EST clustering tool. BIOINFORMATICS, 24(13), p 1542-1546.
S. Hazelhurst, et al., “An overview of the wcd EST clustering tool”, BIOINFORMATICS, vol. 24, 2008, pp. 1542-1546.
Hazelhurst, S., Hide, W., Lipták, Z., Nogueira, R., Starfield, R.: An overview of the wcd EST clustering tool. BIOINFORMATICS. 24, 1542-1546 (2008).
Hazelhurst, Scott, Hide, Winston, Lipták, Zsuzsanna, Nogueira, Ramon, and Starfield, Richard. “An overview of the wcd EST clustering tool”. BIOINFORMATICS 24.13 (2008): 1542-1546.
This data publication is cited in the following publications:
This publication cites the following data publications:

18 Citations in Europe PMC

Data provided by Europe PubMed Central.

Inferring bona fide transfrags in RNA-Seq derived-transcriptome assemblies of non-model organisms.
Mbandi SK, Hesse U, van Heusden P, Christoffels A., BMC Bioinformatics 16(), 2015
PMID: 25880035
EasyCluster2: an improved tool for clustering and assembling long transcriptome reads.
Bevilacqua V, Pietroleonardo N, Giannino E, Stroppa F, Simone D, Pesole G, Picardi E., BMC Bioinformatics 15 Suppl 15(), 2014
PMID: 25474441
Development of EST-based SNP and InDel markers and their utilization in tetraploid cotton genetic mapping.
Li X, Gao W, Guo H, Zhang X, Fang DD, Lin Z., BMC Genomics 15(), 2014
PMID: 25442170
Construction of a public CHO cell line transcript database using versatile bioinformatics analysis pipelines.
Rupp O, Becker J, Brinkrolf K, Timmermann C, Borth N, Puhler A, Noll T, Goesmann A., PLoS ONE 9(1), 2014
PMID: 24427317
A de novo assembly of the newt transcriptome combined with proteomic validation identifies new protein families expressed during tissue regeneration.
Looso M, Preussner J, Sousounis K, Bruckskotten M, Michel CS, Lignelli E, Reinhardt R, Hoffner S, Kruger M, Tsonis PA, Borchardt T, Braun T., Genome Biol. 14(2), 2013
PMID: 23425577
Analysis of the leaf transcriptome of Musa acuminata during interaction with Mycosphaerella musicola: gene assembly, annotation and marker development.
Passos MA, de Cruz VO, Emediato FL, de Teixeira CC, Azevedo VC, Brasileiro AC, Amorim EP, Ferreira CF, Martins NF, Togawa RC, Junior GJ, da Silva OB Jr, Miller RN., BMC Genomics 14(), 2013
PMID: 23379821
Evolution of saxitoxin synthesis in cyanobacteria and dinoflagellates.
Hackett JD, Wisecaver JH, Brosnahan ML, Kulis DM, Anderson DM, Bhattacharya D, Plumley FG, Erdner DL., Mol. Biol. Evol. 30(1), 2013
PMID: 22628533
KABOOM! A new suffix array based algorithm for clustering expression data.
Hazelhurst S, Liptak Z., Bioinformatics 27(24), 2011
PMID: 21984769
SEED: efficient clustering of next-generation sequences.
Bao E, Jiang T, Kaloshian I, Girke T., Bioinformatics 27(18), 2011
PMID: 21810899
Revealing impaired pathways in the an11 mutant by high-throughput characterization of Petunia axillaris and Petunia inflata transcriptomes.
Zenoni S, D'Agostino N, Tornielli GB, Quattrocchio F, Chiusano ML, Koes R, Zethof J, Guzzo F, Delledonne M, Frusciante L, Gerats T, Pezzotti M., Plant J. 68(1), 2011
PMID: 21623977
High-throughput SNP genotyping in the highly heterozygous genome of Eucalyptus: assay success, polymorphism and transferability across species.
Grattapaglia D, Silva-Junior OB, Kirst M, de Lima BM, Faria DA, Pappas GJ Jr., BMC Plant Biol. 11(), 2011
PMID: 21492434
Clustering algorithms in biomedical research: a review.
Xu R, Wunsch DC 2nd., IEEE Rev Biomed Eng 3(), 2010
PMID: 22275205
PEACE: Parallel Environment for Assembly and Clustering of Gene Expression.
Rao DM, Moler JC, Ozden M, Zhang Y, Liang C, Karro JE., Nucleic Acids Res. 38(Web Server issue), 2010
PMID: 20522511
SolEST database: a "one-stop shop" approach to the study of Solanaceae transcriptomes.
D'Agostino N, Traini A, Frusciante L, Chiusano ML., BMC Plant Biol. 9(), 2009
PMID: 19948013
EasyCluster: a fast and efficient gene-oriented clustering tool for large-scale transcriptome data.
Picardi E, Mignone F, Pesole G., BMC Bioinformatics 10 Suppl 6(), 2009
PMID: 19534735


0 Marked Publications

Open Data PUB

Web of Science

View record in Web of Science®


PMID: 18480101
PubMed | Europe PMC

Search this title in

Google Scholar