Geoseq: a tool for dissecting deep-sequencing datasets
Gurtowski J, Cancio A, Shah H, Levovitz C, George A, Homann R, Sachidanandam R (2010)
BMC Bioinformatics 11(1): 506.
Zeitschriftenaufsatz
| Veröffentlicht | Englisch
Download
Autor*in
Gurtowski, James;
Cancio, Anthony;
Shah, Hardik;
Levovitz, Chaya;
George, Ajish;
Homann, RobertUniBi;
Sachidanandam, Ravi
Einrichtung
Abstract / Bemerkung
Background
Datasets generated on deep-sequencing platforms have been deposited in various public repositories such as the Gene Expression Omnibus (GEO), Sequence Read Archive (SRA) hosted by the NCBI, or the DNA Data Bank of Japan (ddbj). Despite being rich data sources, they have not been used much due to the difficulty in locating and analyzing datasets of interest.
Results
Geoseq http://geoseq.mssm.edu provides a new method of analyzing short reads from deep sequencing experiments. Instead of mapping the reads to reference genomes or sequences, Geoseq maps a reference sequence against the sequencing data. It is web-based, and holds pre-computed data from public libraries. The analysis reduces the input sequence to tiles and measures the coverage of each tile in a sequence library through the use of suffix arrays. The user can upload custom target sequences or use gene/miRNA names for the search and get back results as plots and spreadsheet files. Geoseq organizes the public sequencing data using a controlled vocabulary, allowing identification of relevant libraries by organism, tissue and type of experiment.
Conclusions
Analysis of small sets of sequences against deep-sequencing datasets, as well as identification of public datasets of interest, is simplified by Geoseq. We applied Geoseq to, a) identify differential isoform expression in mRNA-seq datasets, b) identify miRNAs (microRNAs) in libraries, and identify mature and star sequences in miRNAS and c) to identify potentially mis-annotated miRNAs. The ease of using Geoseq for these analyses suggests its utility and uniqueness as an analysis tool.
Erscheinungsjahr
2010
Zeitschriftentitel
BMC Bioinformatics
Band
11
Ausgabe
1
Art.-Nr.
506
ISSN
1471-2105
Page URI
https://pub.uni-bielefeld.de/record/2316624
Zitieren
Gurtowski J, Cancio A, Shah H, et al. Geoseq: a tool for dissecting deep-sequencing datasets. BMC Bioinformatics. 2010;11(1): 506.
Gurtowski, J., Cancio, A., Shah, H., Levovitz, C., George, A., Homann, R., & Sachidanandam, R. (2010). Geoseq: a tool for dissecting deep-sequencing datasets. BMC Bioinformatics, 11(1), 506. https://doi.org/10.1186/1471-2105-11-506
Gurtowski, James, Cancio, Anthony, Shah, Hardik, Levovitz, Chaya, George, Ajish, Homann, Robert, and Sachidanandam, Ravi. 2010. “Geoseq: a tool for dissecting deep-sequencing datasets”. BMC Bioinformatics 11 (1): 506.
Gurtowski, J., Cancio, A., Shah, H., Levovitz, C., George, A., Homann, R., and Sachidanandam, R. (2010). Geoseq: a tool for dissecting deep-sequencing datasets. BMC Bioinformatics 11:506.
Gurtowski, J., et al., 2010. Geoseq: a tool for dissecting deep-sequencing datasets. BMC Bioinformatics, 11(1): 506.
J. Gurtowski, et al., “Geoseq: a tool for dissecting deep-sequencing datasets”, BMC Bioinformatics, vol. 11, 2010, : 506.
Gurtowski, J., Cancio, A., Shah, H., Levovitz, C., George, A., Homann, R., Sachidanandam, R.: Geoseq: a tool for dissecting deep-sequencing datasets. BMC Bioinformatics. 11, : 506 (2010).
Gurtowski, James, Cancio, Anthony, Shah, Hardik, Levovitz, Chaya, George, Ajish, Homann, Robert, and Sachidanandam, Ravi. “Geoseq: a tool for dissecting deep-sequencing datasets”. BMC Bioinformatics 11.1 (2010): 506.
Alle Dateien verfügbar unter der/den folgenden Lizenz(en):
Copyright Statement:
Dieses Objekt ist durch das Urheberrecht und/oder verwandte Schutzrechte geschützt. [...]
Volltext(e)
Name
Access Level
Open Access
Zuletzt Hochgeladen
2019-09-06T08:57:54Z
MD5 Prüfsumme
02a72b3fd0f5f33c9f9d1a2f9363e0c0
Daten bereitgestellt von European Bioinformatics Institute (EBI)
3 Zitationen in Europe PMC
Daten bereitgestellt von Europe PubMed Central.
MPI depletion enhances O-GlcNAcylation of p53 and suppresses the Warburg effect.
Shtraizent N, DeRossi C, Nayar S, Sachidanandam R, Katz LS, Prince A, Koh AP, Vincek A, Hadas Y, Hoshida Y, Scott DK, Eliyahu E, Freeze HH, Sadler KC, Chu J., Elife 6(), 2017
PMID: 28644127
Shtraizent N, DeRossi C, Nayar S, Sachidanandam R, Katz LS, Prince A, Koh AP, Vincek A, Hadas Y, Hoshida Y, Scott DK, Eliyahu E, Freeze HH, Sadler KC, Chu J., Elife 6(), 2017
PMID: 28644127
A computational approach for identifying microRNA-target interactions using high-throughput CLIP and PAR-CLIP sequencing.
Chou CH, Lin FM, Chou MT, Hsu SD, Chang TH, Weng SL, Shrestha S, Hsiao CC, Hung JH, Huang HD., BMC Genomics 14 Suppl 1(), 2013
PMID: 23368412
Chou CH, Lin FM, Chou MT, Hsu SD, Chang TH, Weng SL, Shrestha S, Hsiao CC, Hung JH, Huang HD., BMC Genomics 14 Suppl 1(), 2013
PMID: 23368412
MiST: a new approach to variant detection in deep sequencing datasets.
Subramanian S, Di Pierro V, Shah H, Jayaprakash AD, Weisberger I, Shim J, George A, Gelb BD, Sachidanandam R., Nucleic Acids Res 41(16), 2013
PMID: 23828039
Subramanian S, Di Pierro V, Shah H, Jayaprakash AD, Weisberger I, Shim J, George A, Gelb BD, Sachidanandam R., Nucleic Acids Res 41(16), 2013
PMID: 23828039
15 References
Daten bereitgestellt von Europe PubMed Central.
BLAST+: architecture and applications.
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL., BMC Bioinformatics 10(), 2009
PMID: 20003500
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL., BMC Bioinformatics 10(), 2009
PMID: 20003500
deepBase: a database for deeply annotating and mining deep sequencing data.
Yang JH, Shao P, Zhou H, Chen YQ, Qu LH., Nucleic Acids Res. 38(Database issue), 2009
PMID: 19966272
Yang JH, Shao P, Zhou H, Chen YQ, Qu LH., Nucleic Acids Res. 38(Database issue), 2009
PMID: 19966272
Galaxy: a platform for interactive large-scale genome analysis.
Giardine B, Riemer C, Hardison RC, Burhans R, Elnitski L, Shah P, Zhang Y, Blankenberg D, Albert I, Taylor J, Miller W, Kent WJ, Nekrutenko A., Genome Res. 15(10), 2005
PMID: 16169926
Giardine B, Riemer C, Hardison RC, Burhans R, Elnitski L, Shah P, Zhang Y, Blankenberg D, Albert I, Taylor J, Miller W, Kent WJ, Nekrutenko A., Genome Res. 15(10), 2005
PMID: 16169926
Lightweight genome viewer: portable software for browsing genomics data in its chromosomal context.
Faith JJ, Olson AJ, Gardner TS, Sachidanandam R., BMC Bioinformatics 8(), 2007
PMID: 17877794
Faith JJ, Olson AJ, Gardner TS, Sachidanandam R., BMC Bioinformatics 8(), 2007
PMID: 17877794
Analysis of large-scale sequencing of small RNAs.
Olson AJ, Brennecke J, Aravin AA, Hannon GJ, Sachidanandam R., Pac Symp Biocomput (), 2008
PMID: 18229681
Olson AJ, Brennecke J, Aravin AA, Hannon GJ, Sachidanandam R., Pac Symp Biocomput (), 2008
PMID: 18229681
miRBase: tools for microRNA genomics.
Griffiths-Jones S, Saini HK, van Dongen S, Enright AJ., Nucleic Acids Res. 36(Database issue), 2007
PMID: 17991681
Griffiths-Jones S, Saini HK, van Dongen S, Enright AJ., Nucleic Acids Res. 36(Database issue), 2007
PMID: 17991681
mkESA: enhanced suffix array construction tool.
Homann R, Fleer D, Giegerich R, Rehmsmeier M., Bioinformatics 25(8), 2009
PMID: 19246510
Homann R, Fleer D, Giegerich R, Rehmsmeier M., Bioinformatics 25(8), 2009
PMID: 19246510
Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.
Langmead B, Trapnell C, Pop M, Salzberg SL., Genome Biol. 10(3), 2009
PMID: 19261174
Langmead B, Trapnell C, Pop M, Salzberg SL., Genome Biol. 10(3), 2009
PMID: 19261174
Fast and accurate short read alignment with Burrows-Wheeler transform.
Li H, Durbin R., Bioinformatics 25(14), 2009
PMID: 19451168
Li H, Durbin R., Bioinformatics 25(14), 2009
PMID: 19451168
SOAP2: an improved ultrafast tool for short read alignment.
Li R, Yu C, Li Y, Lam TW, Yiu SM, Kristiansen K, Wang J., Bioinformatics 25(15), 2009
PMID: 19497933
Li R, Yu C, Li Y, Lam TW, Yiu SM, Kristiansen K, Wang J., Bioinformatics 25(15), 2009
PMID: 19497933
Computation for ChIP-seq and RNA-seq studies.
Pepke S, Wold B, Mortazavi A., Nat. Methods 6(11 Suppl), 2009
PMID: 19844228
Pepke S, Wold B, Mortazavi A., Nat. Methods 6(11 Suppl), 2009
PMID: 19844228
Archiving next generation sequencing data.
Shumway M, Cochrane G, Sugawara H., Nucleic Acids Res. 38(Database issue), 2009
PMID: 19965774
Shumway M, Cochrane G, Sugawara H., Nucleic Acids Res. 38(Database issue), 2009
PMID: 19965774
Export
Markieren/ Markierung löschen
Markierte Publikationen
Quellen
PMID: 20939882
PubMed | Europe PMC
Suchen in