TopHat: discovering splice junctions with RNA-Seq.

C. Trapnell, L. Pachter, и S. Salzberg.
Bioinformatics, 25 (9): 1105--1111 (мая 2009)
DOI: 10.1093/bioinformatics/btp120

Аннотация

A new protocol for sequencing the messenger RNA in a cell, known as RNA-Seq, generates millions of short sequence fragments in a single run. These fragments, or 'reads', can be used to measure levels of gene expression and to identify novel splice variants of genes. However, current software for aligning RNA-Seq data to a genome relies on known splice junctions and cannot identify novel ones. TopHat is an efficient read-mapping algorithm designed to align reads from an RNA-Seq experiment to a reference genome without relying on known splice sites.We mapped the RNA-Seq reads from a recent mammalian RNA-Seq experiment and recovered more than 72\% of the splice junctions reported by the annotation-based software from that study, along with nearly 20,000 previously unreported junctions. The TopHat pipeline is much faster than previous systems, mapping nearly 2.2 million reads per CPU hour, which is sufficient to process an entire RNA-Seq experiment in less than a day on a standard desktop computer. We describe several challenges unique to ab initio splice site discovery from RNA-Seq reads that will require further algorithm development.TopHat is free, open-source software available from http://tophat.cbcb.umd.edu.Supplementary data are available at Bioinformatics online.

ключ BibTeX: Trapnell2009TopHatdiscoveringsplice
тип записи: article
год: 2009
месяц: May
учреждение: Center for Bioinformatics and Computational Biology\, University of Maryland\, College Park\, MD 20742\, USA. cole@cs.umd.edu
журнал: Bioinformatics
номер: 9
страницы: 1105--1111
том: 25
medline-pst: ppublish
pii: btp120
pmid: 19289445
file: :Trapnell2009TopHat\:discoveringsplice.pdf:PDF
owner: gwo
language: eng
DOI: 10.1093/bioinformatics/btp120
url: http://dx.doi.org/10.1093/bioinformatics/btp120

тэги

Пользователи данного ресурса

Комментарии и рецензиипоказать / перейти в невидимый режим

Пожалуйста, войдите в систему, чтобы принять участие в дискуссии (добавить собственные рецензию, или комментарий)

Цитировать эту публикацию

@article{Trapnell2009TopHatdiscoveringsplice, abstract = {A new protocol for sequencing the messenger RNA in a cell, known as RNA-Seq, generates millions of short sequence fragments in a single run. These fragments, or 'reads', can be used to measure levels of gene expression and to identify novel splice variants of genes. However, current software for aligning RNA-Seq data to a genome relies on known splice junctions and cannot identify novel ones. TopHat is an efficient read-mapping algorithm designed to align reads from an RNA-Seq experiment to a reference genome without relying on known splice sites.We mapped the RNA-Seq reads from a recent mammalian RNA-Seq experiment and recovered more than 72\% of the splice junctions reported by the annotation-based software from that study, along with nearly 20,000 previously unreported junctions. The TopHat pipeline is much faster than previous systems, mapping nearly 2.2 million reads per CPU hour, which is sufficient to process an entire RNA-Seq experiment in less than a day on a standard desktop computer. We describe several challenges unique to ab initio splice site discovery from RNA-Seq reads that will require further algorithm development.TopHat is free, open-source software available from http://tophat.cbcb.umd.edu.Supplementary data are available at Bioinformatics online.}, added-at = {2014-05-13T15:48:44.000+0200}, author = {Trapnell, Cole and Pachter, Lior and Salzberg, Steven L}, biburl = {https://www.bibsonomy.org/bibtex/2f3cd88eb80c939a4c4c01b9b2d961057/gwotto}, doi = {10.1093/bioinformatics/btp120}, file = {:Trapnell2009TopHat\:discoveringsplice.pdf:PDF}, institution = {Center for Bioinformatics and Computational Biology{\,} University of Maryland{\,} College Park{\,} MD 20742{\,} USA. cole@cs.umd.edu}, interhash = {f94ee4f12ae15c59f561aed00a7e0ed1}, intrahash = {f3cd88eb80c939a4c4c01b9b2d961057}, journal = {Bioinformatics}, keywords = {Algorithms; Alignment; Analysis, Expression Gene Genetic; Messenger; Models, Profiling, RNA RNA, RNA; Sequence Software Splicing, genetics; methods;}, language = {eng}, medline-pst = {ppublish}, month = May, number = 9, owner = {gwo}, pages = {1105--1111}, pii = {btp120}, pmid = {19289445}, timestamp = {2014-05-13T15:48:44.000+0200}, title = {TopHat: discovering splice junctions with RNA-Seq.}, url = {http://dx.doi.org/10.1093/bioinformatics/btp120}, volume = 25, year = 2009 }

BibSonomy