A large choice of tools exists for many standard tasks in the analysis of high-throughput sequencing (HTS) data. However, once a project deviates from standard workflows, custom scripts are needed.We present HTSeq, a Python library to facilitate the rapid development of such scripts. HTSeq offers parsers for many common data formats in HTS projects, as well as classes to represent data, such as genomic coordinates, sequences, sequencing reads, alignments, gene model information and variant calls, and provides data structures that allow for querying via genomic coordinates. We also present htseq-count, a tool developed with HTSeq that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes.HTSeq is released as an open-source software under the GNU General Public Licence and available from http://www-huber.embl.de/HTSeq or from the Python Package Index at https://pypi.python.org/pypi/HTSeq.
Description
HTSeq--a Python framework to work with high-throughput sequencing data. - PubMed - NCBI
%0 Journal Article
%1 Anders:2015:Bioinformatics:25260700
%A Anders, Simon
%A Pyl, Paul Theodor
%A Huber, Wolfgang
%D 2015
%J Bioinformatics
%K EMBL GDC MUSTREAD bioinformatics fulltext python rna-seq rpkm sequencing software
%N 2
%P 166-169
%R 10.1093/bioinformatics/btu638
%T HTSeq - a Python framework to work with high-throughput sequencing data.
%U https://www.ncbi.nlm.nih.gov/pubmed/25260700
%V 31
%X A large choice of tools exists for many standard tasks in the analysis of high-throughput sequencing (HTS) data. However, once a project deviates from standard workflows, custom scripts are needed.We present HTSeq, a Python library to facilitate the rapid development of such scripts. HTSeq offers parsers for many common data formats in HTS projects, as well as classes to represent data, such as genomic coordinates, sequences, sequencing reads, alignments, gene model information and variant calls, and provides data structures that allow for querying via genomic coordinates. We also present htseq-count, a tool developed with HTSeq that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes.HTSeq is released as an open-source software under the GNU General Public Licence and available from http://www-huber.embl.de/HTSeq or from the Python Package Index at https://pypi.python.org/pypi/HTSeq.
@article{Anders:2015:Bioinformatics:25260700,
abstract = {A large choice of tools exists for many standard tasks in the analysis of high-throughput sequencing (HTS) data. However, once a project deviates from standard workflows, custom scripts are needed.We present HTSeq, a Python library to facilitate the rapid development of such scripts. HTSeq offers parsers for many common data formats in HTS projects, as well as classes to represent data, such as genomic coordinates, sequences, sequencing reads, alignments, gene model information and variant calls, and provides data structures that allow for querying via genomic coordinates. We also present htseq-count, a tool developed with HTSeq that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes.HTSeq is released as an open-source software under the GNU General Public Licence and available from http://www-huber.embl.de/HTSeq or from the Python Package Index at https://pypi.python.org/pypi/HTSeq.},
added-at = {2017-02-18T19:58:28.000+0100},
author = {Anders, Simon and Pyl, Paul Theodor and Huber, Wolfgang},
biburl = {https://www.bibsonomy.org/bibtex/207834b135758e7b4269c8491b10ce70b/marcsaric},
description = {HTSeq--a Python framework to work with high-throughput sequencing data. - PubMed - NCBI},
doi = {10.1093/bioinformatics/btu638},
interhash = {201177813081081c9e8aeebe2afbf203},
intrahash = {07834b135758e7b4269c8491b10ce70b},
journal = {Bioinformatics},
keywords = {EMBL GDC MUSTREAD bioinformatics fulltext python rna-seq rpkm sequencing software},
month = jan,
number = 2,
pages = {166-169},
pmid = {25260700},
timestamp = {2019-05-05T11:34:11.000+0200},
title = {HTSeq - a Python framework to work with high-throughput sequencing data.},
url = {https://www.ncbi.nlm.nih.gov/pubmed/25260700},
volume = 31,
year = 2015
}