Background
A central challenge to understanding the ecological and biogeochemical roles of microorganisms in natural and human engineered ecosystems is the reconstruction of metabolic interaction networks from environmental sequence information. The dominant paradigm in metabolic reconstruction is to assign functional annotations using BLAST. Functional annotations are then projected onto symbolic representations of metabolism in the form of KEGG pathways or SEED subsystems.
Results
Here we present MetaPathways, an open source pipeline for pathway inference that uses the PathoLogic algorithm to map functional annotations onto the MetaCyc collection of reactions and pathways, and construct environmental Pathway/Genome Databases (ePGDBs) compatible with the editing and navigation features of Pathway Tools. The pipeline accepts assembled or unassembled nucleotide sequences, performs quality assessment and control, predicts and annotates noncoding genes and open reading frames, and produces inputs to PathoLogic. In addition to constructing ePGDBs, MetaPathways uses MLTreeMap to build phylogenetic trees for selected taxonomic anchor and functional gene markers, converts General Feature Format (GFF) files into concatenated GenBank files for ePGDB construction based on third-party annotations, and generates useful file formats including Sequin files for direct GenBank submission and gene feature tables summarizing annotations, MLTreeMap trees, and ePGDB pathway coverage summaries for statistical comparisons.
Conclusions
MetaPathways provides users with a modular annotation and analysis pipeline for predicting metabolic interaction networks from environmental sequence information using an alternative to KEGG pathways and SEED subsystems mapping. It is extensible to genomic and transcriptomic datasets from a wide range of sequencing platforms, and generates useful data products for microbial community structure and function analysis. The MetaPathways software package, installation instructions, and example data can be obtained from http://hallam.microbiology.ubc.ca/MetaPathways.
%0 Journal Article
%1 Konwar2013MetaPathways
%A Konwar, KishoriM
%A Hanson, NielsW
%A Pagé, AntoineP
%A Hallam, StevenJ
%B BMC Bioinformatics
%D 2013
%I BioMed Central
%J BMC Bioinformatics
%K pathway-database tool
%N 1
%P 1--10
%R 10.1186/1471-2105-14-202
%T MetaPathways: a modular pipeline for constructing pathway/genome databases from environmental sequence information
%U http://dx.doi.org/10.1186/1471-2105-14-202
%V 14
%X Background
A central challenge to understanding the ecological and biogeochemical roles of microorganisms in natural and human engineered ecosystems is the reconstruction of metabolic interaction networks from environmental sequence information. The dominant paradigm in metabolic reconstruction is to assign functional annotations using BLAST. Functional annotations are then projected onto symbolic representations of metabolism in the form of KEGG pathways or SEED subsystems.
Results
Here we present MetaPathways, an open source pipeline for pathway inference that uses the PathoLogic algorithm to map functional annotations onto the MetaCyc collection of reactions and pathways, and construct environmental Pathway/Genome Databases (ePGDBs) compatible with the editing and navigation features of Pathway Tools. The pipeline accepts assembled or unassembled nucleotide sequences, performs quality assessment and control, predicts and annotates noncoding genes and open reading frames, and produces inputs to PathoLogic. In addition to constructing ePGDBs, MetaPathways uses MLTreeMap to build phylogenetic trees for selected taxonomic anchor and functional gene markers, converts General Feature Format (GFF) files into concatenated GenBank files for ePGDB construction based on third-party annotations, and generates useful file formats including Sequin files for direct GenBank submission and gene feature tables summarizing annotations, MLTreeMap trees, and ePGDB pathway coverage summaries for statistical comparisons.
Conclusions
MetaPathways provides users with a modular annotation and analysis pipeline for predicting metabolic interaction networks from environmental sequence information using an alternative to KEGG pathways and SEED subsystems mapping. It is extensible to genomic and transcriptomic datasets from a wide range of sequencing platforms, and generates useful data products for microbial community structure and function analysis. The MetaPathways software package, installation instructions, and example data can be obtained from http://hallam.microbiology.ubc.ca/MetaPathways.
@article{Konwar2013MetaPathways,
abstract = {Background
A central challenge to understanding the ecological and biogeochemical roles of microorganisms in natural and human engineered ecosystems is the reconstruction of metabolic interaction networks from environmental sequence information. The dominant paradigm in metabolic reconstruction is to assign functional annotations using {BLAST}. Functional annotations are then projected onto symbolic representations of metabolism in the form of {KEGG} pathways or {SEED} subsystems.
Results
Here we present {MetaPathways}, an open source pipeline for pathway inference that uses the {PathoLogic} algorithm to map functional annotations onto the {MetaCyc} collection of reactions and pathways, and construct environmental {Pathway/Genome} Databases ({ePGDBs}) compatible with the editing and navigation features of Pathway Tools. The pipeline accepts assembled or unassembled nucleotide sequences, performs quality assessment and control, predicts and annotates noncoding genes and open reading frames, and produces inputs to {PathoLogic}. In addition to constructing {ePGDBs}, {MetaPathways} uses {MLTreeMap} to build phylogenetic trees for selected taxonomic anchor and functional gene markers, converts General Feature Format ({GFF}) files into concatenated {GenBank} files for {ePGDB} construction based on third-party annotations, and generates useful file formats including Sequin files for direct {GenBank} submission and gene feature tables summarizing annotations, {MLTreeMap} trees, and {ePGDB} pathway coverage summaries for statistical comparisons.
Conclusions
{MetaPathways} provides users with a modular annotation and analysis pipeline for predicting metabolic interaction networks from environmental sequence information using an alternative to {KEGG} pathways and {SEED} subsystems mapping. It is extensible to genomic and transcriptomic datasets from a wide range of sequencing platforms, and generates useful data products for microbial community structure and function analysis. The {MetaPathways} software package, installation instructions, and example data can be obtained from {http://hallam.microbiology.ubc.ca/MetaPathways}.},
added-at = {2018-12-02T16:09:07.000+0100},
author = {Konwar, KishoriM and Hanson, NielsW and Pag\'{e}, AntoineP and Hallam, StevenJ},
biburl = {https://www.bibsonomy.org/bibtex/27fc599db89138119dee00e260060537d/karthikraman},
booktitle = {BMC Bioinformatics},
citeulike-article-id = {12442847},
citeulike-linkout-0 = {http://dx.doi.org/10.1186/1471-2105-14-202},
citeulike-linkout-1 = {http://view.ncbi.nlm.nih.gov/pubmed/23800136},
citeulike-linkout-2 = {http://www.hubmed.org/display.cgi?uids=23800136},
citeulike-linkout-3 = {http://link.springer.com/article/10.1186/1471-2105-14-202},
day = 21,
doi = {10.1186/1471-2105-14-202},
interhash = {71f3a6e1d5b2d0fd4ff969c591570f10},
intrahash = {7fc599db89138119dee00e260060537d},
issn = {1471-2105},
journal = {BMC Bioinformatics},
keywords = {pathway-database tool},
month = jun,
number = 1,
pages = {1--10},
pmid = {23800136},
posted-at = {2013-09-02 09:49:28},
priority = {2},
publisher = {BioMed Central},
timestamp = {2018-12-02T16:09:07.000+0100},
title = {{MetaPathways}: a modular pipeline for constructing pathway/genome databases from environmental sequence information},
url = {http://dx.doi.org/10.1186/1471-2105-14-202},
volume = 14,
year = 2013
}