Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework

S. Lewis, A. Csordas, S. Killcoyne, H. Hermjakob, M. Hoopmann, R. Moritz, E. Deutsch, und J. Boyle. BMC Bioinformatics, 13 (1): 324 (2012)
DOI: 10.1186/1471-2105-13-324

Zusammenfassung

BACKGROUND:For shotgun mass spectrometry based proteomics the most computationally expensive step is in matching the spectra against an increasingly large database of sequences and their post-translational modifications with known masses. Each mass spectrometer can generate data at an astonishingly high rate, and the scope of what is searched for is continually increasing. Therefore solutions for improving our ability to perform these searches are needed.RESULTS:We present a sequence database search engine that is specifically designed to run efficiently on the Hadoop MapReduce distributed computing framework. The search engine implements the K-score algorithm, generating comparable output for the same input files as the original implementation. The scalability of the system is shown, and the architecture required for the development of such distributed processing is discussed.CONCLUSION:The software is scalable in its ability to handle a large peptide database, numerous modifications and large numbers of spectra. Performance scales with the number of processors in the cluster, allowing throughput to expand with the available resources.

@legaultdeniss Tags hervorgehoben

proteomics

Zitieren Sie diese Publikation

@article{23216909, abstract = {BACKGROUND:For shotgun mass spectrometry based proteomics the most computationally expensive step is in matching the spectra against an increasingly large database of sequences and their post-translational modifications with known masses. Each mass spectrometer can generate data at an astonishingly high rate, and the scope of what is searched for is continually increasing. Therefore solutions for improving our ability to perform these searches are needed.RESULTS:We present a sequence database search engine that is specifically designed to run efficiently on the Hadoop MapReduce distributed computing framework. The search engine implements the K-score algorithm, generating comparable output for the same input files as the original implementation. The scalability of the system is shown, and the architecture required for the development of such distributed processing is discussed.CONCLUSION:The software is scalable in its ability to handle a large peptide database, numerous modifications and large numbers of spectra. Performance scales with the number of processors in the cluster, allowing throughput to expand with the available resources.}, added-at = {2013-04-30T22:59:09.000+0200}, author = {Lewis, Steven and Csordas, Attila and Killcoyne, Sarah and Hermjakob, Henning and Hoopmann, Michael and Moritz, Robert and Deutsch, Eric and Boyle, John}, biburl = {https://www.bibsonomy.org/bibtex/256047065e713cfabad52ebac8149ff3a/legaultdenis}, doi = {10.1186/1471-2105-13-324}, interhash = {150e0b9dfa9aa300a2390e9ca312d5fd}, intrahash = {56047065e713cfabad52ebac8149ff3a}, issn = {1471-2105}, journal = {BMC Bioinformatics}, keywords = {proteomics}, number = 1, pages = 324, pubmedid = {23216909}, timestamp = {2013-04-30T22:59:09.000+0200}, title = {Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework}, url = {http://www.biomedcentral.com/1471-2105/13/324}, volume = 13, year = 2012 }

BibSonomy

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework

Zusammenfassung

Links und Ressourcen

Tags

Community

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf

Metadaten

Kommentare und Rezensionen
(0)

BibSonomy

KopierenLöschenDiese Publikation zur Ablage hinzufügenCommunity-EintragVersionsverlauf dieses EintragsURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework

Zusammenfassung

Links und Ressourcen

Tags

Community

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf

Metadaten

Kommentare und Rezensionen (0)

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework

Kommentare und Rezensionen
(0)