Article,

Fast and accurate genomic analyses using genome graphs

G. Rakocevic, V. Semenyuk, W. Lee, J. Spencer, J. Browning, I. Johnson, V. Arsenijevic, J. Nadj, K. Ghose, M. Suciu, S. Ji, G. Demir, L. Li, B. Toptaş, A. Dolgoborodov, B. Pollex, I. Spulber, I. Glotova, P. Kómár, A. Stachyra, Y. Li, M. Popovic, M. Källberg, A. Jain, and D. Kural.
Nature Genetics, 51 (2): 354--362 (2019)
DOI: 10.1038/s41588-018-0316-4

Abstract

The human reference genome serves as the foundation for genomics by providing a scaffold for alignment of sequencing reads, but currently only reflects a single consensus haplotype, thus impairing analysis accuracy. Here we present a graph reference genome implementation that enables read alignment across 2,800 diploid genomes encompassing 12.6 million SNPs and 4.0 million insertions and deletions (indels). The pipeline processes one whole-genome sequencing sample in 6.5 h using a system with 36 CPU cores. We show that using a graph genome reference improves read mapping sensitivity and produces a 0.5% increase in variant calling recall, with unaffected specificity. Structural variations incorporated into a graph genome can be genotyped accurately under a unified framework. Finally, we show that iterative augmentation of graph genomes yields incremental gains in variant calling accuracy. Our implementation is an important advance toward fulfilling the promise of graph genomes to radically enhance the scalability and accuracy of genomic analyses.

BibTeX key: rakocevic2019accurate
entry type: article
year: 2019
journal: Nature Genetics
number: 2
pages: 354--362
volume: 51
issn: 15461718
refid: Rakocevic2019
DOI: 10.1038/s41588-018-0316-4
url: https://doi.org/10.1038/s41588-018-0316-4

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@article{rakocevic2019accurate, abstract = {The human reference genome serves as the foundation for genomics by providing a scaffold for alignment of sequencing reads, but currently only reflects a single consensus haplotype, thus impairing analysis accuracy. Here we present a graph reference genome implementation that enables read alignment across 2,800 diploid genomes encompassing 12.6 million SNPs and 4.0 million insertions and deletions (indels). The pipeline processes one whole-genome sequencing sample in 6.5 h using a system with 36 CPU cores. We show that using a graph genome reference improves read mapping sensitivity and produces a 0.5% increase in variant calling recall, with unaffected specificity. Structural variations incorporated into a graph genome can be genotyped accurately under a unified framework. Finally, we show that iterative augmentation of graph genomes yields incremental gains in variant calling accuracy. Our implementation is an important advance toward fulfilling the promise of graph genomes to radically enhance the scalability and accuracy of genomic analyses.}, added-at = {2020-06-29T22:11:44.000+0200}, author = {Rakocevic, Goran and Semenyuk, Vladimir and Lee, Wan-Ping and Spencer, James and Browning, John and Johnson, Ivan J. and Arsenijevic, Vladan and Nadj, Jelena and Ghose, Kaushik and Suciu, Maria C. and Ji, Sun-Gou and Demir, Gülfem and Li, Lizao and Toptaş, Berke Ç. and Dolgoborodov, Alexey and Pollex, Björn and Spulber, Iosif and Glotova, Irina and Kómár, Péter and Stachyra, Andrew L. and Li, Yilong and Popovic, Milos and Källberg, Morten and Jain, Amit and Kural, Deniz}, biburl = {https://www.bibsonomy.org/bibtex/22e69e8c44713a3c8a18a5d7be690c726/peter.ralph}, doi = {10.1038/s41588-018-0316-4}, interhash = {031f784b8f6469048d440bebdae5f885}, intrahash = {2e69e8c44713a3c8a18a5d7be690c726}, issn = {15461718}, journal = {Nature Genetics}, keywords = {data_structures genomic_data graph_genome methods}, number = 2, pages = {354--362}, refid = {Rakocevic2019}, timestamp = {2020-06-29T22:11:44.000+0200}, title = {Fast and accurate genomic analyses using genome graphs}, url = {https://doi.org/10.1038/s41588-018-0316-4}, volume = 51, year = 2019 }

BibSonomy

Fast and accurate genomic analyses using genome graphs

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on