Benchmark of Deep Learning Models on Large Healthcare MIMIC Datasets
S. Purushotham, C. Meng, Z. Che, и Y. Liu. (2017)cite arxiv:1710.08531Comment: Submitted to Journal of Biomedical Informatics (JBI). First two authors have equal contributions.
Аннотация
Deep learning models (aka Deep Neural Networks) have revolutionized many
fields including computer vision, natural language processing, speech
recognition, and is being increasingly used in clinical healthcare
applications. However, few works exist which have benchmarked the performance
of the deep learning models with respect to the state-of-the-art machine
learning models and prognostic scoring systems on publicly available healthcare
datasets. In this paper, we present the benchmarking results for several
clinical prediction tasks such as mortality prediction, length of stay
prediction, and ICD-9 code group prediction using Deep Learning models,
ensemble of machine learning models (Super Learner algorithm), SAPS II and SOFA
scores. We used the Medical Information Mart for Intensive Care III (MIMIC-III)
(v1.4) publicly available dataset, which includes all patients admitted to an
ICU at the Beth Israel Deaconess Medical Center from 2001 to 2012, for the
benchmarking tasks. Our results show that deep learning models consistently
outperform all the other approaches especially when the `raw' clinical time
series data is used as input features to the models.
%0 Generic
%1 purushotham2017benchmark
%A Purushotham, Sanjay
%A Meng, Chuizheng
%A Che, Zhengping
%A Liu, Yan
%D 2017
%K daniel toread
%T Benchmark of Deep Learning Models on Large Healthcare MIMIC Datasets
%U http://arxiv.org/abs/1710.08531
%X Deep learning models (aka Deep Neural Networks) have revolutionized many
fields including computer vision, natural language processing, speech
recognition, and is being increasingly used in clinical healthcare
applications. However, few works exist which have benchmarked the performance
of the deep learning models with respect to the state-of-the-art machine
learning models and prognostic scoring systems on publicly available healthcare
datasets. In this paper, we present the benchmarking results for several
clinical prediction tasks such as mortality prediction, length of stay
prediction, and ICD-9 code group prediction using Deep Learning models,
ensemble of machine learning models (Super Learner algorithm), SAPS II and SOFA
scores. We used the Medical Information Mart for Intensive Care III (MIMIC-III)
(v1.4) publicly available dataset, which includes all patients admitted to an
ICU at the Beth Israel Deaconess Medical Center from 2001 to 2012, for the
benchmarking tasks. Our results show that deep learning models consistently
outperform all the other approaches especially when the `raw' clinical time
series data is used as input features to the models.
@misc{purushotham2017benchmark,
abstract = {Deep learning models (aka Deep Neural Networks) have revolutionized many
fields including computer vision, natural language processing, speech
recognition, and is being increasingly used in clinical healthcare
applications. However, few works exist which have benchmarked the performance
of the deep learning models with respect to the state-of-the-art machine
learning models and prognostic scoring systems on publicly available healthcare
datasets. In this paper, we present the benchmarking results for several
clinical prediction tasks such as mortality prediction, length of stay
prediction, and ICD-9 code group prediction using Deep Learning models,
ensemble of machine learning models (Super Learner algorithm), SAPS II and SOFA
scores. We used the Medical Information Mart for Intensive Care III (MIMIC-III)
(v1.4) publicly available dataset, which includes all patients admitted to an
ICU at the Beth Israel Deaconess Medical Center from 2001 to 2012, for the
benchmarking tasks. Our results show that deep learning models consistently
outperform all the other approaches especially when the `raw' clinical time
series data is used as input features to the models.},
added-at = {2019-04-17T16:49:30.000+0200},
author = {Purushotham, Sanjay and Meng, Chuizheng and Che, Zhengping and Liu, Yan},
biburl = {https://www.bibsonomy.org/bibtex/22adc177d7cd152bf32900e1173e9e468/hotho},
interhash = {a5762d5ae4117f0530e28c2fd554bd33},
intrahash = {2adc177d7cd152bf32900e1173e9e468},
keywords = {daniel toread},
note = {cite arxiv:1710.08531Comment: Submitted to Journal of Biomedical Informatics (JBI). First two authors have equal contributions},
timestamp = {2019-04-17T16:49:30.000+0200},
title = {Benchmark of Deep Learning Models on Large Healthcare MIMIC Datasets},
url = {http://arxiv.org/abs/1710.08531},
year = 2017
}