Inbook,

Interpreting Random Forest Classification Models Using a Feature Contribution Method

A. Palczewska, J. Palczewski, R. Marchese Robinson, and D. Neagu.
page 193--218. Springer International Publishing, Cham, (2014)
DOI: 10.1007/978-3-319-04717-1_9

Abstract

Model interpretation is one of the key aspects of the model evaluation process. The explanation of the relationship between model variables and outputs is relatively easy for statistical models, such as linear regressions, thanks to the availability of model parameters and their statistical significance. For ``black box'' models, such as random forest, this information is hidden inside the model structure. This work presents an approach for computing feature contributions for random forest classification models. It allows for the determination of the influence of each variable on the model prediction for an individual instance. By analysing feature contributions for a training dataset, the most significant variables can be determined and their typical contribution towards predictions made for individual classes, i.e., class-specific feature contribution ``patterns'', are discovered. These patterns represent a standard behaviour of the model and allow for an additional assessment of the model reliability for new data. Interpretation of feature contributions for two UCI benchmark datasets shows the potential of the proposed methodology. The robustness of results is demonstrated through an extensive analysis of feature contributions calculated for a large number of generated random forest models.

BibTeX key: palczewska2014interpreting
entry type: inbook
address: Cham
booktitle: Integration of Reusable Systems
year: 2014
pages: 193--218
publisher: Springer International Publishing
isbn: 978-3-319-04717-1
DOI: 10.1007/978-3-319-04717-1_9
url: https://doi.org/10.1007/978-3-319-04717-1_9

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@inbook{palczewska2014interpreting, abstract = {Model interpretation is one of the key aspects of the model evaluation process. The explanation of the relationship between model variables and outputs is relatively easy for statistical models, such as linear regressions, thanks to the availability of model parameters and their statistical significance. For ``black box'' models, such as random forest, this information is hidden inside the model structure. This work presents an approach for computing feature contributions for random forest classification models. It allows for the determination of the influence of each variable on the model prediction for an individual instance. By analysing feature contributions for a training dataset, the most significant variables can be determined and their typical contribution towards predictions made for individual classes, i.e., class-specific feature contribution ``patterns'', are discovered. These patterns represent a standard behaviour of the model and allow for an additional assessment of the model reliability for new data. Interpretation of feature contributions for two UCI benchmark datasets shows the potential of the proposed methodology. The robustness of results is demonstrated through an extensive analysis of feature contributions calculated for a large number of generated random forest models.}, added-at = {2022-08-26T17:03:23.000+0200}, address = {Cham}, author = {Palczewska, Anna and Palczewski, Jan and Marchese Robinson, Richard and Neagu, Daniel}, biburl = {https://www.bibsonomy.org/bibtex/2ef2c416f6f16041d9ee7f70a07d5aac6/msteininger}, booktitle = {Integration of Reusable Systems}, description = {Interpreting Random Forest Classification Models Using a Feature Contribution Method | SpringerLink}, doi = {10.1007/978-3-319-04717-1_9}, editor = {Bouabana-Tebibel, Thouraya and Rubin, Stuart H.}, interhash = {80d7d71cb4236854e4eaa50114884f70}, intrahash = {ef2c416f6f16041d9ee7f70a07d5aac6}, isbn = {978-3-319-04717-1}, keywords = {lursurvey openlur}, pages = {193--218}, publisher = {Springer International Publishing}, timestamp = {2022-08-26T17:03:23.000+0200}, title = {Interpreting Random Forest Classification Models Using a Feature Contribution Method}, url = {https://doi.org/10.1007/978-3-319-04717-1_9}, year = 2014 }

BibSonomy

Interpreting Random Forest Classification Models Using a Feature Contribution Method

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on