An improved random forest-based rule extraction method for breast cancer diagnosis
Wang S, Wang Y, Wang D, Yin Y, Wang Y, Jin Y (2020)
Applied Soft Computing 86: 105941.
Zeitschriftenaufsatz
| Veröffentlicht | Englisch
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
Autor*in
Wang, Sutong;
Wang, Yuyan;
Wang, Dujuan;
Yin, Yunqiang;
Wang, Yanzhang;
Jin, YaochuUniBi
Abstract / Bemerkung
Breast cancer has been becoming the main cause of death in women all around the world. An accurate and interpretable method is necessary for diagnosing patients with breast cancer for well-performed treatment. Nowadays, a great many of ensemble methods have been widely applied to breast cancer diagnosis, capable of achieving high accuracy, such as Random Forest. However, they are black-box methods which are unable to explain the reasons behind the diagnosis. To surmount this limitation, a rule extraction method named improved Random Forest (RF)-based rule extraction (IRFRE) method is developed to derive accurate and interpretable classification rules from a decision tree ensemble for breast cancer diagnosis. Firstly, numbers of decision tree models are constructed using Random Forest to generate abundant decision rules available. And then a rule extraction approach is devised to detach decision rules from the trained trees. Finally, an improved multi-objective evolutionary algorithm (MOEA) is employed to seek for an optimal rule predictor where the constituent rule set is the best trade-off between accuracy and interpretability. The developed method is evaluated on three breast cancer data sets, i.e., the Wisconsin Diagnostic Breast Cancer (WDBC) dataset, Wisconsin Original Breast Cancer (WOBC) dataset, and Surveillance, Epidemiology and End Results (SEER) breast cancer dataset. The experimental results demonstrate that the developed method can primely explain the black-box methods and outperform several popular single algorithms, ensemble learning methods, and rule extraction methods from the view of accuracy and interpretability. What is more, the proposed method can be popularized to other cancer diagnoses in practice, which provides an option to a more interpretable, more accurate cancer diagnosis process.
Erscheinungsjahr
2020
Zeitschriftentitel
Applied Soft Computing
Band
86
Art.-Nr.
105941
ISSN
1568-4946
Page URI
https://pub.uni-bielefeld.de/record/2978410
Zitieren
Wang S, Wang Y, Wang D, Yin Y, Wang Y, Jin Y. An improved random forest-based rule extraction method for breast cancer diagnosis. Applied Soft Computing. 2020;86: 105941.
Wang, S., Wang, Y., Wang, D., Yin, Y., Wang, Y., & Jin, Y. (2020). An improved random forest-based rule extraction method for breast cancer diagnosis. Applied Soft Computing, 86, 105941. https://doi.org/10.1016/j.asoc.2019.105941
Wang, Sutong, Wang, Yuyan, Wang, Dujuan, Yin, Yunqiang, Wang, Yanzhang, and Jin, Yaochu. 2020. “An improved random forest-based rule extraction method for breast cancer diagnosis”. Applied Soft Computing 86: 105941.
Wang, S., Wang, Y., Wang, D., Yin, Y., Wang, Y., and Jin, Y. (2020). An improved random forest-based rule extraction method for breast cancer diagnosis. Applied Soft Computing 86:105941.
Wang, S., et al., 2020. An improved random forest-based rule extraction method for breast cancer diagnosis. Applied Soft Computing, 86: 105941.
S. Wang, et al., “An improved random forest-based rule extraction method for breast cancer diagnosis”, Applied Soft Computing, vol. 86, 2020, : 105941.
Wang, S., Wang, Y., Wang, D., Yin, Y., Wang, Y., Jin, Y.: An improved random forest-based rule extraction method for breast cancer diagnosis. Applied Soft Computing. 86, : 105941 (2020).
Wang, Sutong, Wang, Yuyan, Wang, Dujuan, Yin, Yunqiang, Wang, Yanzhang, and Jin, Yaochu. “An improved random forest-based rule extraction method for breast cancer diagnosis”. Applied Soft Computing 86 (2020): 105941.