Enhancing Very Fast Decision Trees with Local Split-Time Predictions

Losing V, Wersing H, Hammer B (2018)
In: 2018 IEEE International Conference on Data Mining (ICDM). IEEE: 287-296.

Konferenzbeitrag | Veröffentlicht | Englisch
 
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
Abstract / Bemerkung
An increasing number of industrial areas recognize the opportunities of Big Data, requiring highly efficient algorithms which enable real-time processing to reduce the burden of data storage and maintenance. Decision trees are extremely fast, highly accurate and easy to use in practice. Merging multiple decision trees to an ensemble leads to one of the most powerful machine learning methods. The Very Fast Decision Tree is the state-of-the-art incremental decision tree induction algorithm, capable of learning from massive data streams. It is successful due to its theoretical guarantees based on the Hoeffding bound as well as its competitive performance in terms of classification accuracy and time / space efficiency. In this paper, we increase the efficiency even further by replacing its global splitting scheme, which periodically tries to split every n min examples. Instead, we utilize local statistics to predict the split-time, thus, avoiding unnecessary split-attempts, usually dominating the computational cost. Concretely, we use the class distributions of previous split-attempts to approximate the minimum number of examples until the Hoeffding bound is met. This cautious approach yields by design a low delay and reduces the number of split-attempts at the same time. We extensively evaluate our method using common stream-learning benchmarks also considering non-stationary environments. The experiments confirm a substantially reduced run-time without a loss in classification performance.
Erscheinungsjahr
2018
Titel des Konferenzbandes
2018 IEEE International Conference on Data Mining (ICDM)
Seite(n)
287-296
Konferenz
2018 IEEE International Conference on Data Mining (ICDM)
Konferenzort
Singapore
eISBN
978-1-5386-9159-5
Page URI
https://pub.uni-bielefeld.de/record/2982088

Zitieren

Losing V, Wersing H, Hammer B. Enhancing Very Fast Decision Trees with Local Split-Time Predictions. In: 2018 IEEE International Conference on Data Mining (ICDM). IEEE; 2018: 287-296.
Losing, V., Wersing, H., & Hammer, B. (2018). Enhancing Very Fast Decision Trees with Local Split-Time Predictions. 2018 IEEE International Conference on Data Mining (ICDM), 287-296. IEEE. https://doi.org/10.1109/ICDM.2018.00044
Losing, Viktor, Wersing, Heiko, and Hammer, Barbara. 2018. “Enhancing Very Fast Decision Trees with Local Split-Time Predictions”. In 2018 IEEE International Conference on Data Mining (ICDM), 287-296. IEEE.
Losing, V., Wersing, H., and Hammer, B. (2018). “Enhancing Very Fast Decision Trees with Local Split-Time Predictions” in 2018 IEEE International Conference on Data Mining (ICDM) (IEEE), 287-296.
Losing, V., Wersing, H., & Hammer, B., 2018. Enhancing Very Fast Decision Trees with Local Split-Time Predictions. In 2018 IEEE International Conference on Data Mining (ICDM). IEEE, pp. 287-296.
V. Losing, H. Wersing, and B. Hammer, “Enhancing Very Fast Decision Trees with Local Split-Time Predictions”, 2018 IEEE International Conference on Data Mining (ICDM), IEEE, 2018, pp.287-296.
Losing, V., Wersing, H., Hammer, B.: Enhancing Very Fast Decision Trees with Local Split-Time Predictions. 2018 IEEE International Conference on Data Mining (ICDM). p. 287-296. IEEE (2018).
Losing, Viktor, Wersing, Heiko, and Hammer, Barbara. “Enhancing Very Fast Decision Trees with Local Split-Time Predictions”. 2018 IEEE International Conference on Data Mining (ICDM). IEEE, 2018. 287-296.
Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar